christoph/fail - fail - Gitea: Git with a cup of tea

Author	SHA1	Message	Date
Horst Schirmeier	2c31bf79b0	jobclient: expect communication failures This change makes the JobClient act properly on communication aborts. Change-Id: I0a76489f117e9721546215e3b627002605e25452	2014-01-20 22:48:08 +01:00
Horst Schirmeier	882d4f381b	jobclient: bugfix: faster shutdown at campaign end The JobClient currently waits a LONG time until it really shuts down after not having reached the server in sendResultsToServer() (which is unfortunately the by far most probable point in the code to determine this): - A different bug (fixed in the previous commit) provoked the situation that a (way) too large amount of jobs was fetched before. - sendResult() (called after each experiment iteration) realized that CLIENT_JOB_REQUEST_SEC seconds are over, and tried to prematurely call home to send first results (without planning to get new jobs yet). - If the server was gone (done, or aborted), connect in sendResultsToServer() failed after several retries and timeouts. - All subsequent calls to sendResult() retried connecting to the server (again, with retries and timeouts), once for each remaining job. - When all jobs were done, getParam() tries to connect a last time, finally telling the experiment that nobody's home. This resulted in client shutdown times of up to four hours (for the default CLIENT_JOB_LIMIT of 1000) after the campaign server terminated. This change solves the issue by not handing out new (cached) jobs after the connect failed once, making the experiment terminate quickly. Change-Id: I0d8cb2e084d783aca74c51a503fa72eb2b2eb0b7	2014-01-20 22:48:08 +01:00
Horst Schirmeier	ee7bc23d85	jobclient: bugfix: initialize timing statistics If we don't properly initialize the job timing statistics, the number of jobs to be requested in the second request to the server is based on the wrong timings. In our test case, CLIENT_JOB_LIMIT jobs were requested at once. Change-Id: I7e9d8ab6fe14e4488b3a74baf061d9a07f3a77c4	2014-01-20 22:48:08 +01:00
Horst Schirmeier	1f6e275e5e	jobserver: bugfix: potential race Delay insertion of to-be-sent jobs into m_runningJobs until they are really sent, as getMessage() won't work anymore (as in: segfault) if this job is concurrently re-sent (due to campaign end), its result is received, and deleted in the campaign. This becomes non-hypothetical with larger values for CLIENT_JOB_LIMIT and CLIENT_JOB_REQUEST_SEC. Additionally, reinsert the remaining jobs into the input queue if communication fails, instead of inefficiently delaying redistribution until the campaign end. Change-Id: If85e3c8261deda86beb8d4d93343429223753f22	2014-01-20 22:48:08 +01:00
Horst Schirmeier	128b54b045	jobserver: outgoing jobqueue bounded by default Bounding the outgoing queue is always a good idea: If the campaign has separate threads for outgoing and incoming jobs (true for the DatabaseCampaign), this keeps memory requirements reasonable. If the campaign works in a single thread, this is not disadvantageous either. Change-Id: Ic75272daa8266f051adf7b23e2ffe87f5c965b86	2014-01-20 22:48:08 +01:00
Horst Schirmeier	73adc71437	jobserver: use non-blocking accept To allow the JobServer to shutdown properly, the accept() loop in JobServer::run() needs to regularly check whether we're done. This change introduces a timed, non-blocking variant of accept() into SocketComm to achieve this. Change-Id: Id411096be816c4ed6c7b0b37674410e22152eb22	2014-01-20 22:48:08 +01:00
Horst Schirmeier	8671669053	jobserver: join remaining threads on shutdown To avoid accessing destroyed resources in CommThreads talking to clients, we need to properly join them on shutdown. The m_CommMutex becomes a JobServer member to make sure it isn't destroyed before the JobServer itself. Change-Id: I35b9fb93ace08a7a9476650f8f5e93597a3a8aa0	2014-01-20 22:48:08 +01:00
Horst Schirmeier	8505ddbb04	jobserver: synchronization cleanup This change cleans up in/out queue synchronization in the job server. End-of-jobs conditions are now properly signaled through the SynchronizedQueue, allowing to resume and abort blocked readers when no more input is expected. Change-Id: I3eaf37115ccf8c5b5afe3d971c7109cd62b68906	2014-01-20 22:48:08 +01:00
Horst Schirmeier	5ac108ea4b	Merge branch 'mysql-concurrency-fixes'	2014-01-20 18:35:35 +01:00
Horst Schirmeier	84aac60a70	use libmysqlclient_r to ensure thread safety According to <http://dev.mysql.com/doc/refman/5.5/en/c-api-threaded-clients.html>, (potentially) threaded clients should use the reentrant libmysqlclient_r. This is just a precaution, I haven't seen any issues with the normal libmysqlclient. Change-Id: Icb29df6dd54eb666e3b43b73fbda406acccd11cb	2014-01-20 18:34:51 +01:00
Horst Schirmeier	8f9ee3fddd	DatabaseCampaign: run statistics update when finished Change-Id: Ib68e54ba82e988db0d2d74ffafa6dc9bd54cd272	2014-01-20 18:34:51 +01:00
Horst Schirmeier	33b63651ae	DatabaseCampaign: MySQL / concurrency fixes According to <http://dev.mysql.com/doc/refman/5.5/en/c-api-threaded-clients.html>, a MySQL connection handle must not be used concurrently with an open result set and mysql_use_result() in one thread (DatabaseCampaign::run()), and mysql_query() in another (DatabaseCampaign::collect_result_thread()). This indeed leads to crashes when bounding the outgoing job queue (SERVER_OUT_QUEUE_SIZE), and maybe even more insidous effects in other cases. The solution is to create separate connections for both threads. Additionally, call mysql_library_init() before spawning any threads. Change-Id: I2981f2fdc67c9a2cbe8781f1a21654418f621aeb	2014-01-20 18:34:51 +01:00
Michael Lenz	0534b503a6	Merge branch 'use_size_prefix-REMOVED'	2014-01-15 13:54:25 +01:00
Michael Lenz	9c984b9704	fail/cpn: (Database)Campaign no longer loses jobs Up until now the JobServer was silently losing jobs and only claiming to be finished - a workaround for this was to restart the campaign until all jobs were finished according to the database and the campaign's output. This change fixes the underlying problem, so a single campaign-run suffices and does no longer lose any jobs. Debugging this was awful and took us quite some time... Change-Id: Ie6c982cc3b2ce11128941f1f13be563bae22565c	2014-01-15 12:59:13 +01:00
Michael Lenz	abd9decf0b	fail/cpn: removed USE_SIZE_PREFIX from SocketComm This removes the ability to directly parse protobufs from the socket, because google::protobuf::Message::ParseFromFileDescriptor() needs a EOF after each message; thus preventing us from sending multiple Message objects over a single socket. Change-Id: I67c0f631071470d6e0ae597e42848036a6db3656	2014-01-15 12:56:38 +01:00
Christoph Borchert	0a5e54e9aa	ecos_kernel_test experiment bugix: don't resume if 'experiment reached finish() before FI' Change-Id: Id0bb9400b8aa28307ed385a8c32b91b17254ba1c	2014-01-15 12:52:44 +01:00
Richard Hellwig	c0fe64ecd6	Merge "gem5: don't count instruction fetch as mem access"	2014-01-14 13:26:16 +01:00
Richard Hellwig	efbb6c6831	Merge "sal/gem5: getTimerTicks(), getTimerTicksPerSecond() implemented"	2014-01-14 12:45:13 +01:00
Richard Hellwig	f359364888	sal/gem5: getTimerTicks(), getTimerTicksPerSecond() implemented Change-Id: I01fdb5e4bdd61fc761e93ef77904c830131c9ed6	2014-01-14 12:13:55 +01:00
Richard Hellwig	f41247b143	gem5: don't count instruction fetch as mem access Change-Id: I6ea9811c132ef7c235d5a03486ca08afc842b51f	2014-01-06 15:54:02 +01:00
Richard Hellwig	34065fea60	weather-monitor: command line parameter are forwarded now Parameters that are specified on the command line are now also forwarded. Change-Id: I0e636f14dba43ef7877ce6e6deca1abb1f00a8a6	2014-01-03 16:38:02 +01:00
Michael Lenz	0907dfb0ae	weather-monitor: now is a DatabaseCampaign "removed" unneccessary memory-mapping ("Step 0") cleaned out ExperimentData - now consists only of fsppilot and resultset resultset now contains bitoffset which is part of result-table's primary key adapted code to work with msg.fsppilot() instead of ExperimentData-values Change-Id: I3b310e7a71d4b28479028250cd5722b3b2ce9f8c	2013-12-11 14:38:01 +01:00
Martin Hoffmann	839913592a	Merge "Coding Guideline: Fixes."	2013-12-06 20:18:06 +01:00
Horst Schirmeier	ab9c0edf10	DatabaseCampaign: run jobs for known-outcome exps, too Although we know that a known_outcome=1 pilot does not exhibit behavior different from the golden run, the database schema does not yet know what this behavior looks like (in terms of result-table column values). In order to be able to JOIN valid results for all memory writes in the trace table (fspgroup maps them all onto one pilot per variant), we need to run these experiments, too. Additionally, don't join the fspgroup table; we only need this one for result calculations afterwards. Change-Id: Idcd2991274fede84526b1eee68a231774625d11a	2013-12-05 19:27:44 +01:00
Lars Rademacher	8b6d744a3e	import-trace: fix for using non-gzipped traces As non-gzipped trace files cause import-trace to always import zero events, the input file is now openend as in the dump-trace tool, where opening non-gzipped files obviously works fine. In the medium term we should find a centralized solution for this, instead of re-implementing it all over the place. Change-Id: I75845c03c0bbdc2b6b578b83d492b7dbbb40f051	2013-12-04 12:00:21 +01:00
Horst Schirmeier	85fffe007e	tracing: bugfix for enabled memory maps With the recent updates to record one additional instruction at the trace start, I broke memory-map handling (restrictMemoryAddresses() and restrictInstructionAddresses()). This change repairs this functionality. Change-Id: I0daf9f474d0efe3f8e30a168c0ccc1e993e7ddc6	2013-11-18 15:49:06 +01:00
Richard Hellwig	bd91549367	Merge "gem5: restore works now"	2013-11-13 17:20:53 +01:00
Richard Hellwig	45e0b41022	gem5: restore works now The function restore(PATH) can now be used to restore a checkpoint. Change-Id: I25faf9f6335261d2b3ade4185eae93983ece9f97	2013-11-13 17:15:19 +01:00
Richard Hellwig	f31548c026	Merge "core/sal: register issue fixed"	2013-11-13 16:08:40 +01:00
Martin Hoffmann	6fa0ae970b	Coding Guideline: Fixes. Sorry, for the small changesets. Change-Id: I12e7b1b4efff0c63020613e399f8185ace97aec7	2013-11-11 13:27:43 +01:00
Martin Hoffmann	cf95437e65	RealtimeLogger: Fixed coding guideline issues. Change-Id: I1172e0c60e2d6e895b4d3f99eb1a023c348bd3b3	2013-11-11 13:18:26 +01:00
Martin Hoffmann	8f0db45dfe	Exp: Base system for the real time systems lecture Change-Id: I3e5b8c6e60b57e6ec03500e9ee109fd5fb322cb2	2013-11-11 13:08:26 +01:00
Martin Hoffmann	4c7fcae6ad	plugins: A simple signal generator Listens on a configurable SUT's global variable. On read access a signal pattern value is calculated and sent back to the SUT. Currently, only a superimposable sine wave signal form is implemented. Further signal forms can be implemented by inheriting from the abstract SignalForm class. Change-Id: I2e6cf49cd44797999691c9e9cf0c54dd3c96875e	2013-11-11 13:07:42 +01:00
Martin Hoffmann	5d867be83b	plugins: RealtimeLogger plugin Logs access to a given global variable of the SUT, given by a symbol name, and outputs value when variable is written to file. Format: <Simulation time>;<Value of variable> Change-Id: I81b581e571be4255a1a2200c41e7c16657ddfd3d	2013-11-11 12:30:52 +01:00
Bjoern Doebel	443b3e4919	L4Sys: termination shortcuts Add two new breakpoints to L4Sys experiment that allow detecting that execution terminated with an error: vga_console_blink() is called by the kernel if JDB was entered (meaning we are hanging, e.g., due to an assertion); also longjmp() is only used by PF handling code after no valid page fault handling could be performed Change-Id: Ice61039c4bd07815a316bbc0bdb39f3483d9a1da	2013-11-06 17:37:20 +01:00
Bjoern Doebel	d4f22a38ff	L4Sys: EIP deviation tracking * after injecting a fault, track how many instructions it takes until execution deviates from original execution * also track what the first deviating EIP value is Change-Id: I18a9250517ca90214728c2c4b036b412f5dbf224	2013-11-06 17:37:20 +01:00
Bjoern Doebel	3db5d034a2	InstructionFilter: make unused EIP a default 0 parameter Change-Id: I972be71af70934eef98bc67b27e32a98ecb8be3b	2013-11-06 17:37:20 +01:00
Bjoern Doebel	a5866a68a2	add l4-sys ignore file Change-Id: Iea2228d8bafc2a3ecb4b6e26e2552813821a3d0b	2013-11-06 17:37:19 +01:00
Bjoern Doebel	71170145e0	Adapt l4-sys experiment to importer fix no need to decrement instruction offset before setting bp anymore Change-Id: I7f9c02349663899fa8f496a46bcb357bd567ac5c	2013-11-06 17:37:19 +01:00
Bjoern Doebel	63610d0652	L4sys: build fix experiment.hpp is parsed before l4sys.ph.h is generated -> remove dependency Change-Id: I128108e562877caca732ad43fdb65b12e56951f8	2013-11-06 17:37:19 +01:00
Richard Hellwig	3bf64351a4	core/sal: register issue fixed Before, it was not possible to add registers in arbitrary order. Change-Id: I952c03ea4339da2cdaf34bd4546c76c33cecd4cd	2013-11-01 17:26:26 +01:00
Horst Schirmeier	c000b50101	Merge branch 'tracing-off-by-one'	2013-10-28 18:37:07 +01:00
Christian Dietrich	5171645d9a	plugin/tracing: fix extended trace on umapped memory areas When a register in the extended trace was dereferenced and the value was smaller than the memory pool size, but the address was not mapped an assertion occured and the tracing plugin terminated the simulator. Now the dereferenced memory address is checked for being mapped and not being smaller than the memory pool. Change-Id: I9ac954988ef860969679f9f360814c5e4b66f473	2013-10-28 15:09:35 +01:00
Christian Dietrich	148b09be2e	tools/import-trace: added ElfImporter The ElfImporter is not a real trace importer, but we locate it into the import-trace utility, since here the infrastructure is already in place to import things related to an elf binary into the database. The ElfImporter calls objdump and dissassembles an elf binary and imports the results into the database. Change-Id: I6e35673c8dbee3b7e8dfc7549d10e5dca9b55935	2013-10-24 15:30:17 +02:00
Horst Schirmeier	c87075e598	Merge branch 'importtrace-reparse-parameters'	2013-10-23 15:51:21 +02:00
Bjoern Doebel	d97e3dfa8f	revert out-of-l4sys change Change-Id: I86b27aae6fa30992b485af79e767ec23949d1e62	2013-10-21 15:38:15 +02:00
Bjoern Doebel	a65c64791e	L4Sys experiment: add CR3 detection to prep run Change-Id: Iebbc0309695ee6a7bb8c68fd6ffa24b73ffd7ee5	2013-10-21 15:28:07 +02:00
Bjoern Doebel	77b2e208d0	L4Sys Experiment: more on address space tracing * introduce L4SYS_ADDRESS_SPACE_TRACE to indicate that we want to trace instructions in a different AS from the one we are starting the experiment in * add CR3Run() to determine address space ID Change-Id: I7bdaf1e858a6dd369af5175bd56e1b4e2d5f05ef	2013-10-21 15:28:07 +02:00
Bjoern Doebel	523f4a465b	add injection address to results Change-Id: I7966f97b8c09bbd6510ca6066dd40be398b54de3	2013-10-21 15:28:07 +02:00
Horst Schirmeier	f2d0919553	tracing: simplify confusing iponly/memonly configuration The internal m_iponly / m_memonly bools are a bit hackish; especially it's unclear what should happen if both are set. The m_tracetype enum now encompasses all possible configurations, while the plugin's user interface remains unchanged. Change-Id: Ibdd872b5cc5781836428b27bfb2db3825700e671	2013-10-17 19:09:54 +02:00

1 2 3 4 5 ...

961 Commits