fail/cpn: (Database)Campaign no longer loses jobs

Up until now the JobServer was silently losing jobs and only claiming to be
finished - a workaround for this was to restart the campaign until all jobs
were finished according to the database and the campaign's output.
This change fixes the underlying problem, so a single campaign-run suffices
and does no longer lose any jobs.
Debugging this was awful and took us quite some time...

Change-Id: Ie6c982cc3b2ce11128941f1f13be563bae22565c
This commit is contained in:
Michael Lenz
2014-01-15 12:45:23 +01:00
parent abd9decf0b
commit 9c984b9704
3 changed files with 41 additions and 12 deletions

View File

@ -357,7 +357,7 @@ void CommThread::receiveExperimentResults(Minion& minion, FailControlMessage& ct
cout << "[Server] Received another result for workload id ["
<< ctrlmsg.workloadid(i) << "] -- ignored." << endl;
// TODO: Any need for error-handling here?
SocketComm::dropMsg(minion.getSocketDescriptor());
}
}