Back to the main page.
Bug 139 - smartcpu is triggered simultaneously on multiple slaves that share a linux node
Status | CLOSED FIXED |
Reported | 2010-09-02 10:19:00 +0200 |
Modified | 2011-01-05 12:01:00 +0100 |
Product: | FieldTrip |
Component: | peer |
Version: | unspecified |
Hardware: | PC |
Operating System: | Mac OS |
Importance: | P1 minor |
Assigned to: | Robert Oostenveld |
URL: | |
Tags: | |
Depends on: | |
Blocks: | |
See also: |
Robert Oostenveld - 2010-09-02 10:19:28 +0200
if I do n = 100; peercellfun(@pause, repmat({3}, 1, n) ) in one matlab, and while(1); peerlist; pause(1); end in another, I observe multiple peerslaves switching from idle to zombie. This suggests that smartcpu is triggered simultaneously. Previously I had also observed a race condition between slaves, which caused them to alternate between idle and zombie.
Robert Oostenveld - 2010-09-07 17:18:36 +0200
this has been resolved by accumulating evidence over multiple calls prior to switching mode. Furthermore, there is now a freeze flag in smartcpu and smartmem, which is used by tcpsocket to prevent the switching of mode and/or memory available while a new job is arriving.
Robert Oostenveld - 2011-01-05 11:56:55 +0100
selected a long list of resolved bugs from roboos and changed the status into "RESOLVED"