![]() Please wait while updating issue type...
Could not save your changes
This issue has been changed since you started editing it
Data that has been changed is highlighted in red below. Undo your changes to see the updated information
You have changed this issue, but haven't saved your changes yet. To save it, press the Save changes button to the right
This issue is blocking the next release
![]() There are no comments
There is nothing attached to this issue
This issue has no duplicates
There are no code checkins for this issue |
|||||||||||||||||||||||||||||||||||||||
Really delete this comment?
Really delete this comment?
Really delete this comment?
Really delete this comment?
Really delete this comment?
After this, the node reconnects properly after the driver is restarted, and the job is resubmitted and ends normally.
There is however one remaining problem: the socket error is only detected when the results are written. This means the node may wait for a very long time that the execution of the current job is complete. A possible mitigation would be to use the recovery mechanism, however I feel this isn't very satisfying.
The problem is that, while the job is being executed in the node, we are neither reading from, nor writing to the socket connection, so we can't detect that the connection is closed.
Really delete this comment?
Changes committed to SVN trunk revision 2547
The issue was updated with the following change(s):