Work unit way too slow, I think.

Questions and Answers : Unix/Linux : Work unit way too slow, I think.

To post messages, you must log in.

AuthorMessage
Jean-David Beyer

Send message
Joined: 2 Nov 05
Posts: 188
Credit: 6,431,332
RAC: 4,520
Message 13609 - Posted: 13 Apr 2006, 1:49:33 UTC
Last modified: 13 Apr 2006, 1:50:21 UTC

Work unit TRUNCATE_TERMINI_FULLRELAX_1ptq_433_996_0 is taking far too long. It has used 17:07:13 as I type this and it seems to be at 1.04% complete with 20:52:22 remaining. Normally, a work unit is complete long before this. Should I kill it, or what? And if so, how?
ID: 13609 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jean-David Beyer

Send message
Joined: 2 Nov 05
Posts: 188
Credit: 6,431,332
RAC: 4,520
Message 13627 - Posted: 13 Apr 2006, 10:46:57 UTC

P.S.:

If the Work Unit is really Hung -


1. suspend the Work Unit, BOINC Manager -> Work (tab) -> click on the Work Unit click the Suspend button (on the left hand side) then Resume button, wait for the computer to re-start the Work Unit (it will need to finish the new Work Unit it started, if it had another available) and see if it's still stuck, give it about 20min.

It took more than 20 minutes because BOINC client downloaded about
7 Predictor work units and had to do them first.

But after that the rosetta process got up to about 24 hours and still
did not progress.

2. Shutdown BOINC, restart BOINC see if the Work Unit is still stuck, give it about 20min.

After shutting down BOINC, the rosetta process kept on running,
with init as the parent. I killed it and then restarted BOINC.
In less than 60 seconds, the rosetta process got up to 1.01% but
I do not have hope for it.

3. Reboot your computer. See if the Work Unit is still stuck, give it about 20min.

I am not prepared to reboot the computer. What good would that do
that shutting down BOINC, killing any leftover BOINC-owned processes,
and restarting BOINC would already do?

4. Abort the Work Unit, BOINC Manager -> Work (tab) -> click on the Work Unit that's stuck click the Abort button (on the left hand side).

I will consider this if it is still stuck tomorrow.
ID: 13627 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Moderator9
Volunteer moderator

Send message
Joined: 22 Jan 06
Posts: 1014
Credit: 0
RAC: 0
Message 13648 - Posted: 13 Apr 2006, 14:29:09 UTC - in response to Message 13627.  

P.S.:

If the Work Unit is really Hung -


1. suspend the Work Unit, BOINC Manager -> Work (tab) -> click on the Work Unit click the Suspend button (on the left hand side) then Resume button, wait for the computer to re-start the Work Unit (it will need to finish the new Work Unit it started, if it had another available) and see if it's still stuck, give it about 20min.

It took more than 20 minutes because BOINC client downloaded about
7 Predictor work units and had to do them first.

But after that the rosetta process got up to about 24 hours and still
did not progress.

2. Shutdown BOINC, restart BOINC see if the Work Unit is still stuck, give it about 20min.

After shutting down BOINC, the rosetta process kept on running,
with init as the parent. I killed it and then restarted BOINC.
In less than 60 seconds, the rosetta process got up to 1.01% but
I do not have hope for it.

3. Reboot your computer. See if the Work Unit is still stuck, give it about 20min.

I am not prepared to reboot the computer. What good would that do
that shutting down BOINC, killing any leftover BOINC-owned processes,
and restarting BOINC would already do?

4. Abort the Work Unit, BOINC Manager -> Work (tab) -> click on the Work Unit that's stuck click the Abort button (on the left hand side).

I will consider this if it is still stuck tomorrow.


There was a problem with this class of WU, and they have been cancelled. See the thread in the number crunching forum on this subject.

Moderator9
ROSETTA@home FAQ
Moderator Contact
ID: 13648 · Rating: 1 · rate: Rate + / Rate - Report as offensive    Reply Quote

Questions and Answers : Unix/Linux : Work unit way too slow, I think.



©2024 University of Washington
https://www.bakerlab.org