Unrecoverable error???

Message boards : Number crunching : Unrecoverable error???

To post messages, you must log in.

Previous · 1 · 2

AuthorMessage
Moderator9
Volunteer moderator

Send message
Joined: 22 Jan 06
Posts: 1014
Credit: 0
RAC: 0
Message 11525 - Posted: 1 Mar 2006, 16:15:29 UTC - in response to Message 11491.  

A follow up on this. After making my screen saver go for 700 minutes before coming on and reducing both Rosetta and Ralph to 4 hours cpu time, Rosetta now appears to be running ok. Ralph still give Unrecoverable errors saying running out of disk space (with 210 GB available and 85% total usable), so have increased disk to 220 GB and 95% total usable and dropped the RALPH cpu time down to 2 hours to see what happens.

See http://ralph.bakerlab.org/workunit.php?wuid=10305 for the latest failed workunit.



I have reported the screen saver issue to David Kim, he may simply fix whatever is wrong or he may send a note about the issue after he looks at it.

The Ralph problem should be solved now. There was a problem being caused by the way BOINC does error reporting for certain kinds of errors. This should not happen with version 4.90 and above.


Moderator9
ROSETTA@home FAQ
Moderator Contact
ID: 11525 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
humanoid

Send message
Joined: 22 Dec 05
Posts: 4
Credit: 31,590
RAC: 0
Message 11551 - Posted: 2 Mar 2006, 6:05:47 UTC - in response to Message 11215.  

If you are using a recent version of the client, you can attach via the boinc manager by selecting "Attach to project" from the "Projects" pull down menu and go from there. The project url is:

http://ralph.bakerlab.org

or you can go to this web site and click on "Join RALPH@home".


RALPH is our new alpha testing project for R@h. We are using this project to help fix bugs, test new work units, and application updates, etc. If you attach a host that is consistently having problems with R@h, we will be able to try to trouble shoot and debug via RALPH.


OK, I've attached it to Ralph, anything else I need to do?




Wait for the next batch of Ralph test Work Units. Currently Ralph is between tests, so it may be a few daays before you see any workunits. When they arrive let the system run them, and report any errors in the RALPH error reporting forums.

{NOTE: This thread will soon be moved to the NUMBER CRUNCHING forum.}




Still getting "unrecoverable error" messages. Getting them when Ralph finishes up a work unit as well. This is really starting to piss me off...
ID: 11551 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Beringse
Avatar

Send message
Joined: 10 Oct 06
Posts: 20
Credit: 401,284
RAC: 0
Message 31444 - Posted: 20 Nov 2006, 0:00:44 UTC

Hi all,
Got this one a few minutes ago...

11/19/2006 3:40:37 PM|rosetta@home|Unrecoverable error for result PSH_0056_looprlx_GP120_OD1_138_148_6434_1404_3_0 ( - exit code 1073807364 (0x40010004))

Thanks for the input
ID: 31444 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 · 2

Message boards : Number crunching : Unrecoverable error???



©2024 University of Washington
https://www.bakerlab.org