Unrecoverable error

Message boards : Number crunching : Unrecoverable error

To post messages, you must log in.

1 · 2 · 3 · 4 · Next

AuthorMessage
Gordon Hartman

Send message
Joined: 3 Nov 05
Posts: 4
Credit: 118,571
RAC: 0
Message 2568 - Posted: 7 Nov 2005, 12:35:52 UTC

I just started crunching and I gotten several of this same error. Is this common?

11/7/2005 6:22:59 AM|rosetta@home|Unrecoverable error for result 1hz7A_abrelaxmode_random_gauss_length30_jitter02_21173_0 ( - exit code -164 (0xffffff5c))

ID: 2568 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile stephan_t
Avatar

Send message
Joined: 20 Oct 05
Posts: 129
Credit: 35,464
RAC: 0
Message 2572 - Posted: 7 Nov 2005, 14:47:59 UTC

See this thread:
here

On my boxes I never had any errors until the day one of my VMs decided to hog 100% cpu usage too... Rosetta didn't like that at all. But under normal usage (even gaming) it never errored. My two cents.
Team CFVault.com
http://www.cfvault.com

ID: 2572 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Doug Worrall
Avatar

Send message
Joined: 19 Sep 05
Posts: 60
Credit: 58,445
RAC: 0
Message 2586 - Posted: 7 Nov 2005, 19:26:08 UTC - in response to Message 2568.  

I just started crunching and I gotten several of this same error. Is this common?

11/7/2005 6:22:59 AM|rosetta@home|Unrecoverable error for result 1hz7A_abrelaxmode_random_gauss_length30_jitter02_21173_0 ( - exit code -164 (0xffffff5c))



Have only had errors for 2 days.UOD yesterday after turfing Rosetta,then reapplied
yesterday.Another 5 Hours of errors.NO THANKS,The Ram useage will kill an XP Box.
Thank God for Linux!
ID: 2586 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Gordon Hartman

Send message
Joined: 3 Nov 05
Posts: 4
Credit: 118,571
RAC: 0
Message 3627 - Posted: 18 Nov 2005, 20:52:56 UTC

WU that were just downloaded gave these errors??? Any ideas?

11/18/2005 8:39:32 AM|rosetta@home|Unrecoverable error for result 1n0u__abrelaxmode_random_length20_jitter02_omega_sim_aneal_08477_0 ( - exit code -1073741819 (0xc0000005))
11/18/2005 8:39:32 AM|rosetta@home|Unrecoverable error for result 1n0u__abrelaxmode_random_length20_jitter02_omega_sim_aneal_08432_0 ( - exit code -1073741819 (0xc0000005))
11/18/2005 3:38:03 PM|rosetta@home|Unrecoverable error for result 1n0u__abrelaxmode_random_length20_jitter02_omega_sim_aneal_08561_0 ( - exit code -1073741819 (0xc0000005))

ID: 3627 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
themule

Send message
Joined: 6 Nov 05
Posts: 1
Credit: 150,575
RAC: 0
Message 3764 - Posted: 20 Nov 2005, 18:01:19 UTC - in response to Message 3627.  

Count me in also...Last 2 WUs errored out the same way.
ID: 3764 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Andrew Fuller
Avatar

Send message
Joined: 2 Nov 05
Posts: 3
Credit: 23,692
RAC: 0
Message 3843 - Posted: 21 Nov 2005, 23:32:30 UTC

Looks like my Rosetta WU crunches are erroring out since Nov 19th.

Mac G5 dual OSX 10.39, with BOINC Superbench menubar 4.44

G5 dual 2GHz OSX 10.3.9
G4 dual 867MgHz OSX 10.4.3
ID: 3843 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile dgnuff
Avatar

Send message
Joined: 1 Nov 05
Posts: 350
Credit: 24,773,605
RAC: 0
Message 3853 - Posted: 22 Nov 2005, 0:50:04 UTC - in response to Message 2586.  

I just started crunching and I gotten several of this same error. Is this common?

11/7/2005 6:22:59 AM|rosetta@home|Unrecoverable error for result 1hz7A_abrelaxmode_random_gauss_length30_jitter02_21173_0 ( - exit code -164 (0xffffff5c))



Have only had errors for 2 days.UOD yesterday after turfing Rosetta,then reapplied
yesterday.Another 5 Hours of errors.NO THANKS,The Ram useage will kill an XP Box.
Thank God for Linux!


As it happens the Win32 client uses about 1/3 the chip that the linux does, because it only has one copy of the rosetta .exe loaded at once, not three like my linux box does.

So I'd have to say you've got it backwards. That should read "The Ram usage will kill a Linux Box. Thank God for Windows."

Seriously.

ID: 3853 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Gordon Hartman

Send message
Joined: 3 Nov 05
Posts: 4
Credit: 118,571
RAC: 0
Message 3905 - Posted: 22 Nov 2005, 11:55:39 UTC

Well I quit crunching & detatched Rosetta, I'm wasting CPU time, all I get is errors. I will check back time to time!
ID: 3905 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Ulrich Metzner
Avatar

Send message
Joined: 17 Sep 05
Posts: 22
Credit: 405,640
RAC: 0
Message 3907 - Posted: 22 Nov 2005, 11:58:13 UTC - in response to Message 3905.  

Well I quit crunching & detatched Rosetta, I'm wasting CPU time, all I get is errors. I will check back time to time!
Same here: Nothing but errors... Just changed the resource share for Predictor instead :/
greetz, Uli

ID: 3907 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Andrew Fuller
Avatar

Send message
Joined: 2 Nov 05
Posts: 3
Credit: 23,692
RAC: 0
Message 4311 - Posted: 26 Nov 2005, 1:59:19 UTC

It seems all my WUs are erroring out for the past 5 days. I saw a thread speculating that 5.25 was required to make Rosetta@home work on some Macs, but I don't want to stop using Altivec BOINC clients on my other projects. Whatever the case, I'll be detaching from this project until there's an answer or a fix.

G5 dual 2GHz OSX 10.3.9
G4 dual 867MgHz OSX 10.4.3
ID: 4311 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Tern
Avatar

Send message
Joined: 25 Oct 05
Posts: 576
Credit: 4,695,450
RAC: 5
Message 4315 - Posted: 26 Nov 2005, 3:08:22 UTC - in response to Message 4311.  

I saw a thread speculating that 5.25 was required to make Rosetta@home work on some Macs, but I don't want to stop using Altivec BOINC clients on my other projects. Whatever the case, I'll be detaching from this project until there's an answer or a fix.


I was running Rosetta on 4.72 on two Macs, with no problems - I detached those because they weren't fast enough, and the occasional WU came along that took 30+ hours, but they worked without any other errors. I'm having a problem now because I want to put Rosetta BACK on one of those Macs, but like you, I don't want to give up using an optimized BOINC for SETI and Einstein with the Altivec-optimized apps. It looks like I have a choice of not running Rosetta on that machine, or "cheating" on Rosetta, or accepting getting low credit on SETI and Einstein...

ID: 4315 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
chrisjej

Send message
Joined: 24 Sep 05
Posts: 2
Credit: 28,127
RAC: 0
Message 4347 - Posted: 26 Nov 2005, 12:01:59 UTC

Well I'm using Windows 2000 and I seem to be 90% errors. I have suspended work units until I get some feeling I'm not just wasting CPU that could be better spent on other projects.

Regards
Chris
ID: 4347 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Webmaster Yoda
Avatar

Send message
Joined: 17 Sep 05
Posts: 161
Credit: 162,253
RAC: 0
Message 4355 - Posted: 26 Nov 2005, 12:45:28 UTC - in response to Message 4347.  
Last modified: 26 Nov 2005, 12:46:13 UTC

Well I'm using Windows 2000 and I seem to be 90% errors.


What kinds of errors are you getting Chris? Maybe post a few lines from the BOINC log so we get some idea of what is happening? I get hardly any errors at all with my Windows 2000 host.

Contact me via the (your and my) team forum if you like.

*** Join BOINC@Australia today ***
ID: 4355 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile mags
Avatar

Send message
Joined: 22 Nov 05
Posts: 33
Credit: 108,630
RAC: 0
Message 4394 - Posted: 26 Nov 2005, 21:38:32 UTC

I too am getting these errors, very disappointing to say the least. In 4 days of [24/7] running rosetta/boinc only 1 sucessful wu returned.
ID: 4394 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Tern
Avatar

Send message
Joined: 25 Oct 05
Posts: 576
Credit: 4,695,450
RAC: 5
Message 4396 - Posted: 26 Nov 2005, 22:49:53 UTC
Last modified: 26 Nov 2005, 22:51:38 UTC

Since no one is posting the error messages, I pulled one from mags' latest result:

<core_client_version>5.2.7</core_client_version> <message> - exit code -1073741819 (0xc0000005) </message> <stderr_txt> # ===================================== # random seed: 1062701 # ===================================== No heartbeat from core client for 31 sec - exiting # ===================================== # random seed: 1081761 # ===================================== # ===================================== # random seed: 1389881 # ===================================== No heartbeat from core client for 31 sec - exiting ***UNHANDLED EXCEPTION**** Reason: Access Violation (0xc0000005) at address 0x7C910F29 read attempt to address 0xBFE9EBF1 1: 11/26/05 20:55:52 </stderr_txt>

To track this down completely, we'll need to see the relevant part of either stderrdae.txt or stdouterr.txt - I'm running an AMD/Windows PC on Rosetta and have done a large number of results named similar to the one that errored out above, but have never had an error... I'm running 5.2.8 instead of 5.2.7, but I would be surprised if that was the problem; are others that are seeing this running 5.2.7?

EDIT::: No, not a 5.2.7 problem; chris is on 5.2.10 in this latest result:

<core_client_version>5.2.10</core_client_version> <message> - exit code -164 (0xffffff5c) </message> <stderr_txt> # ===================================== # random seed: 1243341 # ===================================== # ===================================== # random seed: 1459201 # ===================================== ***UNHANDLED EXCEPTION**** Reason: Access Violation (0xc0000005) at address 0x77F9D45A read attempt to address 0xBF33E549 </stderr_txt>

ID: 4396 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Spectre
Avatar

Send message
Joined: 1 Nov 05
Posts: 20
Credit: 177,671
RAC: 0
Message 4397 - Posted: 26 Nov 2005, 22:50:36 UTC - in response to Message 4394.  

I too am getting these errors, very disappointing to say the least. In 4 days of [24/7] running rosetta/boinc only 1 sucessful wu returned.


Same here...all of my boxen are getting client errors...sometimes its 2-3 hours into a workunit before it bombs out and starts another one....

Someone needs to fix this ASAP.

Spectre

ID: 4397 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Tern
Avatar

Send message
Joined: 25 Oct 05
Posts: 576
Credit: 4,695,450
RAC: 5
Message 4399 - Posted: 26 Nov 2005, 22:52:51 UTC - in response to Message 4397.  

Same here..


Here is Spectre's latest:

<core_client_version>5.2.7</core_client_version> <message> - exit code -1073741819 (0xc0000005) </message> <stderr_txt> # ===================================== # random seed: 1013501 # ===================================== ***UNHANDLED EXCEPTION**** Reason: Access Violation (0xc0000005) at address 0x00404BC6 read attempt to address 0x6470402E 1: 11/26/05 16:31:31 </stderr_txt>


ID: 4399 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Spectre
Avatar

Send message
Joined: 1 Nov 05
Posts: 20
Credit: 177,671
RAC: 0
Message 4400 - Posted: 26 Nov 2005, 22:53:00 UTC - in response to Message 4397.  

2005-11-25 11:06:21 [rosetta@home] Unrecoverable error for result 1dcj__abrelax_rand_len10_jit02_omega_sim_14068_0 (There are no child processes to wait for. (0x80) - exit code 128 (0x80))
2005-11-25 16:22:42 [rosetta@home] Unrecoverable error for result 1dcj__abrelax_rand_len10_jit02_omega_sim_29809_0 ( - exit code -1073741819 (0xc0000005))
2005-11-25 18:53:07 [rosetta@home] Unrecoverable error for result 1dcj__abrelax_rand_len10_jit02_omega_sim_04919_0 (There are no child processes to wait for. (0x80) - exit code 128 (0x80))
2005-11-25 22:17:23 [rosetta@home] Unrecoverable error for result 1ogw__abrelax_rand_len10_jit02_omega_sim_02007_0 ( - exit code -1073741819 (0xc0000005))
2005-11-26 14:03:51 [rosetta@home] Unrecoverable error for result 1dtj__abrelax_rand_len10_jit02_omega_sim_04980_0 ( - exit code -1073741819 (0xc0000005))
2005-11-26 16:31:32 [rosetta@home] Unrecoverable error for result 1ogw__abrelax_rand_len10_jit02_omega_sim_05727_0 ( - exit code -1073741819 (0xc0000005))

ID: 4400 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile mags
Avatar

Send message
Joined: 22 Nov 05
Posts: 33
Credit: 108,630
RAC: 0
Message 4423 - Posted: 27 Nov 2005, 8:47:47 UTC

Can anyone translate these errors for me?

I had another 18 hours of wasted crunching ............:(
ID: 4423 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
The Colourful Jester

Send message
Joined: 26 Sep 05
Posts: 1
Credit: 46,150
RAC: 0
Message 4430 - Posted: 27 Nov 2005, 11:20:00 UTC

27/11/2005 5:44:10 PM|rosetta@home|Unrecoverable error for result 1dtj__abrelax_rand_len10_jit02_omega_sim_29630_0 ( - exit code -1073741819 (0xc0000005))


That's pretty much all I get as well. Has anyone from the project said anything about it yet?
ID: 4430 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
1 · 2 · 3 · 4 · Next

Message boards : Number crunching : Unrecoverable error



©2024 University of Washington
https://www.bakerlab.org