Message boards : Number crunching : NO_SIM_ANNEAL_BARCODE_30_RLX_NATIVE ERRORS !!
Author | Message |
---|---|
Hoelder1in Send message Joined: 30 Sep 05 Posts: 169 Credit: 3,915,947 RAC: 0 |
I am sure someone must have noticed that all the NO_SIM_ANNEAL_BARCODE_30_RLX_NATIVE* WUs are erroring out after 30 seconds or so. This looks exactly like the "random seed error" we were having in late December shortly before the Holidays. I guess it would be best if the team would cancel this batch as soon as possible. |
Moderator9 Volunteer moderator Send message Joined: 22 Jan 06 Posts: 1014 Credit: 0 RAC: 0 |
I am sure someone must have noticed that all the NO_SIM_ANNEAL_BARCODE_30_RLX_NATIVE* WUs are erroring out after 30 seconds or so. This looks exactly like the "random seed error" we were having in late December shortly before the Holidays. I guess it would be best if the team would cancel this batch as soon as possible. I have notified David Kim and David baker about this issue and am making this a sticky until they can address it. Moderator9 ROSETTA@home FAQ Moderator Contact |
nasher Send message Joined: 5 Nov 05 Posts: 98 Credit: 618,288 RAC: 0 |
i am curently at sea so i cant go directly to my computer but lookin at my results i see errors on DEFAULT_RLX_NATIVE_1r69_280_178 DEFAULT_RLX_NATIVE_1hz6_280_146 OMEGA_WT_1.0_2tif_282_24 NO_SIM_ANNEAL_BARCODE_30_1n0u_251_9144 and also these that were completed sucesfully by another user NO_SIM_ANNEAL_BARCODE_30_1r69_251_19233 NO_SIM_ANNEAL_BARCODE_30_1n0u_251_9144 in my returned results... only errors in the past month or so i think hopefully we can sort out whatever errors are out there and hopefully we dont get many problems.. but i understand that they will always occour. wish i was at home so i could walk up to the computers and physicaly check the work units as oposed to useing the results for user like i am now |
nasher Send message Joined: 5 Nov 05 Posts: 98 Credit: 618,288 RAC: 0 |
another thing i just noticed the errors i had were all on the same computer.. this one while my other computers seem fine including an identical computer that runs the same software and such.. not sure what to make of it.. but i though it might be of note |
Rebel Alliance Send message Joined: 4 Nov 05 Posts: 50 Credit: 3,579,531 RAC: 0 |
I've had 5 in a row of these fail on my dual Opty 165 running stock at 1.8 NO_SIM_ANNEAL_BARCODE_30 On another machine I've had 3 of these fail OMEGA_WT_1.0_2tif_275_623 After checking I have found other machines that has had both of these work units failed. |
David Baker Volunteer moderator Project administrator Project developer Project scientist Send message Joined: 17 Sep 05 Posts: 705 Credit: 559,847 RAC: 0 |
I am sure someone must have noticed that all the NO_SIM_ANNEAL_BARCODE_30_RLX_NATIVE* WUs are erroring out after 30 seconds or so. This looks exactly like the "random seed error" we were having in late December shortly before the Holidays. I guess it would be best if the team would cancel this batch as soon as possible. we are trying to figure this out. in any event, only a small number of these were sent out, so the problem should disappear shortly. sorry! |
AMD_is_logical Send message Joined: 20 Dec 05 Posts: 299 Credit: 31,460,681 RAC: 0 |
I had the following WUs error out quickly, and they errored out for everyone else they were sent to: NO_SIM_ANNEAL_BARCODE_30_RLX_NATIVE_2tif_281_200 NO_SIM_ANNEAL_BARCODE_30_RLX_NATIVE_1dcj_281_34 NO_SIM_ANNEAL_BARCODE_30_RLX_NATIVE_2reb_281_18 DEFAULT_RLX_NATIVE_1r69_280_24 DEFAULT_RLX_NATIVE_1b72_280_76 DEFAULT_RLX_NATIVE_1dcj_280_54 DEFAULT_RLX_NATIVE_1b72_280_55 One of my linux nodes had two other WUs with a different problem. As both WUs were on the same node I can't be sure it isn't a problem with that node. However, my nodes run at stock, and have been checked with memtest86 and a brief run with super pi. I found the node hung at 60%. BOINC was responding to the manager, but wasn't incrementing the current time for the WU. The CPU was idle. Stopping and restarting BOINC allowed the WU to successfully finish and produce this result: NO_SIM_ANNEAL_BARCODE_30_1hz6_251_21191_0 https://boinc.bakerlab.org/rosetta/result.php?resultid=7794890 I then noticed that the node had failed an earlier WU: NO_SIM_ANNEAL_BARCODE_30_1dcj_251_6998_0 https://boinc.bakerlab.org/rosetta/result.php?resultid=7559941 These two results have similar entries in their stderr files, but I don't know what those numbers mean. |
Hoelder1in Send message Joined: 30 Sep 05 Posts: 169 Credit: 3,915,947 RAC: 0 |
DEFAULT_RLX_NATIVE_1hz6_285_2 completed successfully on my computer. So in my case, only the units with the name given in the title of the thread errored out after about 30 seconds. Oh, just noticed that this is a different batch number, _285_ instead of _280_, so may be this is the reason for the different behaviour... |
Angus Send message Joined: 17 Sep 05 Posts: 412 Credit: 321,053 RAC: 0 |
Today: 1/27/2006 4:35:10 PM|rosetta@home|Unrecoverable error for result NO_SIM_ANNEAL_BARCODE_30_RLX_NATIVE_1dtj_281_80_0 (Incorrect function. (0x1) - exit code 1 (0x1)) 1/27/2006 4:35:10 PM||Rescheduling CPU: application exited 1/27/2006 4:35:10 PM|rosetta@home|Computation for result NO_SIM_ANNEAL_BARCODE_30_RLX_NATIVE_1dtj_281_80_0 finished 1/27/2006 4:35:11 PM|rosetta@home|Starting result NO_SIM_ANNEAL_BARCODE_30_RLX_NATIVE_1n0u_281_74_0 using rosetta version 481 1/27/2006 4:35:40 PM|rosetta@home|Unrecoverable error for result NO_SIM_ANNEAL_BARCODE_30_RLX_NATIVE_1n0u_281_74_0 (Incorrect function. (0x1) - exit code 1 (0x1)) 1/27/2006 4:35:40 PM||Rescheduling CPU: application exited 1/27/2006 4:35:40 PM|rosetta@home|Computation for result NO_SIM_ANNEAL_BARCODE_30_RLX_NATIVE_1n0u_281_74_0 finished 1/27/2006 4:35:40 PM|rosetta@home|Starting result NO_SIM_ANNEAL_BARCODE_30_RLX_NATIVE_1hz6_281_74_0 using rosetta version 481 1/27/2006 4:36:07 PM|rosetta@home|Unrecoverable error for result NO_SIM_ANNEAL_BARCODE_30_RLX_NATIVE_1hz6_281_74_0 (Incorrect function. (0x1) - exit code 1 (0x1)) 1/27/2006 4:36:07 PM||Rescheduling CPU: application exited 1/27/2006 4:36:07 PM|rosetta@home|Computation for result NO_SIM_ANNEAL_BARCODE_30_RLX_NATIVE_1hz6_281_74_0 finished 1/27/2006 4:36:07 PM|rosetta@home|Starting result NO_SIM_ANNEAL_BARCODE_30_RLX_NATIVE_1di2_281_87_0 using rosetta version 481 1/27/2006 4:36:39 PM|rosetta@home|Unrecoverable error for result NO_SIM_ANNEAL_BARCODE_30_RLX_NATIVE_1di2_281_87_0 (Incorrect function. (0x1) - exit code 1 (0x1)) 1/27/2006 4:36:39 PM||Rescheduling CPU: application exited 1/27/2006 4:36:39 PM|rosetta@home|Computation for result NO_SIM_ANNEAL_BARCODE_30_RLX_NATIVE_1di2_281_87_0 finished 1/27/2006 4:36:39 PM|rosetta@home|Starting result DEFAULT_RLX_NATIVE_2tif_280_82_0 using rosetta version 481 1/27/2006 4:37:09 PM|rosetta@home|Unrecoverable error for result DEFAULT_RLX_NATIVE_2tif_280_82_0 (Incorrect function. (0x1) - exit code 1 (0x1)) 1/27/2006 4:37:09 PM||Rescheduling CPU: application exited 1/27/2006 4:37:09 PM|rosetta@home|Computation for result DEFAULT_RLX_NATIVE_2tif_280_82_0 finished 1/27/2006 4:37:09 PM|rosetta@home|Starting result NO_SIM_ANNEAL_BARCODE_30_RLX_NATIVE_1hz6_281_84_0 using rosetta version 481 1/27/2006 4:37:36 PM|rosetta@home|Unrecoverable error for result NO_SIM_ANNEAL_BARCODE_30_RLX_NATIVE_1hz6_281_84_0 (Incorrect function. (0x1) - exit code 1 (0x1)) 1/27/2006 4:37:36 PM||Rescheduling CPU: application exited 1/27/2006 4:37:36 PM|rosetta@home|Computation for result NO_SIM_ANNEAL_BARCODE_30_RLX_NATIVE_1hz6_281_84_0 finished 1/27/2006 4:37:36 PM|rosetta@home|Starting result NO_SIM_ANNEAL_BARCODE_30_RLX_NATIVE_1dtj_281_86_0 using rosetta version 481 1/27/2006 4:38:08 PM|rosetta@home|Unrecoverable error for result NO_SIM_ANNEAL_BARCODE_30_RLX_NATIVE_1dtj_281_86_0 (Incorrect function. (0x1) - exit code 1 (0x1)) 1/27/2006 4:38:08 PM||Rescheduling CPU: application exited 1/27/2006 4:38:08 PM|rosetta@home|Computation for result NO_SIM_ANNEAL_BARCODE_30_RLX_NATIVE_1dtj_281_86_0 finished 1/27/2006 4:38:08 PM|rosetta@home|Starting result NO_SIM_ANNEAL_BARCODE_30_RLX_NATIVE_1dtj_281_87_0 using rosetta version 481 1/27/2006 4:38:41 PM|rosetta@home|Unrecoverable error for result NO_SIM_ANNEAL_BARCODE_30_RLX_NATIVE_1dtj_281_87_0 (Incorrect function. (0x1) - exit code 1 (0x1)) 1/27/2006 4:38:41 PM||Rescheduling CPU: application exited 1/27/2006 4:38:41 PM|rosetta@home|Computation for result NO_SIM_ANNEAL_BARCODE_30_RLX_NATIVE_1dtj_281_87_0 finished 1/27/2006 4:38:41 PM|rosetta@home|Starting result DEFAULT_RLX_NATIVE_1hz6_280_83_0 using rosetta version 481 1/27/2006 4:39:08 PM|rosetta@home|Unrecoverable error for result DEFAULT_RLX_NATIVE_1hz6_280_83_0 (Incorrect function. (0x1) - exit code 1 (0x1)) 1/27/2006 4:39:08 PM||Rescheduling CPU: application exited 1/27/2006 4:39:08 PM|rosetta@home|Computation for result DEFAULT_RLX_NATIVE_1hz6_280_83_0 finished 1/27/2006 4:39:08 PM||Allowing work fetch again. 1/27/2006 4:39:08 PM|rosetta@home|Starting result DEFAULT_RLX_NATIVE_1dcj_280_86_0 using rosetta version 481 1/27/2006 4:39:43 PM|rosetta@home|Unrecoverable error for result DEFAULT_RLX_NATIVE_1dcj_280_86_0 (Incorrect function. (0x1) - exit code 1 (0x1)) 1/27/2006 4:39:43 PM||Rescheduling CPU: application exited 1/27/2006 4:39:43 PM|rosetta@home|Computation for result DEFAULT_RLX_NATIVE_1dcj_280_86_0 finished 1/27/2006 4:39:43 PM|rosetta@home|Starting result NO_SIM_ANNEAL_BARCODE_30_RLX_NATIVE_1dcj_281_87_0 using rosetta version 481 1/27/2006 4:40:17 PM|rosetta@home|Unrecoverable error for result NO_SIM_ANNEAL_BARCODE_30_RLX_NATIVE_1dcj_281_87_0 (Incorrect function. (0x1) - exit code 1 (0x1)) 1/27/2006 4:40:17 PM||Rescheduling CPU: application exited 1/27/2006 4:40:17 PM|rosetta@home|Computation for result NO_SIM_ANNEAL_BARCODE_30_RLX_NATIVE_1dcj_281_87_0 finished 1/27/2006 4:40:18 PM|rosetta@home|Starting result DEFAULT_RLX_NATIVE_1mky_280_82_0 using rosetta version 481 1/27/2006 4:40:51 PM|rosetta@home|Unrecoverable error for result DEFAULT_RLX_NATIVE_1mky_280_82_0 (Incorrect function. (0x1) - exit code 1 (0x1)) 1/27/2006 4:40:51 PM||Rescheduling CPU: application exited 1/27/2006 4:40:51 PM|rosetta@home|Computation for result DEFAULT_RLX_NATIVE_1mky_280_82_0 finished 1/27/2006 4:40:51 PM|rosetta@home|Starting result NO_SIM_ANNEAL_BARCODE_30_RLX_NATIVE_1dtj_281_83_0 using rosetta version 481 1/27/2006 4:41:23 PM|rosetta@home|Unrecoverable error for result NO_SIM_ANNEAL_BARCODE_30_RLX_NATIVE_1dtj_281_83_0 (Incorrect function. (0x1) - exit code 1 (0x1)) 1/27/2006 4:41:23 PM||Rescheduling CPU: application exited 1/27/2006 4:41:23 PM|rosetta@home|Computation for result NO_SIM_ANNEAL_BARCODE_30_RLX_NATIVE_1dtj_281_83_0 finished 1/27/2006 4:41:23 PM|rosetta@home|Starting result NO_SIM_ANNEAL_BARCODE_30_RLX_NATIVE_2reb_281_80_0 using rosetta version 481 1/27/2006 4:41:51 PM|rosetta@home|Unrecoverable error for result NO_SIM_ANNEAL_BARCODE_30_RLX_NATIVE_2reb_281_80_0 (Incorrect function. (0x1) - exit code 1 (0x1)) 1/27/2006 4:41:51 PM||Rescheduling CPU: application exited 1/27/2006 4:41:51 PM|rosetta@home|Computation for result NO_SIM_ANNEAL_BARCODE_30_RLX_NATIVE_2reb_281_80_0 finished 1/27/2006 4:41:52 PM|rosetta@home|Starting result NO_SIM_ANNEAL_BARCODE_30_RLX_NATIVE_1ogw_281_87_0 using rosetta version 481 1/27/2006 4:42:25 PM|rosetta@home|Unrecoverable error for result NO_SIM_ANNEAL_BARCODE_30_RLX_NATIVE_1ogw_281_87_0 (Incorrect function. (0x1) - exit code 1 (0x1)) 1/27/2006 4:42:25 PM||Rescheduling CPU: application exited 1/27/2006 4:42:25 PM|rosetta@home|Computation for result NO_SIM_ANNEAL_BARCODE_30_RLX_NATIVE_1ogw_281_87_0 finished 1/27/2006 4:42:25 PM|rosetta@home|Starting result NO_SIM_ANNEAL_BARCODE_30_RLX_NATIVE_2reb_281_87_0 using rosetta version 481 1/27/2006 4:42:52 PM|rosetta@home|Unrecoverable error for result NO_SIM_ANNEAL_BARCODE_30_RLX_NATIVE_2reb_281_87_0 (Incorrect function. (0x1) - exit code 1 (0x1)) 1/27/2006 4:42:52 PM||Rescheduling CPU: application exited 1/27/2006 4:42:52 PM|rosetta@home|Computation for result NO_SIM_ANNEAL_BARCODE_30_RLX_NATIVE_2reb_281_87_0 finished 1/27/2006 4:42:52 PM|rosetta@home|Starting result NO_SIM_ANNEAL_BARCODE_30_RLX_NATIVE_2reb_281_79_0 using rosetta version 481 1/27/2006 4:43:19 PM|rosetta@home|Unrecoverable error for result NO_SIM_ANNEAL_BARCODE_30_RLX_NATIVE_2reb_281_79_0 (Incorrect function. (0x1) - exit code 1 (0x1)) 1/27/2006 4:43:19 PM||Rescheduling CPU: application exited 1/27/2006 4:43:19 PM|rosetta@home|Computation for result NO_SIM_ANNEAL_BARCODE_30_RLX_NATIVE_2reb_281_79_0 finished 1/27/2006 4:43:20 PM|rosetta@home|Starting result DEFAULT_RLX_NATIVE_2tif_280_88_0 using rosetta version 481 1/27/2006 4:43:48 PM|rosetta@home|Unrecoverable error for result DEFAULT_RLX_NATIVE_2tif_280_88_0 (Incorrect function. (0x1) - exit code 1 (0x1)) 1/27/2006 4:43:48 PM||Rescheduling CPU: application exited 1/27/2006 4:43:48 PM|rosetta@home|Computation for result DEFAULT_RLX_NATIVE_2tif_280_88_0 finished 1/27/2006 4:43:48 PM|rosetta@home|Starting result NO_SIM_ANNEAL_BARCODE_30_RLX_NATIVE_2tif_281_87_0 using rosetta version 481 1/27/2006 4:44:16 PM|rosetta@home|Unrecoverable error for result NO_SIM_ANNEAL_BARCODE_30_RLX_NATIVE_2tif_281_87_0 (Incorrect function. (0x1) - exit code 1 (0x1)) 1/27/2006 4:44:16 PM||Rescheduling CPU: application exited 1/27/2006 4:44:16 PM|rosetta@home|Computation for result NO_SIM_ANNEAL_BARCODE_30_RLX_NATIVE_2tif_281_87_0 finished 1/27/2006 4:44:16 PM|rosetta@home|Starting result NO_SIM_ANNEAL_BARCODE_30_RLX_NATIVE_1mky_281_87_0 using rosetta version 481 1/27/2006 4:44:48 PM|rosetta@home|Unrecoverable error for result NO_SIM_ANNEAL_BARCODE_30_RLX_NATIVE_1mky_281_87_0 (Incorrect function. (0x1) - exit code 1 (0x1)) 1/27/2006 4:44:48 PM||Rescheduling CPU: application exited 1/27/2006 4:44:48 PM|rosetta@home|Computation for result NO_SIM_ANNEAL_BARCODE_30_RLX_NATIVE_1mky_281_87_0 finished 1/27/2006 4:44:49 PM|rosetta@home|Starting result NO_SIM_ANNEAL_BARCODE_30_RLX_NATIVE_1b72_281_87_0 using rosetta version 481 1/27/2006 4:45:15 PM|rosetta@home|Unrecoverable error for result NO_SIM_ANNEAL_BARCODE_30_RLX_NATIVE_1b72_281_87_0 (Incorrect function. (0x1) - exit code 1 (0x1)) 1/27/2006 4:45:15 PM||Rescheduling CPU: application exited 1/27/2006 4:45:15 PM|rosetta@home|Computation for result NO_SIM_ANNEAL_BARCODE_30_RLX_NATIVE_1b72_281_87_0 finished 1/27/2006 4:45:15 PM|rosetta@home|Starting result DEFAULT_RLX_NATIVE_1mky_280_88_0 using rosetta version 481 1/27/2006 4:45:47 PM|rosetta@home|Unrecoverable error for result DEFAULT_RLX_NATIVE_1mky_280_88_0 (Incorrect function. (0x1) - exit code 1 (0x1)) 1/27/2006 4:45:47 PM||Rescheduling CPU: application exited 1/27/2006 4:45:47 PM|rosetta@home|Computation for result DEFAULT_RLX_NATIVE_1mky_280_88_0 finished 1/27/2006 4:45:47 PM|rosetta@home|Starting result DEFAULT_RLX_NATIVE_1n0u_280_84_0 using rosetta version 481 1/27/2006 4:46:15 PM|rosetta@home|Unrecoverable error for result DEFAULT_RLX_NATIVE_1n0u_280_84_0 (Incorrect function. (0x1) - exit code 1 (0x1)) 1/27/2006 4:46:15 PM||Rescheduling CPU: application exited 1/27/2006 4:46:15 PM|rosetta@home|Computation for result DEFAULT_RLX_NATIVE_1n0u_280_84_0 finished 1/27/2006 4:46:15 PM|rosetta@home|Starting result NO_SIM_ANNEAL_BARCODE_30_RLX_NATIVE_2reb_281_86_0 using rosetta version 481 1/27/2006 4:46:43 PM|rosetta@home|Unrecoverable error for result NO_SIM_ANNEAL_BARCODE_30_RLX_NATIVE_2reb_281_86_0 (Incorrect function. (0x1) - exit code 1 (0x1)) Time for a <RESET> and off to a different project. Proudly Banned from Predictator@Home and now Cosmology@home as well. Added SETI to the list today. Temporary ban only - so need to work harder :) "You can't fix stupid" (Ron White) |
Rebel Alliance Send message Joined: 4 Nov 05 Posts: 50 Credit: 3,579,531 RAC: 0 |
NO_SIM_ANNEAL_BARCODE_30_2reb_278_2151 25 Jan 2006 4:40:09 UTC 28 Jan 2006 5:25:43 UTC Over Client error Computing 260,224.27 1,133.72 --- and OMEGA_WT_1.0_2reb_275_2901 24 Jan 2006 11:04:19 UTC 28 Jan 2006 5:26:21 UTC Over Client error Computing 224,124.79 883.46 Two different machines. Had to abort both |
Kevin Send message Joined: 15 Jan 06 Posts: 21 Credit: 109,496 RAC: 0 |
|
Steve Shedroff Send message Joined: 7 Nov 05 Posts: 11 Credit: 250,657 RAC: 0 |
NO_SIM_ANNEAL_BARCODE_30_2reb_286_4473_1 has run for over 20:13 hours and is but 1% complete. Looks like I should abort this one. |
Carlos_Pfitzner Send message Joined: 22 Dec 05 Posts: 71 Credit: 138,867 RAC: 0 |
NO_SIM_ANNEAL_BARCODE_30_2reb_286_4473_1 has run for over 20:13 hours and is but 1% complete. Looks like I should abort this one. *It is running ? Did u used taskinfo2000 -> (windows98) & all windows versions top -> linux windows security -> windows XP cntlr-alt-del (task manger) -> windows 2000 *and sorted by cpu usage -> to be sure that it is really "run for over x hours"? Mine WUs when stuck, does *not* use CPU -> 99.99% of IDLE time for the system The ones that are using CPU, will eventually finish OK Be patient ! Click signature for global team stats |
Message boards :
Number crunching :
NO_SIM_ANNEAL_BARCODE_30_RLX_NATIVE ERRORS !!
©2024 University of Washington
https://www.bakerlab.org