Wheresatom
[H]ard|Gawd
- Joined
- Mar 20, 2007
- Messages
- 1,390
I think I got a bad work unit. I haven't had any problems until this morning. I will not rule it out as a possibility that I might be unstable though. Here is some code to help you guys advise my next step.
So, I figure being it didn't even get out of the gates, thats a bad work unit right? What do I do now? Restart the client? Do I have to delete the queue and work folder? It has been a while since I had a problem so I forget how to handle it.
Code:
[01:00:53] + Attempting to get work packet
[01:00:53] - Connecting to assignment server
[01:00:54] - Successful: assigned to (171.67.108.11).
[01:00:54] + News From Folding@Home: GPU folding beta
[01:00:54] Loaded queue successfully.
[01:00:55] + Could not connect to Work Server
[01:00:55] - Attempt #2 to get work failed, and no other work to do.
Waiting before retry.
[01:01:11] + Attempting to get work packet
[01:01:11] - Connecting to assignment server
[01:01:11] - Successful: assigned to (171.67.108.11).
[01:01:11] + News From Folding@Home: GPU folding beta
[01:01:12] Loaded queue successfully.
[01:01:14] + Closed connections
[01:01:14]
[01:01:14] + Processing work unit
[01:01:14] Core required: FahCore_11.exe
[01:01:14] Core found.
[01:01:14] Working on queue slot 04 [January 14 01:01:14 UTC]
[01:01:14] + Working ...
[01:01:14]
[01:01:14] *------------------------------*
[01:01:14] Folding@Home GPU Core - Beta
[01:01:14] Version 1.19 (Mon Nov 3 09:34:13 PST 2008)
[01:01:14]
[01:01:14] Compiler : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86
[01:01:14] Build host: amoeba
[01:01:14] Board Type: Nvidia
[01:01:14] Core :
[01:01:14] Preparing to commence simulation
[01:01:14] - Looking at optimizations...
[01:01:14] - Created dyn
[01:01:14] - Files status OK
[01:01:14] - Expanded 43861 -> 252912 (decompressed 576.6 percent)
[01:01:14] Called DecompressByteArray: compressed_data_size=43861 data_size=252912, decompressed_data_size=252912 diff=0
[01:01:14] - Digital signature verified
[01:01:14]
[01:01:14] Project: 5766 (Run 0, Clone 288, Gen 0)
[01:01:14]
[01:01:14] Assembly optimizations on if available.
[01:01:14] Entering M.D.
[01:01:20] Working on Protein
[01:01:21] Client config found, loading data.
[01:01:21] mdrun_gpu returned
[01:01:21] NANs detected on GPU
[01:01:21]
[01:01:21] Folding@home Core Shutdown: UNSTABLE_MACHINE
[01:01:24] CoreStatus = 7A (122)
[01:01:24] Sending work to server
[01:01:24] Project: 5766 (Run 0, Clone 288, Gen 0)
[01:01:24] - Read packet limit of 540015616... Set to 524286976.
[01:01:24] - Error: Could not get length of results file work/wuresults_04.dat
[01:01:24] - Error: Could not read unit 04 file. Removing from queue.
[01:01:24] - Preparing to get new work unit...
[01:01:24] + Attempting to get work packet
[01:01:24] - Connecting to assignment server
[01:01:25] - Successful: assigned to (171.67.108.11).
[01:01:25] + News From Folding@Home: GPU folding beta
[01:01:25] Loaded queue successfully.
[01:01:27] + Closed connections
[01:01:32]
[01:01:32] + Processing work unit
[01:01:32] Core required: FahCore_11.exe
[01:01:32] Core found.
[01:01:32] Working on queue slot 05 [January 14 01:01:32 UTC]
[01:01:32] + Working ...
[01:01:32]
[01:01:32] *------------------------------*
[01:01:32] Folding@Home GPU Core - Beta
[01:01:32] Version 1.19 (Mon Nov 3 09:34:13 PST 2008)
[01:01:32]
[01:01:32] Compiler : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86
[01:01:32] Build host: amoeba
[01:01:32] Board Type: Nvidia
[01:01:32] Core :
[01:01:32] Preparing to commence simulation
[01:01:32] - Looking at optimizations...
[01:01:32] - Created dyn
[01:01:32] - Files status OK
[01:01:32] - Expanded 43861 -> 252912 (decompressed 576.6 percent)
[01:01:32] Called DecompressByteArray: compressed_data_size=43861 data_size=252912, decompressed_data_size=252912 diff=0
[01:01:32] - Digital signature verified
[01:01:32]
[01:01:32] Project: 5766 (Run 0, Clone 288, Gen 0)
[01:01:32]
[01:01:32] Assembly optimizations on if available.
[01:01:32] Entering M.D.
[01:01:38] Working on Protein
[01:01:40] Client config found, loading data.
[01:01:40] Starting GUI Server
[01:01:40] mdrun_gpu returned
[01:01:40] NANs detected on GPU
[01:01:40]
[01:01:40] Folding@home Core Shutdown: UNSTABLE_MACHINE
[01:01:42] CoreStatus = 7A (122)
[01:01:42] Sending work to server
[01:01:42] Project: 5766 (Run 0, Clone 288, Gen 0)
[01:01:42] - Read packet limit of 540015616... Set to 524286976.
[01:01:42] - Error: Could not get length of results file work/wuresults_05.dat
[01:01:42] - Error: Could not read unit 05 file. Removing from queue.
[01:01:42] - Preparing to get new work unit...
[01:01:42] + Attempting to get work packet
[01:01:42] - Connecting to assignment server
[01:01:43] - Successful: assigned to (171.67.108.11).
[01:01:43] + News From Folding@Home: GPU folding beta
[01:01:43] Loaded queue successfully.
[01:01:44] + Closed connections
[01:01:49]
[01:01:49] + Processing work unit
[01:01:49] Core required: FahCore_11.exe
[01:01:49] Core found.
[01:01:49] Working on queue slot 06 [January 14 01:01:49 UTC]
[01:01:49] + Working ...
[01:01:50]
[01:01:50] *------------------------------*
[01:01:50] Folding@Home GPU Core - Beta
[01:01:50] Version 1.19 (Mon Nov 3 09:34:13 PST 2008)
[01:01:50]
[01:01:50] Compiler : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86
[01:01:50] Build host: amoeba
[01:01:50] Board Type: Nvidia
[01:01:50] Core :
[01:01:50] Preparing to commence simulation
[01:01:50] - Looking at optimizations...
[01:01:50] - Created dyn
[01:01:50] - Files status OK
[01:01:50] - Expanded 43861 -> 252912 (decompressed 576.6 percent)
[01:01:50] Called DecompressByteArray: compressed_data_size=43861 data_size=252912, decompressed_data_size=252912 diff=0
[01:01:50] - Digital signature verified
[01:01:50]
[01:01:50] Project: 5766 (Run 0, Clone 288, Gen 0)
[01:01:50]
[01:01:50] Assembly optimizations on if available.
[01:01:50] Entering M.D.
[01:01:56] Working on Protein
[01:01:57] Client config found, loading data.
[01:01:57] mdrun_gpu returned
[01:01:57] NANs detected on GPU
[01:01:57]
[01:01:57] Folding@home Core Shutdown: UNSTABLE_MACHINE
[01:02:00] CoreStatus = 7A (122)
[01:02:00] Sending work to server
[01:02:00] Project: 5766 (Run 0, Clone 288, Gen 0)
[01:02:00] - Read packet limit of 540015616... Set to 524286976.
[01:02:00] - Error: Could not get length of results file work/wuresults_06.dat
[01:02:00] - Error: Could not read unit 06 file. Removing from queue.
[01:02:00] - Preparing to get new work unit...
[01:02:00] + Attempting to get work packet
[01:02:00] - Connecting to assignment server
[01:02:00] - Successful: assigned to (171.67.108.11).
[01:02:00] + News From Folding@Home: GPU folding beta
[01:02:01] Loaded queue successfully.
[01:02:02] + Closed connections
[01:02:07]
[01:02:07] + Processing work unit
[01:02:07] Core required: FahCore_11.exe
[01:02:07] Core found.
[01:02:07] Working on queue slot 07 [January 14 01:02:07 UTC]
[01:02:07] + Working ...
[01:02:08]
[01:02:08] *------------------------------*
[01:02:08] Folding@Home GPU Core - Beta
[01:02:08] Version 1.19 (Mon Nov 3 09:34:13 PST 2008)
[01:02:08]
[01:02:08] Compiler : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86
[01:02:08] Build host: amoeba
[01:02:08] Board Type: Nvidia
[01:02:08] Core :
[01:02:08] Preparing to commence simulation
[01:02:08] - Looking at optimizations...
[01:02:08] - Created dyn
[01:02:08] - Files status OK
[01:02:08] - Expanded 43861 -> 252912 (decompressed 576.6 percent)
[01:02:08] Called DecompressByteArray: compressed_data_size=43861 data_size=252912, decompressed_data_size=252912 diff=0
[01:02:08] - Digital signature verified
[01:02:08]
[01:02:08] Project: 5766 (Run 0, Clone 288, Gen 0)
[01:02:08]
[01:02:08] Assembly optimizations on if available.
[01:02:08] Entering M.D.
[01:02:14] Working on Protein
[01:02:15] Client config found, loading data.
[01:02:15] mdrun_gpu returned
[01:02:15] NANs detected on GPU
[01:02:15]
[01:02:15] Folding@home Core Shutdown: UNSTABLE_MACHINE
[01:02:18] CoreStatus = 7A (122)
[01:02:18] Sending work to server
[01:02:18] Project: 5766 (Run 0, Clone 288, Gen 0)
[01:02:18] - Read packet limit of 540015616... Set to 524286976.
[01:02:18] - Error: Could not get length of results file work/wuresults_07.dat
[01:02:18] - Error: Could not read unit 07 file. Removing from queue.
[01:02:18] - Preparing to get new work unit...
[01:02:18] + Attempting to get work packet
[01:02:18] - Connecting to assignment server
[01:02:18] - Successful: assigned to (171.67.108.11).
[01:02:18] + News From Folding@Home: GPU folding beta
[01:02:19] Loaded queue successfully.
[01:02:21] + Closed connections
[01:02:26]
[01:02:26] + Processing work unit
[01:02:26] Core required: FahCore_11.exe
[01:02:26] Core found.
[01:02:26] Working on queue slot 08 [January 14 01:02:26 UTC]
[01:02:26] + Working ...
[01:02:26]
[01:02:26] *------------------------------*
[01:02:26] Folding@Home GPU Core - Beta
[01:02:26] Version 1.19 (Mon Nov 3 09:34:13 PST 2008)
[01:02:26]
[01:02:26] Compiler : Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 14.00.50727.762 for 80x86
[01:02:26] Build host: amoeba
[01:02:26] Board Type: Nvidia
[01:02:26] Core :
[01:02:26] Preparing to commence simulation
[01:02:26] - Looking at optimizations...
[01:02:26] - Created dyn
[01:02:26] - Files status OK
[01:02:26] - Expanded 43861 -> 252912 (decompressed 576.6 percent)
[01:02:26] Called DecompressByteArray: compressed_data_size=43861 data_size=252912, decompressed_data_size=252912 diff=0
[01:02:26] - Digital signature verified
[01:02:26]
[01:02:26] Project: 5766 (Run 0, Clone 288, Gen 0)
[01:02:26]
[01:02:26] Assembly optimizations on if available.
[01:02:26] Entering M.D.
[01:02:33] Working on Protein
[01:02:33] Client config found, loading data.
[01:02:34] mdrun_gpu returned
[01:02:34] NANs detected on GPU
[01:02:34]
[01:02:34] Folding@home Core Shutdown: UNSTABLE_MACHINE
[01:02:37] CoreStatus = 7A (122)
[01:02:37] Sending work to server
[01:02:37] Project: 5766 (Run 0, Clone 288, Gen 0)
[01:02:37] - Read packet limit of 540015616... Set to 524286976.
[01:02:37] - Error: Could not get length of results file work/wuresults_08.dat
[01:02:37] - Error: Could not read unit 08 file. Removing from queue.
[01:02:37] EUE limit exceeded. Pausing 24 hours.
[04:04:59] + Working...
[10:04:56] + Working...
So, I figure being it didn't even get out of the gates, thats a bad work unit right? What do I do now? Restart the client? Do I have to delete the queue and work folder? It has been a while since I had a problem so I forget how to handle it.