File IO error, can't fold

amdgamer

Supreme [H]ardness
Joined
Oct 27, 2004
Messages
4,880
Hey guys, my machine started doing this all night long after it apparantly finsihed its last bigadv. I've tried everything including redownloading the client, deleting all information, and so on but it doesn't appear to be fixing the problem as it keeps doing this over and over again.

This literally went on all night long and I will be so sad if this knocks me out of the 80% completion requirement as it will literally take me a very long time to get back up to it. I would suspect I probably have over a hundred failed work units as it started over and over again all night long.

Code:
[14:25:19] 
[14:25:19] Folding@home Core Shutdown: FILE_IO_ERROR
[14:25:24] CoreStatus = 75 (117)
[14:25:24] Error opening or reading from a file.
[14:25:24] Deleting current work unit & continuing...
[14:25:28] Trying to send all finished work units
[14:25:28] + No unsent completed units remaining.
[14:25:28] - Preparing to get new work unit...
[14:25:28] Cleaning up work directory
[14:25:28] + Attempting to get work packet
[14:25:28] Passkey found
[14:25:28] - Will indicate memory of 6142 MB
[14:25:28] - Connecting to assignment server
[14:25:28] Connecting to http://assign.stanford.edu:8080/
[14:25:28] Posted data.
[14:25:28] Initial: ED82; - Successful: assigned to (130.237.232.141).
[14:25:28] + News From Folding@Home: Welcome to Folding@Home
[14:25:28] Loaded queue successfully.
[14:25:28] Sent data
[14:25:28] Connecting to http://130.237.232.141:8080/
[14:25:29] Posted data.
[14:25:29] Initial: 0000; - Receiving payload (expected size: 512)
[14:25:29] Conversation time very short, giving reduced weight in bandwidth avg
[14:25:29] - Downloaded at ~1 kB/s
[14:25:29] - Averaged speed for that direction ~1 kB/s
[14:25:29] + Received work.
[14:25:29] + Closed connections
[14:25:34] 
[14:25:34] + Processing work unit
[14:25:34] Core required: FahCore_a5.exe
[14:25:34] Core found.
[14:25:34] Working on queue slot 03 [September 3 14:25:34 UTC]
[14:25:34] + Working ...
[14:25:34] - Calling '.\FahCore_a5.exe -dir work/ -nice 19 -suffix 03 -np 12 -priority 96 -checkpoint 15 -verbose -lifeline 3408 -version 634'

[14:25:34] 
[14:25:34] *------------------------------*
[14:25:34] Folding@Home Gromacs SMP Core
[14:25:34] Version 2.27 (Mar 12, 2010)
[14:25:34] 
[14:25:34] Preparing to commence simulation
[14:25:34] - Looking at optimizations...
[14:25:34] - Created dyn
[14:25:34] - Files status OK
[14:25:34] Couldn't Decompress
[14:25:34] Called DecompressByteArray: compressed_data_size=0 data_size=0, decompressed_data_size=0 diff=0
[14:25:34] -Error: Couldn't update checksum variables
[14:25:34] Error: Could not open work file
[14:25:34] 
[14:25:34] Folding@home Core Shutdown: FILE_IO_ERROR
[14:25:38] CoreStatus = 75 (117)
[14:25:38] Error opening or reading from a file.
[14:25:38] Deleting current work unit & continuing...
[14:25:42] Trying to send all finished work units
[14:25:42] + No unsent completed units remaining.
[14:25:42] - Preparing to get new work unit...
[14:25:42] Cleaning up work directory
[14:25:42] + Attempting to get work packet
[14:25:42] Passkey found
[14:25:42] - Will indicate memory of 6142 MB
[14:25:42] - Connecting to assignment server
[14:25:42] Connecting to http://assign.stanford.edu:8080/
[14:25:43] Posted data.
[14:25:43] Initial: ED82; - Successful: assigned to (130.237.232.141).
[14:25:43] + News From Folding@Home: Welcome to Folding@Home
[14:25:43] Loaded queue successfully.
[14:25:43] Sent data
[14:25:43] Connecting to http://130.237.232.141:8080/
[14:25:43] Posted data.
[14:25:43] Initial: 0000; - Receiving payload (expected size: 512)
[14:25:43] Conversation time very short, giving reduced weight in bandwidth avg
[14:25:43] - Downloaded at ~1 kB/s
[14:25:43] - Averaged speed for that direction ~1 kB/s
[14:25:43] + Received work.
[14:25:43] + Closed connections
[14:25:48] 
[14:25:48] + Processing work unit
[14:25:48] Core required: FahCore_a5.exe
[14:25:48] Core found.
[14:25:48] Working on queue slot 04 [September 3 14:25:48 UTC]
[14:25:48] + Working ...
[14:25:48] - Calling '.\FahCore_a5.exe -dir work/ -nice 19 -suffix 04 -np 12 -priority 96 -checkpoint 15 -verbose -lifeline 3408 -version 634'

[14:25:48] 
[14:25:48] *------------------------------*
[14:25:48] Folding@Home Gromacs SMP Core
[14:25:48] Version 2.27 (Mar 12, 2010)
[14:25:48] 
[14:25:48] Preparing to commence simulation
[14:25:48] - Looking at optimizations...
[14:25:48] - Created dyn
[14:25:48] - Files status OK
[14:25:48] Couldn't Decompress
[14:25:48] Called DecompressByteArray: compressed_data_size=0 data_size=0, decompressed_data_size=0 diff=0
[14:25:48] -Error: Couldn't update checksum variables
[14:25:48] Error: Could not open work file
[14:25:48] 
[14:25:48] Folding@home Core Shutdown: FILE_IO_ERROR
[14:25:53] CoreStatus = 75 (117)
[14:25:53] Error opening or reading from a file.
[14:25:53] Deleting current work unit & continuing...
[14:25:57] Trying to send all finished work units
[14:25:57] + No unsent completed units remaining.
[14:25:57] - Preparing to get new work unit...
[14:25:57] Cleaning up work directory
[14:25:57] + Attempting to get work packet
[14:25:57] Passkey found
[14:25:57] - Will indicate memory of 6142 MB
[14:25:57] - Connecting to assignment server
[14:25:57] Connecting to http://assign.stanford.edu:8080/
[14:25:57] Posted data.
[14:25:57] Initial: ED82; - Successful: assigned to (130.237.232.141).
[14:25:57] + News From Folding@Home: Welcome to Folding@Home
[14:25:57] Loaded queue successfully.
[14:25:57] Sent data
[14:25:57] Connecting to http://130.237.232.141:8080/
[14:25:58] Posted data.
[14:25:58] Initial: 0000; - Receiving payload (expected size: 512)
[14:25:58] Conversation time very short, giving reduced weight in bandwidth avg
[14:25:58] - Downloaded at ~1 kB/s
[14:25:58] - Averaged speed for that direction ~1 kB/s
[14:25:58] + Received work.
[14:25:58] + Closed connections
[14:26:03] 
[14:26:03] + Processing work unit
[14:26:03] Core required: FahCore_a5.exe
[14:26:03] Core found.
[14:26:03] Working on queue slot 05 [September 3 14:26:03 UTC]
[14:26:03] + Working ...
[14:26:03] - Calling '.\FahCore_a5.exe -dir work/ -nice 19 -suffix 05 -np 12 -priority 96 -checkpoint 15 -verbose -lifeline 3408 -version 634'

[14:26:03] 
[14:26:03] *------------------------------*
[14:26:03] Folding@Home Gromacs SMP Core
[14:26:03] Version 2.27 (Mar 12, 2010)
[14:26:03] 
[14:26:03] Preparing to commence simulation
[14:26:03] - Looking at optimizations...
[14:26:03] - Created dyn
[14:26:03] - Files status OK
[14:26:03] Couldn't Decompress
[14:26:03] Called DecompressByteArray: compressed_data_size=0 data_size=0, decompressed_data_size=0 diff=0
[14:26:03] -Error: Couldn't update checksum variables
[14:26:03] Error: Could not open work file
[14:26:03] 
[14:26:03] Folding@home Core Shutdown: FILE_IO_ERROR
[14:26:07] CoreStatus = 75 (117)
[14:26:07] Error opening or reading from a file.
[14:26:07] Too many errors during run. Purging queue.
[14:26:47] 
[14:26:47] + Processing work unit
[14:26:47] Core required: FahCore_a5.exe
[14:26:47] Core found.
[14:26:47] Working on queue slot 05 [September 3 14:26:47 UTC]
[14:26:47] + Working ...
[14:26:47] - Calling '.\FahCore_a5.exe -dir work/ -nice 19 -suffix 05 -np 12 -priority 96 -checkpoint 15 -verbose -lifeline 3408 -version 634'

[14:26:47] 
[14:26:47] *------------------------------*
[14:26:47] Folding@Home Gromacs SMP Core
[14:26:47] Version 2.27 (Mar 12, 2010)
[14:26:47] 
[14:26:47] Preparing to commence simulation
[14:26:47] - Looking at optimizations...
[14:26:47] - Created dyn
[14:26:47] - Files status OK
[14:26:47] Error: Missing work file=<>
[14:26:47] 
[14:26:47] Folding@home Core Shutdown: MISSING_WORK_FILES
[14:26:52] CoreStatus = 74 (116)
[14:26:52] The core could not find the work files specified. Removing from queue
[14:26:52] Deleting current work unit & continuing...
[14:26:56] Trying to send all finished work units
[14:26:56] + No unsent completed units remaining.
[14:26:56] - Preparing to get new work unit...
[14:26:56] Cleaning up work directory
[14:26:56] + Attempting to get work packet
[14:26:56] Passkey found
[14:26:56] - Will indicate memory of 6142 MB
[14:26:56] - Connecting to assignment server
[14:26:56] Connecting to http://assign.stanford.edu:8080/
[14:26:56] Posted data.
[14:26:56] Initial: ED82; - Successful: assigned to (130.237.232.141).
[14:26:56] + News From Folding@Home: Welcome to Folding@Home
[14:26:56] Loaded queue successfully.
[14:26:56] Sent data
[14:26:56] Connecting to http://130.237.232.141:8080/
[14:26:57] Posted data.
[14:26:57] Initial: 0000; - Receiving payload (expected size: 512)
[14:26:57] Conversation time very short, giving reduced weight in bandwidth avg
[14:26:57] - Downloaded at ~1 kB/s
[14:26:57] - Averaged speed for that direction ~1 kB/s
[14:26:57] + Received work.
[14:26:57] + Closed connections
[14:27:02] 
[14:27:02] + Processing work unit
[14:27:02] Core required: FahCore_a5.exe
[14:27:02] Core found.
[14:27:02] Working on queue slot 06 [September 3 14:27:02 UTC]
[14:27:02] + Working ...
[14:27:02] - Calling '.\FahCore_a5.exe -dir work/ -nice 19 -suffix 06 -np 12 -priority 96 -checkpoint 15 -verbose -lifeline 3408 -version 634'

[14:27:02] 
[14:27:02] *------------------------------*
[14:27:02] Folding@Home Gromacs SMP Core
[14:27:02] Version 2.27 (Mar 12, 2010)
[14:27:02] 
[14:27:02] Preparing to commence simulation
[14:27:02] - Looking at optimizations...
[14:27:02] - Created dyn
[14:27:02] - Files status OK
[14:27:02] Couldn't Decompress
[14:27:02] Called DecompressByteArray: compressed_data_size=0 data_size=0, decompressed_data_size=0 diff=0
[14:27:02] -Error: Couldn't update checksum variables
[14:27:02] Error: Could not open work file
[14:27:02] 
[14:27:02] Folding@home Core Shutdown: FILE_IO_ERROR
[14:27:06] CoreStatus = 75 (117)
[14:27:06] Error opening or reading from a file.
[14:27:06] Deleting current work unit & continuing...
[14:27:10] Trying to send all finished work units
[14:27:10] + No unsent completed units remaining.
[14:27:10] - Preparing to get new work unit...
[14:27:10] Cleaning up work directory
[14:27:10] + Attempting to get work packet
[14:27:10] Passkey found
[14:27:10] - Will indicate memory of 6142 MB
[14:27:10] - Connecting to assignment server
[14:27:10] Connecting to http://assign.stanford.edu:8080/
[14:27:11] Posted data.
[14:27:11] Initial: ED82; - Successful: assigned to (130.237.232.141).
[14:27:11] + News From Folding@Home: Welcome to Folding@Home
[14:27:11] Loaded queue successfully.
[14:27:11] Sent data
[14:27:11] Connecting to http://130.237.232.141:8080/
[14:27:11] Posted data.
[14:27:11] Initial: 0000; - Receiving payload (expected size: 512)
[14:27:11] Conversation time very short, giving reduced weight in bandwidth avg
[14:27:11] - Downloaded at ~1 kB/s
[14:27:11] - Averaged speed for that direction ~1 kB/s
[14:27:11] + Received work.
[14:27:11] + Closed connections
[14:27:16]
 
oh crap .. I got same thing here. I thought it was the windows corruption (after multiple power outages) ... re-installing windows now .. lolz

P.S: Holly sh*t ... my other systems have same issue too.....
 
oh crap .. I got same thing here. I thought it was the windows corruption (after multiple power outages) ... re-installing windows now .. lolz

P.S: Holly sh*t ... my other systems have same issue too.....

Nope, this is something wrong on Stanford's end. I just took the -bigadv flag out and now it is running standard SMP stuff just fine. It looks like I probably failed close to a hundred work units over night, so I will have no choice but to do regular SMP for a long time to come.

This actually really pisses me off.
 
You guys rock.
A problem is discovered, reported, confirmed and a simple work-around is posted all in 10 minutes.
I hope that when PG straightens this out, they also zero out the bigadv failures for the affected project/server so as not to corrupt their bonus point system. Yeah, I am almost sure they will have thought of that aspect and proactively fix it. Most definitely. :rolleyes:
 
It looks like I probably failed close to a hundred work units over night, so I will have no choice but to do regular SMP for a long time to come.
Maybe not. I had this happen on one of my -bigadv folders a couple weeks ago. I did not need to fold standard SMP units to get the 80% success ratio restored. I do not know whether this was attributable to me having a very large number of successful units documented or the newly failed units not counting against me. You might want to run a single -bigadv unit, just to test.

I don't think there's anywhere to check the ratio of and numbers of successful versus unsuccessful work units, is there?
 
My 970 had this last night, I ended up stopping folding, deleting the work directory, queue.dat and the a5 core. Restarted the machine and all was well. Downloaded the a5 core and a new WU and got to crunching.
 
It looks like it is a specific server that is having issues. I started having this problem earlier this afternoon. Kasson may pull that server, which may lead to bigadv shortages:

The downside of that is that we'll probably run out of bigadv work (except for the 12+ core server), although that may get hit harder as well.
 
additional info from my end, all the WUs that had the troubles were 6900s, The one that downloaded after restarting was a 2685, but my SB rig pulled a 6900 with no troubles this morning.
 
So far, it appears like I havn't lost my bonus yet. I'm hoping Stanford is not going to hold all of those failures against us. If all of these failures were held against me, I probably would start folding under a new username as it would take forever to fold enough SMP's to get above 80%.
 
So far, it appears like I havn't lost my bonus yet. I'm hoping Stanford is not going to hold all of those failures against us. If all of these failures were held against me, I probably would start folding under a new username as it would take forever to fold enough SMP's to get above 80%.

If it ever got to that point, you could still keep your username. You would just need to get and qualify a new passkey using a different email address.
 
So far, it appears like I havn't lost my bonus yet. I'm hoping Stanford is not going to hold all of those failures against us. If all of these failures were held against me, I probably would start folding under a new username as it would take forever to fold enough SMP's to get above 80%.

Actually it wouldn't, Switch your rigs to standard SMP for a few days and you would be set.
 
Back
Top