WTF

sl1982

Limp Gawd
Joined
Jul 24, 2008
Messages
365
[18:55:04] Completed 250000 out of 250000 steps (100%)
[18:56:07]
[18:56:07] Finished Work Unit:
[18:56:07] - Reading up to 21128112 from "work/wudata_02.trr": Read 21128112
[18:56:08] trr file hash check passed.
[18:56:08] - Reading up to 27598852 from "work/wudata_02.xtc": Read 27598852
[18:56:08] xtc file hash check passed.
[18:56:08] edr file hash check passed.
[18:56:08] logfile size: 180610
[18:56:08] Leaving Run
[18:56:11] - Writing 49149958 bytes of core data to disk...
[18:56:20] CoreStatus = 0 (0)
[18:56:20] Client-core communications error: ERROR 0x0
[18:56:20] Deleting current work unit & continuing...
[18:56:35] - Warning: Could not delete all work unit files (2): Core file absent
[18:56:35] Trying to send all finished work units
[18:56:35] + No unsent completed units remaining.



Bah why is it deleting my WU's before it sends them?
 
Client-core communications error: ERROR 0x0

something is wrong
 
The WU was completed though. Anyone think it might have something to do with notfred?
 
[18:55:04] Completed 250000 out of 250000 steps (100%)
[18:56:07]
[18:56:07] Finished Work Unit:
[18:56:07] - Reading up to 21128112 from "work/wudata_02.trr": Read 21128112
[18:56:08] trr file hash check passed.
[18:56:08] - Reading up to 27598852 from "work/wudata_02.xtc": Read 27598852
[18:56:08] xtc file hash check passed.
[18:56:08] edr file hash check passed.
[18:56:08] logfile size: 180610
[18:56:08] Leaving Run
[18:56:11] - Writing 49149958 bytes of core data to disk...
[18:56:20] CoreStatus = 0 (0)
[18:56:20] Client-core communications error: ERROR 0x0
[18:56:20] Deleting current work unit & continuing...
[18:56:35] - Warning: Could not delete all work unit files (2): Core file absent
[18:56:35] Trying to send all finished work units
[18:56:35] + No unsent completed units remaining.



Bah why is it deleting my WU's before it sends them?


"The 0x0 and 0x1 errors are unknown errors - all errors that are known will end with some other error code and message, but those errors that Pande Group hasn't seen before or did not know about, will end with error 0x0 or 0x1."

http://fahwiki.net/index.php/Error_0x0_and_0x1
 
might have to delete the core and let it download the core again..
 
I cant. Notfred has no controls that i know of. Maybe ill install ubuntu and see if i can learn me some linux
 
there has to be a folder that its using.. ive never used the notfred client.. so i dont know.. but in your task manager it should show the core exe thats being used..
 
Well it runs in a vm so i cant really access it. I could kill the vm but i dont think that would help. Anyone else running notfred in a vm?
 
Well it runs in a vm so i cant really access it. I could kill the vm but i dont think that would help. Anyone else running notfred in a vm?

You can access your diskless folder through the web interface, do a manual backup which rolls everything up into a zip file, then unpack it on another computer, run qfix, and maybe have something to send back.

I had this start happening a lot on my main rig running LinSMP. It woulf fail right after 100% as it was trying to write to disk. Ziptying a fan to blow on my HDD fixed it. But since you're running diskless, it could be memory related.

I don't have a big farm, so I'll never give up on a WU. :D
 
Back
Top