• Some users have recently had their accounts hijacked. It seems that the now defunct EVGA forums might have compromised your password there and seems many are using the same PW here. We would suggest you UPDATE YOUR PASSWORD and TURN ON 2FA for your account here to further secure it. None of the compromised accounts had 2FA turned on.
    Once you have enabled 2FA, your account will be updated soon to show a badge, letting other members know that you use 2FA to protect your account. This should be beneficial for everyone that uses FSFT.

Server reports problem with unit.

tkam

[H]ard|DCer of the Month - Dec. 2012
Joined
Dec 18, 2007
Messages
436
I've had my last two bigadv units get this "Server reports problem with unit." error message after hitting 100% completion and then doing the normal upload procedure. It's happened on my 4P AMD server and my 2P E5 server. One 8101 and one 8102 - only thing I can find in common is they both had the same assignment server: 128.143.231.201

Neither box has ever had a failed WU before that I'm aware of and neither are OC'd.
 
I have 3 BA WUs with this problem in langouste. Can anyone config they have been sent or not? If not, what needs to be done to send them. Our wonderful weekend breakdown.....

It at least looks like the WU results are still in the work directory. Hopefully they can be resent on Stanford gets their shit together...
 
Last edited:
I had this problem recently with two P8101.

I thought that it was due to bad overclocking .
 
The servers appear to be accepting results again. I think we are all goig to be screwed on results that tried to upload during the server issues.
 
What a crock. Nothing but these shit 8101s and now can't even upload the completed ones.
I have 4 total that now are giving this error.
Talk about piling on......
 
These WUs appear to be permanently un-returnable (already tried a few things) :-(
 
Jeanjean, I'm sure asking won't hurt.

At least not right away :rolleyes:
 
Standford can do nothing ? :confused:

[pissandmoanmode]
Of course not. There is no one never available on the weekends to fix these things. They have millions on millions of dollars of resources donated to their cause and they can not get a couple of under graduate students to baby sit servers over the weekend.
[/pissandmoanmode]
 
Last edited:
Yep, it took a lot of EVGA-bucks to entice PG to pull this one on [H] this particular weekend, but we finally got them to do it. How ya like it? :D
 
Yep, it took a lot of EVGA-bucks to entice PG to pull this one on [H] this particular weekend, but we finally got them to do it. How ya like it? :D

That must be the PR-friendly way of saying evga put thier boy friend boots on:)
 
Server reports problem with unit.
The same happen to 4 of my rigs, I lost 2 x 8102 and 2 x 8101.

1455p all together down the drain!
 
Its got me again, i had the same problem earlier in the week, this time its Project: 8101 (Run 20, Clone 1, Gen 45) thats affected. I have another different WU processing now, if that fails i'm going back to SMP for a while. No point spending 2 days and 14Kwh of power for no reward
 
There were no known outages earlier this week, I suspect something else may be going on.
 
Well the way I see it is just move on, It is just a little bump in the road we just need to grab another gear put the pedal to the metal and smoke them thar fellars over at evga. After all they just go a little false hope adorned upon them now lets crush it. :D
 
It happened last weekend, at the time it was blamed on an unstable overclocked machine which is difficult to do on an asus server mobo.

The rig is fine as its completed 2 BA wu this week without issue (both 8101), we are either getting duff WU to fold or a borked server is dealing with the uploads
 
Were the symptoms exactly the same last weekend? (Server reports problem with unit.)
Do you have a log?
 
I'm still seeing every BA WU fail to upload correctly. Then it gets the same WU and starts processing again.
 
I got this just a few minutes ago on my SR-2?

Code:
[19:58:22] - Autosending finished units... [September 30 19:58:22 UTC]
[19:58:22] + Processing work unit
[19:58:22] Trying to send all finished work units
[19:58:22] Project: 8101 (Run 3, Clone 12, Gen 19)
[19:58:22] Core required: FahCore_a5.exe
[19:58:22] Core found.


[19:58:22] + Attempting to send results [September 30 19:58:22 UTC]
[19:58:22] - Reading file work/wuresults_04.dat from core
[19:58:23] Working on queue slot 08 [September 30 19:58:23 UTC]
[19:58:23] + Working ...
[19:58:23] - Calling './FahCore_a5.exe -dir work/ -nice 19 -suffix 08 -np 24 -checkpoint 3 -verbose -lifeline 1830 -version 634'

[19:58:23]   (Read 91631428 bytes from disk)
[19:58:23] Connecting to http://128.143.231.201:8080/
[19:58:23] 
[19:58:23] *------------------------------*
[19:58:23] Folding@Home Gromacs SMP Core
[19:58:23] Version 2.27 (Thu Feb 10 09:46:40 PST 2011)
[19:58:23] 
[19:58:23] Preparing to commence simulation
[19:58:23] - Ensuring status. Please wait.
[19:58:33] - Looking at optimizations...
[19:58:33] - Working with standard loops on this execution.
[19:58:33] - Previous termination of core was improper.
[19:58:33] - Files status OK
[19:58:35] - Expanded 24863922 -> 30796292 (decompressed 123.8 percent)
[19:58:35] Called DecompressByteArray: compressed_data_size=24863922 data_size=30796292, decompressed_data_size=30796292 diff=0
[19:58:35] - Digital signature verified
[19:58:35] 
[19:58:35] Project: 6901 (Run 14, Clone 0, Gen 317)
[19:58:35] 
[19:58:35] Entering M.D.
[19:58:42] Mapping NT from 24 to 24 
[19:58:44] Completed 0 out of 250000 steps  (0%)
[20:00:52] Posted data.
[20:00:52] Initial: 0000; - Uploaded at ~600 kB/s
[20:00:52] - Averaged speed for that direction ~564 kB/s

[20:00:52] - Server reports problem with unit.

[20:00:52] + Sent 0 of 1 completed units to the server
[20:00:52] - Autosend completed
 
These may be stale units -- if you check slot numbers they all should carry same slot number.
 
I have at least 3 big adv got stuck since yesterday 6PM (EST) update.


this is sketchy to me ...
kasson said:
» Sun Sep 30, 2012 12:29 pm
Yes--I see something weird going on. Nothing has changed with the work server, but I think some of the people at Stanford may have changed the assignment server without telling me. I'm investigating.
 
They won't be returned. If you look at queue entries, they have been marked as "finished" (uploaded).

I tried re-marking them for upload but server seems to have lost context of all outstanding WUs and
is not accepting any of 'old' units.

Just leave the clients running, they will eventually recover.
 
Agree. It happened in the past and no matter what I tried, I couldn't send WUs manually.
Just leave the client running and hope SF fixes the issue.
 
Aye, that is sketchy, sbinh... at best :)
But I gave up hoping for full disclosures... just glad it's fixed.
 
Last edited:
It is. But that's a topic for whole 'nother discussion ;)
 
This is crap. I'm burning hundreds of watts for nothing.
This is not even getting the science done at this point, even if it's ok to screw the points.
At my electric bill rate I will have to consider shutting these rigs down until someone at Stanford shows a bit of concern.
 
From Dr. Kasson this morning...

"We have identified and fixed a WS-CS communication issue. This problem should be taken care of going forward; we are continuing to review the logs to analyze the impact of the problem on rejected work units."
 
From Dr. Kasson this morning...

"We have identified and fixed a WS-CS communication issue. This problem should be taken care of going forward; we are continuing to review the logs to analyze the impact of the problem on rejected work units."

Translated to "hey we fucked up, but we think it *might* be fixed, it is now coffee break":p

Speaking of which, I need more coffee...
 
I've had two 8101's go bad for half a million points. I'll let this one finish and if it fails, I'll be shutting down my folding rigs until Stanford fixes their "problem". My latest one just failed this morning.
 
Back
Top