• Some users have recently had their accounts hijacked. It seems that the now defunct EVGA forums might have compromised your password there and seems many are using the same PW here. We would suggest you UPDATE YOUR PASSWORD and TURN ON 2FA for your account here to further secure it. None of the compromised accounts had 2FA turned on.
    Once you have enabled 2FA, your account will be updated soon to show a badge, letting other members know that you use 2FA to protect your account. This should be beneficial for everyone that uses FSFT.

Why does Linux hate me?

Joined
Dec 13, 2003
Messages
2,238
Code:
22:23:29] Folding@Home Gromacs SMP Core
[22:23:29] Version 2.27 (Thu Feb 10 09:46:40 PST 2011)
[22:23:29] 
[22:23:29] Preparing to commence simulation
[22:23:29] - Looking at optimizations...
[22:23:29] - Files status OK
[22:23:38] - Expanded 57242250 -> 71846524 (decompressed 50.4 percent)
[22:23:38] Called DecompressByteArray: compressed_data_size=57242250 data_size=71846524, decompressed_data_size=71846524 diff=0
[22:23:39] - Digital signature verified
[22:23:39] 
[22:23:39] Project: 6903 (Run 9, Clone 12, Gen 99)
[22:23:39] 
[22:23:39] Assembly optimizations on if available.
[22:23:39] Entering M.D.
[22:23:45] Using Gromacs checkpoints
[22:23:56] Mapping NT from 16 to 16 
[22:24:50] Resuming from checkpoint
[22:24:53] Verified work/wudata_03.log
[22:24:54] Verified work/wudata_03.trr
[22:24:56] Verified work/wudata_03.xtc
[22:24:56] Verified work/wudata_03.edr
[22:24:57] Completed 239715 out of 250000 steps  (95%)
[22:32:07] Completed 240000 out of 250000 steps  (96%)
[23:24:11] Completed 242500 out of 250000 steps  (97%)
[00:08:52] Completed 245000 out of 250000 steps  (98%)
[00:53:32] Completed 247500 out of 250000 steps  (99%)
[01:38:26] Completed 250000 out of 250000 steps  (100%)
[01:38:51] DynamicWrapper: Finished Work Unit: sleep=10000
[01:39:01] 
[01:39:01] Finished Work Unit:
[01:39:01] - Reading up to 121622496 from "work/wudata_03.trr": Read 121622496
[01:39:03] trr file hash check passed.
[01:39:03] - Reading up to 108849268 from "work/wudata_03.xtc": Read 108849268
[01:39:04] xtc file hash check passed.
[01:39:04] edr file hash check passed.
[01:39:04] logfile size: 227886
[01:39:04] Leaving Run
[01:39:06] - Writing 230872642 bytes of core data to disk...
[01:40:26] Done: 230872130 -> 222500063 (compressed to 3.3 percent)
[01:40:27]   ... Done.
[01:40:48] - Shutting down core
[01:40:48] 
[01:40:48] Folding@home Core Shutdown: FINISHED_UNIT
[01:40:51] CoreStatus = 64 (100)
[01:40:51] Unit 3 finished with 70 percent of time to deadline remaining.
[01:40:51] Updated performance fraction: 0.697060
[01:40:51] Sending work to server
[01:40:51] Project: 6903 (Run 9, Clone 12, Gen 99)


[01:40:51] + Attempting to send results [July 5 01:40:51 UTC]
[01:40:51] - Reading file work/wuresults_03.dat from core
[01:40:51]   (Read 222500575 bytes from disk)
[01:40:51] Connecting to http://130.237.232.237:8080/
[03:40:53] - Couldn't send HTTP request to server
[03:40:53] + Could not connect to Work Server (results)
[03:40:53]     (130.237.232.237:8080)
[03:40:53] + Retrying using alternative port
[03:40:53] Connecting to http://130.237.232.237:80/
[04:15:57] - Couldn't send HTTP request to server
[04:15:57] + Could not connect to Work Server (results)
[04:15:57]     (130.237.232.237:80)
[04:15:57] - Error: Could not transmit unit 03 (completed July 5) to work server.
[04:15:57] - 1 failed uploads of this unit.
[04:15:57]   Keeping unit 03 in queue.
[04:15:57] Trying to send all finished work units
[04:15:57] Project: 6903 (Run 9, Clone 12, Gen 99)


[04:15:57] + Attempting to send results [July 5 04:15:57 UTC]
[04:15:57] - Reading file work/wuresults_03.dat from core
[04:15:57]   (Read 222500575 bytes from disk)
[04:15:57] Connecting to http://130.237.232.237:8080/
[04:16:00] - Couldn't send HTTP request to server
[04:16:00] + Could not connect to Work Server (results)
[04:16:00]     (130.237.232.237:8080)
[04:16:00] + Retrying using alternative port
[04:16:00] Connecting to http://130.237.232.237:80/
[04:16:03] - Couldn't send HTTP request to server
[04:16:03] + Could not connect to Work Server (results)
[04:16:03]     (130.237.232.237:80)
[04:16:03] - Error: Could not transmit unit 03 (completed July 5) to work server.
[04:16:03] - 2 failed uploads of this unit.
[04:16:03]   Keeping unit 03 in queue.
[04:16:03] + Sent 0 of 1 completed units to the server
[04:16:03] - Preparing to get new work unit...
[04:16:03] Cleaning up work directory
[04:16:03] + Attempting to get work packet
[04:16:03] Passkey found
[04:16:03] - Will indicate memory of 16048 MB
[04:16:03] - Connecting to assignment server
[04:16:03] Connecting to http://assign.stanford.edu:8080/
[04:16:43] - Could not CosmHTTPOpen
[04:16:43] + Could not connect to Assignment Server
[04:16:43] Connecting to http://assign2.stanford.edu:80/
[04:17:23] - Could not CosmHTTPOpen
[04:17:23] + Could not connect to Assignment Server 2
[04:17:23] + Couldn't get work instructions.
[04:17:23] - Attempt #1  to get work failed, and no other work to do.
Waiting before retry.
[04:17:38] + Attempting to get work packet
[04:17:38] Passkey found
[04:17:38] - Will indicate memory of 16048 MB
[04:17:38] - Connecting to assignment server
[04:17:38] Connecting to http://assign.stanford.edu:8080/
[04:18:18] - Could not CosmHTTPOpen
[04:18:18] + Could not connect to Assignment Server
[04:18:18] Connecting to http://assign2.stanford.edu:80/
[04:18:58] - Could not CosmHTTPOpen
[04:18:58] + Could not connect to Assignment Server 2
[04:18:58] + Couldn't get work instructions.
[04:18:58] - Attempt #2  to get work failed, and no other work to do.
Waiting before retry.
[04:19:18] + Attempting to get work packet
[04:19:18] Passkey found
[04:19:18] - Will indicate memory of 16048 MB
[04:19:18] - Connecting to assignment server
[04:19:18] Connecting to http://assign.stanford.edu:8080/
[04:19:58] - Could not CosmHTTPOpen
[04:19:58] + Could not connect to Assignment Server
[04:19:58] Connecting to http://assign2.stanford.edu:80/
[04:20:38] - Could not CosmHTTPOpen
[04:20:38] + Could not connect to Assignment Server 2
[04:20:38] + Couldn't get work instructions.
[04:20:38] - Attempt #3  to get work failed, and no other work to do.
Waiting before retry.
[04:21:08] + Attempting to get work packet
[04:21:08] Passkey found
[04:21:08] - Will indicate memory of 16048 MB
[04:21:08] - Connecting to assignment server
[04:21:08] Connecting to http://assign.stanford.edu:8080/
[04:21:48] - Could not CosmHTTPOpen
[04:21:48] + Could not connect to Assignment Server
[04:21:48] Connecting to http://assign2.stanford.edu:80/
[04:22:28] - Could not CosmHTTPOpen
[04:22:28] + Could not connect to Assignment Server 2
[04:22:28] + Couldn't get work instructions.
[04:22:28] - Attempt #4  to get work failed, and no other work to do.
Waiting before retry.
[04:23:14] + Attempting to get work packet
[04:23:14] Passkey found
[04:23:14] - Will indicate memory of 16048 MB
[04:23:14] - Connecting to assignment server
[04:23:14] Connecting to http://assign.stanford.edu:8080/
[04:23:29] - Autosending finished units... [July 5 04:23:29 UTC]
[04:23:29] Trying to send all finished work units
[04:23:29] Project: 6903 (Run 9, Clone 12, Gen 99)


[04:23:29] + Attempting to send results [July 5 04:23:29 UTC]
[04:23:29] - Reading file work/wuresults_03.dat from core
[04:23:29]   (Read 222500575 bytes from disk)
[04:23:29] Connecting to http://130.237.232.237:8080/
[04:23:32] - Couldn't send HTTP request to server
[04:23:32] + Could not connect to Work Server (results)
[04:23:32]     (130.237.232.237:8080)
[04:23:32] + Retrying using alternative port
[04:23:32] Connecting to http://130.237.232.237:80/
[04:23:35] - Couldn't send HTTP request to server
[04:23:35] + Could not connect to Work Server (results)
[04:23:35]     (130.237.232.237:80)
[04:23:35] - Error: Could not transmit unit 03 (completed July 5) to work server.
[04:23:35] - 3 failed uploads of this unit.
[04:23:35]   Keeping unit 03 in queue.
[04:23:35] + Sent 0 of 1 completed units to the server
[04:23:35] - Autosend completed
[04:23:54] - Could not CosmHTTPOpen
[04:23:54] + Could not connect to Assignment Server
[04:23:54] Connecting to http://assign2.stanford.edu:80/
[04:24:34] - Could not CosmHTTPOpen
[04:24:34] + Could not connect to Assignment Server 2
[04:24:34] + Couldn't get work instructions.
[04:24:34] - Attempt #5  to get work failed, and no other work to do.
Waiting before retry.
[04:26:02] + Attempting to get work packet
[04:26:02] Passkey found
[04:26:02] - Will indicate memory of 16048 MB
[04:26:02] - Connecting to assignment server
[04:26:02] Connecting to http://assign.stanford.edu:8080/
[04:26:42] - Could not CosmHTTPOpen
[04:26:42] + Could not connect to Assignment Server
[04:26:42] Connecting to http://assign2.stanford.edu:80/
[04:27:22] - Could not CosmHTTPOpen
[04:27:22] + Could not connect to Assignment Server 2
[04:27:22] + Couldn't get work instructions.
[04:27:22] - Attempt #6  to get work failed, and no other work to do.
Waiting before retry.
[04:30:14] + Attempting to get work packet
[04:30:14] Passkey found
[04:30:14] - Will indicate memory of 16048 MB
[04:30:14] - Connecting to assignment server
[04:30:14] Connecting to http://assign.stanford.edu:8080/
[04:30:55] - Could not CosmHTTPOpen
[04:30:55] + Could not connect to Assignment Server
[04:30:55] Connecting to http://assign2.stanford.edu:80/
[04:31:35] - Could not CosmHTTPOpen
[04:31:35] + Could not connect to Assignment Server 2
[04:31:35] + Couldn't get work instructions.
[04:31:35] - Attempt #7  to get work failed, and no other work to do.
Waiting before retry.
[04:37:03] + Attempting to get work packet
[04:37:03] Passkey found
[04:37:03] - Will indicate memory of 16048 MB
[04:37:03] - Connecting to assignment server
[04:37:03] Connecting to http://assign.stanford.edu:8080/
[04:37:43] - Could not CosmHTTPOpen
[04:37:43] + Could not connect to Assignment Server
[04:37:43] Connecting to http://assign2.stanford.edu:80/
[04:38:23] - Could not CosmHTTPOpen
[04:38:23] + Could not connect to Assignment Server 2
[04:38:23] + Couldn't get work instructions.
[04:38:23] - Attempt #8  to get work failed, and no other work to do.
Waiting before retry.
[04:49:13] + Attempting to get work packet
[04:49:13] Passkey found
[04:49:13] - Will indicate memory of 16048 MB
[04:49:13] - Connecting to assignment server
[04:49:13] Connecting to http://assign.stanford.edu:8080/
[04:49:53] - Could not CosmHTTPOpen
[04:49:53] + Could not connect to Assignment Server
[04:49:53] Connecting to http://assign2.stanford.edu:80/
[04:50:34] - Could not CosmHTTPOpen
[04:50:34] + Could not connect to Assignment Server 2
[04:50:34] + Couldn't get work instructions.
[04:50:34] - Attempt #9  to get work failed, and no other work to do.
Waiting before retry.
[05:11:55] + Attempting to get work packet
[05:11:55] Passkey found
[05:11:55] - Will indicate memory of 16048 MB
[05:11:55] - Connecting to assignment server
[05:11:55] Connecting to http://assign.stanford.edu:8080/
[05:11:55] - Could not CosmHTTPOpen
[05:11:55] + Could not connect to Assignment Server
[05:11:55] Connecting to http://assign2.stanford.edu:80/
[05:11:55] - Could not CosmHTTPOpen
[05:11:55] + Could not connect to Assignment Server 2
[05:11:55] + Couldn't get work instructions.
[05:11:55] - Attempt #10  to get work failed, and no other work to do.
Waiting before retry.
I realized my internet connection was down last night. So when I stopped and restarted the client this morning, I got this...

Code:
Launch directory: /home/mike/fah
Executable: ./fah6
Arguments: -smp 16 -bigbeta -verbosity 9 

[09:35:05] - Ask before connecting: No
[09:35:05] - User name: freeloader (Team 33)
[09:35:05] - User ID: 499F34AC40272FCC
[09:35:05] - Machine ID: 1
[09:35:05] 
[09:35:06] Loaded queue successfully.
[09:35:06] Deleting incompletely fetched item (4) from queue position #4
[09:35:06] - Warning: Could not delete all work unit files (4): Core file absent
[09:35:06] - Preparing to get new work unit...
[09:35:06] Cleaning up work directory
[09:35:06] - Autosending finished units... [July 5 09:35:06 UTC]
[09:35:06] + Attempting to get work packet
[09:35:06] Trying to send all finished work units
[09:35:06] Project: 6903 (Run 9, Clone 12, Gen 99)
[09:35:06] Passkey found


[09:35:06] - Will indicate memory of 16048 MB
[09:35:06] - Connecting to assignment server
[09:35:06] + Attempting to send results [July 5 09:35:06 UTC]
[09:35:06] Connecting to http://assign.stanford.edu:8080/
[09:35:06] - Reading file work/wuresults_03.dat from core
[09:35:06] Posted data.
[09:35:09]   (Read 222500575 bytes from disk)
[09:35:09] Initial: ED82; [09:35:09] Connecting to http://130.237.232.237:8080/
[09:35:09] - Successful: assigned to (130.237.232.237).
[09:35:09] + News From Folding@Home: Welcome to Folding@Home
[09:35:09] Loaded queue successfully.
[09:35:09] Sent data
[09:35:09] Connecting to http://130.237.232.237:8080/
[09:35:23] Posted data.
[09:35:23] Initial: 0000; - Receiving payload (expected size: 57238663)
[09:39:03] - Downloaded at ~254 kB/s
[09:39:03] - Averaged speed for that direction ~300 kB/s
[09:39:03] + Received work.
[09:39:03] + Closed connections
[09:39:03] 
[09:39:03] + Processing work unit
[09:39:03] Core required: FahCore_a5.exe
[09:39:03] Core found.
[09:39:03] Working on queue slot 04 [July 5 09:39:03 UTC]
[09:39:03] + Working ...
[09:39:03] - Calling './FahCore_a5.exe -dir work/ -nice 19 -suffix 04 -np 16 -checkpoint 15 -verbose -lifeline 2151 -version 634'

[09:39:03] 
[09:39:03] *------------------------------*
[09:39:03] Folding@Home Gromacs SMP Core
[09:39:03] Version 2.27 (Thu Feb 10 09:46:40 PST 2011)
[09:39:03] 
[09:39:03] Preparing to commence simulation
[09:39:03] - Looking at optimizations...
[09:39:03] - Created dyn
[09:39:03] - Files status OK
[09:39:11] - Expanded 57238151 -> 71846524 (decompressed 50.4 percent)
[09:39:11] Called DecompressByteArray: compressed_data_size=57238151 data_size=71846524, decompressed_data_size=71846524 diff=0
[09:39:12] - Digital signature verified
[09:39:12] 
[09:39:12] Project: 6903 (Run 8, Clone 11, Gen 120)
[09:39:12] 
[09:39:12] Assembly optimizations on if available.
[09:39:12] Entering M.D.
[09:39:21] Mapping NT from 16 to 16 
[09:39:28] Completed 0 out of 250000 steps  (0%)
[10:24:26] Posted data.
[10:24:27] Initial: 0000; - Uploaded at ~73 kB/s
[10:24:27] - Averaged speed for that direction ~64 kB/s
[10:24:27] - Server has already received unit.
[10:24:27] + Sent 0 of 1 completed units to the server
[10:24:27] - Autosend completed
[10:34:30] Completed 2500 out of 250000 steps  (1%)
[11:04:44] ***** Got an Activate signal (2)
[11:04:44] Killing all core threads

Folding@Home Client Shutdown.


--- Opening Log file [July 5 11:04:52 UTC] 


# Linux SMP Console Edition ###################################################
###############################################################################

                       Folding@Home Client Version 6.34

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: /home/mike/fah
Executable: ./fah6
Arguments: -smp 16 -bigbeta -verbosity 9 

[11:04:52] - Ask before connecting: No
[11:04:52] - User name: freeloader (Team 33)
[11:04:52] - User ID: 499F34AC40272FCC
[11:04:52] - Machine ID: 1
[11:04:52] 
[11:04:52] Loaded queue successfully.
[11:04:52] 
[11:04:52] + Processing work unit
[11:04:52] - Autosending finished units... [July 5 11:04:52 UTC]
[11:04:52] Core required: FahCore_a5.exe
[11:04:52] Trying to send all finished work units
[11:04:52] Core found.
[11:04:52] + No unsent completed units remaining.
[11:04:52] - Autosend completed
[11:04:52] Working on queue slot 04 [July 5 11:04:52 UTC]
[11:04:52] + Working ...
[11:04:52] - Calling './FahCore_a5.exe -dir work/ -nice 19 -suffix 04 -np 16 -checkpoint 15 -verbose -lifeline 7817 -version 634'

[11:04:52] 
[11:04:52] *------------------------------*
[11:04:52] Folding@Home Gromacs SMP Core
[11:04:52] Version 2.27 (Thu Feb 10 09:46:40 PST 2011)
[11:04:52] 
[11:04:52] Preparing to commence simulation
[11:04:52] - Looking at optimizations...
[11:04:52] - Files status OK
[11:05:00] - Expanded 57238151 -> 71846524 (decompressed 50.4 percent)
[11:05:00] Called DecompressByteArray: compressed_data_size=57238151 data_size=71846524, decompressed_data_size=71846524 diff=0
[11:05:00] - Digital signature verified
[11:05:00] 
[11:05:00] Project: 6903 (Run 8, Clone 11, Gen 120)
[11:05:00] 
[11:05:01] Assembly optimizations on if available.
[11:05:01] Entering M.D.
[11:05:07] Using Gromacs checkpoints
[11:05:16] Mapping NT from 16 to 16 
[11:06:10] Resuming from checkpoint
[11:06:14] Verified work/wudata_04.log
[11:06:15] Verified work/wudata_04.trr
[11:06:15] Verified work/wudata_04.xtc
[11:06:15] Verified work/wudata_04.edr
[11:06:17] Completed 3575 out of 250000 steps  (1%)
[11:28:56] ***** Got an Activate signal (2)
[11:28:56] Killing all core threads

Folding@Home Client Shutdown.


--- Opening Log file [July 5 11:31:42 UTC] 


# Linux SMP Console Edition ###################################################
###############################################################################

                       Folding@Home Client Version 6.34

                          http://folding.stanford.edu

###############################################################################
###############################################################################

Launch directory: /home/mike/fah
Executable: ./fah6
Arguments: -smp 16 -bigbeta -verbosity 9 

[11:31:42] - Ask before connecting: No
[11:31:42] - User name: freeloader (Team 33)
[11:31:42] - User ID: 499F34AC40272FCC
[11:31:42] - Machine ID: 1
[11:31:42] 
[11:31:42] Loaded queue successfully.
[11:31:42] 
[11:31:42] + Processing work unit
[11:31:42] - Autosending finished units... [July 5 11:31:42 UTC]
[11:31:42] Trying to send all finished work units
[11:31:42] Core required: FahCore_a5.exe
[11:31:42] + No unsent completed units remaining.
[11:31:42] - Autosend completed
[11:31:42] Core found.
[11:31:42] Working on queue slot 04 [July 5 11:31:42 UTC]
[11:31:42] + Working ...
[11:31:42] - Calling './FahCore_a5.exe -dir work/ -nice 19 -suffix 04 -np 16 -checkpoint 15 -verbose -lifeline 8419 -version 634'

[11:31:42] 
[11:31:42] *------------------------------*
[11:31:42] Folding@Home Gromacs SMP Core
[11:31:42] Version 2.27 (Thu Feb 10 09:46:40 PST 2011)
[11:31:42] 
[11:31:42] Preparing to commence simulation
[11:31:42] - Looking at optimizations...
[11:31:42] - Files status OK
[11:31:50] - Expanded 57238151 -> 71846524 (decompressed 50.4 percent)
[11:31:50] Called DecompressByteArray: compressed_data_size=57238151 data_size=71846524, decompressed_data_size=71846524 diff=0
[11:31:51] - Digital signature verified
[11:31:51] 
[11:31:51] Project: 6903 (Run 8, Clone 11, Gen 120)
[11:31:51] 
[11:31:51] Assembly optimizations on if available.
[11:31:51] Entering M.D.
[11:31:57] Using Gromacs checkpoints
[11:32:07] Mapping NT from 16 to 16 
[11:33:01] Resuming from checkpoint
[11:33:05] Verified work/wudata_04.log
[11:33:06] Verified work/wudata_04.trr
[11:33:06] Verified work/wudata_04.xtc
[11:33:06] Verified work/wudata_04.edr
[11:33:08] Completed 4180 out of 250000 steps  (1%)
Is there anyway to know whether Stanford has actually received the unit for sure? My stats aren't reflecting the unit as being received. The fight goes on....

Also, is there anyway to have the client's time match my system time? The client is always four hours ahead of my local time.
 
Hi freeloader (team 33),
Your WU (P6903 R9 C12 G99) was added to the stats database on 2012-07-01 11:08:30 for 238731 points of credit.

The times in the log are UTC. It's supposed to be that way and can't be changed. Unless you want to set your system clock to UTC it will never match your log.
 
Odd, this shouldn't normally happen.

Can you check your logs for instances of:
Code:
Reading file work/wuresults_03.dat from core
at times other than 09:35:06 ?


Looks like successfully returned unit wasn't deleted and its queue entry
didn't get updated either. Hence this happened --
Code:
[10:24:26] Posted data.
[10:24:27] Initial: 0000; - Uploaded at ~73 kB/s
[10:24:27] - Averaged speed for that direction ~64 kB/s
[10:24:27] - Server has already received unit.

You should delete wuresults_03.dat manually so the client ceases
return attempts (otherwise it will be retrying "indefinitely"*).

*) until slot 3 gets reused (read: days)

You said you killed the client and started it again... -- Q: how did you stop the client?
 
Odd, this shouldn't normally happen.

Can you check your logs for instances of:
Code:
Reading file work/wuresults_03.dat from core
at times other than 09:35:06 ?


Looks like successfully returned unit wasn't deleted and its queue entry
didn't get updated either. Hence this happened --
Code:
[10:24:26] Posted data.
[10:24:27] Initial: 0000; - Uploaded at ~73 kB/s
[10:24:27] - Averaged speed for that direction ~64 kB/s
[10:24:27] - Server has already received unit.
You should delete wuresults_03.dat manually so the client ceases
return attempts (otherwise it will be retrying "indefinitely"*).

*) until slot 3 gets reused (read: days)

You said you killed the client and started it again... -- Q: how did you stop the client?

Tear, I used CTRL-C to stop the client in the terminal window. I still have work results 2 and 3 in my work folder. For whatever reason, the client is not deleting the units once it's uploaded them to the results server. :confused: I have multiple times where the term, "reading file work/wuresults_03.dat" has come up in my log. At least three that I can see at 1:40, 4:15 and 4:23. I'm only 7 frames into my current unit, should I delete the existing work folder and start again for a fresh start?



ChelseaOilman...where do you find that info for work units?
 
Last edited:
Very odd. (Ctrl+C is absolutely fine)

Did you set up langouste on your machine? (just trying to determine all the variables)

Also, can you paste your client.cfg? (with the exception of the passkey, ofc)
 
Deleting returned wuresults_XX.dat should be enough.

I suspect that deleting work/ directory is not going to cure the problem (of the client not deleting
returned results and not updating queue.dat accordingly) and we may need to dig in... again ;-)
 
Very odd. (Ctrl+C is absolutely fine)

Did you set up langouste on your machine? (just trying to determine all the variables)

Also, can you paste your client.cfg? (with the exception of the passkey, ofc)

No langouste on my machine that I'm aware of and here's my client.cfg file. Is there anyway to tell if I will get credit for work unit 3 before I manually delete the file?
Code:
[settings]
username=freeloader
team=33
passkey=********************************
asknet=no
machineid=1
bigpackets=big
extra_parms=-smp 16 -bigbeta -verbosity 9
local=1

[http]
active=no
host=localhost
port=8080

[core]
addr=
 
Last edited:
Per Chelsea, your unit 03 (P6903 R9 C12 G99) has been credited, so it looks like
the issue has already occurred twice (unit returned successfully but not marked as such).

I'd say it's safe to remove results from both slots (02 and 03).

My proposal is to watch what happens w/ your current unit (04) -- please avoid restarting
the client so we can see what the client does w/o intervention.
 
So is it possible that my client processed the same unit twice? They have identical PRCG elements. 6903 (Run 9, Clone 12, Gen 99) One was done and credited on July 1st and the other unit just finished this morning as you can see in my logs above. Very strange indeed. My current work folder now looks like this...

work_folder.png

work_folder.png
 
I have seen this happen before on a few occasions. The times I have seen it happen have always been when there was a connection problem on my end. F@H log would read as could not connect to assignment server but F@H was connected to the collection server with a very slow connection between 0kb and 90kb. and I would not notice it for hours. I would then shut F@H down and reboot the computer start fah again at which point the connection problem would be gone. Fah would send the WU again and the collection server would respond it had already seen this unit.

If I remember correctly it would attempt to send the WU a few more times and the CS would refuse, but upon competition and transmission of the next WU the work directory would get cleaned up.
 
A-ha... so one of the symptoms (slot 3 WU already received) could be explained by unit
getting re-issued. Question is why it (slot 3 got same WU as slot 2) happened...

Ok, if you're up for more digging --
1. Delete wuresults_02.dat and wuresults_03.dat -- they're of no use (EDIT -- ok, seems this one's taken care of already)
2. Keep the client running and also make sure there's only one
  client instance running (sanity check) -- ps auxw | grep fah6
3. Download, build and run qd, then paste results please:
Code:
cd ~
wget http://linuxminded.nl/software/qd-tools/source/qd/qd.c
gcc -o qd qd.c -lm
cd fahdirectory
~/qd
4. No matter what issue manifests next, don't shut the client down -- paste complete
  FAHlog.txt (e.g. to pastebin.com or by means of pastebinit app) or just e-mail it
  to me (tear@braxis.org)
 
Last edited:
I will not shut the client down no matter what.

Results from grep...


Code:
mike@MilkyWayG34:~$ ps auxw | grep fah6
mike      2127  0.0  0.0 233064   936 pts/2    Sl+  Jul05   0:06 ./fah6
mike      9289  0.0  0.0  13580   888 pts/3    S+   07:01   0:00 grep --color=auto fah6
mike@MilkyWayG34:~$





Here's the results from the last command.

Code:
mike@MilkyWayG34:~/fah$ ./qd
qd released 26 June 2012 (fr 086)
qd executed Fri Jul 06 06:55:03 EDT 2012 (Fri Jul 06 10:55:03 UTC 2012)
Queue version 6.00
Current index: 4
 Index 5: empty
 Index 6: empty
 Index 7: empty
 Index 8: empty
 Index 9: empty
 Index 0: empty
 Index 1: deleted 68.00 pts
  server: 171.67.108.52:8080; project: 6872
  Folding: run 535, clone 1, generation 265; benchmark 0; misc: 500, 200, 6 (be)
  issue: Wed Jun 27 10:08:44 2012; begin: Wed Jun 27 10:08:45 2012
  end: ZERO; due: Wed Jul 18 10:08:45 2012 (21 days)
  preferred: Wed Jul 11 10:08:45 2012 (14 days)
  core URL: http://www.stanford.edu/~pande/Linux/AMD64/Core_78.fah
  core number: 0x78; core name: GROMACS
  CPU: 16,0 AMD64; OS: 4,0 Linux
  memory: 16048 MB
  assignment info (be): Wed Jun 27 10:06:34 2012; BC27E421
  CS: 171.67.108.49; P limit: 524286976
  user: freeloader; team: 33; ID: BDA63F28C6F62F3E; mach ID: 1
  work/wudata_01.dat file size: 346604; WU type: Folding@home
 Index 2: finished 22706.00 pts (272.531 pt/hr, 6534.21 ppd) 3.46 X min speed
  bonus pts: 260276.42 (3120.872 pt/hr, 74900.94 ppd); bonus factor: 11.46; kfactor: 38.05
  server: 130.237.232.237:8080; project: 6903
  Folding: run 9, clone 12, generation 99; benchmark 0; misc: 500, 634, 12 (be)
  issue: Wed Jun 27 10:22:23 2012; begin: Wed Jun 27 10:27:23 2012
  end: Sat Jun 30 21:46:18 2012; due: Mon Jul  9 10:27:23 2012 (12 days)
  preferred: Mon Jul  2 10:27:23 2012 (5 days)
  core URL: http://www.stanford.edu/~pande/Linux/AMD64/beta/Core_a5.fah (V2.27)
  core number: 0xa5; core name: GRO-A5
  CPU: 16,0 AMD64; OS: 4,0 Linux
  smp cores: 16; cores to use: 16
  memory: 16048 MB
  client type: 6 Big Beta
  assignment info (be): Wed Jun 27 10:20:13 2012; 95897C0F
  P limit: 524286976
  user: freeloader; team: 33; ID: CD2F2740AC349F49; mach ID: 1
  work/wudata_02.dat file size: 57242762; WU type: Folding@home
 Index 3: finished 22706.00 pts (260.251 pt/hr, 6243.09 ppd) 3.3 X min speed
  bonus pts: 254412.34 (2914.648 pt/hr, 69951.55 ppd); bonus factor: 11.20; kfactor: 38.05
  server: 130.237.232.237:8080; project: 6903
  Folding: run 9, clone 12, generation 99; benchmark 0; misc: 500, 634, 12 (be)
  issue: Sun Jul  1 06:23:36 2012; begin: Sun Jul  1 06:26:03 2012
  end: Wed Jul  4 21:40:51 2012; due: Fri Jul 13 06:26:03 2012 (12 days)
  preferred: Fri Jul  6 06:26:03 2012 (5 days)
  core URL: http://www.stanford.edu/~pande/Linux/AMD64/beta/Core_a5.fah (V2.27)
  core number: 0xa5; core name: GRO-A5
  CPU: 16,0 AMD64; OS: 4,0 Linux
  smp cores: 16; cores to use: 16
  memory: 16048 MB
  client type: 6 Big Beta
  assignment info (be): Sun Jul  1 06:21:26 2012; 95764E34
  P limit: 524286976
  user: freeloader; team: 33; ID: CD2F2740AC349F49; mach ID: 1
  work/wudata_03.dat file size: 57242762; WU type: Folding@home
 Index 4: folding now 22706.00 pts (278.911 pt/hr, 6688.55 ppd) 3.54 X min speed; 31% complete
  bonus pts: 263332.34 (1001.950 pt/hr, 77570.31 ppd); bonus factor: 11.60; kfactor: 38.05
  server: 130.237.232.237:8080; project: 6903
  Folding: run 8, clone 11, generation 120; benchmark 0; misc: 500, 634, 12 (be)
  issue: Thu Jul  5 05:35:10 2012; begin: Thu Jul  5 05:39:03 2012
  expect: Sun Jul  8 15:03:37 2012; due: Tue Jul 17 05:39:03 2012 (12 days)
  preferred: Tue Jul 10 05:39:03 2012 (5 days)
  core URL: http://www.stanford.edu/~pande/Linux/AMD64/beta/Core_a5.fah (V2.27)
  core number: 0xa5; core name: GRO-A5
  CPU: 16,0 AMD64; OS: 4,0 Linux
  smp cores: 16; cores to use: 16
  flops: 1060270729 (1060.270729 megaflops)
  memory: 16048 MB
  client type: 6 Big Beta
  assignment info (be): Thu Jul  5 05:32:57 2012; 957CB4DB
  P limit: 524286976
  user: freeloader; team: 33; ID: CD2F2740AC349F49; mach ID: 1
  work/wudata_04.dat file size: 57238663; WU type: Folding@home
Average download rate 307.562 KB/s (u=4); upload rate 66.157 KB/s (u=2)
Performance fraction 0.697060 (u=1)
Average pph: 267.881, ppd: 6429.15, ppw: 45004.1, ppy: 2348183
Average bonus pph: 2364.852, ppd: 56756.45, ppw: 397295.1, ppy: 20729725
Average alternate pph: 270.138, ppd: 6483.31, ppw: 45383.1, ppy: 2367962
Average alternate bonus pph: 3085.423, ppd: 74050.15, ppw: 518351.1, ppy: 27046077
mike@MilkyWayG34:~/fah$
 
Last edited:
Thank you!

qd output confirms slots 2 and 3 getting same WU... even though issue dates
are several days apart... :eek:

Don't have a better idea than to continue observations...
 
Thanks Tear. This current unit is at 42% so I'll be able to see what happens tomorrow night sometime. I've also switched over my connection from wireless to a cable to see if the wireless was causing problems during the transmission of a completed unit.
 
Back
Top