FAH message: "Server reports problem with unit"

Pocatello

DC Moderator and [H]ard DCOTM x6
Staff member
Joined
Jun 15, 2005
Messages
6,703
Code:
                       1d01h42:59
               (Mnbf/s)   (GFlops)   (ns/day)  (hour/ns)
Performance:   1167.152     61.696      0.933     25.716

Thanx for Using GROMACS - Have a Nice Day

[18:30:00] DynamicWrapper: Finished Work Unit: sleep=10000
[18:30:10] 
[18:30:10] Finished Work Unit:
[18:30:10] - Reading up to 64206000 from "work/wudata_02.trr": Read 64206000
[18:30:10] trr file hash check passed.
[18:30:10] - Reading up to 31551092 from "work/wudata_02.xtc": Read 31551092
[18:30:10] xtc file hash check passed.
[18:30:10] edr file hash check passed.
[18:30:10] logfile size: 206379
[18:30:10] Leaving Run
[18:30:13] - Writing 96124347 bytes of core data to disk...
[18:30:30] Done: 96123835 -> 91350651 (compressed to 5.6 percent)
[18:30:30]   ... Done.
[18:30:37] - Shutting down core
[18:30:37] 
[18:30:37] Folding@home Core Shutdown: FINISHED_UNIT
[18:30:38] CoreStatus = 64 (100)
[18:30:38] Unit 2 finished with 64 percent of time to deadline remaining.
[18:30:38] Updated performance fraction: 0.590027
[18:30:38] Sending work to server
[18:30:38] Project: 8104 (Run 0, Clone 3, Gen 158)


[18:30:38] + Attempting to send results [August 14 18:30:38 UTC]
[18:30:38] - Reading file work/wuresults_02.dat from core
[18:30:38]   (Read 91351163 bytes from disk)
[18:30:38] Connecting to http://128.143.231.201:8080/
[18:35:46] Posted data.
[18:35:46] Initial: 0000; - Uploaded at ~289 kB/s
[18:35:46] - Averaged speed for that direction ~280 kB/s
[18:35:46] - Server reports problem with unit.
[18:35:46] Trying to send all finished work units
[18:35:46] + No unsent completed units remaining.
[18:35:46] - Preparing to get new work unit...
[18:35:46] Cleaning up work directory
[18:35:47] + Attempting to get work packet
[18:35:47] Passkey found
[18:35:47] - Will indicate memory of 8000 MB
[18:35:47] - Connecting to assignment server
[18:35:47] Connecting to http://assign.stanford.edu:8080/
[18:35:47] Posted data.
[18:35:47] Initial: 8F80; - Successful: assigned to (128.143.231.201).
[18:35:47] + News From Folding@Home: Welcome to Folding@Home
[18:35:48] Loaded queue successfully.
[18:35:48] Sent data
[18:35:48] Connecting to http://128.143.231.201:8080/
[18:35:54] Posted data.
[18:35:54] Initial: 0000; - Receiving payload (expected size: 30302961)
[18:36:05] - Downloaded at ~2690 kB/s
[18:36:05] - Averaged speed for that direction ~2867 kB/s
[18:36:05] + Received work.
[18:36:05] Trying to send all finished work units
[18:36:05] + No unsent completed units remaining.
[18:36:05] + Closed connections
[18:36:05] 
[18:36:05] + Processing work unit
[18:36:05] Core required: FahCore_a5.exe
[18:36:05] Core found.
[18:36:05] Working on queue slot 03 [August 14 18:36:05 UTC]
[18:36:05] + Working ...
[18:36:05] - Calling './FahCore_a5.exe -dir work/ -nice 19 -suffix 03 -np 24 -checkpoint 5 -verbose -lifeline 2176 -version 634'

thekraken: The Kraken 0.7-pre15 (compiled Sat Jul 21 12:20:31 MDT 2012 by h@theater)
thekraken: Processor affinity wrapper for Folding@Home
thekraken: The Kraken comes with ABSOLUTELY NO WARRANTY; licensed under GPLv2
thekraken: PID: 4334
thekraken: Logging to thekraken.log
[18:36:05] 
[18:36:05] *------------------------------*
[18:36:05] Folding@Home Gromacs SMP Core
[18:36:05] Version 2.27 (Thu Feb 10 09:46:40 PST 2011)
[18:36:05] 
[18:36:05] Preparing to commence simulation
[18:36:05] - Looking at optimizations...
[18:36:05] - Created dyn
[18:36:05] - Files status OK
[18:36:07] - Expanded 30302449 -> 33158020 (decompressed 109.4 percent)
[18:36:07] Called DecompressByteArray: compressed_data_size=30302449 data_size=33158020, decompressed_data_size=33158020 diff=0
[18:36:07] - Digital signature verified
[18:36:07] 
[18:36:07] Project: 8101 (Run 1, Clone 4, Gen 302)
[18:36:07] 
[18:36:07] Assembly optimizations on if available.
[18:36:07] Entering M.D.
                         :-)  G  R  O  M  A  C  S  (-:

                   Groningen Machine for Chemical Simulation

                            :-)  VERSION 4.5.3  (-:

        Written by Emile Apol, Rossen Apostolov, Herman J.C. Berendsen,
      Aldert van Buuren, Pär Bjelkmar, Rudi van Drunen, Anton Feenstra, 
        Gerrit Groenhof, Peter Kasson, Per Larsson, Pieter Meulenhoff, 
           Teemu Murtola, Szilard Pall, Sander Pronk, Roland Schulz, 
                Michael Shirts, Alfons Sijbers, Peter Tieleman,

               Berk Hess, David van der Spoel, and Erik Lindahl.

       Copyright (c) 1991-2000, University of Groningen, The Netherlands.
            Copyright (c) 2001-2010, The GROMACS development team at
        Uppsala University & The Royal Institute of Technology, Sweden.
            check out http://www.gromacs.org for more information.


                               :-)  Gromacs  (-:

Reading file work/wudata_03.tpr, VERSION 4.5.5-dev-20120903-d64b9e3 (single precision)
[18:36:14] Mapping NT from 24 to 24 
Starting 24 threads
Making 2D domain decomposition 6 x 4 x 1
starting mdrun 'FP_membrane in water'
75750000 steps, 303000.0 ps (continuing from step 75500000, 302000.0 ps).
[18:36:20] Completed 0 out of 250000 steps  (0%)

NOTE: Turning on dynamic load balancing

[19:03:25] Completed 2500 out of 250000 steps  (1%)
[19:29:59] Completed 5000 out of 250000 steps  (2%)
[19:56:35] Completed 7500 out of 250000 steps  (3%)
[20:16:11] - Autosending finished units... [August 14 20:16:11 UTC]
[20:16:11] Trying to send all finished work units
[20:16:11] + No unsent completed units remaining.
[20:16:11] - Autosend completed
[20:23:09] Completed 10000 out of 250000 steps  (4%)
[20:49:41] Completed 12500 out of 250000 steps  (5%)
[21:16:13] Completed 15000 out of 250000 steps  (6%)
[21:42:48] Completed 17500 out of 250000 steps  (7%)
[22:09:21] Completed 20000 out of 250000 steps  (8%)
[22:35:53] Completed 22500 out of 250000 steps  (9%)
[23:02:28] Completed 25000 out of 250000 steps  (10%)
[23:29:01] Completed 27500 out of 250000 steps  (11%)
[23:55:35] Completed 30000 out of 250000 steps  (12%)
[00:22:11] Completed 32500 out of 250000 steps  (13%)
[00:48:45] Completed 35000 out of 250000 steps  (14%)
[01:15:20] Completed 37500 out of 250000 steps  (15%)
[01:41:57] Completed 40000 out of 250000 steps  (16%)
[02:08:33] Completed 42500 out of 250000 steps  (17%)
[02:16:11] - Autosending finished units... [August 15 02:16:11 UTC]
[02:16:11] Trying to send all finished work units
[02:16:11] + No unsent completed units remaining.
[02:16:11] - Autosend completed
[02:35:09] Completed 45000 out of 250000 steps  (18%)
[03:01:44] Completed 47500 out of 250000 steps  (19%)
[03:28:20] Completed 50000 out of 250000 steps  (20%)
[03:54:54] Completed 52500 out of 250000 steps  (21%)
[04:21:30] Completed 55000 out of 250000 steps  (22%)
[04:48:06] Completed 57500 out of 250000 steps  (23%)
[05:14:40] Completed 60000 out of 250000 steps  (24%)
[05:41:16] Completed 62500 out of 250000 steps  (25%)
[06:07:54] Completed 65000 out of 250000 steps  (26%)
[06:34:28] Completed 67500 out of 250000 steps  (27%)
[07:01:03] Completed 70000 out of 250000 steps  (28%)
[07:27:40] Completed 72500 out of 250000 steps  (29%)
[07:54:14] Completed 75000 out of 250000 steps  (30%)
[08:16:11] - Autosending finished units... [August 15 08:16:11 UTC]
[08:16:11] Trying to send all finished work units
[08:16:11] + No unsent completed units remaining.
[08:16:11] - Autosend completed
[08:20:48] Completed 77500 out of 250000 steps  (31%)
[08:47:23] Completed 80000 out of 250000 steps  (32%)
[09:13:57] Completed 82500 out of 250000 steps  (33%)
[09:40:32] Completed 85000 out of 250000 steps  (34%)
[10:07:08] Completed 87500 out of 250000 steps  (35%)
[10:33:41] Completed 90000 out of 250000 steps  (36%)
[11:00:15] Completed 92500 out of 250000 steps  (37%)
[11:26:51] Completed 95000 out of 250000 steps  (38%)
[11:53:25] Completed 97500 out of 250000 steps  (39%)
[12:20:00] Completed 100000 out of 250000 steps  (40%)
[12:46:36] Completed 102500 out of 250000 steps  (41%)
[13:13:11] Completed 105000 out of 250000 steps  (42%)
[13:39:46] Completed 107500 out of 250000 steps  (43%)
[14:06:21] Completed 110000 out of 250000 steps  (44%)
[14:16:11] - Autosending finished units... [August 15 14:16:11 UTC]
[14:16:11] Trying to send all finished work units
[14:16:11] + No unsent completed units remaining.
[14:16:11] - Autosend completed
[14:32:57] Completed 112500 out of 250000 steps  (45%)
[14:59:31] Completed 115000 out of 250000 steps  (46%)
[15:26:06] Completed 117500 out of 250000 steps  (47%)
[15:52:43] Completed 120000 out of 250000 steps  (48%)
[16:19:17] Completed 122500 out of 250000 steps  (49%)
[16:45:54] Completed 125000 out of 250000 steps  (50%)
[17:12:31] Completed 127500 out of 250000 steps  (51%)
[17:39:06] Completed 130000 out of 250000 steps  (52%)
[18:05:42] Completed 132500 out of 250000 steps  (53%)
[18:32:21] Completed 135000 out of 250000 steps  (54%)
[18:58:57] Completed 137500 out of 250000 steps  (55%)
[19:25:33] Completed 140000 out of 250000 steps  (56%)
[19:52:11] Completed 142500 out of 250000 steps  (57%)
[20:16:11] - Autosending finished units... [August 15 20:16:11 UTC]
[20:16:11] Trying to send all finished work units
[20:16:11] + No unsent completed units remaining.
[20:16:11] - Autosend completed
[20:18:48] Completed 145000 out of 250000 steps  (58%)
[20:45:22] Completed 147500 out of 250000 steps  (59%)
[21:12:00] Completed 150000 out of 250000 steps  (60%)
[21:38:37] Completed 152500 out of 250000 steps  (61%)
[22:05:11] Completed 155000 out of 250000 steps  (62%)
[22:31:47] Completed 157500 out of 250000 steps  (63%)
[22:58:22] Completed 160000 out of 250000 steps  (64%)
[23:24:55] Completed 162500 out of 250000 steps  (65%)
[23:51:31] Completed 165000 out of 250000 steps  (66%)
[00:18:05] Completed 167500 out of 250000 steps  (67%)
[00:44:41] Completed 170000 out of 250000 steps  (68%)
[01:11:16] Completed 172500 out of 250000 steps  (69%)
[01:37:54] Completed 175000 out of 250000 steps  (70%)
[02:04:32] Completed 177500 out of 250000 steps  (71%)
[02:16:11] - Autosending finished units... [August 16 02:16:11 UTC]
[02:16:11] Trying to send all finished work units
[02:16:11] + No unsent completed units remaining.
[02:16:11] - Autosend completed
[02:31:09] Completed 180000 out of 250000 steps  (72%)
[02:57:45] Completed 182500 out of 250000 steps  (73%)
[03:24:21] Completed 185000 out of 250000 steps  (74%)
[03:50:59] Completed 187500 out of 250000 steps  (75%)
[04:17:36] Completed 190000 out of 250000 steps  (76%)
[04:44:14] Completed 192500 out of 250000 steps  (77%)
[05:10:51] Completed 195000 out of 250000 steps  (78%)
[05:37:29] Completed 197500 out of 250000 steps  (79%)
[06:04:06] Completed 200000 out of 250000 steps  (80%)
[06:30:45] Completed 202500 out of 250000 steps  (81%)
[06:57:22] Completed 205000 out of 250000 steps  (82%)
[07:23:58] Completed 207500 out of 250000 steps  (83%)
[07:50:34] Completed 210000 out of 250000 steps  (84%)
[08:16:11] - Autosending finished units... [August 16 08:16:11 UTC]
[08:16:11] Trying to send all finished work units
[08:16:11] + No unsent completed units remaining.
[08:16:11] - Autosend completed
[08:17:12] Completed 212500 out of 250000 steps  (85%)
[08:43:48] Completed 215000 out of 250000 steps  (86%)
[09:10:25] Completed 217500 out of 250000 steps  (87%)
[09:37:03] Completed 220000 out of 250000 steps  (88%)
[10:03:41] Completed 222500 out of 250000 steps  (89%)
[10:30:18] Completed 225000 out of 250000 steps  (90%)
[10:56:56] Completed 227500 out of 250000 steps  (91%)
[11:23:33] Completed 230000 out of 250000 steps  (92%)
[11:50:10] Completed 232500 out of 250000 steps  (93%)
[12:16:47] Completed 235000 out of 250000 steps  (94%)
[12:43:23] Completed 237500 out of 250000 steps  (95%)
[13:09:58] Completed 240000 out of 250000 steps  (96%)
[13:36:37] Completed 242500 out of 250000 steps  (97%)
[14:03:13] Completed 245000 out of 250000 steps  (98%)
[14:16:11] - Autosending finished units... [August 16 14:16:11 UTC]
[14:16:11] Trying to send all finished work units
[14:16:11] + No unsent completed units remaining.
[14:16:11] - Autosend completed
[14:29:49] Completed 247500 out of 250000 steps  (99%)
[14:56:25] Completed 250000 out of 250000 steps  (100%)

Writing final coordinates.

 Average load imbalance: 0.2 %
 Part of the total run time spent waiting due to load imbalance: 0.0 %
 Steps where the load balancing was limited by -rdd, -rcon and/or -dds: X 0 % Y 0 %


	Parallel run - timing based on wallclock.

               NODE (s)   Real (s)      (%)
       Time: 159615.192 159615.192    100.0
                       1d20h20:15
               (Mnbf/s)   (GFlops)   (ns/day)  (hour/ns)
Performance:    681.044     35.285      0.541     44.337

Thanx for Using GROMACS - Have a Nice Day

[14:56:34] DynamicWrapper: Finished Work Unit: sleep=10000
[14:56:44] 
[14:56:44] Finished Work Unit:
[14:56:44] - Reading up to 64340496 from "work/wudata_03.trr": Read 64340496
[14:56:45] trr file hash check passed.
[14:56:45] - Reading up to 31618600 from "work/wudata_03.xtc": Read 31618600
[14:56:45] xtc file hash check passed.
[14:56:45] edr file hash check passed.
[14:56:45] logfile size: 220647
[14:56:45] Leaving Run
[14:56:49] - Writing 96340619 bytes of core data to disk...
[14:57:07] Done: 96340107 -> 91562097 (compressed to 5.8 percent)
[14:57:07]   ... Done.
[14:57:14] - Shutting down core
[14:57:14] 
[14:57:14] Folding@home Core Shutdown: FINISHED_UNIT
[14:57:15] CoreStatus = 64 (100)
[14:57:15] Unit 3 finished with 54 percent of time to deadline remaining.
[14:57:15] Updated performance fraction: 0.572682
[14:57:15] Sending work to server
[14:57:15] Project: 8101 (Run 1, Clone 4, Gen 302)


[14:57:15] + Attempting to send results [August 16 14:57:15 UTC]
[14:57:15] - Reading file work/wuresults_03.dat from core
[14:57:15]   (Read 91562609 bytes from disk)
[14:57:15] Connecting to http://128.143.231.201:8080/
[15:02:24] Posted data.
[15:02:24] Initial: 0000; - Uploaded at ~289 kB/s
[15:02:24] - Averaged speed for that direction ~283 kB/s



[COLOR="Red"][B][15:02:24] - Server reports problem with unit.[/B][/COLOR]


[15:02:24] Trying to send all finished work units
[15:02:24] + No unsent completed units remaining.
[15:02:24] - Preparing to get new work unit...
[15:02:24] Cleaning up work directory
[15:02:25] + Attempting to get work packet
[15:02:25] Passkey found
[15:02:25] - Will indicate memory of 8000 MB
[15:02:25] - Connecting to assignment server
[15:02:25] Connecting to http://assign.stanford.edu:8080/
[15:02:26] Posted data.
[15:02:26] Initial: 8F80; - Successful: assigned to (128.143.231.201).
[15:02:26] + News From Folding@Home: Welcome to Folding@Home
[15:02:26] Loaded queue successfully.
[15:02:26] Sent data
[15:02:26] Connecting to http://128.143.231.201:8080/
[15:02:34] Posted data.
[15:02:34] Initial: 0000; - Receiving payload (expected size: 30340239)
[15:02:43] - Downloaded at ~3292 kB/s
[15:02:43] - Averaged speed for that direction ~2973 kB/s
[15:02:43] + Received work.
[15:02:43] Trying to send all finished work units
[15:02:43] + No unsent completed units remaining.
[15:02:43] + Closed connections
[15:02:43] 
[15:02:43] + Processing work unit
[15:02:43] Core required: FahCore_a5.exe
[15:02:43] Core found.
[15:02:43] Working on queue slot 04 [August 16 15:02:43 UTC]
[15:02:43] + Working ...
[15:02:43] - Calling './FahCore_a5.exe -dir work/ -nice 19 -suffix 04 -np 24 -checkpoint 5 -verbose -lifeline 2176 -version 634'

thekraken: The Kraken 0.7-pre15 (compiled Sat Jul 21 12:20:31 MDT 2012 by h@theater)
thekraken: Processor affinity wrapper for Folding@Home
thekraken: The Kraken comes with ABSOLUTELY NO WARRANTY; licensed under GPLv2
thekraken: PID: 6041
thekraken: Logging to thekraken.log
[15:02:43] 
[15:02:43] *------------------------------*
[15:02:43] Folding@Home Gromacs SMP Core
[15:02:43] Version 2.27 (Thu Feb 10 09:46:40 PST 2011)
[15:02:43] 
[15:02:43] Preparing to commence simulation
[15:02:43] - Looking at optimizations...
[15:02:43] - Created dyn
[15:02:43] - Files status OK
[15:02:45] - Expanded 30339727 -> 33163648 (decompressed 109.3 percent)
[15:02:45] Called DecompressByteArray: compressed_data_size=30339727 data_size=33163648, decompressed_data_size=33163648 diff=0
[15:02:45] - Digital signature verified
[15:02:45] 
[15:02:45] Project: 8103 (Run 1, Clone 42, Gen 139)
[15:02:45] 
[15:02:45] Assembly optimizations on if available.
[15:02:45] Entering M.D.
                         :-)  G  R  O  M  A  C  S  (-:

"Server reports problem with unit"

I put some blank lines in the code above to make the error message stand out better.

Let me know if I left any secrets in there that should not be posted.

I did not get any points for this WU, I did not even get credit for one WU.

Any ideas what is up? Everything seems fine on my end.

system stats:

SR2 with two 6-core Intel ES chips running at 3.15 Ghz.
12 GB memory
Linux ( I think 10.0)
Dedicated folding rig.

Thanks.
 
i used the FAH Ubuntu installation guide. This is the first time I have seen this message.
 
Snip
Thanks.


Have you tried another WU since then and had the same results? If not, Id go ahead and try another WU before I decided to reinstall. It doesnt seem like it is something on your end because the WU actually completed according to those logs.

Sounds like it may be something to do with the receiving server and nothing you could have prevented. Maybe a duplicate WU or something to that effect.
 
I'll go with m33p on this one. I have seen these, and they piss me off also. But, there is nothing you can do about them. Keep an eye on the next unit and see if it has any issues.
 
I will wait.

The current WU is going to finish later today. Everything seems fine.
 
I will wait.

The current WU is going to finish later today. Everything seems fine.

I know Ive heard people say in the past that when someone decides the dont like a WU (IE and 8101) and they go and delete it to try and get a new one, that it will often times screw quite a few people down the line. Has something to do with it being assigned to the first person and then reassigned and completed by you.
 
Back
Top