Linux versus Windows - bigadv folding results

I suppose I'll just need to try to get GPUs working again, then, as the benefit to -bigadv still leaves me 10k ppd short of where i was with windows and 2x 460s
 
Got this compiled, and I also downloaded and compiled source from ck-sources. "Eselect" sees all three sources so I'm good to pick and choose.

If you want, I could try both out and let you know if I see any differences on the SR-2.

When I generated the "genkernel" it also asked me to add an "initrd" (initialize root disk) entry into GRUB.CONF under /boot/grub/grub.conf after the initial "kernel" entry. When I do a manual compiling of the kernel it doesn't list this as required in grub.conf, nor the "real_root" entry.

Is this ok, or does the kernel with manual config have a built-in module for this? I just want to make sure I don't have to use the liveCD again because I'm a n00b.

Thanks much!

Once you have a kernel with BFS, you will see better gains. When you configure the kernel, make sure you set the following:

General Setup ->
Select BFS
Zen-Tune Profile = Server

Processor Type and Features ->
Processor Family = Core 2/newer Xeon
Preemption Model = No Forced Preemption (Server)
Timer Frequency = 100 Hz.

I also do a bunch of stuff such as removing all drivers except for what I need, removing AMD support (since it's an Intel box), etc., etc. Just be careful and you'll be fine.
 
I suppose I'll just need to try to get GPUs working again, then, as the benefit to -bigadv still leaves me 10k ppd short of where i was with windows and 2x 460s
Two GTX 460s will net you approaching 30K PPD all by themselves provided with good OCs. It would be difficult to extract that much more from an i7 with 8 threads, four of which are real cores, to match or exceed the production of two Fermis to be advantageous in removing them, even with the most efficient OS possible.
 
Got this compiled, and I also downloaded and compiled source from ck-sources. "Eselect" sees all three sources so I'm good to pick and choose.

If you want, I could try both out and let you know if I see any differences on the SR-2.

When I generated the "genkernel" it also asked me to add an "initrd" (initialize root disk) entry into GRUB.CONF under /boot/grub/grub.conf after the initial "kernel" entry. When I do a manual compiling of the kernel it doesn't list this as required in grub.conf, nor the "real_root" entry.

Is this ok, or does the kernel with manual config have a built-in module for this? I just want to make sure I don't have to use the liveCD again because I'm a n00b.

Thanks much!

Only Genkernel kernels needs the initrd. If you compiled a kernel from source without using Genkernel, you can just comment out the initrd line.
 
Two GTX 460s will net you approaching 30K PPD all by themselves provided with good OCs. It would be difficult to extract that much more from an i7 with 8 threads, four of which are real cores, to match or exceed the production of two Fermis to be advantageous in removing them, even with the most efficient OS possible.

I realize... I'm trying to get the 460s running under wine with the gpu3 client. I'd say I'm about half way there... then I need to figure out overclocking under linux for those..... should be fun =)
 
I realize... I'm trying to get the 460s running under wine with the gpu3 client. I'd say I'm about half way there... then I need to figure out overclocking under linux for those..... should be fun =)
Now that would be very interesting. I might try this on one system that could absolutely use the benefits of Linux and it has one GPU client running.
 
getting tpfs of 29:41 on an i7 920 at 3.9ghz with c-states enabled on gentobit. compared to my old windows times of 36:15 with -smp 7, that doesn't seem like as much of a gain as I should be seeing. Any suggestions?
Are you running zen-kernel and did you set the "Zen Profile" to server? It makes a big difference compared to the other options.
 
Tobit, the kernel config I'm using is the one you did for musky, with a couple of drivers added in for my hardware... so yes.
 
Tobit, the kernel config I'm using is the one you did for musky, with a couple of drivers added in for my hardware... so yes.
Then no, I had tried telling you in IRC that we had to make one change to his config after you downloaded it but you kept ping timing out. Switching the zen-profile to server increased his performance considerably, saving upwards of 3m per frame.

Run "make menuconfig" on your kernel tree and set zen-profile to server and recompile.
 
trying now... presumably you mean the one that was set to "Desktop" before? will post updated tpfs in an hour or two once they've stabilised.
 
Probably dumb on my part to use genkernel and core2 CFLags, but that's how one learns, and this is more exciting than my job right now which has me learning a 20 year O/S again. That's right, Novell :confused:

If you need any help with that, just shout. I know it very well :)

H.
 
I realize... I'm trying to get the 460s running under wine with the gpu3 client. I'd say I'm about half way there... then I need to figure out overclocking under linux for those..... should be fun =)

the tool available when I did it last couldn't separate the shader from core clock...

I will look around for you though...


the basic way...
Log in as root and open /etc/X11/xorg.conf in your favorite editor.
vi /etc/X11/xorg.conf

scroll down with keypad and Under Device add:

Option "Coolbits" "1"

your looking to make it look like this
Section "Device"
Identifier "Videocard0"
Driver "nvidia"
VendorName "Videocard vendor"
BoardName "nVidia Corporation Unknown device 0391"
Option "Coolbits" "1"
EndSection

to input in vi ...
type i (that puts u in edit mode...)
then what you want "Coolbits" "1"
then hit escape
then type :wq (write quit)

restart and when it comes back up you can overclock in the nvidia control panel...
 
Last edited:
the tool available when I did it last couldn't separate the shader from core clock...

I will look around for you though...
IINM, the core and shader clocks are linked on these cards and they cannot be adjusted independently. My GTS 450 is also linked. I believe it's that way with all the 400-series cards.
 
IINM, the core and shader clocks are linked on these cards and they cannot be adjusted independently. I believe it's that way with all the 400-series cards. My GTS 450 is also linked.

oh... well then coolbits should be enough...
 
I'm not seeing it... this should be in nvidia-settings? coolbits are enabled, 260.19.36 drivers
 
I'm not seeing it... this should be in nvidia-settings? coolbits are enabled, 260.19.36 drivers

should be in your nvidia display settings... after you manually edit that file... save it reboot is should appear...
 
Update - SR2#3 - Ubuntu 10 Musky Mix DFS kernal. Running at 4.3Ghz.

Changed checkpoint from 3 to 12 mins -saves 5 seconds a frame - which I can't recall being the case on windows.Nothing to sneeze at.

So that takes me to consistent 9mins 54sec (Windows average p6901 - 11mins 12sec)

Numa disabled. Cstates enabled. Captured p6901 from Musky.

which takes me to 197,700 ppd = 20.3% more than windows.

Interesting test: c-states has no effect on the x5680 running with turbo off. On the x5660 with turbo on, c-states has a large effect and should be left on.

Musky is a legend.
 
Update - SR2#3 - Ubuntu 10 Musky Mix DFS kernal. Running at 4.3Ghz.

Changed checkpoint from 3 to 12 mins -saves 5 seconds a frame - which I can't recall being the case on windows.Nothing to sneeze at.

So that takes me to consistent 9mins 54sec (Windows average p6901 - 11mins 12sec)

Numa disabled. Cstates enabled. Captured p6901 from Musky.

which takes me to 197,700 ppd = 20.3% more than windows.

Interesting test: c-states has no effect on the x5680 running with turbo off. On the x5660 with turbo on, c-states has a large effect and should be left on. You cut 2 hours off your BigAdv processing time too, compared to Windows. Nice.

Musky is a legend.

Crazy Aussie, I set it to 15 minutes and watch it BURN!!!!!!!!!!!!!! I don't use turbo, but I'll try C-States as well and turn NUMA off (as I mention below).

If you need any help with that, just shout. I know it very well :)

H.

Thanks fellow Opteronite.

My zen kernel died with a kernel panic on boot, crying about mounting /dev/sda3 and not finding dev8, sda3 (or something to that effect) which means that I left it without some important module, I guess. I did follow Tobit's earlier instructions using the "performance server profile" 100hz tick, etc... I also made sure it was multilib, etc...

I will re-check the logs and see what the heck didn't load. My /boot > /dev/sda1 partition is ext2 and / > /dev/sda3 is ext3, and I'm sure those were starred (ie built-in and activated, not set as modules) in make so I'm not sure exactly what the issue is. I'll try and check the logs tonight.

Not wanting to go back to junky genkernel I decided to re-run using CK-sources to have BFS in there. I used the same /boot/grub/grub.conf for the CK kernel so it wasn't that causing issues with Zen Kernel.

That's what's folding right now.

I noticed when I ran "make && make modules_install" it didn't actually run the "make modules_install" part showing an error of "no modules_install" so I just ran it manually using "make modules_install" and that worked. I didn't think this was an issue because I didn't add any modules to the kernel autoload.d file with either Zen or CK.

Since I did not add the "numa emulation" module I will try it with NUMA off to see if there is an improvement on the SR-2. Hopefully there won't be fireworks :)

So that's where it stands.

The AMD skankbox48 went down 3 minutes TPF for a 120k PPD rise with p6901 and the SR-2 went up about 10K. I will continue to play around after the next BigAdv drops on the SR-2. I have to also add BFS to the AMD running Ubuntu. I might try Gentoo on it as well at a later time.

Thanks!
 
Hey all,

A quick update to Musky's post here on how to install CK source/BFS(cheduler):

http://hardforum.com/showpost.php?p=1036935472&postcount=259

If you get a "command not found" error when running "sudo add-apt-repository ppa:chogydan/ppa && sudo apt-get update"

You need to do:

sudo apt-get install python-software-properties

FYI - and thanks Musky for your fine post. Otherwise worked perfectly fine for me on SkankBox48.
 
So I seem to have the exact opposite results with BFS so far. My TPF has gone up by roughly 35 seconds. I think I'm going to save this WU and try a few frames on CFS if my next WU sees similar TPFs.
 
Hey all,

A quick update to Musky's post here on how to install CK source/BFS(cheduler):

http://hardforum.com/showpost.php?p=1036935472&postcount=259

If you get a "command not found" error when running "sudo add-apt-repository ppa:chogydan/ppa && sudo apt-get update"

You need to do:

sudo apt-get install python-software-properties

FYI - and thanks Musky for your fine post. Otherwise worked perfectly fine for me on SkankBox48.

You shouldn't need to install python-software-properties with Desktop. You do with Server. I purposely left it out, but it won't hurt anything to run the command in and Ubuntu release.
 
You shouldn't need to install python-software-properties with Desktop. You do with Server. I purposely left it out, but it won't hurt anything to run the command in and Ubuntu release.

I put it in just in case someone is either running server, or they see this error..

So here's how it stands for Unit p6901 with my two "heavies":

SkankBox 48 under Ubuntu 10.10 Server gets 7:43 frame times consistently with stock (generic) kernel and NUMA on, nearly 10 minutes with the CK patches added and NUMA on (????) and much worse with NUMA off, which is the exact opposite of Windows Server 2k8 R2 x64 which shows great speed upgrades when NUMA support is OFF but still 3 minutes off Ubuntu.

The SR-2 under Gentoo with CK kernel gets 11:40 minute TPFs with NUMA on, and 11:02 with NUMA off.

The AMD shows over 100K PPD rise with Ubuntu vs. Windows from 167K PPD to 290(!), while the SR-2 shows more normal gains of about 20K PPD from 145 to 167K PPD.

Time to swap out drives and put Gentoo on the AMD with Zen Kernel methinks. Hopefully that will be more fun.

I don't know why Ubuntu with the BFS shows more consistent, but MUCH slower frame times on AMD. Maybe something is not optimized there.. This is something Vediovus mentioned earlier. Weird.
 
Looking forward to SkankyBox on Gentoo.
 
7:43[/COLOR] frame times consistently with stock (generic) kernel and NUMA on, nearly 10 minutes with the CK patches added and NUMA on (????) and much worse with NUMA off, which is the exact opposite of Windows Server 2k8 R2 x64 which shows great speed upgrades when NUMA support is OFF but still 3 minutes off Ubuntu.

Think I need to do some more testing on the Dual Hex - Ubuntu Server 10.10 was showing a potential improvment, even though on a slower drive ....

Using OpenSuSE 11.2, CFS scheduler I'm seeing something a little odd though .. the last WU it completed was a 6901 (R9,C12,G3) it chewed through 100 frames at a consistent 22:01 - 22:04. The next WU was another 6901 (R14,C16,G3), first 7 frames were at 22:50, then dropped to 21:36 - 21:39 for the last 27 frames.


H.
 
Last edited:
Think I need to do some more testing on the Dual Hex - Ubuntu Server 10.10 was showing a potential improvment, even though on a slower drive ....

Using OpenSuSE 11.2, CFS scheduler I'm seeing something a little odd though .. the last WU it completed was a 6901 (R9,C12,G3) it chewed through 100 frames at a consistent 22:01 - 22:04. The next WU was another 6901 (R14,C16,G3), first 7 frames were at 22:50, then dropped to 21:36 - 21:39 for the last 27 frames.


H.
that is the inconsistency that musky saw... cept his would start low and finish slow... and have spikes and other stuff...

I have to figure out how to patch rhel kernels so I can get it bfs working... meh....
 
that is the inconsistency that musky saw... cept his would start low and finish slow... and have spikes and other stuff...
but he is no longer seeing this with BFS.
 
correct...that and ext3

This is my 12 core box - it's running ext3 and CFS - going to migrate it to Ubuntu. I just don't get how it does a complete WU with rock solid 22:03 +/- 3 second tpf for 100% of frames, then starts a new WU slower by 45 seconds, then kicks over to a 30 second quicker than previous observed tpf. Gut instinct says some background process - but one that runs for two hours after thirty+ hours? WTF? Box has not been touched - it just folds ......

H.
 
but he is no longer seeing this with BFS.
correct...that and ext3

BFS fixed the CPU not loading 100%/inconsistent frame times issue. Switching from ext4 (Ubuntu's default) to ext3 fixed the issue where the client would wait 45 minutes between finishing a unit and starting to upload the results. This held true for an Intel single hex, an Intel dual quad, and a few Intel dual hex machines. I feel confident in saying that you should use the BFS for any Linux install on an Intel machine. The jury is still out for higher end AMD machines. You should use the ext3 file system regardless of platform.
 
BFS fixed the CPU not loading 100%/inconsistent frame times issue. Switching from ext4 (Ubuntu's default) to ext3 fixed the issue where the client would wait 45 minutes between finishing a unit and starting to upload the results. This held true for an Intel single hex, an Intel dual quad, and a few Intel dual hex machines. I feel confident in saying that you should use the BFS for any Linux install on an Intel machine. The jury is still out for higher end AMD machines. You should use the ext3 file system regardless of platform.

I am curious as to how long between end of unit and start...
my default setup takes 4min to upload... a total of ~15min between 100% , uploaded, dl new wu and be started 0%
 
BFS fixed the CPU not loading 100%/inconsistent frame times issue. Switching from ext4 (Ubuntu's default) to ext3 fixed the issue where the client would wait 45 minutes between finishing a unit and starting to upload the results. This held true for an Intel single hex, an Intel dual quad, and a few Intel dual hex machines. I feel confident in saying that you should use the BFS for any Linux install on an Intel machine. The jury is still out for higher end AMD machines. You should use the ext3 file system regardless of platform.

I'll let you know. My Gentoo on Skankbox48 is nearly done. I'm installing with Zen Kernel so that BFS should be the default scheduler. I know on Ubuntu on the 48-core AMD (and yes, SkankBox48 is the hostname) going to BFS caused my time-per-frame to go from 7:35 - 7:40 to over 10:15, so initially I would say no, but we'll see with Gentoo.

Thanks!

Update: Gosh Darnit somehow my Intel 82576 drivers didn't actuate in the kernel(!!!!!!!!!!!!) Everything else went ok but it whines about two things:

1) ext2 being an unknown file system so it won't see /boot (???????????) < Fixed by re-doing kernel and selecting this in make. Apparently ext2 is not a standard supported file system LOL.
2) net.eth0 needing a kernel module driver

Ugh. Thank Dog for IPMI.
 
Last edited:
Hello from the Land of OZ.
what a mighty impressive thread this one is along with some amazing PC's :eek: congratulations!
I have recently started folding with ubuntu 10.10 also and wanted to see if any of our friends from across the globe were having trouble receiving -bigadv units under linux? Is there a different way to force -bigadv under this OS?
I personally am referring to my 2600K and have received all A3 smp units so far despite the appropriate -smp 8 -bigadv settings.
Curious also to know what kind of ppd any of you are receiving on the smp units - currently im about 26K ppd


Click to view full size!


Recently i have stepped up to run some comps at OCAU head quarters in hopes to get some more output. Things will get better over time i hope :)
I would like to add its great to see [H] going so strongly. Your output and team support in the forums is something to admire. I pray that you crush the evil forces that are EVGA (whom i secretly name DVDA - not sure if that computes over there) :D
 
Welcome RETARDOCAU. nice to have you on the forums. You might want to make sure that you're using the latest client (6.34 I think) and are using a passkey as well...I didn't see that mentioned in your post.
 
Hey RETARDOCAU, I am the lone Aussie on [H] here... welcome. I could borrow your username, because when I started folding I didn't even realise ocau had a team. (don't hate me). I spent one naive day with EVGA (don't hate me #2) before moving over here, where it is nice and warm.

I think there are server issues with giving out bigadv on linux atm. see here.

I have been testing linux builds for a week, and now that I have settled on musky's mix as my best so far, I have started folding on my 2 slaves. The one that has had the HDD/install that did all the testing has yet to get a bigadv in about 2 days folding over the last few days. The fresh install I did last night on my 3rd rig scored a bigadv 6901 out of the box. Between units just now on my 2nd box I just did a fresh install of FAH, but no, still cursed, drew a SMP. On non-6701 units I am seeing exactly half the PPD I see on 6901s. - on windows it is more like 2/3.

Interestingly the SMP frame times (still Core A3) on linux seem to be within 2 seconds of the times they always were on windows, whereas Core A5 bigadv is running 11-12% faster = 19-20% better PPD. :D

My main rig is my workstation and has to stay windows, but all I want to do is see all three pumping away at max before I have to shut down tomorrow. Might not happen.

My HFM is in my sig.

Hello from the Land of OZ.
what a mighty impressive thread this one is along with some amazing PC's :eek: congratulations!
I have recently started folding with ubuntu 10.10 also and wanted to see if any of our friends from across the globe were having trouble receiving -bigadv units under linux? Is there a different way to force -bigadv under this OS?
I personally am referring to my 2600K and have received all A3 smp units so far despite the appropriate -smp 8 -bigadv settings.
Curious also to know what kind of ppd any of you are receiving on the smp units - currently im about 26K ppd


Click to view full size!


Recently i have stepped up to run some comps at OCAU head quarters in hopes to get some more output. Things will get better over time i hope :)
I would like to add its great to see [H] going so strongly. Your output and team support in the forums is something to admire. I pray that you crush the evil forces that are EVGA (whom i secretly name DVDA - not sure if that computes over there) :D
 
Back
Top