FlyinBrian
Gawd
- Joined
- Jul 1, 2004
- Messages
- 733
deleted
Follow along with the video below to see how to install our site as a web app on your home screen.
Note: This feature may not be available in some browsers.
...FRAPS can only give you the rough numbers. It can be off from 1% to 30%(the human factor). Your mistake is to take it too seriously, that is why you have turned this site into a joke. What makes it even funnier is that you seem to be very proud of that "real world" thing.![]()
The run-throughs and framerate data you see represented on our pages is not used to draw conclusions on by the author. We take a video card, play through our suite of games and make notes indicating the highest playable settings we could muster with the card while checking image quality along the way,
Oh man, ANOVA, that goes back....With cards like the 8800 Nvidia series, and ATi's HD38xx series, doing tests at 1280X1024 is pretty pointless (note my 19" monitor). All cards at these price levels are going to be just fine, and produce framerates well above what you can notice the difference between. In addition, those resolutions just don't stress the card. Sure, if you turn on 16x AA, you might fill up the frame buffer, but the GPU will be yawning.If HardOCP reviewed cars, they would say things like "the Mustang is the best car because it has four seats compared to the Corvette's 11 second quarter mile." Meanwhile, Anandtech would report top speed, skidpad tests, and acceleration/braking tests, and get mocked for "not testing real-world driving experiences." Anandtech's review method is simply better; it provides more information and lets the reader draw their own conclusion from the collected data. It also lets us compare cards at a fixed selection of settings, allowing real comparisons to be made.
I'm not asking for factorial ANOVAs or anything, or even any real statistical analysis. I'm just saying that it's impossible to reliably draw any sort of conclusion about the relative rank of all these video cards when you cherrypick what settings to run at, what settings are important and what is acceptable gameplay, and then report back something that is essentially based on the reviewer's gut feeling.
Because it removes the randomness of the USER. I don't play the same level teh same way every run-through. Look at [H]'s graphs. They are remarkably consistent when it comes to highs and lows.why not create a benchmark that throws random events into the mix... so that videocard companies cannot optimize their drivers for benchmarks... since they know the exact events/stress/loads etc...
like.... random explosions, quantities of objects, random effects, random flashes etc... that are not static... set in stone... maybe even use random ai for living npcs in the benchmark so they too can put different stresses into the benchmark like a real world experience?
ps: anandtech still sucks
Good article.
Curious - why do you guys use only 2GB of ram with a 64-bit OS? seems like a weird pairing to me.
I agree I think its time to start using 4GB of RAM for benchmarking with vista 64. It made a big difference over 2 gb in games like the Witcher for me.
Benchmarking the Benchmarks Article said:Do you want video card reviews that suggest “relative performance of a graphics card” based on timedemo benchmarks when some cards benchmark better than others, or do you want an evaluation of those video cards' in-game performance in the latest and greatest computer games that you are going to be playing with it?
I agree I think its time to start using 4GB of RAM for benchmarking with vista 64. It made a big difference over 2 gb in games like the Witcher for me.
AbsintheCommando makes a good point, but there's something else [H] screwed up in this article.
When comparing your results to Anandtech's numbers, you leave out one glaring inconsistency: You're running your benchmarks on an inferior system. Anandtech is running their rig with twice the RAM and a better processor than you are. How much of an effect is that going to have on the framerates of a game that is so stressful on your system resources?
Trying to compare your benchmark numbers to those given by Anandtech is utterly meaningless unless you can configure the systems used in the tests at least somewhat similarly.
I'll assert that people are more interested in knowing the relative performance of a card, with results that they can verify and reproduce, than they are in knowing what settings someone else would play the game with.
AbsintheCommando makes a good point, but there's something else [H] screwed up in this article.
When comparing your results to Anandtech's numbers, you leave out one glaring inconsistency: You're running your benchmarks on an inferior system. Anandtech is running their rig with twice the RAM and a better processor than you are. How much of an effect is that going to have on the framerates of a game that is so stressful on your system resources?
Trying to compare your benchmark numbers to those given by Anandtech is utterly meaningless unless you can configure the systems used in the tests at least somewhat similarly.
N.B. I know one of the resident [H] minions (actually, probably Kyle himself) is going to try and jump down my throat stating that the main emphasis of the article was not comparing numbers with Anandtech over canned benchmarks, but rather the difference between timedemos and actual gameplay, so let me just go ahead and tell you that I don't care; that's not the part of the article I'm referencing.
Also, speaking on the topic of your comparisons between the timedemo numbers and real-time numbers, Anandtech shows performance deltas that are much greater than the diffference between the performance increases each of the cards when moving to the timedemos in your comparison.
(Island Test Results) So it would seem that depending on the settings used, it is quite possible for the 3870 X2 to benchmark much better than the 8800 GTX in this example.
(Rescue Map) So, had we used even our custom demos to timedemo benchmark our two cards in our 3870 X2 evaluation, the 3870 X2 would have enjoyed a benchmark advantage over the 8800 GTX when compared to real world gameplay.
Now we're at a bit of a sticking point. Which is more accurate? Testing with accurate tools in situations where the results can't be "exactly" reproduced; or testing with known flawed tools but which can be reproduced exactly?
Those are the questions you have to ask yourself.
If the game can only address 2 gb of ram anyway (if the OS is 64 bit, the app still has to be written to allow for more than 2gb) why add more? Also, NAME a game other than supreme commander that is CPU limited...
But if it's applicable to the games you will be actively playing, who cares?
Maybe I'm missing something, but your argument seem irrelevant. Yes, if you're comparing Anandtech's review with 4GB and [H]'s review with 2GB or RAM there might be some FPS differences. However, when you look at a review and see a 3870x2 not performing as well as a GTX, how is having 4GB of RAM going to make the X2 perform better? If you have a system with 2GB of RAM and you test a X2 and a GTX on the same system and the GTX performs better then it shouldn't matter if you upgrade the RAM to 4GB and do the same tests again. You should still have the GTX performing better.Ok, Lost Planet, and that's just off the top of my head.
Also, are you going to tell me that you're going to see absolutely NO increase in performance when doubling the amount of RAM in the system? If so, then you're going to need to turn over that forum title, because you have no idea what you're talking about.
Not a large user base has vista 64 with 4gig ram, but when it does, I bet you will see it added to the evaluation.(not benchmark ;0)
I agree I think its time to start using 4GB of RAM for benchmarking with vista 64. It made a big difference over 2 gb in games like the Witcher for me.
I think this question was asked as if it was meant to be rhetorical; but yes, in fact, I do want a review which indicates the relative performance of a graphics card and I want the results to be reproducible, otherwise I just have to have faith in the reviewer's methodology and honesty; and that's bad science.
Ironically, H's "real time" vs. "traditional timedemo" results seem to reinforce their validity as benchmarking tools. Both cards are ~35% faster in traditonal timedemos over their real-time counterparts, which shows that the results are comparable.
I'll assert that people are more interested in knowing the relative performance of a card, with results that they can verify and reproduce, than they are in knowing what settings someone else would play the game with.
You probably wouldn't. However, for those that play other games (or plan on using the card to play currently unreleased games in the future), "best playable settings" is significantly less useful than establishing an overall relative performance of the cards. That aside, it still ignores that issue of the subjectivity of "best playable settings" which we're made painfully aware of every time H reviews something.But if it's applicable to the games you will be actively playing, who cares?
There is not one shred or semblance of scientific testing with HardOcp. You are merely taking Brent or Kyle at their words it's that simple.
I'm ok with that and so are many others because as readers we trust them. But lets stop petting our asses on this one, their testing is in no way scientific because their simply is no definite control.
I think it really comes down like it always has read everyone's reviews and make your own choices.
You will see some improvement, but not as much as a video card upgrade, by far. My experience has been that once you hit 2 gb of ram, you see a very quick series of diminishing returns with more ram.Ok, Lost Planet, and that's just off the top of my head.
Also, are you going to tell me that you're going to see absolutely NO increase in performance when doubling the amount of RAM in the system? If so, then you're going to need to turn over that forum title, because you have no idea what you're talking about.
But it's a subjective industry! Everything about gaming is based on personal preferences and individual hardware and software settings.You probably wouldn't. However, for those that play other games (or plan on using the card to play currently unreleased games in the future), "best playable settings" is significantly less useful than establishing an overall relative performance of the cards. That aside, it still ignores that issue of the subjectivity of "best playable settings" which we're made painfully aware of every time H reviews something.
You might also care that there's no way to scientifically verify the results.
There is not one shred or semblance of scientific testing with HardOcp. You are merely taking Brent or Kyle at their words it's that simple.
I'm ok with that and so are many others because as readers we trust them. But lets stop petting our asses on this one, their testing is in no way scientific because their simply is no definite control.
I think it really comes down like it always has read everyone's reviews and make your own choices.
You don't simply get it? It's just not [H] is making these claims anymore. There are other sites that have tested with similar methods and gues what? They get similar results! That's what makes it more intrestingThere is not one shred or semblance of scientific testing with HardOcp. You are merely taking Brent or Kyle at their words it's that simple.
I'm ok with that and so are many others because as readers we trust them. But lets stop petting our asses on this one, their testing is in no way scientific because their simply is no definite control.
I think it really comes down like it always has read everyone's reviews and make your own choices.
I am not interested in benchmarks because of the numbers they provide in and of themselves, but the numbers they provide relative to other cards. If I get 100 FPS on Card A on Benchmark X, and 130 FPS on Card B on Benchmark X, then all else being equal, I can assume that Card A is faster. I am not interested in the bench scores in and of themselves, but in how they stack up against other scores. I use them to provide a rank order of scores. I don't use the raw scores themselves.