Follow along with the video below to see how to install our site as a web app on your home screen.
Note: This feature may not be available in some browsers.
DryFire said:Prescott has a rather inefficient 31 stage pipeline while A64's have a 10 stage pipeline.
Prescott's L2 cache latency is greater then that of A64's. (IIRC it's also asynchronous)
Josh_B said:Correct me if I'm wrong, but I am fairly certain the Athlon 64 architecture has 12/17 stages. (ALU and FPU, respectively.)
Also, I remember reading that the P4 had quite a low latency on the L1/L2, as compared to any of AMD's processors. Where did you find that information? (Honestly, I'd be interested to see some figures.)
To summarize my feelings about the Athlon 64, and finish my thought:
AMD chose to take another approach in achieving maximum performance - a small pipeline, and a lower overall clock speed. IPC has traditionally (here I go on a slippery slope...) been more catered to RISC-like CPUs than to CISC. Take for example, the four stage G4. As a post-RISC CPU, it has an extremely short pipeline, and relatively low clockspeeds. Yet somehow, it manages to attain similar levels of performance to many modern x86 CPUs clocked at much higher rates. Given that many of AMD's current engineers are ex-Alpha, it is hardly inconceiveable that they would favor this type of efficient, parallel, and scalable architecture. I believe AMD was quite smart to take this approach. AMD must've known that Moore's law is coming to an end, and that processor scaling could not continue at the same pre-P4 rate, given that as I mentioned above, Silicon does have some inherent limitations. As you shrink the die to smaller and smaller gate widths, current leakage increasingly becomes an issue, as does heat. How would AMD counter heat and current leakage? Enter SOI. SOI allows a certain amount of protection against current leakage by providing insulation around leakage areas. (Can someone who is more versed in this topic perhaps elaborate? I would love to hear more about this.) Intel has since admitted SOI will eventually be implemented on their CPUs, and in the meantime, has begun using strained silicon, which is another way of reducing leakage. To my knowledge, reducing leakage can help solve heat and power problems found on many modern CPUs. G5 CPUs and by virtue, the Power series from IBM, have also made use of SOI.
The end result of AMD's planning - a cool running, extremely powerful CPU with a three pipeline design. This design was mainly borrowed from the K7 (Athlon), in the same way the Pentium M borrowed design elements from the PIII. AMD's planning must've started YEARS ago, when they knew the K7 was only the beginning of good things to come. Someone at AMD has excellent vision...
The short to medium triple pipeline ALU/FPU design of the Athlon allowed it to outperform the PIII, and for a long time, the P4 as well. AMD added two stages to each of the pipelines in order to allow the Athlon 64 to scale slightly better than the Athlon XP. This brought the stages for ALU to 12, and the FPU to 17. Additionally, AMD increased IPC significantly by adding an on-die DDR PC3200 controller. As others have mentioned, by adding the memory controller right into the CPU, there is no need to implement a northbridge onto the motherboard. (The AGP GART is only on die - this should eliminate the source of MANY chipset issues, hopefully!) Not only did this reduce memory latencies beyond what any P4 could match, but also serves to reduce the requirements for chipset vendors to create a northbridge. (If you are so inclined, AMD does state there are ways of bypassing the onboard memory controller. Future chipsets may choose to do this if there is a way of delivering similar or improved performance, although at this point, it is unlikely.)
AMD's implementation of SIMD (MMX, SSE, SSE2, 3DNow!, and recently SSE3) are relatively strong, and do show marginal performance improvements in many areas. Because of the Athlon 64's powerful triple pipeline FPU, it does not necessitate as many tweaks to program code to achieve excellent levels of performance. Additionally, AMD improved the Athlon's TLB's, added 64-bit capabilities which double the number of general purpose registers, and allow complex functions that include large numbers not easily handled by 32-bit IA32 CPUs. All of these features
In many respects, the P4 is a better, more forward-thinking design, but when most code is poorly, if at all optimized, the brute force approach the Athlon 64 provides is sometimes more efficient. Besides, foundries just don't have the technology to crank out reliable 4GHz P4's yet. Intel will likely have to wait a few months for the process to catch up before this will happen. Until Intel does this, AMD will likely gain marketshare, as their CPUs feature roughly the same or more performance in most respects. (Don't get me wrong however; one should always choose the right CPU for the job, whether it is a P4 or other.)
The Athlon 64 is like a muscle car engine - lots of torque at low RPM, with little need to run at the same RPM as a 'P4' engine to achieve the same or better performance.
yeah, i don't know for sure on the fpu, but everything i've seen says 12 stages.Josh_B said:Correct me if I'm wrong, but I am fairly certain the Athlon 64 architecture has 12/17 stages. (ALU and FPU, respectively.)
Also, I remember reading that the P4 had quite a low latency on the L1/L2, as compared to any of AMD's processors. Where did you find that information? (Honestly, I'd be interested to see some figures.)
Josh_B said:Correct me if I'm wrong, but I am fairly certain the Athlon 64 architecture has 12/17 stages. (ALU and FPU, respectively.)
Also, I remember reading that the P4 had quite a low latency on the L1/L2, as compared to any of AMD's processors. Where did you find that information? (Honestly, I'd be interested to see some figures.)
Dude8383 said:Why are AMD's top processor speeds lower than Intel's?? Isn't that bad for competition? or are the newest amd processor's better even with their lower processor speeds.
Noob question i guess
MD_Willington said:Performance Rating:
AMD chips differ architecturally from Intel Chips in a number of ways. Intel chips use longer, narrow pipelines, while AMD chips use shorter, wider pipelines. Intel chips perform 6 Instructions Per clock Cycle (IPC), while the Athlon XP line performs 9. This means that an Athlon XP can theoretically be 33% slower in overall speed, yet do the same amount of work as a P4 @ the same rating.
Snoop dog would put it lis this:
AMD chips shot calla architecturally F-R-to-tha-izzom Intel Chips in a numba of ways . Chill as I take you on a trip. Intel chips use longa, narrow pipelizzles while AMD chips use gangsta motherfucka pipelizzles. Intel chips perform 6 Instructions Per clizzock Cycle (IPC), while tha Athlon XP line performs 9 fo' sho'. This means tizzy an Athlon XP can theoretically be 33% brotha in overall speed, yet do tha same amount of wizzle as a P4 @ tha same rat'n.
MD_Willington said:AMD chips shot calla architecturally F-R-to-tha-izzom Intel Chips in a numba of ways . Chill as I take you on a trip. Intel chips use longa, narrow pipelizzles while AMD chips use gangsta motherfucka pipelizzles. Intel chips perform 6 Instructions Per clizzock Cycle (IPC), while tha Athlon XP line performs 9 fo' sho'. This means tizzy an Athlon XP can theoretically be 33% brotha in overall speed, yet do tha same amount of wizzle as a P4 @ tha same rat'n.
(cf)Eclipse said:is it that the software isn't optimized, or the tasks that need to be done cannot take advantage of the streaming nature of the netburst architecture?
i'm thinking the latter![]()
cfish said:so how far would you go, could you say a non-oced athlon 64 3400+ would outperform an intel 3.4c when it comes to things like gaming?
cfish said:
why is the 3000/3200 outperforming the 3400 in source?
CastleBravo said:Dual channel vs. single channel memory. Says so on the chart.![]()