Intel Yonah Performance Preview - Part I: The Exclusive First Look at Yonah
by Anand Lal Shimpi on November 30, 2005 2:50 AM EST- Posted in
- CPUs
Same Size, but Twice the Cores
Obviously the biggest improvement Yonah offers over Dothan is the fact that it’s dual core. But where things get interesting is that thanks to Yonah’s 65nm process, a dual core Yonah die is about the same size as a single core Dothan die - in other words, it costs Intel just as much to make a dual core Yonah, as it did for them to make a single core Dothan.
A major reason the die size didn’t really go up much is because although Yonah has two cores, its L2 cache size remains unchanged at 2MB. Unlike the Pentium D, Yonah’s 2MB L2 cache is not split into two discrete 1MB caches, it is actually one whole 2MB cache that is shared by both cores. This is a very important distinction, as it means that Yonah is far from just two Dothans stuck together.
There are other architectural enhancements to Yonah that will give it a performance advantage over Dothan. Below is an excerpt from our IDF coverage of Yonah, detailing what we know about the new CPU:
Making Pentium M more "Media Friendly"
All of the major performance improvements to each of Yonah's cores seem to revolve around SIMD FP and FP performance, two of the Pentium M's present day weaknesses in comparison to the Pentium 4.
The first improvement is that now all three of Yonah's decoders can decode SSE instructions, regardless of the type of instruction. Improving the decode width of the processor is a quick way to improve performance.
Next, SSE/SSE2 operations (not sure if all can be, but at least some) can now be fused using the Micro Ops Fusion engine of Yonah. At a high level, the benefit here is increased performance and lower power consumption, we'll get into architectural details of why that is when we eventually sink our teeth into Yonah next year.
Each of the two cores in Yonah have also received support for SSE3 instructions much like the Pentium 4 E [Prescott].
And finally there have been some improvements to Yonah's floating point performance, although Mooly would not say exactly what's been done. Curiously, Mooly referred to the floating point performance improvements as specifically made to improve gaming performance. Intel may have grander plans for Yonah than once thought...
The SSE/FP optimizations are all being grouped into what Intel is calling their Digital Media Boost technology, yes the names seem to get worse and worse as time goes on - but at least the functionality should be good.
We started out this article talking about the Pentium M’s shortcomings in digital media applications, and Intel has begun to address them with the architecture of Yonah, but the real question is - how effective have their efforts been?
135 Comments
View All Comments
edwardhchan - Wednesday, November 30, 2005 - link
For those who care, this is probably the chip that will make it into the Macs next year. Drooling over a new powerbook that dualboots....bldckstark - Wednesday, November 30, 2005 - link
I suppose you will be drowning yourself when you start dreaming of a triple boot with the GoogleOS? On Cell processors? With SkyFi wireless bouncing signals off of the moon (system outage during new moon transition)?Dreams of vaporware let you down more often than a topless dancer when you run out of ones.
I do however often dream about my robotically driven car so that I may play PC games from the back seat on my 60" OLED TV during the drive to work. Imagine internet access on the road then. All the mobile PRON. Gives a new meaning to the word carjack doesn't it?
dougSF30 - Wednesday, November 30, 2005 - link
and on the next generation process.So really, one could say it this way:
AMD's IMC allows their 90nm DC part with 1/2 to 1/4 the L2 cache to perform better than Intel's new 65nm part running at the same clock speed.
fitten - Thursday, December 1, 2005 - link
Only fanbois would say that (or folks who just don't know about computer architectures).a) The AMD Athlon64 has a faster FPU than the Yonah. Benchmarks doing heavy FPU will do better aon the Athlon64.
2) The Athlon64 has more memory bandwidth available to it than the Yonah. While chipsets for Yonah are dual channel, the chip itself still seems to be single channel (like it's predecessors - for power reasons). This configuration does give better performance than all single channel but it isn't as good as all dual channel.
D) Despite these two disadvantages (there are a few others), Yonah still manages to stay close to the Athlon64 parts on average. Some benchmarks it's actually a little faster and some it's a fair bit slower.
One of the best unsung things about the Athlon64 X2s is that a single core Athlon64 only benefits marginally in real benchmarks with dual channel memory compared to single channel memory (ignore synthetic benchmarks). The best I've seen is 20% faster but it is typically on the order of 7% faster. This means that adding a second core, which can double the bandwidth requirements to main memory in really bad situations, can be almost completely satisfied by dual channel. Using the best cases above, the benchmark would need 1.20x the bandwidth of single channel, so this same with a dual core would need 2.4x single channel bandwidth. That's only lacking by 20%.
Compare this to the Pentium4 which gets very large speedups (60% or more sometimes) from dual channel over single channel. Two of those cores doing that would need 3.2x single channel, but dual channel only gives 2.0x meaning that the Pentium4 dual core may have a deficit of 60% of the bandwidth needed to run both cores full-tilt.
Leper Messiah - Thursday, December 1, 2005 - link
But what does the above have to do with Yonah's performance. Architecturally, Yonah is far closer to K8 than to netburst. It should have more than enough bandwidth provided with the "667" FSB, esp. if its dual channel.Anemone - Wednesday, November 30, 2005 - link
I'd agree, the basic core of the A64 chip is quite superior to the P-M, meaning when you take out the cache factor (remember that also applies to programs whose requirements exceed the 2M cache as well so think ahead on that).And yes someone mentioned power requirements going up for Merom - which might require a moderating of clockspeeds to keep it in check in the mobile arena.
Basically for AMD and Nvidia a big opportunity exists here to deliver (and deliver soon) the M2 and take advantage of a very good, but not superb chip from Intel. It remains to be seen if that can be done.
Yonah is good, and would have been great if it had delivered when originally promised (Aug 05, from very early roadmaps), however it is not stellar. But as an overall mobile platform is is the current best. Kudos Intel.
dougSF30 - Wednesday, November 30, 2005 - link
And as far as power goes, Intel admitted that 64-bit would've pushed the power way up on Yonah, so you need to consider that when looking at Yonah TDP vs. X2 TDP.Amagus - Wednesday, November 30, 2005 - link
I'm assuming the load tests were for both cores being fully stressed. What I would have like to have seen were power numbers on a similarly configured Dothan desktop. That would give us a better idea of how expensive that 2nd core is and how good Intel's 65nm process is.
-Amagus
phaxmohdem - Wednesday, November 30, 2005 - link
Well, I'm fairly impressed with this chip personally. It's definately not the "spooge your pants" object of Techno-Lust many of us were hoping for, but as a man truely impressed with the performance of his Banias Pentium M 1.6GHZ CPU, I can only imagine how awesome that 2.0 Dually would be in a laptop.If Conroe can pull off slightly better performance per clock + higher clock speeds + add an integrated Mem/PCIe controller.... We might see some good shiznat in the year to come.
NullSubroutine - Wednesday, November 30, 2005 - link
Intel at least has something to offer if you want a decent notebook. I think it all comes down to cost. It probably wont be as equally fast as an AMD dual core Turion, but if it costs alot more, the whole "less power" really doesnt sell me on the extra money I would have to spend for the entire Intel setup.The product is a decent product, it doesnt blow you away, but it is something that you can get in a new system and not be disappointed (unless you encode/decode). I personally plan to buy a dual core laptop sometime at the end of spring, if the price is right, then it might be what I am getting (unless they produce that Monroe by then).