Floating Point Performance

Just about a year ago, our own Johan De Gelas made an extremely interesting point about one of the weaknesses of the Pentium M - floating point performance. The theory is this - the Pentium 4, Athlon 64 and Pentium M all have very different platforms, with equally different characteristics. Unfortunately, as we've already shown, the Pentium M is quite possibly the worst off with only a single channel 333MHz DDR memory bus. It's also widely known that most floating point intensive applications are highly memory bandwidth limited, meaning that the Pentium M already has an excuse for poor floating point performance - it doesn't have enough memory bandwidth.

But what if we are able to take memory bandwidth out of the equation? This is where a little benchmark called "flops" comes into play. The beauty of flops is that it executes entirely within the L1 cache of the Pentium M, meaning that the benchmark is limited by two things: the performance of the Pentium M's L1 cache, and more importantly, the performance of the Pentium M's floating point and SSE units.

The actual tests that flops runs are a mixture of floating point add, subtract, multiply and divide operations. The mix of ADD/SUB, MUL and DIV operations is listed next to each test in the table below.

We compiled flops using the latest Intel C compilers to give the Pentium M as solid of a foundation as possible using the /O3 and architecture specific flags under Visual Studio .NET. All of the results are expressed in MFLOPs, higher scores being better:

 Test (% ADD, SUB, MUL, DIV)  AMD Athlon 64 3200+ (2.0GHz)  AMD Athlon 64 FX-55 (2.6GHz)  Intel Pentium 4 3.2GHz  Intel Pentium M 755 (2.0GHz)
1 (50,0,43,7) 1576 2057 1274 899
2 (43,29,14,14) 856 1118 790 492
3 (35,12,53,0) 1388 1802 2476 1470
4 (47,0,53,0) 1244 1622 2792 1601
5 (45,0,52,3) 1477 1923 2351 1019
6 (45,0,55,0) 1466 1908 2762 1607
7 (25,25,25,25) 458 595 365 252
8 (43,0,57,0) 1585 2065 2566 1572
Average 1256 1636 1922 1114

The first comparison to look at is the Athlon 64 3000+ vs the Pentium M 755, since both CPUs run at the same clock speed. Despite the Pentium M's improvements to enhance IPC, the Athlon 64 is still able to outperform it at a core level (without the aid of its memory controller) by almost 13%. But here's where the next Athlon 64 score comes into play - while the Pentium M will hit 2.26GHz by the end of this year, the Athlon 64 will be at or above 3.0GHz. So, the headroom of the Athlon 64's architecture gives it a huge performance advantage here in flops as you can see by the Athlon 64 FX-55 results (remember that the larger L2 cache of the FX-55 has no effect on the flops results as the program runs entirely out of L1).

Next, we have one of the slower Pentium 4s vs. the Pentium M 755. Why not compare to a 3.6GHz or the new 3.8GHz Pentium 4? Well, look at how much the Pentium 4 3.2GHz outperforms the Pentium M 755 - 72% using Intel's 8.1 C++ compiler. When running optimized SSE2/3 code, the Pentium 4 is a much stronger FP performer than what the Pentium M ever could be, which is very important for the following reason: the future of desktop applications is in very floating-point intensive media transcoding tasks, and for those applications, the Pentium M just won't cut it. So, to those who feel that Intel will soon ditch Net Burst in favor of the Pentium M's architecture, the results speak for themselves. While elements of the Pentium M architecture will undoubtedly make an appearance in the Pentium 4's successor, its dated P6 execution core will not.

Memory Latency and Bandwidth The Motherboards
Comments Locked

77 Comments

View All Comments

  • bluesdoggy - Tuesday, February 8, 2005 - link

    ...in the mobile world, the Pentium 4 and Athlon 64 are often castrated or limited either by low clock speeds...

    Mommy, is that processor a steer?
  • valnar - Tuesday, February 8, 2005 - link

    As usual, an unfair review. Comparing a 2.0Ghz 400FSB laptop CPU against 3.0Ghz desktop heatmonsters? Of course it won't beat them. But look at how well it does, and probably would do (if reviewed correctly) against Pentium 4 2.4-2.8Ghz CPU's. Considering the ultralow power it needs and lack of heat it generates, this WILL be the hot (err... cool) ticket for Shuttle XPC's and the like in the near future. For anyone who doesn't need the fastest processor at the moment, the Banias designers did a fantastic job.
  • EODetroit - Tuesday, February 8, 2005 - link

    Great article, its about time that you did this one. And you compared both P-M motherboards on the market, I don't remember the other web sites doing that.

    You stated that the P-M won't scale, and that's the reason this isn't Intel's desktop future. One thing though... Intel's other desktop CPUs aren't going to scale much this year either. In fact, on a percentage basis, the P-M might actually scale more this year than the various P4-Kiln edition CPUs after all.

    Combine that with a mobile-915 chipset for the desktop, and therefore the elimination of the huge memory bottlenecks (and hopefully a little more voltage adjustments) and all of the sudden we may see all those Losses and Ties turn into Ties and Wins.

    Whatever happens, don't be the last enthusiast site to review the mobile-915 desktop motherboards when they arrive, like you were with this. We need a trusted source to know what to buy.
  • mickyb - Tuesday, February 8, 2005 - link

    The performance per watt is awesome. Great for SFF. The article is good, but until there is a newer chipset for this CPU, we won't be able to determine a final performance ruling. I am dissappointed in the lack of desktop MB offerings. This will be the challenger to the MAC mini in near future. Someone will be putting laptop components in a box and call it done.

    I found a couple of things interesting. Taking the memory out of play, it seems the A64 is still better optimized. L1 cache of Northwood is pretty impressive. AMD has an opportunity to improve performance just by improving the L2 cache latency.

    I really don't think the Pentium-M limits are around 2.6 GHz by the end of the year. At 22W, this could probably reach higher speeds. I think the upper limit that Intel is publishing is in context of a laptop and the cooling challenges in that platform. If you put a chip in a DT, then it is a different story.
  • AtaStrumf - Tuesday, February 8, 2005 - link

    A great article! Another Anand classic :-)

    I'd just like you to add an Athlon XP 3200 to the lineup and at least one more Newcastle (which is just the most popular A64 at the moment ;-) May I suggest a 3000+ 2,0 GHz/512/1CH? With just one dot on the graph extrapolating anything becomes a nightmare :-(

    As for P-M it's one hell of a CPU considering it's limitations and we just can't stop wondering what it could become if Intel decided to remove them. Sonoma will party answer that question, but unfortunately the ultra low voltage cap will still remain, so we may never really know.

    On the other hand I think an A64 will still be a nice enough desktop CPU so we really have no need for P-M on the desktop side of things. With Lancaster-Turion supposedly on S754 we may be in for a very nice successor to 2500+ Mobile, so to hell with P-M >;-)
  • bob661 - Tuesday, February 8, 2005 - link

    C'mon guys. These tests aren't showing that the P-M is crap, just not what we originally thought it was. I am surprised as hell at these results. For a laptop CPU, it still kicks ass. And with two A64 and three AXP machines, I am no Intel fanboy.
  • paulsiu - Tuesday, February 8, 2005 - link

    For folks who want to have a mobile chip lower power solution, why not just go to the mobile Athlon 64? The CPU performance should be about the same as their desktop counterpart (at least the socket 754 version) and you can often use the same motherboard as the desktop.

    The Pentium Mobile idea seems nice, but I can't imagine spending $300 on a board that contains outdated technology.

  • MIDIman - Tuesday, February 8, 2005 - link

    Superb article.

    Granted, this is a "desktop" review, but I think the P-M is a completely different world from the P4-775 and A64, and I'm not entiely sure how people can compare them. This was built to be a portable solution and has been moved to desktop. Put that into account, and you have an extremely capable system that is silent, passive, and can be extremely small (matx here, but ITX is out there). I'm just trying to figure out why I didn't just read a Sonoma-based review, since it is out and being made (i.e. Dell's new 6000 laptop), or at least a 2.2ghz Dothan.

    I think Sonoma will bridge a bit of this performance gap, but consdering that these types of chipsets and CPUs will always be low voltage, I think we'll always see places where its performance is maybe not up to par, but well worth every penny for small and silent with desktop performance. THey'll only get smaller and faster, and IMHO, this is pretty damn close to desktop performance.
  • muddocktor - Tuesday, February 8, 2005 - link

    I agree withpost #36 about the benchmarks seemingly being picked to go for the P-M's weaknesses, but I guess that's how you get article hits. ;) I do fully agree that the present motherboards and chipsets they use hold back the perfromnace quite a bit; it might be a different story when the new mobile 915 chipset mATX boards come out for desktop use though.

    One glaring weakness in this comprehensive test though is the utter lack of numbers on system power usage and noise. If I were deploying a whole bunch of new systems for a corporation, I would give serious thought to a P-M setup even though the initial outlay would be more than a comparable P4 setup due to the decreased wattage used by the P-M system and the resultant heat from operation being much less, leading to lower environmental costs. Face it, in typical office applications the P-M is more than powerful enough for 90% of the users for the forseeable future and if your company has hundred or thousands of computers, the power saving should more than compensate for the higher pricetag of aquiring the P-M systems.

    Anand, when the new mobos based on the mobile 915 chipset come out, you need to revisit Dothan and it's performance.
  • msva124 - Tuesday, February 8, 2005 - link

    What were people expecting out of the pentium m? I have always multiplied the Mhz by 1.5 and used that number as the speed rating. So for instance the 2.0Ghz Dothan would be 3000+. The benchmarks confirm this - with the exception of one or two tests, it met or exceeded the performance of the Athlon 64 3000+.

    Whenever it was discussed as a desktop alternative I always assumed the implication was that this would be way off in the future, once clock speeds were ramped up.

Log in

Don't have an account? Sign up now