AnandTech Tests GPU Accelerated Flash 10.1 Prerelease
by Anand Lal Shimpi on November 19, 2009 12:00 AM EST- Posted in
- GPUs
Huge Improvements under OS X
The release notes for the Flash 10.1 preview say the following about cross-platform hardware accelerated H.264 decoding support:
In Flash Player 10.1, H.264 hardware acceleration is not supported under Linux and Mac OS. Linux currently lacks a developed standard API that supports H.264 hardware video decoding, and Mac OS X does not expose access to the required APIs. We will continue to evaluate adding the feature to Linux and Mac OS in future releases.
Ouch. Linux isn’t ready and Apple isn’t open enough. That’s not to say that there aren’t major performance gains to be had.
I took the same Office clip I’d been using for all of the other tests and ran it on my Mac Pro at full screen (2560 x 1600). Using Activity Monitor I looked at the CPU utilization of the Flash Player plug-in. I compared both versions of Flash and saw a significant drop in CPU utilization:
Hulu Full Screen (2560 x 1600) Average CPU Utilization | Flash 10.0.32.18 | Flash 10.1.51.45 |
Hulu 480p - The Office - Murder | 450% | 190% |
Going from roughly 450% down to 190% (or a bit over 10% of total CPU utilization across 16 threads) made full-screen Hulu playable on my machine. In the past I always had to run it in a smaller window, but thanks to Flash 10.1 I don’t have to any longer.
With actual GPU-accelerated H.264 decoding I’m guessing those CPU utilization numbers could drop to a remotely reasonable value. But it’s up to Apple to expose the appropriate hooks to allow Adobe to (eventually) enable that functionality.
Until then, even OS X users have something to look forward to with the Flash 10.1 upgrade.
Final Words
It's finally here. GPU accelerated video decode for Adobe Flash. Grab the preview and let us know how it fares on your system in the comments.
135 Comments
View All Comments
LoneWolf15 - Thursday, November 19, 2009 - link
At work, our systems with discrete graphics use Radeon 2400 class GPUs. A few workstations use NV40 or NV41-based Quadro cards.Is there any chance Adobe will go a little further back in their support? We only update every 3-5 years, and it seems like there are a lot more GPUs that could be capable of this (e.g., Geforce 6/7 series, Radeon X16 and up, etc.).
JarredWalton - Thursday, November 19, 2009 - link
I'm pretty sure you need a GPU with VP1/VP2 from the NVIDIA world, which means GeForce 8800 GT as a bare minimum. GF6/7 users are out of luck as far as H.264 video decode acceleration.NX3 - Thursday, November 19, 2009 - link
AMD IGP 3200 (780g) with Cats 9.11 + beta Flash 10.1 + FirefoxBBC iPlayer HD, its definetly working as cpu usage is way down below 30% from 80-100% for the same episode of Top Gear.
YouTube HD, its not working at all as CPU time is still 80-100%, even after following the advice in the article.
ira176 - Wednesday, November 18, 2009 - link
Just because us HD3870 owners only have UVD (1) doesn't mean we should be left out of the GPU acceleration. I hope Adobe adds support for for those cards, as they have h264 acceleration as well and should be more than powerful enough.jmurbank - Wednesday, November 18, 2009 - link
The Linux version of the latest Flash player plugin (10.1) still does not use the GPU at all even though Adobe said it works. From reading their blogs about them implementing OpenGL for acceleration, it still seems that Adobe does not include extras correctly or they fail to understand how stuff should work. The following is what Adobe tried doing and failed to handle OpenGL correctly.http://blogs.adobe.com/penguin.swf/2008/05/flash_u...">http://blogs.adobe.com/penguin.swf/2008/05/flash_u...
I even tried using other web browsers and it still does not do what you said it should. I also included the following lines in /etc/adobe/mms.cnf.
WindowlessDisable = 1
OverrideGPUValidation = 1
Adobe needs to get their act together and understand that they need to make a more efficient plug-in and does better checks. Using OpenGL and VA API does not help to provide efficiency. It just puts a band-aid. How come Xara can make (a lot) more efficient vector graphics program compared to Adobe.
Flash player plug-in version 9 was better while 10 has gotten horrible. If I watched any videos on Hulu with version 10.0, frames skips all the time and with out any programs or windows open. If I use 10.1 with Hulu, it is the same. I know it is a beta version, but it is still poor. Sure Linux is the only OS that contains a 64-bit version of the Adobe Flash player plug-in, but from I heard or read that it still is poor or has no noticable increase in performance.
My computer contains T7300 and a GeForce8 8400M GS. It decodes H.264 fine with the help with VDPAU (nVidia's way of VA API) and it helps by a huge CPU usage reduction that is around 10% instead of 100% by just the processor alone decode. Flash player plug-in 10.0 or 10.1 is around a crazy CPU usage around 120%. Does anybody think that Adobe is doing things right because I do not think so.
ZPedro - Wednesday, November 18, 2009 - link
Okay, I must admit that as a longtime Mac user, I was skeptical of the claimed performance improvements on the Mac; Macromedia/Adobe never optimized Flash on the Mac in a satisfying way, and if you combine that with the fact that, up until recently, I was using the original MacBook, so the fan would kick in at the first YouTube video… I was not going to suddenly start trusting Adobe. So I measured.Contrary to Anand, I'm not going to concentrate on video, as to me Flash is a superfluous middleman in the web video equation, and I can't wait to get rid of it (there's also the fact I don't have access to Hulu). However, I have a healthy respect of Flash as a way to create and deliver (interactive) content. So here are my tests.
- A: 1 group and 9 Dailymotion videos playing at the same time (cacophony!)
- B: the same, but all videos paused (including the three flash ads)
- C: YouTube HD video playback ( http://www.youtube.com/watch?v=1TdaoOluq0A">http://www.youtube.com/watch?v=1TdaoOluq0A )
- D: The same YouTube HD video in fullscreen playback
- E: Scrolling Strong Bad emails ( http://www.homestarrunner.com/sbemail.html">http://www.homestarrunner.com/sbemail.html )
- F: Playback of Strong Bad email: Dragon (aka Trogdor, at http://www.homestarrunner.com/sbemail58.html">http://www.homestarrunner.com/sbemail58.html )
- G: Playback of Ultimate Showdown ( http://www.weebls-stuff.com/toons/ultimate+showdow...">http://www.weebls-stuff.com/toons/ultimate+showdow... )
- H: Playback of Magical Trevor 3 ( http://www.weebls-stuff.com/toons/magical+trevor+3...">http://www.weebls-stuff.com/toons/magical+trevor+3... )
- I: Playback of Internet Killed the Video Star ( http://www.koreus.com/files/200407/internet_killed...">http://www.koreus.com/files/200407/internet_killed... ) at the size it was intended: 540x330
- J: The same, this time the flash file opened directly and as big as possible given the Safari chrome on a 1680x1050 screen.
- K: Game of Magenta Kong ( http://thedailywtf.com/Articles/Magenta-Kong.aspx">http://thedailywtf.com/Articles/Magenta-Kong.aspx )
CPU is the processor usage percent of the WebKitPluginHost process for Flash of 64-bit Safari as reported by top (you'll remember that on Mac/Unix, 100% means one core fully used, my machine only maxes up at 400%, so it's normal there is stuff that's more than 100%), either interactively or (for fullscreen and Magenta Kong cases) logged to a file. Tests performed on a Mac pro early 2009 (aka Nehalem) single proc (4 core) with Mac OS X 10.6.2, Safari, and Shockwave Flash 10.0 r32 ("10.0"), Shockwave Flash 10.1 d51 ("10.1"). Scientific accuracy not guaranteed. Less CPU used is obviously better.
A: with 10.0, 200% CPU; with 10.1, 230-250% CPU (yep, a regression)
B: with 10.0, 15-30%; same with 10.1
C: with 10.0, 50-60% with one peak to 84%; with 10.1, 30-40% with one peak to 47%
D: with 10.0, 36% with one peak to 62%; with 10.1, 25-33% with one peak to 43%
E: with 10.0 max 60%; with 10.1 max 40%
F: with 10.0 mean 10%, max usually 15%, peaked to 17%, with 10.1 barely touched 10% at worst
G: with 10.0 around 30% but a busy time peaked 66%; with 10.1 20-25% but the busy time peaked 57%
H: with 10.0 max 100%; with 10.1 max 82%
I: with 10.0 max 50%; with 10.1 max 32%
J: with 10.0 max 163%: with 10.1 max 92%
K: Due to different playthroughs, could not compare CPU usage; however, the game felt more fluid with 10.1; I'd need to do a double blind to be sure.
So yeah. Overall, it's a definitive improvement. I think it could be made better still, but an effort has definitely been made. Not bad Adobe. Not bad.
Note: This "benchmark" needs more Flash games, but I'm no specialist. Suggestions?
SnM77 - Wednesday, November 18, 2009 - link
Hulu improved dramatically! Sadly, I do nto have an application monitoring FPS, but the same SD video clip that played with 90 to 95 percent CPU on my Dell GX280 with XP Media center edition and Flash 10.0, play with CPU use of between 35 and 40 percent with flash 10.1.I was running this out to my Samsung LCD TV over HDMI. I also trie don a different 280 running a 1440x900 monitor on an OLD 1950 ATI card, it made no difference.
One odd thing I noted - video was much smoother with 10.1, but it was like someone turned up the contrast a bit too high and the SD video was very grainy. Also, the color washed out a bit on the particular clip i was watching. But it was now watchable. I will play with Catalyst driver settings tonight to see if that makes any difference, and also see if I can test HD. I may have missed it, but what software can I use to actually measure FPS in Hulu? Is there an option I can turn on in Hulu itself?
beginner99 - Wednesday, November 18, 2009 - link
It's sad how poor flash perfroms. But if you look over the whole Adobe stuff, no wonder. Everythings seems blotted, especially Acrobat. Like a 1 Gb install for creating and playing around with PDF's...(Was it already that bad when it was not Adobe?I have the feeling not but I'm probably mistaken)
I'm glad flash is now usable at least on some weaker hardware, ideally also normal netbooks should be able to run it...But here we habe the opppsite trend to the car industry. Instead of getting more efficient we just add a second engine to satisfy the customers.
I mean cool I could run hd flash on a ION netbook. But probably battery live will we soemthign like halfed...nothing is free.
- Tuesday, November 17, 2009 - link
Anyone think the ATI Radeon HD 3xxx standalone cards will work with some kind of driver hack?There is support for integrated Radeon HD 3xxx series however there is no mention of Radeon HD 3xxx standalone cards. I'm thinking this must be to encourage people to upgrade to the HD 4xxx series cards. ATI however supports their HD 3xxx series integrated video, so I suppose the capability must be there for a standalone HD 3xxx card.
JarredWalton - Tuesday, November 17, 2009 - link
The 9.11 drivers do NOT list HD 3200/3300 anymore, oddly enough. Remember that the RS780 chipset did get a bit of the advanced video decoding features even though it didn't have the shader and gaming performance. I think IGP 3000 will get support, but I wouldn't expect the same level of acceleration on the normal discrete 3000 series.Also interesting is how HD 3200 (RS780) became HD 4200 (RS785). They're practically the same, I think, but one got the 4000 series moniker just to make it sound newer. Just like NVIDIA with the GTS 250, etc. cards.
At present, the HD 3200 on a laptop definitely isn't accelerating Flash to any noticeable extent. With a dual-core 2.0GHz processor, it almost doesn't need help from the GPU, but it would be nice to go from 60% CPU and some occasional frame dropping to 20% CPU and no dropping. Atom went from playing only 10% (or less) of frames and 90%+ CPU to less than 70% CPU and playing 100% of frames, with the ION chipset. That shows what is possible, once all the various aspects are dealt with.