Haswell GT3 uses shaders to save power

More is less in a parallel universe

Sep 7, 2012 by Charlie Demerjian

Haswell may have the shaders to be a graphics monster, but it isn’t going to use them that way. SemiAccurate is hearing that the added shaders and memory are there to save power.

Haswell has three variants for consumers, GT1, GT2, and GT3, with 10, 20, and 40 shaders each. The shaders are derivatives of the Ivy Bridge shaders, so clock for clock, they should be a little faster than the current parts. The highest end GT3, currently slated for laptops only, has optional memory on the package called Crystalwell. When we broke the news about the 40 shaders in Haswell, we called it a “graphics monster” because of the massive shader count.

GT3 is a graphics monster, or at least it could be, but Intel is not going to use it in that fashion. Instead of really fast graphics with marginally functional drivers; sorry, broken and unfixable due to Intel’s moronic internal policies, they are going to use the added shaders to save power. If you downclock a CPU you save power. How slow vs how much energy saved depends entirely on the part in question, the starting point, the ending point, and a lot of black magic. Lets just say it can be better than a linear relationship, worse than linear, or both depending on where you stop. Performance is proportional to the clocks the chip runs at.

With GPUs, things are a little easier because you have multiple copies of the same units. Performance is proportional to the clocks times the shader count, something that is not true with CPUs. In a GPU, if you double the shader count, you can halve the clocks and end up with the same performance. Pick your counts right, and you can save a lot of energy.

This is exactly what Intel is doing with Haswell GT3. GT1 and GT2 run at ~1200MHz for the top turbo clock, SemiAccurate’s sources tell us that GT3 will run at 800MHz or so peak. 10 shaders at 1200MHz equals 12K ‘GPU work units’ (GWU – a mythical term we just made up), GT2 would be 24KGWU, and GT3 would be at 32KGWU. Note the almost linear progression 1, 2, 2.67, a number that rounds to three.

If Intel ran the numbers right, GT3 will probably burn less wattage than the GT2 while handily outperforming it. Since the GT3 variant is not currently slated for desktop use, we will never see the full 40 shaders at 1200MHz, but that is simply a marketing choice, not a technical problem. If anyone was concerned about how area would be used usefully with the next few process shrinks, here is a really big pointer to the direction Intel is taking. The radically different Sky Lake and Skymont architectures will fit very well with this paradigm, very well indeed.S|A

Bio
Latest Posts

Charlie Demerjian

Roving engine of chaos and snide remarks at SemiAccurate

Charlie Demerjian is the founder of Stone Arch Networking Services and SemiAccurate.com. SemiAccurate.com is a technology news site; addressing hardware design, software selection, customization, securing and maintenance, with over one million views per month. He is a technologist and analyst specializing in semiconductors, system and network architecture. As head writer of SemiAccurate.com, he regularly advises writers, analysts, and industry executives on technical matters and long lead industry trends. Charlie is also available through Guidepoint and Mosaic. FullyAccurate

Latest posts by Charlie Demerjian (see all)

What is Qualcomm’s Purwa/X Pro SoC? - Apr 19, 2024
Intel Announces their NXE: 5000 High NA EUV Tool - Apr 18, 2024
AMD outs MI300 plans… sort of - Apr 11, 2024
Qualcomm is planning a lot of Nuvia/X-Elite announcements - Mar 25, 2024
Why is there an Altera FPGA on QTS Birch Stream boards? - Mar 12, 2024

Thank you, Subscribers!

Thank you to our Subscribers, past and present.

You are appreciated.

You are what keeps SemiAccurate going, what allows us to maintain our journalism, what keeps us ad-free, what allows us to tell it like it is, it is still just you. You, the reader and subscriber, we thank you.

If you want to know more about subscriptions, both free and paid, the information can be found here.

For more on our track record of leading edge journalism see Fully Accurate.
Our Writers

Charlie Demerjian is the founder of Stone Arch Networking Services and S|A.

SemiAccurate.com is a technology news site; addressing hardware design, software selection, customization, security and maintenance, with over one million views per month. He is a technologist and analyst specializing in semiconductors, system and network architecture.

As head writer of SemiAccurate.com, he regularly advises writers, analysts, and industry executives on technical matters and long lead industry trends.

Thomas Ryan is a GIS Programmer and freelance technology writer from Seattle, WA. You can find his work on SemiAccurate and PCWorld.
Tweets from https://twitter.com/SemiAccurate/lists/writers

SemiAccurate

On Target Technology News

Hot Article AMD to differentiate cores

Hot Article Intel foundry customer bails out

Hot Article Coffee Lake is going to impact Intel’s margins

Hot Article SemiAccurate digs up Intel Coffee Lake specs

Haswell GT3 uses shaders to save power

More is less in a parallel universe

Charlie Demerjian

Latest posts by Charlie Demerjian (see all)