AMD Teases First Navi GPU Products: RX 5700 Series Launches in July, 25% Improved Perf-Per-Clockby Ryan Smith on May 26, 2019 11:30 PM EST
- Posted in
- PCIe 4.0
- Computex 2019
- RX 5700 Series
While the bulk of this morning’s AMD Computex keynote has been on AMD’s 3rd generation Ryzen CPUs and their underlying Zen 2 architecture, the company also took a moment to briefly touch upon its highly anticipated Navi GPU architecture and associated family of products. AMD didn’t go too deep here, but they have given us just enough to be tantalized ahead of a full reveal in the not too distant future. The first Navi cards will be the Radeon RX 5700 series, which are launching in July and on an architectural level will offer 25% better performance per clock per core and 50% better power efficiency than AMD’s current-generation Vega architecture. The products will also be AMD's first video cards using faster GDDR6 memory. Meanwhile AMD isn’t offering much in the way of concrete details on performance, but they are showing it off versus NVIDIA’s GeForce RTX 2070 in the AMD-favorable game Strange Brigade.
A Peek At the Navi GPU Architecture
Compared to today’s Ryzen information AMD is being a lot more modest – there’s no specific SKU information for a start – but they are offering us our first architectural details on Navi since we learned in 2016 that it would bring “next gen memory” support. First and foremost, AMD has been tinkering with their GPU architecture to improve both the effective IPC and memory bandwidth efficiency. In what the company is calling their Radeon DNA (RNDA) archtiecture – I should note that it’s not clear if this is a branding exercise to downplay the GCN family name or if it's a more heavily overhauled architecture – AMD has revealed that Navi’s compute units have been redesigned to improve their efficiency. AMD's press materials also note that, regardless of the above changes, the size hasn't changed: a single CU is still 64 stream processors.
Altogether, a Navi core/CU should be 25% faster than a Vega core on a clock-for-clock basis. The devil is in the details of course – AMD's data is based off of their internal testing, taking the geomean of 30 games tested at 3840x2160 with Ultra settings and 4x AA, so it's not a holistic view of just the core architecture – but it’s still potentially one of AMD’s largest GPU IPC gains in the last several years. AMD’s presentation also noted that this was part of a larger streamlining of the graphics pipeline, which is designed for both higher IPCs and high clockspeeds. For reference, on the same process AMD has been able to push Radeon VII well past its comfort/efficiency point to 1750MHz, so it will be interesting to see how Navi compares.
Feeding the beast is a new multi-level cache hierarchy. AMD is touting that Navi’s cache subsystem offers both higher performance and lower latency than Vega’s, all for less power consumption. AMD has always been hamstrung a bit by memory/cache bottlenecks, so this would be a promising development for AMD’s GPU architecture. Meanwhile for a bit of reference, Vega already implemented a more modern cache hierarchy, so it would seem unlikely that AMD is changing their cache levels or what blocks are clients of which caches.
AMD's memory controllers themselves have also been updated. Long expected, and confirmed in the press release that went out after Dr. Lisa Su's keynote, the upcoming RX 5700 series cards use GDDR6 memory, which should give AMD's cards a hearty bandwidth bump over their comparable GDDR5 Polaris cards.
It's also worth noting that, fittingly, the new Navi parts support PCI Express 4.0 as well. This will actually be AMD's second GPU with PCIe 4.0 functionality – Vega 20 was first, but it's only enabled in the Radeon Instinct parts – so this is the first time it's enabled in a consumer part. Of course to make full use of it you'll need a PCI 4.0-capable host, which AMD is happy to sell you as well. As for the performance impacts, these remain to be seen. Thus far high-end AMD/NVIDIA parts haven't been significantly bottlenecked in games by PCIe 3.0 x16 (or even x8), so it will be interesting to see how much the extra bus bandwidth helps these first generation of parts.
Last but certainly not least of course is overall power efficacy. Thanks to the combination of AMD’s architectural improvements and TSMC’s 7nm process, AMD is promoting a 50% increase in performance per watt for Navi. Power efficiency is AMD’s Achilles heel relative to NVIDIA, so this is another area where we’re eager to see AMD catch up. It should be noted however that this statement didn’t come with a qualifier – if it’s 50% more efficient at the same clockspeeds as Vega or a given total card TDP – which can impact the meaningfulness somewhat. For Polaris/Vega, AMD opted to push their cards well up the voltage/frequency curve in order to maximize performance at a cost to power consumption, so power efficiency is fluid based on what clockspeeds AMD ships at.
Update: Since there have been some questions about what the efficiency number is in reference to – if it's against the 7nm Vega 20 GPU or the 14nm Vega 10 GPU – I went back over AMD's keynote presentation and transcribed it. Here is what Dr. Lisa Su specifically said on the subject (emphasis mine).
And then, when you put that together, both the architecture – the design capability – as well as the process technology, we're seeing 1.5x or higher performance per watt capability on the new Navi products
So while I will hold off on calling this the definitive word until we've had a full technology briefing, at first glance it would seem that the 50% efficiency gain is a combination of architecture and the move from GloFo 14nm to TSMC 7nm.
Radeon RX 5700 Series: Coming in July
Alongside the architecture teaser, AMD also offered up some high-level details about the first Navi video cards. Navi will be sold under AMD’s RX 5000 series of video cards – that’s right, after Vega and Radeon VII, they’re going back to distinct series numbers. The RX 5000 series means that AMD is making a big jump in their numbering system, going back to 4 digits and back to the 5000 series. Officially, this is because AMD is celebrating its 50th year in business this year – so of course the product numbers need to start with a 50. However long-time observers will note that it’s been 10 years now since AMD’s previous 5000 series of video cards, the well-received Radeon HD 5000 series, and AMD may be trying to capture a bit of that.
Of the RX 5000 cards, the first series will be the RX 5700. These will be launching in July. AMD is also showing off the specific Navi GPU that will be used here, though as far as AMD’s numbering system goes, we don’t know whether this is Navi 10 or another GPU model number.
This is a traditional, monolithic die with a significant pin count package. So if I were a betting man (ed: you still owe me a steak), then I’d expect it to be paired up with GDDR6. (ed again: this has since been confirmed by AMD's press release)
Update: Thanks to our own Andrei Frumusanu for doing the leg work, we now have a die size estimate based on these and some unpublished photos. Our working guess right now is 275mm2, give or take a few percent. Though that's going to be less precise than what a proper micro-caliper measurement will turn up, so this is very much still just an estimate.
Finally, while AMD isn’t giving us concrete performance information, the company did use its Computex keynote to briefly demonstrate performance on the card versus NVIDIA’s GeForce RTX 2070, a $500 video card. While this is a game that favors AMD to begin with, it’s a promising sign that they’re able to pull ahead of mid-to-high-end NVIDIA card by 10%. Which means we’re all going to be eagerly awaiting more information on the video card as AMD ramps up for their July launch.
AMD's next big gaming event will be their E3 2019 Next Horizing Gaming Event, and AMD tells us that we'll find out more about the RX 5000 series there. So stay tuned.
Post Your CommentPlease log in or sign up to comment.
View All Comments
brakdoo - Monday, May 27, 2019 - linkHow wide is that thumb? Die area?
Ryan Smith - Monday, May 27, 2019 - linkWe didn't have a ruler with us for the GPU shot, unfortunately.
As for the thumb, all I can tell you is that it's 1 thumb in width, plus or minus a few layers of epidermis.
ats - Monday, May 27, 2019 - linkAlso, hope you don't use your thumbprint for security...
LemmingOverlord - Tuesday, May 28, 2019 - linkDo like Charlie, always put your business card next to it when taking photos. Not only is it "good branding technique", it's also intelligent. You can later measure things up in Photoshop or some other image editor.
ET - Monday, May 27, 2019 - linkPrecisely my thought. To see this picture without a die size estimate was disappointing.
Ryan Smith - Monday, May 27, 2019 - linkWe now have an estimate from some additional photos: 275mm2.=)
zepi - Monday, May 27, 2019 - linkModern phones should be able to run all kinds of AR-measurement apps, you should have tried some of those.
mode_13h - Wednesday, May 29, 2019 - linkThey're not terribly accurate, though.
CiccioB - Monday, May 27, 2019 - link1.5x efficiency is not that high, seen the upgrade from 14nm to 7nm PP.
If true, however, it is better than what happened with Polaris, where they stated the same and finished with a beefly OC chip to have performances in the range of the small GP106 chip rendering those claims false.
Moreover that 1.25x more performance per clock may just come from the release of memory bottlenecks with the use of GDDR6. In fact, all test used for the mean were done in 4K resolution, putting more stress on memory bandwidth than other parts of the architecture.
BTW, I have now counted at least 4 cache hierarchy redesign in AMD architectures. It seems they are not able to use caching appropriately and keeps on experimenting on it. Hope this is the right scheme.
jabbadap - Monday, May 27, 2019 - linkWell yeah you have obviously read the press release footnotes, so let's give the link to it for all of us:
So for that up-to 1.5x perf/W, they gave their test as:
12. Testing done by AMD performance labs 5/23/19, using the Division 2 @ 25x14 Ultra settings. Performance may vary based on use of latest drivers. RX-325
And for the 1.25x perf/clock:
11. Testing done by AMD performance labs 5/23/19, showing a geomean of 1.25x per/clock across 30 different games @ 4K Ultra, 4xAA settings. Performance may vary based on use of latest drivers. RX-327