
Take a look at our newest merchandise
AMD RDNA 4 And Radeon RX 9070 Collection: MSRP TBD AMD has taken the wraps off of its RDNA 4 GPU structure and formally revealed the Radeon RX 9070 sequence’ speeds and feeds.
|
|||
![]()
|
![]()
|
After many months of rumors and wild hypothesis, AMD simply formally unveiled its RDNA 4 GPU structure and gave a glimpse into the GPU that may energy the upcoming Radeon RX 9070 sequence graphics playing cards.
At a excessive stage, RDNA 4 is AMD’s newest shopper GPU structure, which was designed to reinforce effectivity over the earlier era, whereas additionally optimizing efficiency for immediately’s more-taxing gaming and AI workloads. RDNA 4 options next-gen Ray Tracing engines, devoted {hardware} for AI and ML workloads, higher bandwidth utilization, and multimedia enhancements for each gaming and content material creation. With this newest structure, AMD is claiming a big generational efficiency uplift per Compute Unit for higher rasterization, ray tracing, and machine studying efficiency, whereas additionally supporting forward-looking options and applied sciences. There’s lots to get to, and never a whole lot of time to do it, so let’s get began…
Enter The Radeon RX 9070 XT
By now, most of you studying it will have heard of the Radeon RX 9070 and 9070 XT and should have digested among the many rumors which have leaked over the previous couple of months, but it surely’s time for the actual deal.
The Radeon RX 9070 XT and 9070 are each constructed across the identical Navi 48 GPU – it’s simply scaled again considerably on the non-XT playing cards to hit a lower cost level. Mentioned GPU has a PCI Gen 5 native, 16 lane interface and is manufactured on TSMC’s 4nm node. The chip is comprised of roughly 53.9B transistors, with a die dimension of about 356mm2. That’s considerably smaller than the combination die sizes of the Radeon RX 7900 XT/XTX (~550mm2), which makes use of a number of chiplets – one bigger chiplet for the compute die, with a number of, smaller reminiscence controller chiplets throughout. The Radeon RX 9070 and 9070 XT, in distinction, use a single, monolithic die.

Tunneling a little deeper, the GPU itself is organized into 4 shader engines, every with 8 workgroup processors, that are outfitted with two compute items every. It is a comparable construction to earlier gen, RDNA 3 GPUs, however there are lots of enhancements within the design, which enhance efficiency and effectivity and add new options.
Inside every CU there are up to date third era Ray Accelerators that supply 2x the height throughput of RDNA 3. RDNA 4’s Ray Accelerators supply double the ray intersection charges, with improved BVH compression, accelerated ray traversal and shading, and add help for a brand new characteristic known as Oriented Bounding Containers – extra on that final one later. third Technology Matrix Accelerators are additionally current, which supply improved efficiency, together with help for 8-bit float knowledge varieties, with structured sparsity help as effectively.

The GPU options 2MB of combination CU cache, together with 8MB of L2 cache, and 64MB of third era Infinity Cache located out on the edges, between the reminiscence controllers and shader engines. The L2 cache surrounds an up to date, centrally positioned Command Processor, with enhanced packet acceleration. The up to date Command Processor additionally helps some new directions and options improved department prediction, to raised predict knowledge fetches and preserve caches higher utilized.
The GPU connects to 16GB of GDDR6 reminiscence working at an efficient knowledge charge of 20Gbps, over a 256-bit interface (4, 4×16 reminiscence controllers). The reminiscence controllers additionally characteristic enhanced reminiscence compression expertise, to make higher use of accessible bandwidth.

The Radeon RX 9070 sequence’ precise specs are outlined above. For this launch, though they’re pictured within the slide above, AMD will not be releasing playing cards of its personal, however relatively leaving the entire designs as much as its board companions. The Radeon RX 9070 XT is the complete implementation of the Navi 48 GPU, with 64 whole CUs, 64 Ray Accelerators, and 128 AI (Matrix) Acceleratprs. It affords as much as 1,557 peak TOPs (Int4 with Sparsity) with a lift clock of two.97GHz and a complete board energy of 304 watts. The Radeon RX 9070 scales the CUs, and Ray and AI accelerators again to 56, 56, and 112, respectively, with a 2.52GHz enhance clock and solely a 220 watt board energy. The Radeon RX 9070 will supply 1,165 peak TOPs.
For those who have been anticipating a GeForce RTX 5090 killer, the Radeon RX 9070 seires shouldn’t be it. Quite, AMD is focusing on extra reasonably priced worth factors with these playing cards and efficiency ranges within the neighborhood of the GeForce RTX 4070 Ti – RTX 5070 Ti, relying on the workload.

Twin media engines and an up to date Radiance Show Engine are current on the GPU as effectively. The media engines help H.264, HEVC, and AV1 accelerated encode and decode, and supply higher output high quality and elevated efficiency (as much as 30% at 720p), at decrease energy. There’s a roughly 25% improve in H.264 low latency encode high quality, and 11% enchancment with HEVC. Principally, RDNA’s new media engine can extra shortly produce cleaner, much less blocky output, from the identical enter knowledge, because of upgrades to the movement estimation algo and multi-frame reference capabilities. AMD can also be claiming a 50% efficiency uplift for AV1 and VP9 decode, with diminished context switching overhead and reminiscence write entry, each of which finally save energy.

The Radiance Show Engine has additionally been up to date to reinforce idle energy when twin FreeSync shows are linked and it helps the newest flip mannequin in Home windows, with {hardware} Flip Queue help. This enables body pacing to be managed by the GPU, relatively than the CPU for video playback.
The Radiance Show Engine additionally incorporates Radeon Picture Sharpening 2, which is applied previous to the output being despatched to a show. RIS2 enhances the sharpness of photographs despatched to the monitor. As a result of RIS 2 is now part of the show engine, it really works throughout all APIs and might be enabled or disabled with a single toggle within the drivers.
AMD RDNA 4: New Options And Capabilities
AMD’s RDNA 4 GPU structure permits a variety of new options and applied sciences, which ought to finally improve efficiency and effectivity and make the Radeon RX 9070 sequence higher fitted to the newest video games and AI workloads.

One characteristic coming with RDNA 4 that ought to improve efficiency and guarantee optimum GPU utilization is dynamic registers. With RNDA 3, the shaders allotted registers for the worst case situation, however that led to important variance in reside registers. With RDNA 4, nonetheless, the shaders and dynamically allocate shaders from the out there pool when wanted, and launch them again to the pool when work is full. In the end this enables the structure to raised make the most of avilable registers and probably have extra waves in-flight at any given time.

We talked about one other new characteristic, Oriented Bounding Containers, a bit earlier on this piece. At the moment, Bounding Quantity Hierarchies (BVH) are generally used acceleration constructions for ray tracing on the GPU, due to their comparatively low reminiscence footprint and suppleness in adapting to temporal adjustments in scene geometry. With earlier generations, the bounding packing containers used to construct the BVH have been aligned to the vertical and horizontal axis in a scene, which may end in a lot bigger packing containers than technically essential to sure the mandatory geometry. With OOB in RDNA 4, nonetheless, the bounding field might be rotated and higher aligned to any off-axis geometry. This leads to a discount of traversal steps and extra environment friendly use of GPU and reminiscence sources, which ought to enhance efficiency.

RDNA 4 additionally helps Out Of Order reminiscence operations. With RDNA 3, reminiscence operations have been carried out within the order by which they have been made, which may end in extra latency ought to one wave require requesting new knowledge, whereas one other wave is ready on its subsequent step. RDNA 4 permits for requests from totally different shaders to be carried out out of order, which permits for extra environment friendly execution and diminished latency total.

As talked about, RDNA 4 is a signifcant step ahead for GPU Ray Tracing as effectively. RDNA 4 doubles throughput and options devoted {hardware} for Occasion rework and stack administration acceleration. Though RDNA 4 additionally helps an 8-wide BVH, new primitive node compression expertise truly reduces the reminiscence necessities.

How all of those new developments have an effect on ray tracing efficiency is printed within the slide above. The elevated throughput and BVH8 help account for the overwhelming majority of the rise, however OBB, OOO Reminiscence, the brand new BVH compression, and occasion rework all contribute as effectively.
FSR 4 And HYPR-RX Updates For RDNA 4

Arriving alongside the Radeon RX 9070 sequence is the newest iteration of FidelityFX Tremendous Decision – FSR4. Enhancements to HYPR-RX are coming as effectively. For these unaware, FSR 4 options ML-powered upscaling, to enhance picture high quality and cut back frequent points related to present upscalers, like ghosting, incorrect anti-aliasing, particle artifacts, moiré results, and extra. AMD claims close to native – and typically higher – picture high quality with FSR 4 upscaling enabled, however clearly at a lot greater perceived efficiency, as a result of the video games are literally rendered at a decrease enter decision.

In contrast to earlier variations of FSR, which didn’t leverage any form of machine studying, FSR 4 advantages from an AMD-built customized sport ML fashions and a brand new, RDNA 4-accelerated AI upscaling algorithm. FSR 4 was particularly developed for RDNA 4 and works alongside FSR Body Technology and Radeon Anti-Lag latency discount. FSR Body Technology will inject a single generated body at the moment (although extra are potential sooner or later). We also needs to point out that FSR 4 makes use of FSR 3.1’s upgradable API, so for any video games which are already FSR 3.1 enabled, FSR 4 is basically a drop-in replace.

Which brings as much as HYPR-RX. For those who recall, HYPR-RX is not a single characteristic, however relatively refers to a toggle in AMD’s driver management panel that serves to activate a number of options directly, together with Radeon Tremendous Decision (with or with out Fluid Movement Frames 2), Radeon Enhance, and Radeon Anti-Lag(+). When enabled, HYPR-RX toggles on Radeon Tremendous Decision, Radeon Enhance, and Radeon Anti-Lag(+), however be aware that Radeon Enhance and Anti-Lag(+) don’t work in each sport. In video games that are not supported by these two applied sciences, nonetheless, customers will nonetheless profit from the efficiency enhancements that come by the use of Radeon Tremendous Decision. HYPR-RX is mainly a one-click efficiency enhancing answer for players that is probably not savvy sufficient, or just don’t need to, handle the person applied sciences on their very own.

AMD’s in-driver body era expertise, Fluid Movement Frames, can also be getting up to date for this launch. AMD Fluid Movement Body 2.1 will supply elevated picture high quality for generated frames, with diminished ghosting and higher temporal monitoring, particularly in fast-paced motion. FMF 2.1 may also higher deal with overlays and resolve and render wonderful particulars. FMF 2.1, nonetheless, shouldn’t be an RDNA 4 unique. Will probably be out there with Radeon RX 6000 sequence GPUs, and newer, and it’ll work on AMD Ryzen AI 300 sequence APUs as effectively.
AMD Software program Adrenalin Version Updates For RDNA 4
There are a mess of updates coming to AMD’s Adrenalin Version software program too, lots of which might be AI-infused.

The beforehand talked about Radeon Picture Sharpening 2 replace is coming particularly for the Radeon RX 9000 sequence, which affords stronger and extra responsive sharpening with all APIs, and even on the Home windows desktop, as a result of it’s now a part of the show engine. A brand new toggle will enable RIS2 to be utilized throughout all the desktop, whereas the earlier model was restricted to only particular person functions.
AMD can also be introducing a brand new AI Apps Supervisor. You may consider the AMD AI Apps Supervisor as a one-stop-shop for entry to the entire AI-enabled apps on a system. It’s just like AMD’s sport supervisor, however for AI-enabled functions, and it affords comparable options, like the flexibility to trace metrics from inside the functions.
AMD Set up Supervisor is one other new addition. AMD Set up supervisor is mainly a software that offers customers the flexibility to handle AMD-specific software program. It affords auto-updating, uninstallation choices, product info, and so forth. and is supported on all merchandise that also work with AMD Adrenaline Version software program.
Subsequent is AMD Chat. AMD Chat is GPU-accelerated, native, offline Chat Bot with picture era capabilities for Radeon RX 9000 graphics playing cards. Along with conventional Chat Bot options, AMD Chat will have the ability to toggle Adrenalin Version options on or off by merely asking the chat bot to take action and it may assist players perceive what every setting or GPU characteristic does as effectively. It makes use of a LLM to generate textual content responses and assist with immediate processing, together with picture era, with out having to explicitly ask the chat to ‘generate a picture of…’. Right now, AMD Chat helps English and Simplified Chinese language. It’s utilizing Llama 3.1 8B for English and QN2 7B for Simplified Chinese language. Picture era relies on Steady Diffusion XL 1.0, although that’s solely out there for the English model for now.

AMD may also be utilizing AI to extend driver high quality. The corporate is now automating some elements of the sport testing course of, utilizing an in-house created AI mannequin. The AI-enhanced instruments are able to detecting crashes, along with quite a few visible anomalies, like shader or form artifacts, sq. patches, popping or lacking textures, stutters, discoloration and numerous different corruption or rendering points.
For finish customers, the corporate can also be introducing AMD Picture Inspector. AMD Picture Inspector leverages AMD’s customized AI mannequin and is designed to drive sport high quality enhancements, by detecting corruption in gameplay. Customers should opt-in to make use of the characteristic, and in the event that they do, it captures and sends diagnostic knowledge within the type of textual content and pictures again to AMD for extra detailed diagnostics and (hopefully) faster fixes. The textual content info is comprised of diagnostic outcomes generated by the mannequin from a specified session, together with related nameless system info. Gameplay photographs are captured each 3 seconds when GPU utilization is under 90% and the sport is working in full display screen mode. The mannequin then stories any situations of visible artifacts or instability to AMD, so the corporate can extra shortly react to any potential driver points.
Radeon RX 9070 XT And 9070 Anticipated Efficiency
AMD supplied some anticipated efficiency knowledge for each the Radeon RX 9070 and extra highly effective 9070 XT, relative to the earlier era Radeon RX 7900 GRE…


With the video games, settings amd resolutions utilized by AMD, the Radeon RX 9070 is shaping as much as be about 20% sooner than the Radeon RX 7900 GRE, on common — in some video games the uplift is a little bit greater and in others it’s a bit decrease.


Utilizing the identical set of video games (and settings, and so forth.), the Radeon RX 9070 XT’s efficiency relative to the Radeon RX 7900 GRE is anticipated to be a lot better. The common uplift at 1440p seems be about 38%. At 4K issues look even higher, with a few 42% efficiency uplift. We do not suspect AMD would put out any shoddy numbers, however in fact, take all vendor supplied knowledge with a grain of salt. It will not be lengthy till independant critiques hit anyway.
AMD RDNA 4 And Radeon RX 9070 Collection: The Wait Is Nearly Over
All instructed, RDNA seems to be a big step ahead for AMD. Though the corporate hasn’t produced a monster GPU to compete on the ultra-high-end of the market, the Radeon RX 9070 and 9070 goal the majority of the PC gaming fanatic house and seem poised to make a giant splash, assuming efficiency in the actual world mirrors what AMD has disclosed. Fortunately, we received’t have to attend for much longer to search out out.
Subsequent week goes to be very busy round right here. NVIDIA is launching the ultimate, beforehand introduced member of the RTX 50 sequence and AMD will unleash the 9070 and 9070 XT. We’re instructed availability of the Radeon RX 9070 and 9070 XT may also be good, so right here’s hoping the following few weeks see some a lot wanted stabilization within the shopper GPU market and players which have been itching to improve can get their fingers on a shiny, new GPU, at – or near – MSRP.