Gaming PC

Nvidia Reveals Ada Lovelace GPU Secrets: Extreme Transistor Counts at High Clocks

When Nvidia unveiled its Ada Lovelace family of graphics processing units earlier this week, it mostly focused on its top-end AD102 GPU and flagship GeForce RTX 4090 graphics card. Not many details have been released about the AD103 and AD104 graphics chips. Luckily, Nvidia has today uploaded an Ada Lovelace whitepaper that contains a ton of data about their new GPUs and fills in many gaps. We’ve updated all known hubs for the RTX 40-series GPUs with new details, but here’s a quick rundown of the new and interesting information.

Big GPU for big games

We already know that Nvidia’s top-of-the-line AD102 is a 608 mm^2 GPU containing 76.3 billion transistors, 18,432 CUDA cores and 96MB of L2 cache. We also know that the AD103 is a 378.6 mm^2 graphics processor with 45.9 billion transistors, 10,240 CUDA cores and 64MB of L2 cache. As for the AD104, it has a die size of 294.5 mm^2, 35.8 billion transistors, 7680 CUDA cores and 48 MB of L2.

Nvidia Ada Specs and Amps
GPU/graphic card Full AD102 RTX4090 RTX4080 16GB RTX4080 12GB RTX 3090 Ti
architecture AD102 AD102 AD103 AD104 GA102
process technology TSMC 4N TSMC 4N TSMC 4N TSMC 4N Samsung 8LPP
Transistor (billion) 76.3 76.3 45.9 35.8 28.3
Die size (mm^2) 608 608 378.6 294.5 628.4
streaming multiprocessor 144 128 76 60 84
GPU core (shader) 18432 16384 9728 7680 10752
Tensor cores 576 512 320 240 336
Ray Tracing Core 144 144 80 60 84
TMU 512 512 304? 240 336
ROP 192 192 112 80 112
L2 Cache (MB) 96 96 64 48 6
Boost Clock (MHz) ? 2520 2505 2600 1860
TFLOPS FP32 (Boost) ? 82.6 48.7 40.1 40.0
TFLOPS FP16 (FP8) ? 661 (1321) 390 (780) 319 (639) 320 (none)
TFLOPS Ray Tracing ? 191 113 82 78.1
memory interface (bits) 384 384 256 192 384
Memory speed (GT/s) ? twenty one 22.4 twenty one twenty one
Bandwidth (GBps) ? 1008 736 504 1008
TDP (Watts) ? 450 320 285 450
Release date ? October 12, 2022 November 2022? November 2022? March 2022
launch price ? $1,599 $1,199 $899 $1,999

One of the interesting things Nvidia mentions in its whitepaper is that the Ada Lovelace GPU uses fast transistors in the critical path to boost maximum clock speeds. As a result, a fully capable AD102 GPU with 18,432 of his CUDA cores “can be clocked above 2.5 GHz while maintaining the same 450W TGP.” With this in mind, it should come as no surprise that the company is talking about the 3.0 GHz clock of the GeForce RTX 4090 (which has 16,384 CUDA cores) reached in the lab. Definitely tops the list of cards.

(Image credit: Nvidia)

In addition to high clock speeds, Nvidia’s Ada Lovelace GPUs also feature a large L2 cache to boost performance for computationally intensive workloads (ray tracing, path tracing, simulations, etc.) and reduce memory bandwidth requirements. reduce Basically, Nvidia’s Ada GPUs take a page from the RDNA 2 Infinity Cache book (here) as a reference, but the general target for the new architecture is pretty much where AMD’s Radeon RX 6000 series products debut in 2020. I believe it was set before.

Related Articles

Back to top button