Gaming PC

Intel Details Inner Workings of XeSS

Intel has released an explainer video of its upcoming XeSS AI upscaling technology, showcasing how the technology will work on its Arc Alchemist GPUs, which are almost ready to go public. We used the fastest Arc A770 for demonstration, but it’s hard to say how the performance will match the best graphics cards based on the limited performance details provided.

If you’re even slightly familiar with Nvidia’s DLSS, it’s been around for four years in one form or another, but this video should inspire Deja Vu’s keen senses. Tom Petersen, who previously worked at Nvidia and did some of his previous DLSS presentations, explains the fundamentals of XeSS. In a nutshell, XeSS is very similar to Nvidia’s mirrored version of his DLSS, except that it’s designed to run on Intel’s deep learning XMX cores instead of Nvidia’s tensor cores. . However, the tech works with other GPUs as well, but using DP4a mode could be an interesting alternative to AMD’s FSR 2.0 upscaler.

In the demo shown by Intel, XeSS seemed to work well. Of course, it’s difficult to say for certain when the source video is a 1080p compressed version of the actual content, but I’ll save a detailed picture quality comparison for another time. Performance improvements appear to be similar to those seen with DLSS, with over 100% improvement in

Usage

If you already know how DLSS works, Intel’s solution is pretty much the same, with a few minor tweaks. XeSS is an AI-accelerated resolution upscaling algorithm designed to boost frame rates in video games.

It starts with training, the first step of most deep learning algorithms. The AI ​​network takes low resolution sample frames from the game and processes them to produce what needs to be upscaled in the output image. The network then compares the result to the desired target image and backpropagates weight adjustments to try to fix the “error”. The resulting image doesn’t look good at first, but the AI ​​algorithm slowly learns from its mistakes. After thousands (or more) of training images, the network will eventually converge towards the ideal weights that “magically” produce the desired result.

Once the algorithm is fully trained using examples from different games, it is theoretically possible to take any image input from any video game and upscale it almost perfectly. Similar to DLSS (and FSR 2.0), the XeSS algorithm also plays a role in anti-aliasing and replaces traditional solutions such as temporal AA.

(Image credit: Intel)

Again, nothing particularly noteworthy so far. DLSS and FSR 2.0, and even standard temporal AA algorithms, have many of the same core features (minus the AI ​​elements of FSR and TAA). The game integrates XeSS into its rendering pipeline. This is typically after the main rendering and initial effects are complete, but before any post-processing effects and GUI/HUD elements are drawn. That way, the UI stays crisp even when doing the difficult task of 3D rendering at lower resolutions.

XeSS runs on Intel’s Arc XMX cores, but can also run on other GPUs in slightly different modes. A DP4a instruction is basically four INT8 (8-bit integer) computations performed using a single 32-bit register, typically accessible via a GPU shader core. The XMX core, on the other hand, natively supports INT8 and can manipulate 128 values ​​at once.

This may seem very biased, but as an example, the Arc A380 has 1024 shader cores, each capable of doing 4 INT8 operations at the same time. Alternatively, the A380 has 128 MXM units, each capable of 128 INT8 operations. This makes the MXM’s throughput 4x faster than his DP4a’s, but clearly he should be good enough to get some XeSS goodness even in DP4a mode.

DP4a is Wrong A trained network, possibly a less computationally intensive network. How that translates into actual performance and image quality remains to be seen.If game developers want to support non-Arc GPUs, they’ll need to explicitly include support for both XMX and DP4a modes. There seems to be

Intel XeSS Performance Expectations

Related Articles

Back to top button