• 0 Posts
  • 11 Comments
Joined 11 months ago
cake
Cake day: October 25th, 2023

help-circle
  • In Windows, access to the GPU by multiple processes is scheduled, much like access to the CPU is. Without HAGS, all of this scheduling is performed by the CPU. With HAGS, some of the scheduling in particular situations can be offloaded to the GPU.

    It has nothing whatsoever to do with how work on the GPU is scheduled across the compute resources.

    It will have no measurable impact on gaming performance, and it’s not supposed to, if done correctly. It’s only going to potentially affect performance when multiple applications are trying to use the GPU at the same time to a significant degree.

    So if you’re trying to play a game at the same time as you’re using your GPU to render something, then HAGS might slightly reduce the overhead of both tasks sharing the GPU. You still won’t actually notice a difference.







  • It wasn’t about bus width, it was about nVidia’s fictitious CUDA core counts with Ampere.

    At the last minute, the 4352 CUDA cores of the 3080 (same as the 2080 Ti) was changed to 8704 “CUDA cores”, because the INT32 ALU was replaced with a dual-function INT32/FP32 ALU. People who didn’t understand that (i.e. basically everyone who didn’t call out nVidia’s dishonesty in marketing those figures) thought, from the leaks, that it’d be 8704 shaders against 4608 shaders. It wasn’t. It was more like ~5200-5400 shaders, depending on resolution, against 4608, with the latter running at a substantially higher clock speed.

    Ironically, the reverse happened with RDNA 3, as the values leaked were incorrect - they said 12,288 ALU’s for Navi 31, without mentioning that it was really 6144 FP32 ALU’s with 6144 INT32/FP32 ALU’s that could be partially used. So people thought it was 12,288 on the side of Navi 31 versus 16,384 on the side of the 4090, with those numbers meaning the same as they did with 5120 for the 6900 XT versus 10,496 for the 3090. But they didn’t mean the same thing at all. It was ~7400 effective shaders for Navi 31 versus ~10,240 effective shaders for the 4090. With no real clock speed advantage.

    As it turns out, the 4090 scales pretty poorly though, so it’s not as far ahead of the 7900 XTX as it should be base on raw compute.