RTX 显卡引入 NVFP 技术,图像生成速度最高提升 4.6 倍。
30、40、50 系显卡可以享受到此次更新。
从图上看 Quen Image 启用原生 NVFP4 支持。可带来高达 4.6 倍的性能提升。
同时还能够降低显存使用。
| Feature | FP4 (E2M1) | MXFP4 | NVFP4 |
|---|---|---|---|
| Format | |||
| Structure | 4 bits (1 sign, 2 exponent, 1 mantissa) plus software scaling factor | 4 bits (1 sign, 2 exponent, 1 mantissa) plus 1 shared power-of-two scale per 32 value block | 4 bits (1 sign, 2 exponent, 1 mantissa) plus 1 shared FP8 scale per 16 value block |
| Accelerated Hardware Scaling | No | Yes | Yes |
| Memory | Up to 4x less memory than FP16 | ||
| Accuracy | Risk of noticeable accuracy drop compared to FP8 | Risk of noticeable accuracy drop compared to FP8 | Lower risk of noticeable accuracy drop particularly for larger models |

详细介绍可以看 NV 的博客,NV 在 CES2026 还开源了全球最大规模的数据集。


