DeepSeek V4: Nvidia Is Supporting It with Blackwell Before Everyone Else!
TECH NEWS – On 1.6T models, Jensen Huang’s company is already pushing as much as 3,500 tokens per second out of the Chinese AI model. DeepSeek V4 has arrived, bringing major optimizations with it, including model sizes of up to 1.6T, and Nvidia is already offering Day-0 support for it on Blackwell GPUs using NVFP4. The updated AI model uses only 27% of the inference FLOPs per token and just 10% of the KV cache when operating with a one-million-token context window. Two new models have also been introduced: a Pro model with 1.6 trillion parameters and a Flash version with 284 billion parameters. Nvidia says Blackwell GPUs provide both the scale and the low-latency performance required to run the long-context, one-million-token inference and… Olvasd tovább... DeepSeek V4: Nvidia Is Supporting It with Blackwell Before Everyone Else!
- Hirdetés -
