Compared to DeepSeek R1, Llama-3.1-Nemotron-Ultra-253B shows competitive results despite having less than half the parameters.

VentureBeat is a leading source of news, analysis, and insights on technology innovation, startups, and venture capital. Covering topics such as AI, blockchain, gaming, and more, VentureBeat provides  reporting, interviews, and commentary on trends and developments shaping the tech industry. Entrepreneurs, investors, and technology enthusiasts can stay informed about the latest news, funding rounds, and market trends through VentureBeat's coverage.

Venture Beat

Nvidia has released the Llama-3.1-Nemotron-Ultra-253B, an open-source large language model optimized for advanced reasoning and instruction following. This 253-billion parameter model outperforms the larger DeepSeek R1 in several benchmarks while being more memory and computationally efficient. Enhanced with post-training fine-tuning and reinforcement learning, it offers versatile applications in AI workflows, including multilingual capabilities and commercial usage under an open license.

Nvidia’s new Llama-3.1 Nemotron Ultra outperforms DeepSeek R1 at half the size

Post-training for reasoning and alignment

Improved performance across numerous domains and benchmarks