The 120B parameter model aims to improve compute efficiency and accuracy for complex multi-agent workloads such as software development and cybersecurity triage.

InfoWorld is a source of news, analysis, and commentary on technology trends, IT strategies, and business innovation. With a focus on enterprise technology and digital transformation, InfoWorld offers insights and guidance for IT decision-makers, software developers, and technology professionals. From  articles on cloud computing and cybersecurity to product reviews and industry trends, InfoWorld helps readers navigate the complexities of modern IT environments and make informed decisions to drive business success.

InfoWorld

Nvidia has launched Nemotron 3 Super, a 120B total / 12B active-parameter hybrid AI model designed for enterprise agentic workloads. The model combines Mamba sequence modeling, transformer attention, and Mixture-of-Experts routing to address 'context explosion' in multi-agent systems, which can generate up to 15x more tokens than standard chat. Released with open weights, datasets, and training recipes, it targets use cases like software development and cybersecurity triage. Analysts highlight its potential for lower TCO, better GPU utilization, and suitability for regulated industries requiring fine-tuning, on-prem deployment, and regulatory compliance.

Nvidia launches Nemotron 3 Super to power enterprise AI agents