Nvidia has launched Nemotron 3 Super, a 120B total / 12B active-parameter hybrid AI model designed for enterprise agentic workloads. The model combines Mamba sequence modeling, transformer attention, and Mixture-of-Experts routing to address 'context explosion' in multi-agent systems, which can generate up to 15x more tokens than standard chat. Released with open weights, datasets, and training recipes, it targets use cases like software development and cybersecurity triage. Analysts highlight its potential for lower TCO, better GPU utilization, and suitability for regulated industries requiring fine-tuning, on-prem deployment, and regulatory compliance.
Sort: