Best of NVIDIAJanuary 2026

  1. 1
    Article
    Avatar of modal_labsModal·20w

    Keeping 20,000 GPUs healthy

    Modal manages over 20,000 GPUs across AWS, GCP, Azure, and OCI, encountering significant reliability and performance differences between cloud providers. Their GPU health system includes instance type benchmarking and selection, machine image preparation with automated testing, boot-time validation, and continuous passive monitoring (via DCGM and dmesg) plus weekly active healthchecks (DCGM diag, GPUBurn, NCCL tests). Key findings: Cloud providers vary dramatically in H100 performance (up to 50% differences), temperature management (some reaching 94°C), and ECC error rates. GPUs account for 58.7% of training failures in Meta's LLaMA 3 development, compared to just 0.5% for CPUs, highlighting the reliability gap.

  2. 2
    Article
    Avatar of 80lv80 LEVEL·19w

    NVIDIA CEO Doesn't Want "Well-Respected People" to Criticize AI

    NVIDIA CEO Jensen Huang criticized well-respected tech leaders for expressing concerns about AI, calling their warnings a "doomer narrative" that could lead to regulations stifling startups. He questioned the intentions of PhDs and CEOs who warn governments about dystopian AI scenarios. The comments appear directed at figures like Anthropic CEO Dario Amodei, who has supported AI regulations and warned about potential job displacement. The article questions Huang's own motivations, noting NVIDIA's position as a major AI boom beneficiary with military ties.

  3. 3
    Video
    Avatar of techlinkedTechLinked·20w

    Laptops Are So Back...

    Dell admits AI PCs failed to resonate with consumers and revives the XPS brand after a failed rebrand. Wi-Fi 8 products are already appearing despite the standard not releasing until 2028. Storage manufacturers showcase innovative designs including upgradeable external SSDs and hybrid devices. Nvidia may restart RTX 3060 production due to memory shortages, while AMD considers reintroducing AM4 products. The upcoming James Bond game lists non-existent hardware in its system requirements. XAI faces global investigations over deepfake generation, and Samsung receives a restraining order over alleged TV screenshot capture without consent.

  4. 4
    Article
    Avatar of huggingfaceHugging Face·20w

    Scaling Real-Time Voice Agents with Cache-Aware Streaming ASR

    NVIDIA introduces Nemotron Speech ASR, an open model that uses cache-aware streaming architecture to process real-time voice interactions. Unlike traditional buffered inference systems that repeatedly reprocess overlapping audio windows, this approach maintains an internal cache of encoder representations and processes each audio frame exactly once. The model achieves 3x higher efficiency, supports 560 concurrent streams on H100 GPUs, maintains stable latency under load, and delivers 24ms median time-to-final transcription. Real-world validation from Daily and Modal demonstrates zero latency drift at scale, enabling natural conversational agents with sub-900ms voice-to-voice loops.

  5. 5
    Article
    Avatar of hnHacker News·20w

    ‘We’ve Done Our Country a Great Disservice’ by Offshoring: Nvidia’s Jensen Huang Says ‘We Have to Create Prosperity’ for All, Not Just PhDs

    Nvidia CEO Jensen Huang argues that America must reverse decades of manufacturing offshoring by building AI infrastructure domestically, emphasizing that energy availability is the foundational constraint. He contends that creating prosperity for all Americans, not just highly educated workers, requires bringing manufacturing jobs back through the AI industrial revolution. Nvidia is helping build $500 billion in AI infrastructure in the U.S. and plans to leverage its position to make America the global AI manufacturing hub.