NVIDIA is collaborating as a launch partner with Google in delivering Gemma, a newly optimized family of open models built from the same research and technology…

NVIDIA DevTalk serves as a vibrant community hub where developers can engage in discussions, seek assistance, and collaborate on projects involving NVIDIA hardware and software. Developers can tap into the collective expertise of the NVIDIA developer community, sharing insights, troubleshooting issues, and exploring best practices for GPU programming and AI development. Additionally, DevTalk provides a platform for developers to showcase their projects, receive feedback, and network with peers, fostering collaboration and knowledge exchange within the NVIDIA ecosystem.

NVIDIA Developer

NVIDIA collaborates with Google to deliver Gemma, an optimized family of open models built for high throughput and performance. TensorRT-LLM provides optimizations and kernels that boost Gemma's performance, including FP8, XQA, and INT4 AWQ.

NVIDIA TensorRT-LLM Revs Up Inference for Google Gemma

Real-time performance with over 79K tokens per second