Kimi K2.5 is the newest open vision language model (VLM) from the Kimi family of models. Kimi K2.5 is a general-purpose multimodal model that excels in current…

NVIDIA DevTalk serves as a vibrant community hub where developers can engage in discussions, seek assistance, and collaborate on projects involving NVIDIA hardware and software. Developers can tap into the collective expertise of the NVIDIA developer community, sharing insights, troubleshooting issues, and exploring best practices for GPU programming and AI development. Additionally, DevTalk provides a platform for developers to showcase their projects, receive feedback, and network with peers, fostering collaboration and knowledge exchange within the NVIDIA ecosystem.

NVIDIA Developer

Kimi K2.5 is a new open-source vision language model with 1T total parameters (32.86B active) that supports text, image, and video inputs with a 262K context length. The model uses a mixture-of-experts architecture with 384 experts and achieves 3.2% parameter activation per token. Developers can access GPU-accelerated endpoints for free prototyping through build.nvidia.com, deploy using vLLM, or fine-tune with NVIDIA NeMo Framework and AutoModel for domain-specific tasks.

Build with Kimi K2.5 Multimodal VLM Using NVIDIA GPU-Accelerated Endpoints

Build with NVIDIA GPU-accelerated endpoints