NVIDIA released Nemotron ColEmbed V2, a family of late-interaction multimodal embedding models in 3B, 4B, and 8B sizes that achieve state-of-the-art performance on ViDoRe V1, V2, and V3 benchmarks for visual document retrieval. The 8B model ranks #1 on ViDoRe V3, using a ColBERT-style multi-vector architecture that enables

5m read timeFrom huggingface.co
Post cover image
Table of contents
Nemotron ColEmbed V2 Highlights (TL;DR)Models’ ArchitectureStart Building with Nemotron ColEmbed V2

Sort: