LLM2Vec is a simple AI approach that transforms any decoder-only LLM into a text encoder, achieving state-of-the-art performance on the Massive Text Embeddings Benchmark (MTEB) in the unsupervised and supervised category. The method uses bidirectional attention, masked next token prediction, and unsupervised contrastive learning to develop robust representations.

5m read time From marktechpost.com
Post cover image

Sort: