Towards Data Science is a community-powered publication that showcases work in data science, machine learning and artificial intelligence. Every day newcomers, seasoned researchers and industry practitioners publish tutorials, research notes and real-world case studies that help the field move forward.

Towards Data Science

Traditional RAG systems lose context when documents are split into chunks, leading to irrelevant retrievals. Contextual retrieval, introduced by Anthropic in 2024, solves this by using an LLM to generate a short contextual description for each chunk before indexing, situating it within its source document. This enriched chunk is then used for both vector and BM25 indexing. The result is a reported 35% improvement in retrieval accuracy. Cost concerns are mitigated because the extra LLM calls happen only at ingestion time, not at query time, and prompt caching can further reduce expenses.

Understanding Context and Contextual Retrieval in RAG