Building an Advanced RAG System With Self-Querying Retrieval

Learn how to build an advanced Retrieval Augmented Generation (RAG) system that leverages self-querying retrieval to improve search relevance. This tutorial covers extracting metadata filters from natural language queries, combining metadata filtering with vector search, and generating structured outputs using LLMs. The guide focuses on developing an investment assistant to answer financial questions using MongoDB as the vector store and LangGraph for orchestration.

#llm

#mongodb

#data-processing

#rag

#vector-search

Sep 11, 2024•30m read time•From mongodb.com

Table of contents

What is metadata? Why is it important for RAG?Extracting metadata filters from natural language Building a RAG system with self-querying retrieval Step 1: Decide what metadata to extract Step 2: Install required libraries Step 3: Set up prerequisites Step 4: Partition, chunk, and embed PDF files Step 5: Add custom metadata to the processed documents Step 6: Write the processed documents to MongoDB Step 7: Define graph state Step 8: Define graph nodes Step 9: Define conditional edges Step 10: Build the graph Step 11: Execute the graph Conclusion

Comment

Bookmark

Copy

Sort: