Computerphile is supported by Jane Street. Learn more about them (and exciting career opportunities) at: https://jane-st.co/computerphile

This video was filmed and edited by Sean Riley.

Computerphile is a sister project to Brady Haran's Numberphile. More at https://www.bradyharanblog.com

Computerphile is a YouTube channel and platform dedicated to computer science education, featuring videos on a wide range of topics, from algorithms and data structures to computer hardware and software engineering. Readers can learn about computer science concepts, programming languages, and the history of computing. With engaging videos, expert interviews, and educational content, Computerphile provides a resource for students, educators, and technology enthusiasts.

Computerphile

Vector search is a technique used in RAG systems to find relevant documents from large collections before feeding them to an LLM. Text is embedded into high-dimensional numerical vectors using a transformer-based model, and cosine similarity is used to find semantically similar passages. A practical demo shows chunking a 170-page NIST key management PDF, embedding all chunks into a ChromaDB vector database, querying with a cryptographic question, and using a Mistral 7B model to answer based only on retrieved context. The approach handles typos and paraphrasing gracefully, and when combined with a strict prompt, the model can correctly say 'I don't know' when the answer isn't in the retrieved documents.

Vector Search with LLMs- Computerphile