A conference talk covering how to secure RAG pipelines using fine-grained, relationship-based access control (ReBAC) inspired by Google Zanzibar. The speaker explains the evolution of authorization models from ACLs to RBAC to ABAC to ReBAC, then demonstrates two techniques for enforcing document-level permissions in RAG systems: pre-filter (lookup which documents a user can access before querying the vector DB) and post-filter (retrieve embeddings first, then check permissions per document). A live demo using SpiceDB, Pinecone, LangChain, and OpenAI shows both approaches in action, including revoking access in real time. The talk also covers applicability to AI agents, audit logging benefits, and production usage at scale (e.g., ChatGPT connectors handling 37 billion documents).

44m watch time

Sort: