Dave explains how retraining, RAG (retrieval augmented generation) and context documents serve to expand the functionality of existing models, both local and online.  For my book on the autism spectrum, check out: https://amzn.to/3zBinWM

Dave's Attic - Friday 4PM Podcast - https://www.youtube.com/@UCtb6a_CnmGbSns9G8W2Ny0w 

Follow me for updates!  
Twitter:   @davepl1968 davepl1968
Facebook: fb.com/davepl

Dave's Garage's resource offers insights, tutorials, and resources for technology enthusiasts and DIYers. Readers can learn about electronics, hardware hacking, and DIY projects. With tutorials, project guides, and community forums, Dave's Garage provides resources for makers and hobbyists interested in tinkering with technology.

Dave's Garage

Dave discusses how to add personal knowledge files and documents to large language models (LLMs) both locally and online. He breaks down three methods: retraining the model, using retrieval augmented generation (RAG), and adding documents to the context window. Dave demonstrates the setup with a dual Nvidia RTX 60008 machine, compares performance across different model sizes, and offers practical steps to integrate knowledge into models like Chat GPT and local Olama models running under Open Web UI.

Retraining vs RAG vs Context: Your Local Data on LLMs!