In this video, we look at a recent feature that was added to Ollama that lets you easily download any of the GGUF format models from the Hugging Face hub. 

For more tutorials on using LLMs and building agents, check out my Patreon
Patreon: https://www.patreon.com/SamWitteveen
Twitter: https://twitter.com/Sam_Witteveen

HuggingFace Docs Blog: https://huggingface.co/docs/hub/ollama

🕵️ Interested in building LLM Agents? Fill out the form below
Building LLM Agents Form: https://drp.li/dIMes

👨‍💻Github:
https://github.com/samwit/langchain-tutorials (updated)
https://github.com/samwit/llm-tutorials


⏱️Time Stamps:
00:00 Intro
00:08 Hugging Face Community Blog
00:54 How to
03:17 How to pick the right quantization format?
05:19 Hugging Face Hub: Custom Chat Template and Parameters
05:58 Hugging Face GGUF Models

Sam Witteveen AI is a publication offering insights, tutorials, and resources for artificial intelligence (AI) enthusiasts and practitioners. Readers can learn about machine learning algorithms, deep learning frameworks, and AI applications. With tutorials, case studies, and expert interviews, Sam Witteveen AI provides  guidance and expertise for building and deploying AI solutions.

Sam Witteveen

Ollama and Hugging Face have announced a collaboration allowing access to GGUF models on Hugging Face's hub, totaling around 45,000 models. Users can easily run these models using the Ollama run command, with options to choose different levels of model quantization (from 2-bit to 8-bit). The post provides guidance on selecting the appropriate quantization format based on performance and quality trade-offs. This new feature streamlines the process of deploying diverse models quickly and efficiently.

Ollama + HuggingFace - 45,000 New Models

<p>The world of open source never ceases to amaze.</p>