In this video, I look at the DeepSeek R1 model and the technical paper behind it and how you can get the Distilled versions running with code.

Colab: https://drp.li/Z6yYS

For more tutorials on using LLMs and building agents, check out my Patreon
Patreon: https://www.patreon.com/SamWitteveen
Twitter: https://x.com/Sam_Witteveen

🕵️ Interested in building LLM Agents? Fill out the form below
Building LLM Agents Form: https://drp.li/dIMes

👨‍💻Github:
https://github.com/samwit/llm-tutorials

⏱️Time Stamps:
00:00 Intro
02:21 Benchmarks
04:08 Using the DeepSeek V3 Base model
05:04 DeepSeek Chat Interface
08:30 DeepSeek Technical Paper
17:28 DeepSeek available in Ollama
17:59 Demo Colab

Sam Witteveen AI is a publication offering insights, tutorials, and resources for artificial intelligence (AI) enthusiasts and practitioners. Readers can learn about machine learning algorithms, deep learning frameworks, and AI applications. With tutorials, case studies, and expert interviews, Sam Witteveen AI provides  guidance and expertise for building and deploying AI solutions.

Sam Witteveen

DeepSeek released a family of models including the R1 light preview and various distilled versions. The models, which outperform several proprietary models on specific tasks, are now available with open weights and licensed for use. The post explains the process of training these models, their benchmarks compared to other models, and how to run them locally. The R1 model shows notable advancements in reinforcement learning and the way it handles tasks, making it a significant release in the AI and machine learning field.

DeepSeekR1 - Full Breakdown