DeepSeek R-1 is the most powerful open-source reasoning model that performs on par with OpenAI's o1 model.

Run the 1.58-bit Dynamic GGUF version by Unsloth.

Hacker News is a community-driven platform for sharing and discussing technology news, startups, and programming-related topics. Through user submissions and comments, Hacker News offers insights into emerging technology trends, industry developments, and entrepreneurial ventures. Readers can participate in discussions, share their insights, and stay informed about the latest advancements in technology and innovation.

Hacker News

The post explains how to install and run the DeepSeek-R1 model, highlighting the importance of adding BOS and EOS tokens in interactions. It provides detailed setup instructions using commands like `apt-get update` for dependencies, downloading the model via `huggingface_hub`, and outlines how to configure GPU offloading based on available memory. Additionally, there's guidance on quantizing the model's K cache to 4bit and running the model using those configurations.

Run DeepSeek-R1 Dynamic 1.58-bit

<p>Does it have sense to run it in such a small quant?</p>


<p>i’m so glad It provides detailed setup instructions using commands like <code>apt-get update</code> for dependencies, thank you AI TL;DR</p>