Everything about the SmolLM & SmolLM2 family of models  - GitHub - huggingface/smollm: Everything about the SmolLM & SmolLM2 family of models

Hacker News is a community-driven platform for sharing and discussing technology news, startups, and programming-related topics. Through user submissions and comments, Hacker News offers insights into emerging technology trends, industry developments, and entrepreneurial ventures. Readers can participate in discussions, share their insights, and stay informed about the latest advancements in technology and innovation.

Hacker News

SmolLM2 is a family of compact language models ranging from 135M to 1.7B parameters, designed for on-device use with versatile capabilities. The SmolLM2-1.7B-Instruct model can be used as an assistant via various tools and frameworks. Detailed instructions for pre-training, fine-tuning, and using these models are provided. Additionally, the newly introduced SmolTalk dataset aids in building these models.