I have been watching HomeAssistant’s progress with assist for some time. We previously used Google Home via Nest Minis, and have switched to using fully local assist backed by local first + llama.cpp (previously Ollama).…

Hacker News is a community-driven platform for sharing and discussing technology news, startups, and programming-related topics. Through user submissions and comments, Hacker News offers insights into emerging technology trends, industry developments, and entrepreneurial ventures. Readers can participate in discussions, share their insights, and stay informed about the latest advancements in technology and innovation.

Hacker News

A detailed account of migrating from Google Home/Nest Minis to a fully local Home Assistant voice assistant powered by llama.cpp and local LLMs. Covers hardware selection (eGPU setup with Beelink MiniPC), GPU and model comparisons, STT/TTS stack (Faster Whisper, Piper), LLM prompt engineering to fix unwanted behaviors, custom wake word training with microWakeWord, and automation-based music playback via Music Assistant. Key lessons include using higher-quantization GGUF models from HuggingFace, the critical role of prompt design, and solving edge cases like emoji in TTS output and false activations.

My Journey to a reliable and enjoyable locally hosted voice assistant