Lemonade is an open source local AI server that runs LLMs, image generation, speech, and transcription on consumer PCs using GPUs and NPUs. It features a native C++ backend weighing only 2MB, a one-minute installer, OpenAI API compatibility for drop-in integration with hundreds of apps, multi-engine support (llama.cpp, Ryzen AI SW, FastFlowLM), and cross-platform support for Windows, Linux, and macOS. Version 10.0.1 was released March 24, 2026.
Table of contents
Built by the local AI community for every PC.Works with great apps.Built for practical local AI workflows.One local service for every modality.Always improving.Sort: