Lemonade is an open source local AI server that runs LLMs, image generation, speech, and transcription on consumer PCs using GPUs and NPUs. It features a native C++ backend weighing only 2MB, a one-minute installer, OpenAI API compatibility for drop-in integration with hundreds of apps, multi-engine support (llama.cpp, Ryzen AI SW, FastFlowLM), and cross-platform support for Windows, Linux, and macOS. Version 10.0.1 was released March 24, 2026.

2m read timeFrom lemonade-server.ai
Post cover image
Table of contents
Built by the local AI community for every PC.Works with great apps.Built for practical local AI workflows.One local service for every modality.Always improving.

Sort: