KittenTTS is an ultra-lightweight open-source text-to-speech model with only 15 million parameters and under 25MB size. It runs on CPU without GPU requirements, offers multiple voice options, and is optimized for real-time speech synthesis. The model is currently in developer preview with plans for full release, mobile SDK, and web version.
7 Comments
Sort: