KittenTTS is an ultra-lightweight open-source text-to-speech model with only 15 million parameters and under 25MB size. It runs on CPU without GPU requirements, offers multiple voice options, and is optimized for real-time speech synthesis. The model is currently in developer preview with plans for full release, mobile SDK, and web version.

1m read timeFrom github.com
Post cover image
Table of contents
✨ Features🚀 Quick Start💻 System RequirementsChecklist
7 Comments

Sort: