Daily Open Source Tools
serdarbuyukdereli's profile
Serdarcan Buyukdereli@serdarbuyukdereli•Apr 11
20.3K
Post cover image

GitHub - OpenBMB/VoxCPM: VoxCPM2: Tokenizer-Free TTS for Multilingual Speech Generation, Creative Voice Design, and True-to-Life Cloning

From github.com•Apr 11•20m read time

VoxCPM2 is a 2B-parameter, tokenizer-free Text-to-Speech system using a diffusion autoregressive architecture. It supports 30 languages, voice design from natural-language descriptions, controllable voice cloning, and outputs 48kHz studio-quality audio. Trained on over 2 million hours of multilingual speech data, it achieves state-of-the-art or competitive results on multiple TTS benchmarks. The model is fully open-source under Apache-2.0, supports SFT and LoRA fine-tuning, and can run in real-time with RTF ~0.13 on an RTX 4090 via Nano-vLLM. A Python API, CLI, and web demo are provided for quick integration.

Smiley Face1 Award

Sort:

serdarbuyukdereli's user avatar
Serdarcan Buyukdereli
@serdarbuyukdereli
Joined Oct 20. 2023
20.3K

Senior Devops and Cloud Engineer

Would you recommend this post?

Copy link
WhatsApp
Facebook
X
New Squad
  • © 2026 Daily Dev Ltd.
  • Guidelines
  • Explore
  • Tags
  • Sources
  • Squads
  • Leaderboard