Contribute to bytedance/MegaTTS3 development by creating an account on GitHub.

Dickson A.

Community Picks is a section on daily.dev where our community members share the most interesting and valuable content they've discovered online. From insightful articles to handy tools, every post is a gem curated by our dedicated coomunity. To contribute to Community Picks, you need to have at least 250 reputation points, ensuring that only active and trusted members can share their finds.

Community Picks

MegaTTS3 by Bytedance is a lightweight and efficient text-to-speech (TTS) model with only 0.45B parameters. It supports high-quality voice cloning, bilingual (Chinese and English) speech synthesis, and accent intensity control. Users can download pre-trained models, use command-line tools for inference, and access a web UI. The project aims for academic use, with stringent security measures and is licensed under Apache-2.0.

bytedance/MegaTTS3

<p>Very interesting! I’m looking at using TTS on the edge for some embedded projects with an ESP32… This might be usable for that!</p>


<p>I wouldn’t touch this with a 10-mile pole… bytedance is the owner TikTok and a Chinese Government op meant to steal information.</p>