Amazon Science recently published their work on Big Adaptive Streamable TTS with Emergent abilities (BASE TTS). BASE TTS supports voice-cloning and outperforms baseline TTS models when evaluated by hu

InfoQ is a leading online platform for software developers, architects, and technical leaders, providing news, articles, presentations, and interviews on a wide range of topics, including agile practices, DevOps, microservices, and emerging technologies. With a focus on quality content and expert insights, InfoQ helps professionals stay informed about the latest trends, best practices, and industry developments. Developers can learn from real-world experiences, gain  knowledge, and connect with peers in the global software community through InfoQ's diverse and engaging content.

InfoQ

Amazon announces BASE TTS, a speech model with one billion parameters that supports voice-cloning and outperforms baseline TTS models. The model is trained on unlabeled speech audio scraped from the web, and its quality improves with increased data and model size.

Amazon Announces One Billion Parameter Speech Model BASE TTS