Today we're releasing our most capable and conversational voice model that can speak in 30+ languages using any voice or accent, with industry leading speed

Hacker News is a community-driven platform for sharing and discussing technology news, startups, and programming-related topics. Through user submissions and comments, Hacker News offers insights into emerging technology trends, industry developments, and entrepreneurial ventures. Readers can participate in discussions, share their insights, and stay informed about the latest advancements in technology and innovation.

Hacker News

Play 3.0 mini is a new lightweight, reliable, and cost-efficient multilingual text-to-speech model that can converse in over 30 languages. It achieves a mean latency of 189 milliseconds, making it the fastest model yet, and supports text-in and audio-out streaming via HTTP REST API, websockets API, or SDKs. The model also exhibits significant improvements in audio quality, reliability, and naturalness of speech. Additionally, it features state-of-the-art voice cloning capabilities and is offered at reduced pricing for different business tiers.

Introducing Play 3.0 mini – A lightweight, reliable and cost-efficient Multilingual Text-to-Speech model

Play 3.0 mini is our fastest, most conversational speech model yet

Play 3.0 mini supports 30+ languages across any voice

Play 3.0 mini reads alphanumeric sequences more naturally

Play 3.0 mini achieves the best voice similarity for voice cloning