ElevenLabs has released Eleven v3 (alpha), a new text-to-speech model that offers enhanced expressiveness and emotional control. The model supports 70+ languages and introduces inline audio tags for controlling emotion, delivery, and direction. Key features include multi-speaker conversation generation with shared context, dynamic range control, and immersive soundscape creation. The alpha version represents a significant advancement in controllable speech synthesis technology.

1m read timeFrom elevenlabs.io
Post cover image

Sort: