Parler-TTS is a cutting-edge text-to-speech library featuring two models: Large v1 and Mini v1. Trained on 45,000 hours of audio, these models provide high-quality speech with controllable features such as gender, background noise, and pitch. Users can specify speaker characteristics and use punctuation to optimize audio output. Parler-TTS embraces open-source principles, making all its datasets, training code, and model weights publicly available to foster community innovation.
Sort: