NVIDIA NeMo recently released Parakeet-TDT, a model that offers better accuracy and greater speed in speech recognition. Parakeet-TDT uses a Token-and-Duration Transducer architecture to optimize recognition process. To use Parakeet-TDT for speech recognition, NVIDIA NeMo needs to be installed.

4m read time From developer.nvidia.com
Post cover image
Table of contents
Parakeet-TDT model overviewUnderstanding Token-and-Duration Transducer modelsHow to use Parakeet-TDTConclusion

Sort: