Whisper Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multi-task model that can perform multilingual speech recognition. Whisper's performance varies widely depending on the import whisper model = whisper. The codebase is expected to be compatible with

•4m read time•From github.com
Post cover image
Table of contents
ApproachSetupAvailable models and languagesCommand-line usagePython usageMore examplesLicense
2 Comments

Sort: