Deepgram offers two Nova speech-to-text models optimized for different use cases. Nova-2 prioritizes speed and cost-effectiveness for English-heavy batch processing, while Nova-3 excels in real-time multilingual transcription with advanced features like keyterm prompting. The guide provides detailed benchmarks, cost comparisons, and implementation recommendations to help developers choose the right model based on accuracy requirements, latency constraints, language support needs, and budget considerations.
Table of contents
⏩ TL;DR – 30-Second Cheat SheetWhy Model Choice Matters in Choosing Speech-to-Text (STT) APIs for Your AppsWhich Model Is Right for You? Nova-2 or Nova-3?Nova-2: When to Use ItBenchmarks and Performance Tests That MatterProduction Checklist for Devs Shipping Deepgram’s Speech-to-Text (STT) ModelsNext Steps: When to Use Nova-2 vs Nova-3 (for devs)Sort: