Gemini 2.5 Pro significantly enhances audio transcription capabilities by generating up to 64,000 tokens, facilitating transcriptions of up to 2 hours of audio. This model efficiently handles speaker diorization, making it ideal for summarizing podcasts and conducting automated questioning over audio. Users can upload large audio files and opt for specific timestamps for transcription, making it highly versatile for lengthy audio content.
•16m watch time
Sort: