Callstack introduces on-device speech transcription for React Native apps using Apple's new SpeechAnalyzer and SpeechTranscriber APIs from iOS 26, integrated with the AI SDK's experimental_transcribe function. The implementation runs entirely on-device with no cloud dependency, supports multiple languages via ISO 639-1 codes, and handles language model asset management automatically without impacting app bundle size. Performance benchmarks show it transcribes audio over 2.2x faster than Whisper Large V3. The API accepts ArrayBuffer or base64 audio and returns transcribed text, word-level segments with timestamps, and total duration.

4m read timeFrom callstack.com
Post cover image
Table of contents
Under the Hood: Apple's SpeechAnalyzer FrameworkHow to Use ItUnmatched PerformanceConclusion

Sort: