IBM releases Granite 4.0 1B Speech, a compact 1-billion-parameter speech-language model for enterprise use on resource-constrained devices. It supports multilingual ASR and bidirectional speech translation across English, French, German, Spanish, Portuguese, and Japanese. Despite having half the parameters of its predecessor (granite-speech-3.3-2b), it achieves higher English transcription accuracy, faster inference via speculative decoding, and adds Japanese ASR support plus keyword list biasing for names and acronyms. The model ranked #1 on the OpenASR leaderboard and is released under Apache 2.0 with native support in transformers and vLLM.

2m read timeFrom huggingface.co
Post cover image

Sort: