AI NewsCohere launches an open-source voice model specifically for transcription
Cohere launches an open-source voice model specifically for transcription
9:25 PM IST · March 26, 2026

Enterprise AI company Cohere on Thursday launched its first voice model: Transcribe is an open-source automatic speech recognition model that can be used for tasks like note-taking and speech analysis. Relatively light at just 2 billion parameters, the model is meant for use with consumer-grade GPUs for those who want to self-host it. It currently supports 14 languages: English, French, German, Italian, Spanish, Portuguese, Greek, Dutch, Polish, Chinese, Japanese, Korean, Vietnamese and Arabic. Cohere says Transcribe beats models such as Zoom Scribe v1, IBM Granite 4.0 1B, ElevenLabs Scribe v2, and Qwen3-ASR-1.7B Speech onthe Hugging Face Open ASR leaderboard, achieving an average word error rate (WER) of 5.42, lower than any other model on the benchmark. The company claims Transcribe had an average win rate of 61% over other models when human evaluators assessed its transcriptions for accuracy, coherence and usability. However, the model fell behind its rivals when it had to transcribe Portuguese, German and Spanish. Cohere says Transcribe can process 525 minutes of audio in a minute, which is high for its class of model. The company is planning to integrate Transcribe into its enterprise agent orchestration platform,North, and is making the model available through itsAPIfor free. The model will also be available onModel Vault, Cohere’s managed inference platform. Speech recognition models are growing increasingly popular as demand grows for note-taking and dictation apps like Granola andWispr Flow. Earlier this year, Cohere reportedlytoldinvestors that it was generating annual recurring revenue of $240 million in 2025, and its CEO, Aidan Gomez, was cited as saying that the startupmay go public “soon”.
read more



