Canadian AI company Cohere has introduced “Transcribe,” an open-source model designed for automatic speech recognition, claiming the top position on the Hugging Face Open ASR Leaderboard. With an impressive average word error rate of just 5.42 percent, Transcribe surpasses notable competitors such as OpenAI’s Whisper Large v3, ElevenLabs Scribe v2, and Qwen3-ASR-1.7B. Cohere also highlights that Transcribe offers the highest throughput among models of similar size, further solidifying its competitive edge in the rapidly evolving AI landscape.
The performance metrics for Cohere’s Transcribe are compelling. A comparative analysis showcases its capabilities against seven other speech recognition models with over a billion parameters. Transcribe leads with a real-time factor (RTFx) of 525 and a word error rate (WER) hovering around 5.4, making it both the fastest and most accurate model available. Following closely is NVIDIA’s Canary Qwen 2.5B, which registers an RTFx of 418. Other models, including OpenAI’s Whisper Large v3 and Voxtral Realtime, lag significantly in terms of both speed and accuracy.
The model, which comprises 2 billion parameters, supports a total of 14 languages, including English, German, French, and Japanese, broadening its applicability for diverse user bases. Transcribe is available for public download under the Apache 2.0 license on Hugging Face, allowing developers and researchers to utilize its capabilities. Additionally, it can be accessed through Cohere’s API and the Model Vault platform, making it easily integrated into various applications.
Cohere’s strategy includes plans to integrate Transcribe into its AI agent platform, North, indicating a forward-thinking approach to enhancing user interaction and experience in real-time communication. This move reflects a growing trend in the AI sector, where companies are continually seeking to refine their technologies by leveraging advancements in automatic speech recognition.
The introduction of Transcribe comes at a time when the demand for efficient and accurate speech recognition technologies is surging across multiple sectors, including customer support, content creation, and accessibility services. As organizations increasingly rely on AI-driven solutions to streamline operations and improve customer engagement, Transcribe’s competitive benchmarks may offer significant advantages in these areas.
In the broader context, the development of high-performance models like Transcribe represents an important step in the ongoing evolution of AI technologies. The current landscape is characterized by rapid advancements, with various companies striving to push the boundaries of what is possible in speech recognition and other AI applications. As Cohere positions itself at the forefront of this shift, the impact of Transcribe on both the market and user experience will be closely monitored by industry analysts and competitors alike.
See also
Germany”s National Team Prepares for World Cup Qualifiers with Disco Atmosphere
95% of AI Projects Fail in Companies According to MIT
AI in Food & Beverages Market to Surge from $11.08B to $263.80B by 2032
Satya Nadella Supports OpenAI’s $100B Revenue Goal, Highlights AI Funding Needs
Wall Street Recovers from Early Loss as Nvidia Surges 1.8% Amid Market Volatility



















































