Cohere Launches Tiny Aya: Multilingual AI Models for 70+ Languages on Everyday Devices

Cohere unveils Tiny Aya, a multilingual AI model with 3.35 billion parameters, supporting over 70 languages for efficient on-device use, targeting diverse global markets.

Staff

Published

1 hour ago

Cohere has unveiled its new language model family, Tiny Aya, which is designed to operate on standard hardware while supporting over 70 languages. This announcement coincided with the India AI Summit, highlighting a growing emphasis on multilingual capabilities in artificial intelligence. Targeted at developers and researchers, Tiny Aya promises fast, private, and culturally sensitive AI solutions that do not necessitate constant internet connectivity.

Tiny Aya is grounded in the belief that global AI should not be dominated by English or reliant solely on cloud resources. The base model features 3.35 billion parameters, a size manageable for everyday devices yet capable of handling a diverse array of languages, including Bengali, Hindi, Punjabi, Urdu, Gujarati, Tamil, Telugu, and Marathi. A general-purpose variant, TinyAya-Global, focuses on following user instructions across multiple languages. In contrast, regional editions such as TinyAya-Earth for African languages, TinyAya-Fire for South Asian languages, and TinyAya-Water for Asia Pacific, West Asia, and Europe seek to enhance fluency and tone.

The approach of regional fine-tuning aims to improve comprehension and idiomatic accuracy for underrepresented languages by utilizing domain-specific tuning. Research initiatives, including the BigScience project and Meta’s NLLB, have demonstrated that tailored data significantly enhances translation and reasoning capabilities on benchmarks like FLORES-200. Tiny Aya adheres to this trend, maintaining a robust multilingual backbone for cross-lingual transfer.

Developed using a cluster of 64 Nvidia H100 GPUs, Tiny Aya stands out for its design tailored for efficient, on-device inference. This focus opens up potential offline applications, such as providing translation services in areas with limited connectivity or deploying local assistance tools that prioritize data privacy. For instance, a health worker in rural Maharashtra could translate consent forms without transmitting patient data to the cloud, while a customer support application in Nairobi could manage code-switched inquiries in Swahili and English on local devices.

This development aligns with the industry’s shift toward “right-sized” models, as organizations increasingly seek smaller, specialized models that fulfill latency, cost, and privacy needs. According to the GSMA’s latest Mobile Economy research, the global “usage gap” in mobile internet access underscores the importance of offline-capable AI tools that broaden access where connectivity is limited.

The regional variants of Tiny Aya strive to maintain comprehensive coverage across more than 70 languages while refining local nuances such as dialects and transliteration. This attention to detail is crucial in multilingual markets, where accuracy alone is insufficient. Developers will be keenly observing Tiny Aya’s performance on evaluations like FLORES-200 for translation and TyDi QA for cross-lingual question answering.

Cohere prioritizes a low compute footprint in its software stack, facilitating regional fine-tuning without necessitating the extensive budgets of larger organizations. This accessibility opens doors for civic tech groups, media companies, and academic institutions to localize AI systems tailored for their communities.

The Tiny Aya models are available for public use through Hugging Face and the Cohere Platform, allowing developers to download and deploy local versions via platforms like Kaggle and Ollama. Cohere is also releasing training and evaluation datasets on Hugging Face, alongside a forthcoming technical report detailing their methodology—an important step for enhancing reproducibility in AI development.

For enterprises facing stringent data residency requirements, on-device and on-premise deployment offers a viable solution to reduce data egress and audit risks. Industries such as finance, public services, and healthcare are particularly invested in this capability. Smaller multilingual models that excel in task-following could effectively bridge gaps in performance without incurring substantial inference costs.

The increasing momentum behind Tiny Aya may accelerate its adoption across various sectors. Cohere’s CEO has indicated plans for the company to go public, with reports estimating annual recurring revenue at $240 million, highlighting a significant 50% growth rate quarter-over-quarter—indicating sustained demand for developer-friendly AI solutions.

As attention shifts to performance benchmarks and safety assessments, pressing questions remain. How will Tiny Aya models perform on low-resource languages beyond scripted prompts? Can they effectively manage dialectal code-switching in real-time conversations? The anticipated technical report, dataset releases, and early adopter case studies will provide crucial insights into these issues.

Ultimately, Tiny Aya holds the promise of enabling robust, multilingual AI applications directly where users need them, in their preferred languages, and on devices they already possess. If it delivers on this potential, it could represent a significant shift in multilingual AI development, moving from cloud dependency back to edge computing without compromising accessibility or linguistic nuance.

AIPRESSA.COM

Top Stories

Cohere Launches Tiny Aya: Multilingual AI Models for 70+ Languages on Everyday Devices

Trending

AI Cybersecurity

Endpoint Security Market to Reach $23.9B by 2030 with 7.2% CAGR Amid Rising Cyber Threats

Top Stories

Albania Appoints AI Bot Minister Diella Amid Corruption Concerns and EU Membership Goals

AI Government

BigBear.ai Launches Biometric Platform at O’Hare, Acquires Generative AI Ask Sage for $250M

AI Business

Enterprise Architecture Shifts to Strategic Enabler in AI-Driven Business Models

AI Technology

AI Hardware Market Grows 30% in 2025, Driven by Generative AI and Edge Computing Demand

You May Also Like

Top Stories

Cohere Achieves $240M ARR in 2025, Surpassing Targets and Signaling Strong Enterprise Growth

Top Stories

SAP and Cohere Launch Sovereign AI Solutions in Canada for Secure Data Management

Top Stories

Modem Secures $4.4M Pre-Seed Funding to Enhance AI Product Management for Developers

Top Stories

Adaption Labs Secures $50M Seed Funding to Develop Adaptive AI Models

Top Stories

Hanwha Signs MOUs with Cohere and Telesat for AI and Satellite Tech Collaboration

Top Stories

Cohere’s Joëlle Pineau Champions Business-Driven AI Over Superintelligence

Top Stories

TKMS and Cohere Collaborate to Enhance AI for Canada’s Submarine Program

Top Stories

TKMS Partners with Cohere to Enhance Canadian Submarine AI Capabilities Amid Valuation Surge