Google Cloud Unveils Gemini 2.5 with Native Audio and Live Speech Translation for 70+ Languages

Google Cloud’s Gemini 2.5 introduces live speech translation for 70+ languages, enhancing conversational AI and enabling 14,000 loans for United Wholesale Mortgage.

Staff

Published

14 December, 2025

Google Cloud’s new AI model, Gemini, is making significant strides in real-world applications, particularly through its innovative audio capabilities. Early adopters across various industries have reported tangible benefits, utilizing Gemini to enhance processes such as mortgage processing and customer interaction. David Wurtz, VP of Product at Shopify, noted that users often forget they are conversing with AI, highlighting the model’s effectiveness in creating a natural dialogue through its Sidekick feature. “Users often forget they’re talking to AI within a minute of using Sidekick, and in some cases have thanked the bot after a long chat,” Wurtz said, emphasizing the positive feedback from the platform’s merchants.

In the financial sector, Gemini’s impact is equally notable. Jason Bressler, Chief Technology Officer at United Wholesale Mortgage, shared insights on the integration of Gemini’s 2.5 Flash Native Audio model. “By integrating the Gemini 2.5 Flash Native Audio model…we’ve significantly enhanced Mia’s capabilities since launching in May 2025,” he stated, revealing that this combination has led to the generation of over 14,000 loans for their broker partners. Such numbers indicate a robust return on investment for UWM as it leverages cutting-edge technology to streamline operations.

Meanwhile, Newo.ai is also capitalizing on Gemini’s capabilities. Co-founder David Yang highlighted how the model enables AI Receptionists to achieve superior conversational intelligence. “Working with the Gemini 2.5 Flash Native Audio model through Vertex AI allows Newo.ai AI Receptionists to achieve unmatched conversational intelligence,” Yang noted. The technology reportedly allows for effective speaker identification even in noisy environments, as well as the ability to switch languages mid-conversation while maintaining a natural tone.

Perhaps one of the most groundbreaking features introduced by Gemini is its live speech translation functionality. This new capability supports both continuous listening and real-time two-way conversations, effectively bridging communication gaps across different languages. Continuous listening allows users to hear translations in their preferred language while engaging in their surroundings. For two-way interactions, Gemini can automatically switch between languages based on the speaker, facilitating seamless communication between individuals who speak different languages.

Gemini’s live speech translation is equipped with several key capabilities designed to enhance user experience. The model boasts language coverage of over 70 languages and 2,000 language pairs, a feat achieved by combining Gemini’s extensive world knowledge with its advanced audio features. Additionally, the technology incorporates style transfer, capturing the nuances of human speech, including intonation, pacing, and pitch, ensuring that the translations sound natural and authentic.

Understanding multiple languages simultaneously is another advantage of Gemini’s system. This feature allows users to follow multilingual conversations in real-time without the need to adjust settings. Furthermore, the model can automatically detect the spoken language, initiating translation without users needing to identify the language being spoken. This capability enhances convenience, especially in diverse settings, where multiple languages may be in use.

Noise robustness is also a critical attribute of Gemini’s technology. By effectively filtering out ambient sounds, users can maintain clear conversations even in bustling, outdoor environments. This feature is essential for businesses and individuals who operate in dynamic settings where interruptions from background noise are common.

The advancements introduced by Gemini reflect a notable trend in AI development, as companies increasingly seek to harness the potential of machine learning and natural language processing. Google Cloud is positioning itself at the forefront of this rapidly evolving landscape, providing tools that not only enhance business efficiency but also improve human interaction through technology. As Gemini continues to evolve, its applications across various sectors are expected to expand, paving the way for a future where communication barriers are minimized and technology plays an integral role in everyday interactions.

AI Business

Red Hat Reveals Small Language Models as Key to Scaling Enterprise AI Agents

Red Hat advances enterprise AI with Small Language Models that achieve over 98% validity in structured tasks, prioritizing reliability and data sovereignty.

Marcus Chen3 May, 2026

AI Research

OpenAI’s AI Model Achieves 81.6% Diagnostic Accuracy, Surpassing Human Doctors in ER Tests

OpenAI's o1 model achieves 81.6% diagnostic accuracy in emergency situations, surpassing human doctors and signaling a major shift in medical practice.

Staff3 May, 2026

AI Marketing

BusySeed Launches Rankxa to Measure Brand Visibility in AI-Generated Search Results

BusySeed unveils Rankxa, a tool tracking brand visibility across AI-generated responses, revealing 90% of brands lack meaningful presence in this new landscape.

Sofía Méndez3 May, 2026

AI Regulation

Korea Ventures Launches AI Initiative to Enhance Fund Management and Policy Efficiency

Korea Venture Investment Corp. unveils AI-driven fund management systems by integrating Nvidia H200 GPUs to enhance efficiency and support unicorn growth.

Staff3 May, 2026

AI Technology

Apple Raises Mac Mini Price to $799 Amid AI-Driven Supply Shortages

Apple raises Mac mini starting price to $799 amid AI-driven inventory shortages, eliminating the $599 model in response to surging demand for advanced computing.

Staff3 May, 2026

AI Research

IBM Launches Chicago Quantum Hub, Creating 750 AI Jobs and Expanding MIT Research Lab

IBM launches a Chicago Quantum Hub to create 750 AI jobs and expands its MIT partnership to advance quantum computing and AI integration.

Staff3 May, 2026

AI Government

71% of Aussies Use Generative AI, Yet Only 36% Trust Its Implementation, Says Expert

71% of Australian employees use generative AI daily, but only 36% trust its implementation, highlighting urgent calls for better policy frameworks and safeguards.

Staff3 May, 2026

AI Generative

Google Prepares Omni Model for Gemini Video Generation Ahead of I/O 2026

Google is set to unveil its new video-generation tool, Omni, at I/O 2026, potentially integrating Gemini's capabilities and enhancing competition against ByteDance's Seedance 2.0.

Staff2 May, 2026

AIPRESSA.COM

Top Stories

Google Cloud Unveils Gemini 2.5 with Native Audio and Live Speech Translation for 70+ Languages

Trending

Top Stories

Albania Appoints AI Bot Minister Diella Amid Corruption Concerns and EU Membership Goals

AI Government

BigBear.ai Launches Biometric Platform at O’Hare, Acquires Generative AI Ask Sage for $250M

AI Cybersecurity

Endpoint Security Market to Reach $23.9B by 2030 with 7.2% CAGR Amid Rising Cyber Threats

AI Business

Enterprise Architecture Shifts to Strategic Enabler in AI-Driven Business Models

AI Research

Amazon Awards 63 Research Grants to 41 Universities Across 8 Countries for AI Innovation

You May Also Like

AI Business

Red Hat Reveals Small Language Models as Key to Scaling Enterprise AI Agents

AI Research

OpenAI’s AI Model Achieves 81.6% Diagnostic Accuracy, Surpassing Human Doctors in ER Tests

AI Marketing

BusySeed Launches Rankxa to Measure Brand Visibility in AI-Generated Search Results

AI Regulation

Korea Ventures Launches AI Initiative to Enhance Fund Management and Policy Efficiency

AI Technology

Apple Raises Mac Mini Price to $799 Amid AI-Driven Supply Shortages

AI Research

IBM Launches Chicago Quantum Hub, Creating 750 AI Jobs and Expanding MIT Research Lab

AI Government

71% of Aussies Use Generative AI, Yet Only 36% Trust Its Implementation, Says Expert

AI Generative

Google Prepares Omni Model for Gemini Video Generation Ahead of I/O 2026