AI Generative

Gemini Embedding 2 Launches with Multimodal Capabilities, Enhancing AI Retrieval Accuracy by 40%

Google’s Gemini Embedding 2 enhances AI retrieval accuracy by 40%, enabling multimodal inputs and boosting search precision for platforms like Harvey and Nuuly.

Staff

Published

1 May, 2026

Google announced the General Availability (GA) of its Gemini Embedding 2 through the Gemini API and Gemini Enterprise Agent Platform. Launched last week, this sophisticated model allows developers to map diverse inputs—text, images, video, audio, and documents—into a single embedding space, supporting over 100 languages. This capability opens new avenues for applications ranging from multimodal retrieval-augmented generation (RAG) to visual search.

Gemini Embedding 2 is designed to handle an extensive variety of inputs with a single call, accommodating up to 8,192 text tokens, six images, 120 seconds of video, 180 seconds of audio, and six pages of PDFs. By integrating different modalities into one semantic space, developers can create nuanced experiences that interpret proprietary data in more meaningful ways.

One of the model’s standout features is its ability to process interleaved inputs, allowing for combinations of text and images in a single request. This enhances the model’s understanding of complex, real-world data. For developers needing separate embeddings for distinct inputs, the Batch API will soon offer that capability on the Agent Platform.

Applications of Gemini Embedding 2 are already being realized in various sectors. For instance, the legal research platform Harvey reported a 3% increase in Recall@20 precision on legal benchmarks after implementing the model, thereby providing more accurate citations and answers for law firms. Similarly, Supermemory has developed a “vector database for memory” that enables conceptual searching across disjointed memos. Since integrating the model, it has achieved a 40% increase in search Recall@1 accuracy.

The model also serves as a powerful tool for multimodal search. Nuuly, a clothing rental company owned by URBN, has utilized Gemini Embedding 2 for a visual search tool that matches images taken on the warehouse floor against their catalog. This implementation has dramatically improved their Match@20 accuracy from 60% to nearly 87% and boosted their overall product identification rate from 74% to over 90%.

In addition to visual search, the model is adept at enhancing retrieval pipelines. Embeddings can be recalibrated to rerank initial search results, ensuring that users receive the most relevant answers. For instance, developers can calculate distance metrics, such as cosine similarity or dot product scores, between embedded search results and user queries. This approach allows for a more refined selection of the best match based on contextual relevance.

Applications extend to clustering, classification, and anomaly detection as well. By creating clusters based on similarities, users can quickly identify hidden trends or outliers, making this feature ideal for sentiment analysis. The same task prefix can be used for both the query and document, which simplifies the embedding process.

Efficient storage and usage of these embeddings is another key aspect. They can be stored in vector databases such as Agent Platform Vector Search, Pinecone, Weaviate, Qdrant, or ChromaDB. The embeddings generated by Gemini Embedding 2 utilize Matryoshka Representation Learning (MRL), allowing for dimensional reduction to enhance storage efficiency without compromising accuracy. The default 3072-dimensional vectors can be truncated to dimensions of 1536 or 768 for optimal performance.

Gemini Embedding 2 marks a significant advancement in the realm of data interpretation and machine learning, promising to improve how businesses and developers approach complex datasets. As more organizations explore its capabilities, the model is poised to set new benchmarks in the fields of AI-driven search and data retrieval.

For developers eager to implement this groundbreaking model, the Gemini API and Agent Platform provide the necessary tools for diving into multimodal embeddings that enhance understanding across various industries.

AI Government

US Defense Partners with Anthropic, OpenAI, and Tech Giants for AI-First Military Initiative

US Department of Defense partners with tech giants including SpaceX and OpenAI to launch an "AI-first" initiative aimed at enhancing military decision-making efficiency.

Staff3 May, 2026

AI Marketing

BusySeed Launches Rankxa to Measure Brand Visibility in AI-Generated Search Results

BusySeed unveils Rankxa, a tool tracking brand visibility across AI-generated responses, revealing 90% of brands lack meaningful presence in this new landscape.

Sofía Méndez3 May, 2026

AI Generative

Google Prepares Omni Model for Gemini Video Generation Ahead of I/O 2026

Google is set to unveil its new video-generation tool, Omni, at I/O 2026, potentially integrating Gemini's capabilities and enhancing competition against ByteDance's Seedance 2.0.

Staff2 May, 2026

AI Marketing

ACME.BOT Reveals SEO Checklists are Obsolete as AI Search Reshapes Content Visibility

ACME.BOT declares traditional SEO checklists obsolete, revealing a 27% drop in organic traffic as AI platforms disrupt content visibility.

Sofía Méndez2 May, 2026

Apple, Google, and Amazon Shine Post-Earnings as AI Demand Reshapes Tech Landscape

Apple's Q2 earnings reveal a price hike for the Mac mini to $799, fueled by AI memory demand, as Google and Amazon also report...

Staff2 May, 2026

AI Technology

Big Tech to Invest $3.7 Trillion in AI Infrastructure, Surpassing Historic Rail Expansion

Major tech giants, including Google and Amazon, are set to invest $3.7 trillion in AI infrastructure over five years, reshaping the workforce and economy.

Staff1 May, 2026

AI Finance

AI Boosts Retirees’ Portfolios by 38% While Young Workers Face 16,000 Job Losses Monthly

AI technology is fueling a 38% surge in retirees' 401(k) portfolios while causing 16,000 job losses monthly among younger workers, highlighting stark generational disparities.

Marcus Chen1 May, 2026

Google Expands AI Max Ads for Travel Brands, Integrating AI Overviews and Booking Links

Google expands AI Max ads for travel brands, enhancing ad targeting with AI Overview searches and introducing personalized hotel ads and booking links.

Staff30 April, 2026

AIPRESSA.COM

AI Generative

Gemini Embedding 2 Launches with Multimodal Capabilities, Enhancing AI Retrieval Accuracy by 40%

Trending

Top Stories

Albania Appoints AI Bot Minister Diella Amid Corruption Concerns and EU Membership Goals

AI Government

BigBear.ai Launches Biometric Platform at O’Hare, Acquires Generative AI Ask Sage for $250M

AI Cybersecurity

Endpoint Security Market to Reach $23.9B by 2030 with 7.2% CAGR Amid Rising Cyber Threats

AI Business

Enterprise Architecture Shifts to Strategic Enabler in AI-Driven Business Models

AI Research

Amazon Awards 63 Research Grants to 41 Universities Across 8 Countries for AI Innovation

You May Also Like

AI Government

US Defense Partners with Anthropic, OpenAI, and Tech Giants for AI-First Military Initiative

AI Marketing

BusySeed Launches Rankxa to Measure Brand Visibility in AI-Generated Search Results

AI Generative

Google Prepares Omni Model for Gemini Video Generation Ahead of I/O 2026

AI Marketing

ACME.BOT Reveals SEO Checklists are Obsolete as AI Search Reshapes Content Visibility

Top Stories

Apple, Google, and Amazon Shine Post-Earnings as AI Demand Reshapes Tech Landscape

AI Technology

Big Tech to Invest $3.7 Trillion in AI Infrastructure, Surpassing Historic Rail Expansion

AI Finance

AI Boosts Retirees’ Portfolios by 38% While Young Workers Face 16,000 Job Losses Monthly

Top Stories

Google Expands AI Max Ads for Travel Brands, Integrating AI Overviews and Booking Links