Connect with us

Hi, what are you looking for?

AI Generative

Gemini Embedding 2 Launches with Multimodal Capabilities, Enhancing AI Retrieval Accuracy by 40%

Google’s Gemini Embedding 2 enhances AI retrieval accuracy by 40%, enabling multimodal inputs and boosting search precision for platforms like Harvey and Nuuly.

Google announced the General Availability (GA) of its Gemini Embedding 2 through the Gemini API and Gemini Enterprise Agent Platform. Launched last week, this sophisticated model allows developers to map diverse inputs—text, images, video, audio, and documents—into a single embedding space, supporting over 100 languages. This capability opens new avenues for applications ranging from multimodal retrieval-augmented generation (RAG) to visual search.

Gemini Embedding 2 is designed to handle an extensive variety of inputs with a single call, accommodating up to 8,192 text tokens, six images, 120 seconds of video, 180 seconds of audio, and six pages of PDFs. By integrating different modalities into one semantic space, developers can create nuanced experiences that interpret proprietary data in more meaningful ways.

One of the model’s standout features is its ability to process interleaved inputs, allowing for combinations of text and images in a single request. This enhances the model’s understanding of complex, real-world data. For developers needing separate embeddings for distinct inputs, the Batch API will soon offer that capability on the Agent Platform.

Applications of Gemini Embedding 2 are already being realized in various sectors. For instance, the legal research platform Harvey reported a 3% increase in Recall@20 precision on legal benchmarks after implementing the model, thereby providing more accurate citations and answers for law firms. Similarly, Supermemory has developed a “vector database for memory” that enables conceptual searching across disjointed memos. Since integrating the model, it has achieved a 40% increase in search Recall@1 accuracy.

The model also serves as a powerful tool for multimodal search. Nuuly, a clothing rental company owned by URBN, has utilized Gemini Embedding 2 for a visual search tool that matches images taken on the warehouse floor against their catalog. This implementation has dramatically improved their Match@20 accuracy from 60% to nearly 87% and boosted their overall product identification rate from 74% to over 90%.

In addition to visual search, the model is adept at enhancing retrieval pipelines. Embeddings can be recalibrated to rerank initial search results, ensuring that users receive the most relevant answers. For instance, developers can calculate distance metrics, such as cosine similarity or dot product scores, between embedded search results and user queries. This approach allows for a more refined selection of the best match based on contextual relevance.

Applications extend to clustering, classification, and anomaly detection as well. By creating clusters based on similarities, users can quickly identify hidden trends or outliers, making this feature ideal for sentiment analysis. The same task prefix can be used for both the query and document, which simplifies the embedding process.

Efficient storage and usage of these embeddings is another key aspect. They can be stored in vector databases such as Agent Platform Vector Search, Pinecone, Weaviate, Qdrant, or ChromaDB. The embeddings generated by Gemini Embedding 2 utilize Matryoshka Representation Learning (MRL), allowing for dimensional reduction to enhance storage efficiency without compromising accuracy. The default 3072-dimensional vectors can be truncated to dimensions of 1536 or 768 for optimal performance.

Gemini Embedding 2 marks a significant advancement in the realm of data interpretation and machine learning, promising to improve how businesses and developers approach complex datasets. As more organizations explore its capabilities, the model is poised to set new benchmarks in the fields of AI-driven search and data retrieval.

For developers eager to implement this groundbreaking model, the Gemini API and Agent Platform provide the necessary tools for diving into multimodal embeddings that enhance understanding across various industries.

See also
Staff
Written By

The AiPressa Staff team brings you comprehensive coverage of the artificial intelligence industry, including breaking news, research developments, business trends, and policy updates. Our mission is to keep you informed about the rapidly evolving world of AI technology.

You May Also Like

AI Finance

AI technology is fueling a 38% surge in retirees' 401(k) portfolios while causing 16,000 job losses monthly among younger workers, highlighting stark generational disparities.

Top Stories

Google expands AI Max ads for travel brands, enhancing ad targeting with AI Overview searches and introducing personalized hotel ads and booking links.

AI Finance

Amazon and Google report record cloud growth, with AWS revenue at $37.6B and Google Cloud up 63% to $20B, while Meta and Microsoft face...

Top Stories

Meta's ad revenue surged 33% to $55B, surpassing Google's 15% growth to $77B, amid escalating AI investments that could reshape digital advertising.

AI Research

Google's TurboQuant enables AI models to use up to 6x less memory during inference, promising significant efficiency gains without sacrificing performance.

AI Generative

Google TV enhances user experience with AI-driven image and video tools, introducing the Nano Banana and Veo features on Gemini-enabled TCL TVs in the...

AI Technology

Google researchers harness AI to enhance CO2 monitoring, achieving 10-minute updates on column-averaged CO2 levels using GOES East satellite data.

Top Stories

Google TV enhances user engagement with AI-driven features, including photo search and dynamic slideshows, while introducing YouTube Shorts for personalized content.

© 2025 AIPressa · Part of Buzzora Media · All rights reserved. This website provides general news and educational content for informational purposes only. While we strive for accuracy, we do not guarantee the completeness or reliability of the information presented. The content should not be considered professional advice of any kind. Readers are encouraged to verify facts and consult appropriate experts when needed. We are not responsible for any loss or inconvenience resulting from the use of information on this site. Some images used on this website are generated with artificial intelligence and are illustrative in nature. They may not accurately represent the products, people, or events described in the articles.