Connect with us

Hi, what are you looking for?

Top Stories

Prolific Study Reveals Gemini 2.5 Pro Tops AI Chatbots, Outperforming ChatGPT in User Rankings

Prolific’s Humaine study ranks Google’s Gemini 2.5 Pro as the top AI chatbot, leaving ChatGPT-4.1 in eighth place despite its 800 million weekly users.

Since the launch of ChatGPT-3.5 by OpenAI in November 2022, generative AI has rapidly entered the mainstream, propelling the use of AI chatbots into the public consciousness. With a remarkable milestone of surpassing 100 million monthly active users within just months, ChatGPT became synonymous with AI chat technology. However, a recent study conducted by the British firm Prolific has stirred the waters by ranking ChatGPT as only the eighth best AI chatbot, trailing behind various competitors including Google’s Gemini, DeepSeek, and Mistral.

Study Context and Methodology

Prolific’s study, which introduces a new benchmark called Humaine, aims to evaluate AI chatbots based on metrics that matter most to users. Unlike previous evaluations that heavily relied on technical benchmarks, Humaine focuses on aspects such as conversational understanding, clarity of answers, adaptability in discussions, and overall trustworthiness. The study involved around 25,000 participants who compared chatbots in head-to-head matchups, assessing their performance across four main metrics:

  1. Core Task Performance & Reasoning
  2. Interaction Fluidity & Adaptiveness
  3. Communication Style & Presentation
  4. Trust, Ethics & Safety

Results of the Humaine Study

The results have placed Gemini 2.5 Pro from Google at the top, followed closely by DeepSeek v3 and Mistral Medium. The leaderboard reveals that ChatGPT-4.1, despite its popularity, only managed an eighth-place finish. The complete rankings are:

  1. Gemini 2.5 Pro (Google)
  2. DeepSeek v3 (DeepSeek)
  3. Magistral Medium (Mistral AI)
  4. Grok 4 (xAI)
  5. Grok 3 (xAI)
  6. Gemini 2.5 Flash (Google)
  7. DeepSeek R1 (DeepSeek)
  8. ChatGPT-4.1 (OpenAI)
  9. Gemma (Google)
  10. Gemini 2.0 Flash (Google)

Understanding ChatGPT’s Lower Ranking

This outcome raises the question of why ChatGPT, which boasts around 800 million active users weekly and accounts for nearly 48% of AI chatbot usage, did not perform better. The discrepancy is attributed to the fact that the study’s methodology focused on user experience rather than just performance metrics. Prolific aims to provide insights into user preferences that had been overlooked in prior studies.

Implications for AI Chatbots

The results of the Humaine study indicate a shift in user expectations. Participants valued chatbots that provide human-like conversational experiences, showing adaptability in discussions and ethical responses. Gemini 2.5 Pro not only topped the leaderboard but also demonstrated superior adaptability and communication style, highlighting the need for chatbots to engage users meaningfully.

This study is crucial as it explores the human-facing dimensions of AI, prompting developers to rethink their designs based on user feedback. While ChatGPT remains a formidable player in the market, the findings suggest that competition is intensifying and that user experience should be at the forefront of AI development.

In conclusion, while OpenAI continues to lead in usage and brand recognition, the rankings from the Humaine study reveal that the landscape of AI chatbots is evolving rapidly. Companies aiming to innovate must focus on developing chatbots that resonate with users, fostering trust and engagement.

See also
Staff
Written By

The AiPressa Staff team brings you comprehensive coverage of the artificial intelligence industry, including breaking news, research developments, business trends, and policy updates. Our mission is to keep you informed about the rapidly evolving world of AI technology.

You May Also Like

Top Stories

Character.AI and Google settle lawsuits over teen safety, addressing claims of negligence in AI interactions linked to youth exploitation, with a $2.7B partnership under...

Top Stories

DeepSeek's V4 model, launching February 17, aims to surpass Claude and GPT in coding performance, leveraging a $6 million development cost and innovative mHC...

Top Stories

Nvidia, Broadcom, and Amazon are set to lead the AI market's explosive growth, with Nvidia's EPS projected to soar 45% and Broadcom's AI revenue...

AI Business

As enterprises double down on AI investments, OpenAI faces intensified competition from Google's Gemini and Microsoft's Copilot, threatening its market dominance.

Top Stories

Anthropic seeks $10 billion in funding to boost its valuation to $350 billion amid rising concerns of an AI bubble, as competition with OpenAI...

Top Stories

China's AI-driven labor market saw recruitment for high-exposure roles plummet by 30%, while Singapore pivoted to resilience with a 200% rise in demand for...

Top Stories

Meta acquires AI startup Manus for up to $3B to enhance its platforms, while OpenAI secures a $300B cloud deal with Oracle, reshaping AI...

Top Stories

DeepSeek expands its R1 paper from 22 to 86 pages, unveiling detailed training insights and benchmarks ahead of potential V4 release this Lunar New...

© 2025 AIPressa · Part of Buzzora Media · All rights reserved. This website provides general news and educational content for informational purposes only. While we strive for accuracy, we do not guarantee the completeness or reliability of the information presented. The content should not be considered professional advice of any kind. Readers are encouraged to verify facts and consult appropriate experts when needed. We are not responsible for any loss or inconvenience resulting from the use of information on this site. Some images used on this website are generated with artificial intelligence and are illustrative in nature. They may not accurately represent the products, people, or events described in the articles.