Connect with us

Hi, what are you looking for?

Top Stories

Prolific Study Reveals Gemini 2.5 Pro Tops AI Chatbots, Outperforming ChatGPT in User Rankings

Prolific’s Humaine study ranks Google’s Gemini 2.5 Pro as the top AI chatbot, leaving ChatGPT-4.1 in eighth place despite its 800 million weekly users.

Since the launch of ChatGPT-3.5 by OpenAI in November 2022, generative AI has rapidly entered the mainstream, propelling the use of AI chatbots into the public consciousness. With a remarkable milestone of surpassing 100 million monthly active users within just months, ChatGPT became synonymous with AI chat technology. However, a recent study conducted by the British firm Prolific has stirred the waters by ranking ChatGPT as only the eighth best AI chatbot, trailing behind various competitors including Google’s Gemini, DeepSeek, and Mistral.

Study Context and Methodology

Prolific’s study, which introduces a new benchmark called Humaine, aims to evaluate AI chatbots based on metrics that matter most to users. Unlike previous evaluations that heavily relied on technical benchmarks, Humaine focuses on aspects such as conversational understanding, clarity of answers, adaptability in discussions, and overall trustworthiness. The study involved around 25,000 participants who compared chatbots in head-to-head matchups, assessing their performance across four main metrics:

  1. Core Task Performance & Reasoning
  2. Interaction Fluidity & Adaptiveness
  3. Communication Style & Presentation
  4. Trust, Ethics & Safety

Results of the Humaine Study

The results have placed Gemini 2.5 Pro from Google at the top, followed closely by DeepSeek v3 and Mistral Medium. The leaderboard reveals that ChatGPT-4.1, despite its popularity, only managed an eighth-place finish. The complete rankings are:

  1. Gemini 2.5 Pro (Google)
  2. DeepSeek v3 (DeepSeek)
  3. Magistral Medium (Mistral AI)
  4. Grok 4 (xAI)
  5. Grok 3 (xAI)
  6. Gemini 2.5 Flash (Google)
  7. DeepSeek R1 (DeepSeek)
  8. ChatGPT-4.1 (OpenAI)
  9. Gemma (Google)
  10. Gemini 2.0 Flash (Google)

Understanding ChatGPT’s Lower Ranking

This outcome raises the question of why ChatGPT, which boasts around 800 million active users weekly and accounts for nearly 48% of AI chatbot usage, did not perform better. The discrepancy is attributed to the fact that the study’s methodology focused on user experience rather than just performance metrics. Prolific aims to provide insights into user preferences that had been overlooked in prior studies.

Implications for AI Chatbots

The results of the Humaine study indicate a shift in user expectations. Participants valued chatbots that provide human-like conversational experiences, showing adaptability in discussions and ethical responses. Gemini 2.5 Pro not only topped the leaderboard but also demonstrated superior adaptability and communication style, highlighting the need for chatbots to engage users meaningfully.

This study is crucial as it explores the human-facing dimensions of AI, prompting developers to rethink their designs based on user feedback. While ChatGPT remains a formidable player in the market, the findings suggest that competition is intensifying and that user experience should be at the forefront of AI development.

In conclusion, while OpenAI continues to lead in usage and brand recognition, the rankings from the Humaine study reveal that the landscape of AI chatbots is evolving rapidly. Companies aiming to innovate must focus on developing chatbots that resonate with users, fostering trust and engagement.

See also
Staff
Written By

The AiPressa Staff team brings you comprehensive coverage of the artificial intelligence industry, including breaking news, research developments, business trends, and policy updates. Our mission is to keep you informed about the rapidly evolving world of AI technology.

You May Also Like

Top Stories

MiniMax, China's AI unicorn, skyrocketed 109% in its record-breaking Hong Kong market debut, marking a significant milestone for tech investments.

AI Research

Stanford and Yale warn that OpenAI’s GPT, Anthropic's Claude, and others can reproduce extensive copyrighted texts, raising potential billion-dollar legal liabilities.

Top Stories

Google enhances Gmail with AI Overviews and AI Inbox, leveraging Gemini 3 to streamline email management and boost productivity for users.

Top Stories

DeepSeek's V4 model, launching February 17, 2024, may surpass ChatGPT and Claude in long-context coding, aiming for over 80% accuracy in Software Engineering tasks.

AI Regulation

AI professionals must navigate new executive order changes while complying with state laws to avoid costly penalties and ensure ethical data practices.

Top Stories

DeepSeek's V4 model, launching by February 17, aims to outperform Claude and ChatGPT in coding, leveraging innovative training to boost accuracy beyond 80.9%.

AI Research

Thinking Machines Lab secures $2B funding at a $12B valuation and launches Tinker, a groundbreaking tool for efficient AI model customization.

AI Technology

NVIDIA and AMD unveil a future where AI becomes the core operating system of life, with AMD predicting a thousandfold increase in AI chip...

© 2025 AIPressa · Part of Buzzora Media · All rights reserved. This website provides general news and educational content for informational purposes only. While we strive for accuracy, we do not guarantee the completeness or reliability of the information presented. The content should not be considered professional advice of any kind. Readers are encouraged to verify facts and consult appropriate experts when needed. We are not responsible for any loss or inconvenience resulting from the use of information on this site. Some images used on this website are generated with artificial intelligence and are illustrative in nature. They may not accurately represent the products, people, or events described in the articles.