Prolific Study Ranks ChatGPT 8th, Behind Gemini and DeepSeek Models

Prolific’s new Humaine benchmark ranks OpenAI’s ChatGPT eighth, trailing behind top competitors like Google’s Gemini and DeepSeek.

Staff

Published

23 November, 2025

Since the launch of OpenAI’s ChatGPT in late 2022, the chatbot has undeniably set the standard in the realm of generative AI, capturing a significant share of the market. However, a recent study by British firm Prolific has revealed that ChatGPT ranks eighth among leading AI models, trailing behind competitors such as Gemini, Grok, Claude, DeepSeek, and Mistral.

Prolific developed a new benchmark known as “Humaine” to evaluate AI performance based on human interaction standards rather than purely technical metrics. The company criticizes current evaluation methods, asserting that they often focus on data that is more relevant for researchers than for everyday users. “Current evaluation is heavily skewed towards metrics that are meaningful to researchers but opaque to everyday users,” the blog post emphasized, highlighting a disconnection between optimization and user experience. This sentiment is echoed in concerns about human-preference leaderboards, which can suffer from sample bias, often favoring tech-savvy audiences.

ChatGPT’s Ranking and Market Context

According to the Humaine study, the top ten AI models are as follows:

Gemini 2.5 Pro (Google)
DeepSeek v3 (DeepSeek)
Magistral Medium (Mistral)
Gemini 2.5 Flash (Google)
DeepSeek R1 (DeepSeek)
Gemini 2.0 Flash (Google)
ChatGPT

This ranking is particularly surprising for OpenAI, given that its model is now positioned behind not only Google’s Gemini models but also offerings from DeepSeek and Mistral. The study was published in September, prior to the release of Google’s Gemini 3 Pro model and xAI’s Grok 4.1 models, which may affect future rankings.

Despite the ongoing advancements from competitors, the Gemini 2.5 Pro maintaining its place at the top comes as little surprise, having consistently led various performance benchmarks since its introduction. However, OpenAI’s omission from the top five raises questions about its current standing in an increasingly competitive landscape.

Addressing Evaluation Gaps

The Prolific study highlights the need for more rigorous and relevant methods of evaluating AI models. By implementing automated quality monitoring in their Humaine benchmark, Prolific aims to ensure that the feedback and interactions are genuinely reflective of user preferences rather than skewed by sample bias.

As AI technology continues to evolve, understanding how models perform under conditions that mimic real human interaction becomes essential. The insights gained from this study could potentially guide developers in refining their models to better meet user expectations, thereby bridging the gap between technical performance and user satisfaction.

In summary, while OpenAI’s ChatGPT remains a significant player in the generative AI space, its recent ranking as eighth among AI models indicates a shift in the competitive landscape. As companies like Google and DeepSeek continue to innovate, the importance of evaluating AI through the lens of user interaction will likely become increasingly vital for maintaining relevance in the market.

AI Regulation

OpenAI’s Sam Altman Advocates for AI Privilege Amid Legal Challenges Over User Data

OpenAI's Sam Altman calls for legal protections akin to attorney-client privilege for AI interactions as courts grapple with user privacy and corporate accountability.

Staff2 hours ago

Demis Hassabis Reveals ChatGPT’s Launch Triggered Unprecedented AI Commercial Pressure

Demis Hassabis of Google DeepMind reveals that ChatGPT's November 2022 launch sparked a "ferocious commercial pressure race" among AI labs, altering development strategies.

Staff2 hours ago

AI Tools

OpenAI Powers Rome2Rio and Omio Apps, Revolutionizing Travel Planning for 900M Users

OpenAI powers Rome2Rio and Omio's new apps, streamlining travel planning for 900 million users with real-time transport options and pricing.

Staff4 hours ago

AI Generative

OpenAI’s GPT 5.4 Ties with Gemini as Top AI Model for Android App Development

Google's Android Bench ranks OpenAI's GPT 5.4 and Gemini 3.1 Pro Preview at 72.4%, establishing them as top AI models for Android app development.

Staff5 hours ago

AI Technology

Illia Polosukhin: AI’s Data Risks, Blockchain’s Trust, and Crypto’s Global Payment Solutions

Illia Polosukhin of NEAR Foundation warns that traditional AI services risk exposing sensitive data, advocating for blockchain's trust layer and cryptocurrency to revolutionize global...

Staff5 hours ago

Police Arrest Suspect After Molotov Cocktail Attack at OpenAI CEO Sam Altman’s Home

Police arrest a 20-year-old suspect after a Molotov cocktail attack on OpenAI CEO Sam Altman's home, raising urgent safety concerns in the AI sector.

Staff9 hours ago

AI Finance

CoreWeave Secures Multi-Year Deal with Anthropic to Boost Claude Model Capacity

Core Weave secures a multi-year deal with Anthropic to enhance Claude model capacity, seizing a strategic opportunity amid rising demand for AI computational resources

Marcus Chen13 hours ago

Anthropic Surpasses $30B Revenue, Emerges as Top Choice Over OpenAI at HumanX Conference

Anthropic soars to over $30B in revenue, displacing OpenAI as the top choice at HumanX, signaling a seismic shift in Silicon Valley's AI landscape.

Staff13 hours ago

AIPRESSA.COM

Top Stories

Prolific Study Ranks ChatGPT 8th, Behind Gemini and DeepSeek Models

ChatGPT’s Ranking and Market Context

Addressing Evaluation Gaps

Trending

Top Stories

Albania Appoints AI Bot Minister Diella Amid Corruption Concerns and EU Membership Goals

AI Government

BigBear.ai Launches Biometric Platform at O’Hare, Acquires Generative AI Ask Sage for $250M

AI Cybersecurity

Endpoint Security Market to Reach $23.9B by 2030 with 7.2% CAGR Amid Rising Cyber Threats

AI Business

Enterprise Architecture Shifts to Strategic Enabler in AI-Driven Business Models

AI Technology

AI Hardware Market Grows 30% in 2025, Driven by Generative AI and Edge Computing Demand

You May Also Like

AI Regulation

OpenAI’s Sam Altman Advocates for AI Privilege Amid Legal Challenges Over User Data

Top Stories

Demis Hassabis Reveals ChatGPT’s Launch Triggered Unprecedented AI Commercial Pressure

AI Tools

OpenAI Powers Rome2Rio and Omio Apps, Revolutionizing Travel Planning for 900M Users

AI Generative

OpenAI’s GPT 5.4 Ties with Gemini as Top AI Model for Android App Development

AI Technology

Illia Polosukhin: AI’s Data Risks, Blockchain’s Trust, and Crypto’s Global Payment Solutions

Top Stories

Police Arrest Suspect After Molotov Cocktail Attack at OpenAI CEO Sam Altman’s Home

AI Finance

CoreWeave Secures Multi-Year Deal with Anthropic to Boost Claude Model Capacity

Top Stories

Anthropic Surpasses $30B Revenue, Emerges as Top Choice Over OpenAI at HumanX Conference