AI Finance

AI Tools Mislead Users: Which? Study Reveals Accuracy Scores Below 70% for Finance Advice

Which? study reveals AI tools like ChatGPT and Copilot score below 70% in accuracy for finance advice, raising concerns for over 25 million UK users.

Marcus Chen

Published

21 November, 2025

A recent study by Which? evaluated the performance of six popular AI tools in addressing everyday consumer queries across various domains, including personal finance, legal issues, health, consumer rights, and travel. The researchers posed 40 questions to each tool and assessed their responses based on accuracy, clarity, usefulness, relevance, and ethical responsibility, ultimately scoring them out of 100.

According to the findings, Perplexity emerged as the leading tool with a score of 71%, followed closely by Gemini’s AIO at 70% and the standalone Gemini tool at 69%. Copilot scored 68%, while ChatGPT and Meta AI scored 64% and 55%, respectively. Notably, despite being the most widely used tool, ChatGPT ranked second from the bottom.

Gaps in AI Responses

The controlled tests revealed significant shortcomings in how these AI tools managed detailed regulations. For instance, when asked about the ISA limits, both ChatGPT and Copilot confidently provided incorrect information, neglecting to mention the correct allowance of £20,000. This oversight could lead users to inadvertently breach HMRC regulations.

Travel-related inquiries also highlighted flaws. Copilot incorrectly informed testers that passengers are entitled to a full refund for canceled flights, a claim that lacks nuance. Additionally, Meta provided inaccurate details regarding compensation for flight delays, failing to explain the full rules that apply to extraordinary circumstances.

The survey further disclosed that 51% of UK adults, more than 25 million people, utilize AI for information searching. Remarkably, nearly half of these users expressed a trusting attitude towards the information provided, with the confidence level rising to 65% among frequent users. One in six individuals rely on AI for financial guidance, while one in eight consult it for legal matters and one in five for health-related issues. A third of respondents believe that the answers generated by these tools stem from reputable sources.

Risks Identified in AI Guidance

The evaluation raised concerns regarding the level of warning provided in sensitive areas like legal and financial advice. For example, when testers inquired about rights related to poor broadband speeds, both ChatGPT and Gemini AIO failed to clarify that only providers adhering to Ofcom’s voluntary guaranteed speed code allow customers to exit contracts without penalties. This misunderstanding was compounded when Gemini suggested that consumers with building disputes hold back payment from builders, a recommendation that could entangle users in further legal complications.

Financial advice also presented various risks. In response to queries about tax refunds, both ChatGPT and Perplexity provided links to premium tax refund services alongside government options, which can lead to unnecessary fees and potential fraud. Furthermore, ChatGPT incorrectly stated that travel insurance is mandatory for UK residents visiting Schengen states, which is not true.

Levent Ergin, Chief Strategist for Climate, Sustainability, and Artificial Intelligence at Informatica, remarked, “AI chatbots are only ever as good as the data and context behind them. Public models are impressive, but they’re trained on what’s broadly available, not the deeply contextual, well-governed information you need for reliable financial guidance.” He stressed the importance of ensuring that these tools draw from trusted data sources to potentially deliver accurate and personalized advice.

As more consumers turn to AI for financial recommendations, the necessity for AI tools to evolve into reliable sources of information becomes paramount. The integration of governed data from banks, brokers, and insurers could pave the way for genuinely personalized advice that reflects users’ specific circumstances.

In summary, while AI tools are becoming increasingly integral to daily life, the Which? study underscores the critical need for improvements in their accuracy and reliability. As AI continues to shape how consumers access information, ensuring its ethical and responsible application will be essential for building user trust and safeguarding against potential misguidance.

AI Regulation

OpenAI’s Tumbler Ridge Incident Sparks Calls for New AI Regulation in Canada

OpenAI's failure to alert authorities after banning a user for violent posts led to the Tumbler Ridge shooting that killed eight, prompting calls for...

Staff1 hour ago

Samsung Expands Galaxy AI with Perplexity Integration for Streamlined User Experience

Samsung enhances its Galaxy AI strategy with the introduction of Perplexity, a multi-agent platform that streamlines workflows and improves user engagement across devices.

Staff7 hours ago

AI Business

Barndoor Launches Venn.ai, Enabling Safe AI Integration with Business Apps

Barndoor.ai unveils Venn.ai, empowering businesses to seamlessly integrate AI with tools like Salesforce and Google Docs while ensuring user security and oversight.

Marcus Chen13 hours ago

AI Generative

OpenAI Reveals 20 Best Generative AI Tools of 2026 to Boost Productivity and Creativity

McKinsey reports 79% of organizations now use generative AI tools like ChatGPT and DALL·E 3 to enhance productivity and streamline content creation.

Staff16 hours ago

AI Generative

Gemini App Launches Video Templates for Streamlined Content Creation

Google's Gemini app enhances video generation with new templates, enabling users to create up to five videos daily based on subscription tiers.

Staff24 hours ago

AI Tools

Amazon Launches Open Beta for MCP Server, Enhancing AI-Driven Ad Workflows

Amazon Ads launches open beta for its MCP Server, enabling AI platforms like ChatGPT to transform natural language into actionable ad API calls, streamlining...

Staff2 days ago

AI Generative

OpenAI Faces Defamation Lawsuit Over False Claims from ChatGPT Outputs

OpenAI faces defamation lawsuits in multiple countries, as generative AI's false outputs provoke significant legal challenges and reputational risks for public figures.

Staff2 days ago

Underrated AI Tools: Gamma, Perplexity, and Runway Transform Productivity and Creativity

Gamma, Perplexity AI, and Runway are revolutionizing productivity and creativity, enabling users to create presentations, streamline research, and edit videos significantly faster and with...

Staff2 days ago

AIPRESSA.COM

AI Finance

AI Tools Mislead Users: Which? Study Reveals Accuracy Scores Below 70% for Finance Advice

Gaps in AI Responses

Risks Identified in AI Guidance

Trending

AI Cybersecurity

Endpoint Security Market to Reach $23.9B by 2030 with 7.2% CAGR Amid Rising Cyber Threats

Top Stories

Albania Appoints AI Bot Minister Diella Amid Corruption Concerns and EU Membership Goals

AI Government

BigBear.ai Launches Biometric Platform at O’Hare, Acquires Generative AI Ask Sage for $250M

AI Business

Enterprise Architecture Shifts to Strategic Enabler in AI-Driven Business Models

AI Technology

AI Hardware Market Grows 30% in 2025, Driven by Generative AI and Edge Computing Demand

You May Also Like

AI Regulation

OpenAI’s Tumbler Ridge Incident Sparks Calls for New AI Regulation in Canada

Top Stories

Samsung Expands Galaxy AI with Perplexity Integration for Streamlined User Experience

AI Business

Barndoor Launches Venn.ai, Enabling Safe AI Integration with Business Apps

AI Generative

OpenAI Reveals 20 Best Generative AI Tools of 2026 to Boost Productivity and Creativity

AI Generative

Gemini App Launches Video Templates for Streamlined Content Creation

AI Tools

Amazon Launches Open Beta for MCP Server, Enhancing AI-Driven Ad Workflows

AI Generative

OpenAI Faces Defamation Lawsuit Over False Claims from ChatGPT Outputs

Top Stories

Underrated AI Tools: Gamma, Perplexity, and Runway Transform Productivity and Creativity