Connect with us

Hi, what are you looking for?

AI Research

AI Models in Economic Game Show Key Differences from Human Strategic Thinking

AI models, including GPT-4o and Claude Sonnet, show significant strategic differences from humans in economic games, opting for lower numbers despite shared emotional responses.

In a groundbreaking study, researchers have explored how advanced artificial intelligence (AI) models make decisions in strategic contexts traditionally reserved for human judgment. Conducted by Dmitry Dagaev, head of the Laboratory of Sports Studies at HSE University, along with colleagues from HSE University–Perm and the University of Lausanne, the research delved into the behavior of prominent AI models during the “Guess the Number” game, a modern adaptation of the Keynesian beauty contest. This experiment poses a critical question: when tasked with strategic reasoning, do AI systems think like humans?

The “Guess the Number” game requires participants to select a number between 0 and 100, with the winner being the one whose choice is closest to a designated fraction of the group’s average. Historical studies indicate that human players often deviate from optimal mathematical choices, influenced by cognitive limits and emotional factors. The researchers aimed to determine if AI would follow a similar pattern.

The team assessed five leading language models, including GPT-4o, GPT-4o Mini, Gemini-2.5-flash, Claude-Sonnet-4, and Llama-4-Maverick, across 16 scenarios inspired by classic economic experiments. Scenarios varied in parameters, such as the fraction for the winning number and how participants’ choices were aggregated—through averages, medians, or maximums. Each model acted as a single player, receiving identical instructions and repeating each scenario 50 times without learning from previous rounds, reflecting one-shot experiments typically used with human subjects.

Initial results were promising, with all 4,000 responses adhering to game rules and remaining within the 0 to 100 range. Most explanations demonstrated some level of strategic reasoning, with only 23 instances lacking it. However, when comparing AI choices to human results from previous studies by economist Rosemarie Nagel, significant differences emerged. In situations where the target was half the group average, human participants averaged about 27, while AI models consistently opted for lower values, often approaching zero, which is typically the Nash equilibrium in these scenarios.

Notably, AI behavior diverged based on game structure. For instance, in variations utilizing maximum numbers, both humans and AI tended to choose higher values, yet specific models still displayed marked differences. Claude Sonnet averaged around 35, while Llama opted for significantly lower numbers. “These results show that AI responds to changes in game structure much like people do,” Dagaev noted. However, a crucial gap was identified: in two-player game formats, none of the models recognized that choosing zero is a weakly dominant strategy, opting instead for detailed reasoning about the potential choices of others—contrasting with the formal economic training that often guides human players.

Further analysis revealed distinct behavioral patterns among the models. In pairwise comparisons, algorithms such as GPT-4o and Claude Sonnet generally produced mid-range results, while Gemini Flash fluctuated between cautious and aggressive choices. The team also examined Llama models of various sizes, from 1 billion to 405 billion parameters, discovering that smaller models tended to choose numbers closer to typical human guesses, while larger models gravitated toward theoretical predictions, selecting lower values as their complexity increased.

The researchers also investigated context sensitivity, altering prompts by changing wording or framing the game as a televised contest with emotionally charged opponents. Results indicated that both AI and human players responded similarly to emotional framing; when opponents were described as angry, both groups tended to select higher numbers. However, the overall response structure remained stable across models.

The study’s findings highlight critical insights into AI decision-making in economic contexts. While modern AI demonstrates the ability to recognize strategic settings and adjust behavior accordingly, it often behaves more “rationally” than human participants, typically opting for lower numbers. Yet, its failure to identify simple dominant strategies and its tendency to overestimate the sophistication of others mark notable limitations.

As Dagaev emphasized, “We are now at a stage where AI models are beginning to replace humans in many operations, enabling greater economic efficiency in business processes.” This research underscores the importance of understanding where AI aligns with human behavior and where it diverges, which will ultimately influence the application of these systems in markets, policy-making, and everyday life.

These insights also suggest that if AI models generally anticipate strategic behavior, they may misjudge emotional markets characterized by irrational decision-making. Conversely, their predictive alignment with comparative trends indicates potential utility in forecasting and analysis. For researchers, this study identifies areas in AI that require enhancement, particularly in recognizing straightforward strategic dominance, while for society, it provides crucial guidance on when to place trust in AI decisions versus human judgment.

See also
Staff
Written By

The AiPressa Staff team brings you comprehensive coverage of the artificial intelligence industry, including breaking news, research developments, business trends, and policy updates. Our mission is to keep you informed about the rapidly evolving world of AI technology.

You May Also Like

AI Business

Red Hat advances enterprise AI with Small Language Models that achieve over 98% validity in structured tasks, prioritizing reliability and data sovereignty.

AI Research

OpenAI's o1 model achieves 81.6% diagnostic accuracy in emergency situations, surpassing human doctors and signaling a major shift in medical practice.

AI Regulation

Korea Venture Investment Corp. unveils AI-driven fund management systems by integrating Nvidia H200 GPUs to enhance efficiency and support unicorn growth.

AI Technology

Apple raises Mac mini starting price to $799 amid AI-driven inventory shortages, eliminating the $599 model in response to surging demand for advanced computing.

AI Research

IBM launches a Chicago Quantum Hub to create 750 AI jobs and expands its MIT partnership to advance quantum computing and AI integration.

AI Government

71% of Australian employees use generative AI daily, but only 36% trust its implementation, highlighting urgent calls for better policy frameworks and safeguards.

AI Regulation

The Academy of Motion Picture Arts and Sciences bars AI performances from Oscar eligibility, emphasizing human-authored content amid rising industry tensions over generative AI's...

AI Tools

Workday's stock jumps 3.73% to $126.96 amid AI product updates and earnings optimism, yet analysts cite a 49.8% undervaluation risk at $253.14.

© 2025 AIPressa · Part of Buzzora Media · All rights reserved. This website provides general news and educational content for informational purposes only. While we strive for accuracy, we do not guarantee the completeness or reliability of the information presented. The content should not be considered professional advice of any kind. Readers are encouraged to verify facts and consult appropriate experts when needed. We are not responsible for any loss or inconvenience resulting from the use of information on this site. Some images used on this website are generated with artificial intelligence and are illustrative in nature. They may not accurately represent the products, people, or events described in the articles.