Google DeepMind Launches AGI Benchmark Hackathon with $200K Prize Pool on Kaggle

Google DeepMind and Kaggle launch a $200,000 hackathon to establish new benchmarks for evaluating artificial general intelligence capabilities.

Staff

Published

25 March, 2026

Google DeepMind and Kaggle have announced a new hackathon aimed at creating benchmarks for evaluating artificial general intelligence (AGI), alongside the release of a research paper that proposes a framework for assessing AI systems against human cognitive capabilities. The initiative addresses ongoing industry debates concerning the definition and measurement of progress toward more general AI.

The hackathon, titled “Measuring Progress Toward AGI: Cognitive Abilities,” is set to run from March 17 to April 16, offering a total prize pool of $200,000 for participants who develop innovative evaluation methods. Neil Hoyne, Chief Strategist at Google, noted in a LinkedIn post that this initiative is not solely about the largest AI models; rather, it seeks to determine whether these models possess the capabilities, intuition, and focus to navigate the world similarly to humans.

Accompanying the hackathon, Google DeepMind has released a paper titled “Measuring Progress Toward AGI: A Cognitive Taxonomy,” which outlines ten cognitive abilities considered crucial for assessing general intelligence. These include perception, generation, attention, learning, memory, reasoning, metacognition, executive functions, problem-solving, and social cognition. The framework encourages testing AI systems across various tasks associated with these abilities and comparing their performance to human baselines. This approach aims to provide a more nuanced understanding of how AI systems operate across different cognitive activities, moving beyond simplistic benchmark scores.

Google DeepMind asserts that current evaluation methods often fail to differentiate between models relying on memorization and those capable of adapting to new challenges. By focusing on five key areas—learning, metacognition, attention, executive functions, and social cognition—the hackathon seeks to address these gaps. Participants are encouraged to create benchmarks utilizing Kaggle’s Community Benchmarks platform, designing tasks that test how AI systems manage new information, maintain attention, plan actions, and interpret social contexts.

The competition includes two $10,000 awards for each of the five tracks, as well as four $25,000 grand prizes for the top submissions. Judging will occur between April 17 and May 31, with results anticipated on June 1. Hoyne emphasized that participants will contribute to the development of a more truthful method for evaluating AI by creating assessments that reflect real human skills, such as adaptive learning and understanding social cues.

This initiative marks a shift in focus from assessing AI model performance to establishing robust measurement standards. As the landscape of artificial intelligence evolves, there is increasing demand for consistent evaluation criteria that can accurately reflect a system’s capabilities. Google DeepMind advocates for new evaluation methods that can highlight where AI systems demonstrate reliability and where they face limitations, particularly concerning reasoning, adaptability, and social interaction.

For developers and researchers, the hackathon positions benchmark design as an essential component of AI development, with expected outcomes aimed at enhancing future evaluation standards within the industry. The initiative holds the potential to reshape how AI systems are tested and understood, paving the way for a more comprehensive approach to measuring intelligence in artificial systems.

ETIH Innovation Awards 2026 are also now accepting entries. These awards acknowledge education technology organizations that achieve measurable impact across K–12, higher education, and lifelong learning. Open to submissions from the UK, the Americas, and international participants, the awards will assess entries based on their outcomes and real-world applications.

AI Government

US Defense Partners with Anthropic, OpenAI, and Tech Giants for AI-First Military Initiative

US Department of Defense partners with tech giants including SpaceX and OpenAI to launch an "AI-first" initiative aimed at enhancing military decision-making efficiency.

Staff3 May, 2026

AI Marketing

BusySeed Launches Rankxa to Measure Brand Visibility in AI-Generated Search Results

BusySeed unveils Rankxa, a tool tracking brand visibility across AI-generated responses, revealing 90% of brands lack meaningful presence in this new landscape.

Sofía Méndez3 May, 2026

AI Generative

Google Prepares Omni Model for Gemini Video Generation Ahead of I/O 2026

Google is set to unveil its new video-generation tool, Omni, at I/O 2026, potentially integrating Gemini's capabilities and enhancing competition against ByteDance's Seedance 2.0.

Staff2 May, 2026

AI Marketing

ACME.BOT Reveals SEO Checklists are Obsolete as AI Search Reshapes Content Visibility

ACME.BOT declares traditional SEO checklists obsolete, revealing a 27% drop in organic traffic as AI platforms disrupt content visibility.

Sofía Méndez2 May, 2026

Apple, Google, and Amazon Shine Post-Earnings as AI Demand Reshapes Tech Landscape

Apple's Q2 earnings reveal a price hike for the Mac mini to $799, fueled by AI memory demand, as Google and Amazon also report...

Staff2 May, 2026

AI Technology

Big Tech to Invest $3.7 Trillion in AI Infrastructure, Surpassing Historic Rail Expansion

Major tech giants, including Google and Amazon, are set to invest $3.7 trillion in AI infrastructure over five years, reshaping the workforce and economy.

Staff1 May, 2026

AI Generative

Gemini Embedding 2 Launches with Multimodal Capabilities, Enhancing AI Retrieval Accuracy by 40%

Google's Gemini Embedding 2 enhances AI retrieval accuracy by 40%, enabling multimodal inputs and boosting search precision for platforms like Harvey and Nuuly.

Staff1 May, 2026

Google DeepMind’s AI Co-Clinician Surpasses GPT-5.4 in Blind Doctor Tests

Google DeepMind's AI co-clinician outperformed GPT-5.4 in doctor tests, achieving 67 preferences in primary care queries and a remarkable 95% quality score in open-ended...

Staff1 May, 2026

AIPRESSA.COM

Top Stories

Google DeepMind Launches AGI Benchmark Hackathon with $200K Prize Pool on Kaggle

Trending

Top Stories

Albania Appoints AI Bot Minister Diella Amid Corruption Concerns and EU Membership Goals

AI Government

BigBear.ai Launches Biometric Platform at O’Hare, Acquires Generative AI Ask Sage for $250M

AI Cybersecurity

Endpoint Security Market to Reach $23.9B by 2030 with 7.2% CAGR Amid Rising Cyber Threats

AI Business

Enterprise Architecture Shifts to Strategic Enabler in AI-Driven Business Models

AI Research

Amazon Awards 63 Research Grants to 41 Universities Across 8 Countries for AI Innovation

You May Also Like

AI Government

US Defense Partners with Anthropic, OpenAI, and Tech Giants for AI-First Military Initiative

AI Marketing

BusySeed Launches Rankxa to Measure Brand Visibility in AI-Generated Search Results

AI Generative

Google Prepares Omni Model for Gemini Video Generation Ahead of I/O 2026

AI Marketing

ACME.BOT Reveals SEO Checklists are Obsolete as AI Search Reshapes Content Visibility

Top Stories

Apple, Google, and Amazon Shine Post-Earnings as AI Demand Reshapes Tech Landscape

AI Technology

Big Tech to Invest $3.7 Trillion in AI Infrastructure, Surpassing Historic Rail Expansion

AI Generative

Gemini Embedding 2 Launches with Multimodal Capabilities, Enhancing AI Retrieval Accuracy by 40%

Top Stories

Google DeepMind’s AI Co-Clinician Surpasses GPT-5.4 in Blind Doctor Tests