Connect with us

Hi, what are you looking for?

Top Stories

Google DeepMind Launches AGI Benchmark Hackathon with $200K Prize Pool on Kaggle

Google DeepMind and Kaggle launch a $200,000 hackathon to establish new benchmarks for evaluating artificial general intelligence capabilities.

Google DeepMind and Kaggle have announced a new hackathon aimed at creating benchmarks for evaluating artificial general intelligence (AGI), alongside the release of a research paper that proposes a framework for assessing AI systems against human cognitive capabilities. The initiative addresses ongoing industry debates concerning the definition and measurement of progress toward more general AI.

The hackathon, titled “Measuring Progress Toward AGI: Cognitive Abilities,” is set to run from March 17 to April 16, offering a total prize pool of $200,000 for participants who develop innovative evaluation methods. Neil Hoyne, Chief Strategist at Google, noted in a LinkedIn post that this initiative is not solely about the largest AI models; rather, it seeks to determine whether these models possess the capabilities, intuition, and focus to navigate the world similarly to humans.

Accompanying the hackathon, Google DeepMind has released a paper titled “Measuring Progress Toward AGI: A Cognitive Taxonomy,” which outlines ten cognitive abilities considered crucial for assessing general intelligence. These include perception, generation, attention, learning, memory, reasoning, metacognition, executive functions, problem-solving, and social cognition. The framework encourages testing AI systems across various tasks associated with these abilities and comparing their performance to human baselines. This approach aims to provide a more nuanced understanding of how AI systems operate across different cognitive activities, moving beyond simplistic benchmark scores.

Google DeepMind asserts that current evaluation methods often fail to differentiate between models relying on memorization and those capable of adapting to new challenges. By focusing on five key areas—learning, metacognition, attention, executive functions, and social cognition—the hackathon seeks to address these gaps. Participants are encouraged to create benchmarks utilizing Kaggle’s Community Benchmarks platform, designing tasks that test how AI systems manage new information, maintain attention, plan actions, and interpret social contexts.

The competition includes two $10,000 awards for each of the five tracks, as well as four $25,000 grand prizes for the top submissions. Judging will occur between April 17 and May 31, with results anticipated on June 1. Hoyne emphasized that participants will contribute to the development of a more truthful method for evaluating AI by creating assessments that reflect real human skills, such as adaptive learning and understanding social cues.

This initiative marks a shift in focus from assessing AI model performance to establishing robust measurement standards. As the landscape of artificial intelligence evolves, there is increasing demand for consistent evaluation criteria that can accurately reflect a system’s capabilities. Google DeepMind advocates for new evaluation methods that can highlight where AI systems demonstrate reliability and where they face limitations, particularly concerning reasoning, adaptability, and social interaction.

For developers and researchers, the hackathon positions benchmark design as an essential component of AI development, with expected outcomes aimed at enhancing future evaluation standards within the industry. The initiative holds the potential to reshape how AI systems are tested and understood, paving the way for a more comprehensive approach to measuring intelligence in artificial systems.

ETIH Innovation Awards 2026 are also now accepting entries. These awards acknowledge education technology organizations that achieve measurable impact across K–12, higher education, and lifelong learning. Open to submissions from the UK, the Americas, and international participants, the awards will assess entries based on their outcomes and real-world applications.

See also
Staff
Written By

The AiPressa Staff team brings you comprehensive coverage of the artificial intelligence industry, including breaking news, research developments, business trends, and policy updates. Our mission is to keep you informed about the rapidly evolving world of AI technology.

You May Also Like

AI Government

US Department of Defense partners with tech giants including SpaceX and OpenAI to launch an "AI-first" initiative aimed at enhancing military decision-making efficiency.

AI Marketing

BusySeed unveils Rankxa, a tool tracking brand visibility across AI-generated responses, revealing 90% of brands lack meaningful presence in this new landscape.

AI Generative

Google is set to unveil its new video-generation tool, Omni, at I/O 2026, potentially integrating Gemini's capabilities and enhancing competition against ByteDance's Seedance 2.0.

AI Marketing

ACME.BOT declares traditional SEO checklists obsolete, revealing a 27% drop in organic traffic as AI platforms disrupt content visibility.

Top Stories

Apple's Q2 earnings reveal a price hike for the Mac mini to $799, fueled by AI memory demand, as Google and Amazon also report...

AI Technology

Major tech giants, including Google and Amazon, are set to invest $3.7 trillion in AI infrastructure over five years, reshaping the workforce and economy.

AI Generative

Google's Gemini Embedding 2 enhances AI retrieval accuracy by 40%, enabling multimodal inputs and boosting search precision for platforms like Harvey and Nuuly.

Top Stories

Google DeepMind's AI co-clinician outperformed GPT-5.4 in doctor tests, achieving 67 preferences in primary care queries and a remarkable 95% quality score in open-ended...

© 2025 AIPressa · Part of Buzzora Media · All rights reserved. This website provides general news and educational content for informational purposes only. While we strive for accuracy, we do not guarantee the completeness or reliability of the information presented. The content should not be considered professional advice of any kind. Readers are encouraged to verify facts and consult appropriate experts when needed. We are not responsible for any loss or inconvenience resulting from the use of information on this site. Some images used on this website are generated with artificial intelligence and are illustrative in nature. They may not accurately represent the products, people, or events described in the articles.