Connect with us

Hi, what are you looking for?

Top Stories

Google DeepMind Launches AGI Benchmark Hackathon with $200K Prize Pool on Kaggle

Google DeepMind and Kaggle launch a $200,000 hackathon to establish new benchmarks for evaluating artificial general intelligence capabilities.

Google DeepMind and Kaggle have announced a new hackathon aimed at creating benchmarks for evaluating artificial general intelligence (AGI), alongside the release of a research paper that proposes a framework for assessing AI systems against human cognitive capabilities. The initiative addresses ongoing industry debates concerning the definition and measurement of progress toward more general AI.

The hackathon, titled “Measuring Progress Toward AGI: Cognitive Abilities,” is set to run from March 17 to April 16, offering a total prize pool of $200,000 for participants who develop innovative evaluation methods. Neil Hoyne, Chief Strategist at Google, noted in a LinkedIn post that this initiative is not solely about the largest AI models; rather, it seeks to determine whether these models possess the capabilities, intuition, and focus to navigate the world similarly to humans.

Accompanying the hackathon, Google DeepMind has released a paper titled “Measuring Progress Toward AGI: A Cognitive Taxonomy,” which outlines ten cognitive abilities considered crucial for assessing general intelligence. These include perception, generation, attention, learning, memory, reasoning, metacognition, executive functions, problem-solving, and social cognition. The framework encourages testing AI systems across various tasks associated with these abilities and comparing their performance to human baselines. This approach aims to provide a more nuanced understanding of how AI systems operate across different cognitive activities, moving beyond simplistic benchmark scores.

Google DeepMind asserts that current evaluation methods often fail to differentiate between models relying on memorization and those capable of adapting to new challenges. By focusing on five key areas—learning, metacognition, attention, executive functions, and social cognition—the hackathon seeks to address these gaps. Participants are encouraged to create benchmarks utilizing Kaggle’s Community Benchmarks platform, designing tasks that test how AI systems manage new information, maintain attention, plan actions, and interpret social contexts.

The competition includes two $10,000 awards for each of the five tracks, as well as four $25,000 grand prizes for the top submissions. Judging will occur between April 17 and May 31, with results anticipated on June 1. Hoyne emphasized that participants will contribute to the development of a more truthful method for evaluating AI by creating assessments that reflect real human skills, such as adaptive learning and understanding social cues.

This initiative marks a shift in focus from assessing AI model performance to establishing robust measurement standards. As the landscape of artificial intelligence evolves, there is increasing demand for consistent evaluation criteria that can accurately reflect a system’s capabilities. Google DeepMind advocates for new evaluation methods that can highlight where AI systems demonstrate reliability and where they face limitations, particularly concerning reasoning, adaptability, and social interaction.

For developers and researchers, the hackathon positions benchmark design as an essential component of AI development, with expected outcomes aimed at enhancing future evaluation standards within the industry. The initiative holds the potential to reshape how AI systems are tested and understood, paving the way for a more comprehensive approach to measuring intelligence in artificial systems.

ETIH Innovation Awards 2026 are also now accepting entries. These awards acknowledge education technology organizations that achieve measurable impact across K–12, higher education, and lifelong learning. Open to submissions from the UK, the Americas, and international participants, the awards will assess entries based on their outcomes and real-world applications.

See also
Staff
Written By

The AiPressa Staff team brings you comprehensive coverage of the artificial intelligence industry, including breaking news, research developments, business trends, and policy updates. Our mission is to keep you informed about the rapidly evolving world of AI technology.

You May Also Like

AI Technology

Alphabet's TPU technology poses a significant threat to Nvidia's GPU dominance as both companies face stock declines of 6% amid intensified AI competition.

AI Education

Rise Up CEO Arnaud Blachon announces the launch of Forge Learning, a standalone AI platform designed to transform enterprise training by leveraging real-time data...

Top Stories

Google's AI tool misleads users by altering headlines without consent, risking credibility and prompting users to consider alternatives like DuckDuckGo and Brave.

AI Research

NC State researchers introduce 'neuron freezing' to fortify LLM safety, preventing bypass of AI filters in popular systems like ChatGPT and Gemini.

AI Education

OpenAI appoints Nikita Le Messurier from Google Cloud to accelerate generative AI adoption among startups in Australia and New Zealand, enhancing its regional strategy.

AI Generative

Luma AI's new Uni-1 model outperforms competitors with top scores in image generation tests and offers competitive pricing starting at $0.09 per image.

Top Stories

Google DeepMind appoints Jasjeet Sekhon as chief strategy officer to spearhead its artificial general intelligence initiatives, emphasizing responsible development and strategic oversight.

AI Marketing

GetHookd enhances its AI ad platform with new tools for Facebook and Google campaigns, helping ecommerce marketers analyze 65 million ads and optimize strategies.

© 2025 AIPressa · Part of Buzzora Media · All rights reserved. This website provides general news and educational content for informational purposes only. While we strive for accuracy, we do not guarantee the completeness or reliability of the information presented. The content should not be considered professional advice of any kind. Readers are encouraged to verify facts and consult appropriate experts when needed. We are not responsible for any loss or inconvenience resulting from the use of information on this site. Some images used on this website are generated with artificial intelligence and are illustrative in nature. They may not accurately represent the products, people, or events described in the articles.