AI Generative

Meta’s Galactica Processes 106B Tokens But Faces Backlash Over Fabricated Citations

Meta’s Galactica processes 106 billion tokens from 48 million papers but faces backlash for 69% citation inaccuracy and generating fabricated research.

Staff

Published

12 December, 2025

Meta’s experimental AI model, Galactica, underwent a brief public demonstration in November 2022, processing a staggering 106 billion tokens from approximately 48 million scientific papers. The model, designed to advance scientific reasoning capabilities, was withdrawn just three days later, following widespread criticism for generating misleading scientific content and fabricated citations. Despite these challenges, Galactica’s development has sparked significant discussions regarding the intersection of artificial intelligence and scientific integrity.

The model was built on a curated dataset that Meta AI described as “curated humanity’s scientific knowledge,” setting it apart from typical web-scraped training methodologies. This dataset encompassed an array of scientific literature, including research papers, textbooks, and lecture notes, integrating over 360 million in-context citations. Galactica was trained over 4.25 epochs using a decoder-only Transformer architecture, with the flagship variant comprising 120 billion parameters and requiring 128 NVIDIA A100 nodes for efficient processing.

Galactica’s performance on scientific benchmarks was notable, achieving an accuracy of 68.2% for LaTeX equation generation—substantially higher than GPT-3’s 49%. The model also outperformed larger counterparts on specific tasks; for instance, the 30 billion parameter variant surpassed the PaLM 540 billion model in mathematical reasoning, scoring 20.4% on the MATH benchmark compared to PaLM’s 8.8%. Other achievements included state-of-the-art results on PubMedQA and MedMCQA.

However, the accuracy of Galactica’s citation predictions varied significantly, ranging from 36.6% to 69.1%, often biased toward highly-cited works. Critics highlighted instances where the model generated citations for non-existent papers attributed to real researchers, including fabricated works from Meta’s Reality Labs and Google AI researchers. Michael Black, Director of the Max Planck Institute for Intelligent Systems, voiced concerns that such outputs could lead to what he termed “deep scientific fakes.”

Galactica’s public demo, which commenced on November 15, 2022, was one of the shortest in the AI industry, ending on November 18, just two weeks prior to the launch of OpenAI’s ChatGPT. The rapid withdrawal reflected not only the internal challenges faced by Meta but also the heightened scrutiny surrounding AI-generated scientific outputs. This backlash provided critical lessons that influenced Meta’s subsequent AI initiatives.

In the aftermath of Galactica, Meta pivoted its strategy, launching the Llama model family in February 2023 with a focus on form-based access for researchers rather than public demos. Joelle Pineau, VP of Research at Meta AI, acknowledged that insights gained from Galactica were integral to the development of future models. Additionally, Yann LeCun noted the Galactica incident when discussing the access restrictions for Llama, underscoring the need for responsible deployment of AI technologies.

The ramifications of Galactica extend beyond Meta, as the global market for large language models (LLMs) reached $2.08 billion in 2024 and is projected to grow to $15.64 billion by 2029, reflecting a compound annual growth rate of 49.6%. Notably, a study conducted in 2024 found that 80.9% of researchers reported using LLMs in at least one research area, highlighting a significant trend towards AI adoption in academia.

While Galactica faced challenges during its brief debut, the model’s development has left a lasting imprint on the landscape of scientific AI. The lessons learned from its rollout are likely to shape how future AI models are developed, accessed, and utilized. As the demand for reliable AI in research continues to grow, the industry must address the delicate balance between innovation and accuracy, ensuring that technology serves to enhance, rather than undermine, the pursuit of knowledge.

Apple, Google, and Amazon Shine Post-Earnings as AI Demand Reshapes Tech Landscape

Apple's Q2 earnings reveal a price hike for the Mac mini to $799, fueled by AI memory demand, as Google and Amazon also report...

Staff2 May, 2026

AI Finance

Amazon, Google Surge with Record Cloud Growth; Meta, Microsoft Face Investor Backlash

Amazon and Google report record cloud growth, with AWS revenue at $37.6B and Google Cloud up 63% to $20B, while Meta and Microsoft face...

Marcus Chen30 April, 2026

Meta’s Ad Revenue Soars 33% to $55B, Google Grows 15% to $77B Amid AI Investments

Meta's ad revenue surged 33% to $55B, surpassing Google's 15% growth to $77B, amid escalating AI investments that could reshape digital advertising.

Staff30 April, 2026

Meta Partners with Overview Energy for 1 GW Space Solar Power to Enhance Data Centers

Meta partners with Overview Energy to harness 1 GW of space solar power, revolutionizing energy for its data centers and emphasizing sustainable innovation.

Staff30 April, 2026

Meta’s AI Acquisition Fails as China’s DeepSeek V4 Struggles to Compete

Meta's failed acquisition of AI start-up Manus underscores China's ambitions in AI, while DeepSeek's V4 struggles to meet industry benchmarks, raising competitive concerns.

Staff28 April, 2026

AI Research

Microsoft Reveals 40 Careers Most Vulnerable to AI, Including Teaching Roles

Microsoft's new report highlights 40 careers, including teaching and writing roles, most vulnerable to AI disruption, with 5 million U.S. jobs at risk.

Staff28 April, 2026

Ex-Meta AI Exec Clara Shih Shares How Gen Z Can Thrive in AI-Dominated Job Market

Meta's recent layoffs of thousands highlight how AI is reshaping the workforce, prompting Clara Shih to launch the New Work Foundation to guide Gen...

Staff27 April, 2026

AI Marketing

Meta Expands AI Business Assistant to Major Global Markets for Enhanced Marketing Optimization

Meta expands its AI business assistant to major global markets, enhancing marketing campaign effectiveness with actionable insights and advanced analytics.

Sofía Méndez27 April, 2026

AIPRESSA.COM

AI Generative

Meta’s Galactica Processes 106B Tokens But Faces Backlash Over Fabricated Citations

Trending

Top Stories

Albania Appoints AI Bot Minister Diella Amid Corruption Concerns and EU Membership Goals

AI Government

BigBear.ai Launches Biometric Platform at O’Hare, Acquires Generative AI Ask Sage for $250M

AI Cybersecurity

Endpoint Security Market to Reach $23.9B by 2030 with 7.2% CAGR Amid Rising Cyber Threats

AI Business

Enterprise Architecture Shifts to Strategic Enabler in AI-Driven Business Models

AI Research

Amazon Awards 63 Research Grants to 41 Universities Across 8 Countries for AI Innovation

You May Also Like

Top Stories

Apple, Google, and Amazon Shine Post-Earnings as AI Demand Reshapes Tech Landscape

AI Finance

Amazon, Google Surge with Record Cloud Growth; Meta, Microsoft Face Investor Backlash

Top Stories

Meta’s Ad Revenue Soars 33% to $55B, Google Grows 15% to $77B Amid AI Investments

Top Stories

Meta Partners with Overview Energy for 1 GW Space Solar Power to Enhance Data Centers

Top Stories

Meta’s AI Acquisition Fails as China’s DeepSeek V4 Struggles to Compete

AI Research

Microsoft Reveals 40 Careers Most Vulnerable to AI, Including Teaching Roles

Top Stories

Ex-Meta AI Exec Clara Shih Shares How Gen Z Can Thrive in AI-Dominated Job Market

AI Marketing

Meta Expands AI Business Assistant to Major Global Markets for Enhanced Marketing Optimization