AI Regulation

Twelve AI Firms Release Updated Safety Policies Amid Growing Risk Concerns

Twelve leading AI firms, including OpenAI and Google DeepMind, unveil updated safety policies to mitigate risks from advanced models, reflecting a commitment to accountability.

Staff

Published

22 December, 2025

A coalition of developers specializing in large foundation models has begun implementing corporate protocols to evaluate and mitigate risks associated with their artificial intelligence (AI) technologies. As of September 2023, several key AI companies have voluntarily published these protocols aimed at addressing severe risks posed by their models. This initiative gained momentum at the AI Seoul Summit in May 2024, where sixteen companies committed to the Frontier AI Safety Commitments, with an additional four companies joining since then. Currently, twelve organizations, including Anthropic, OpenAI, Google DeepMind, Meta, and Microsoft, have made their frontier AI safety policies public.

The initial report released in August 2024 focused on the commonalities found in the safety policies of Anthropic, OpenAI, and Google DeepMind. By March 2025, as the number of available policies increased to twelve, the document was updated to incorporate new insights and developments. The latest version, published in December 2025, references updates in some developers’ safety policies, along with relevant guidelines from the EU AI Act and California’s Senate Bill 53.

Each policy scrutinized in the reports employs capability thresholds, which evaluate the potential risks associated with AI models, such as their capacity to facilitate biological weapons development, cyberattacks, or autonomous replication. The developers commit to conducting assessments to determine if their models approach these thresholds that could lead to severe or catastrophic outcomes. When such thresholds are approached, the policies advocate for model weight security and deployment mitigations, especially for models identified as having concerning capabilities.

In response to risks, developers have pledged to secure model weights to prevent theft by sophisticated adversaries and to implement safety measures that minimize the risk of misuse. Policies also include provisions to halt development and deployment should mitigation efforts prove inadequate. To ensure effective risk management, evaluations are designed to thoroughly assess model capabilities, occurring before deployment, during training, and after deployment. All three policies emphasize the importance of exploring accountability mechanisms, including potential oversight by third parties or advisory boards, which would monitor policy implementation and assist with evaluations.

As developers continue to refine their evaluation processes and deepen their understanding of AI-related risks, the policies are expected to be updated over time. This ongoing evolution reflects a heightened awareness within the industry of the potential consequences of advanced AI technologies and the necessity for stringent safety measures. With the rapid advancement of AI capabilities, the commitment to accountability and risk mitigation promises to shape the future landscape of responsible AI deployment.

Pentagon Considers Supply Chain Risk Designation for Anthropic Amid Tensions

Pentagon plans to designate Anthropic a "supply chain risk," jeopardizing contracts with eight of the ten largest U.S. companies using its AI model, Claude.

Staff1 hour ago

AI Cybersecurity

Microsoft Launches Security Dashboard for AI, Enhancing Risk Management for Enterprises

Microsoft unveils the Security Dashboard for AI in public preview, streamlining enterprise AI risk management by aggregating signals from Defender, Entra, and Purview.

Rachel Torres4 hours ago

AI Business

Pentagon Integrates ChatGPT into GenAI.mil, Expanding Access to 3M Defense Personnel

Pentagon partners with OpenAI to integrate ChatGPT into GenAI.mil, granting 3 million personnel access to advanced AI capabilities for enhanced mission readiness.

Marcus Chen4 hours ago

AI Technology

New Report Reveals 74% of Big Tech’s AI Climate Claims Are Unproven, Exposing Greenwashing

A new report reveals that 74% of climate claims by tech giants like Google and Microsoft lack evidence, highlighting serious environmental costs of AI...

Staff4 hours ago

AI Impact Summit Set to Unlock ₹8 Lakh Crore Investments, Position India as Global Tech Leader

AI Impact Summit in India aims to unlock ₹8 lakh crore in investments, gathering leaders like Bill Gates and Sundar Pichai to shape global...

Staff7 hours ago

AI Education

UGA Launches $800K AI Pilot Program for Students, Access to ChatGPT Edu and Gemini Pro

UGA invests $800,000 to launch a pilot program providing students access to premium AI tools like ChatGPT Edu and Gemini Pro starting spring 2026.

David Park9 hours ago

AI Generative

OpenAI Retires GPT-4o, Sparking Outcry Among AI Companion Community

OpenAI has retired the GPT-4o model, impacting 0.1% of users who formed deep emotional bonds with the AI as it transitions to newer models...

Staff10 hours ago

AI Technology

CodePath Partners with Anthropic to Elevate AI Education for Low-Income Students

CodePath partners with Anthropic to integrate Claude into AI courses, empowering low-income students to access high-demand skills with a 56% wage premium.

Staff14 hours ago

AIPRESSA.COM

AI Regulation

Twelve AI Firms Release Updated Safety Policies Amid Growing Risk Concerns

Trending

AI Cybersecurity

Endpoint Security Market to Reach $23.9B by 2030 with 7.2% CAGR Amid Rising Cyber Threats

AI Government

BigBear.ai Launches Biometric Platform at O’Hare, Acquires Generative AI Ask Sage for $250M

Top Stories

Albania Appoints AI Bot Minister Diella Amid Corruption Concerns and EU Membership Goals

AI Business

Enterprise Architecture Shifts to Strategic Enabler in AI-Driven Business Models

AI Technology

AI Hardware Market Grows 30% in 2025, Driven by Generative AI and Edge Computing Demand

You May Also Like

Top Stories

Pentagon Considers Supply Chain Risk Designation for Anthropic Amid Tensions

AI Cybersecurity

Microsoft Launches Security Dashboard for AI, Enhancing Risk Management for Enterprises

AI Business

Pentagon Integrates ChatGPT into GenAI.mil, Expanding Access to 3M Defense Personnel

AI Technology

New Report Reveals 74% of Big Tech’s AI Climate Claims Are Unproven, Exposing Greenwashing

Top Stories

AI Impact Summit Set to Unlock ₹8 Lakh Crore Investments, Position India as Global Tech Leader

AI Education

UGA Launches $800K AI Pilot Program for Students, Access to ChatGPT Edu and Gemini Pro

AI Generative

OpenAI Retires GPT-4o, Sparking Outcry Among AI Companion Community

AI Technology

CodePath Partners with Anthropic to Elevate AI Education for Low-Income Students