Connect with us

Hi, what are you looking for?

Top Stories

OpenAI Launches GPT-5.2 with 40.3% Math Problem Accuracy and New Discovery

OpenAI launches GPT-5.2, achieving a record 40.3% accuracy in complex math problems and supporting breakthrough research in statistical learning theory.

OpenAI Group PBC has officially launched its latest large language model, GPT-5.2, marking a significant advancement in artificial intelligence capabilities. Released today, the model is available in three variations: Instant, Thinking, and Pro. The company claims that the Thinking and Pro versions deliver unprecedented performance, particularly in mathematical tasks, surpassing competitors in various domains.

In testing the mid-range Thinking version using the FrontierMath benchmark, which includes challenging college-level math problems, GPT-5.2 Thinking achieved a record success rate of 40.3% in solving these problems correctly, a new industry benchmark. Notably, the model secured a perfect score on a qualifying examination for the International Mathematical Olympiad.

The Pro version of GPT-5.2 demonstrated its prowess by assisting researchers in a groundbreaking discovery within the field of statistical learning theory. It effectively resolved a simplified form of an open problem that had been presented during a 2019 math conference, accomplishing this without human guidance on how to approach the task, according to OpenAI.

Compared with its predecessor, GPT-5.1, the new model exhibits enhanced capabilities in interpreting charts commonly found in scientific literature. Testing using the CharXiv Reasoning benchmark revealed that the Thinking version correctly interpreted 88.7% of charts, representing an improvement of over 8% from GPT-5.1 Thinking.

The visual reasoning functionalities of GPT-5.2 extend to various applications. In internal evaluations, OpenAI staff utilized the model to identify key components from a low-resolution image of a motherboard. Additionally, it has shown competency in analyzing business intelligence dashboards and product diagrams.

Front-end development tasks, including the creation of visual application components such as interfaces, also benefit from GPT-5.2’s enhanced capabilities. The model excels in developing three-dimensional assets, including simulations. In other programming tasks, it achieved an impressive score of 55.6% on the SWE-Bench Pro, a benchmark encompassing complex coding challenges across multiple programming languages, and scored 80% on the Python-specific SWE-bench Verified version.

OpenAI began rolling out GPT-5.2 to ChatGPT users today and has made the model accessible via its application programming interface for developers. For pricing, the entry-level model is set at $1.75 per million input tokens and $14 per million output tokens. The Pro version commands higher fees, priced at $21 and $168 per million input and output tokens, respectively. Notably, OpenAI indicated that developers could slash output costs by up to 90% using a caching feature that retains frequently requested answers, minimizing the need for repeated generation.

The introduction of GPT-5.2 positions OpenAI at the forefront of advancements in artificial intelligence, particularly in mathematical reasoning and programming tasks. As the company continues to innovate and expand the capabilities of its language models, the implications for various industries and applications are significant, potentially reshaping how businesses and researchers approach problem-solving in complex fields.

See also
Staff
Written By

The AiPressa Staff team brings you comprehensive coverage of the artificial intelligence industry, including breaking news, research developments, business trends, and policy updates. Our mission is to keep you informed about the rapidly evolving world of AI technology.

You May Also Like

AI Generative

OpenAI has retired the GPT-4o model, impacting 0.1% of users who formed deep emotional bonds with the AI as it transitions to newer models...

Top Stories

Google DeepMind's Aletheia achieves a groundbreaking 95.1% accuracy in autonomous math research, revolutionizing proof generation and verification capabilities.

AI Research

Physicists, aided by OpenAI's GPT-5.2, discovered a groundbreaking simple formula for gluon collisions, transforming complex calculations in particle physics.

AI Tools

Anthropic's launch of Claude Opus 4.6 triggers a $10B selloff in SaaS stocks as concerns grow over its advanced AI capabilities disrupting traditional software.

AI Generative

Anthropic's Claude Opus 4.6 launches with a 144 Elo point advantage over GPT-5.2, enhancing AI-driven productivity and safety for enterprise applications

Top Stories

Caura.ai's PeerRank framework reveals systematic biases in AI evaluations, achieving a 0.904 correlation with accuracy, as models autonomously assess each other.

AI Cybersecurity

95% of Model Context Protocol deployments lack security, raising alarms among experts as AI cyber threats escalate, particularly from nation-states like Iran and China.

AI Generative

OpenAI CEO Sam Altman admits GPT-5.2's writing quality is "unwieldy" compared to GPT-4.5, promising future improvements amid user complaints.

© 2025 AIPressa · Part of Buzzora Media · All rights reserved. This website provides general news and educational content for informational purposes only. While we strive for accuracy, we do not guarantee the completeness or reliability of the information presented. The content should not be considered professional advice of any kind. Readers are encouraged to verify facts and consult appropriate experts when needed. We are not responsible for any loss or inconvenience resulting from the use of information on this site. Some images used on this website are generated with artificial intelligence and are illustrative in nature. They may not accurately represent the products, people, or events described in the articles.