AI Generative

OpenAI Launches GPT-5.4 Mini and Nano Models for High-Volume AI Tasks at Lower Costs

OpenAI introduces GPT-5.4 mini and nano models, achieving over 2x speed improvements at costs as low as $0.20 per million tokens for efficient high-volume AI tasks

Staff

Published

2 hours ago

OpenAI has unveiled two new AI models, GPT-5.4 mini and GPT-5.4 nano, designed to enhance efficiency for high-volume tasks where speed and cost are critical. These models aim to incorporate several capabilities of the larger GPT-5.4 system into more agile formats, thereby improving response times significantly. The introduction of these models aligns with use cases where latency is crucial, including coding assistance, automated subagents, and real-time image processing applications.

The GPT-5.4 mini is positioned as a successor to its predecessor, GPT-5 mini, offering advancements in coding, reasoning, multimodal understanding, and tool utilization. OpenAI claims that this model operates over twice as fast as the earlier version and performs comparably to the larger GPT-5.4 model on specific benchmarks such as SWE-Bench Pro and OSWorld-Verified.

In contrast, GPT-5.4 nano is the smallest and most cost-effective variant in the new series. It is specifically tailored for lighter tasks including classification, data extraction, and ranking, while also supporting coding functions. According to OpenAI, this model represents a notable improvement over GPT-5 nano, emphasizing efficiency without compromising essential capabilities.

Both models are engineered for environments that demand quick and dependable outputs rather than maximum scale. Use cases include coding tools that require rapid responsiveness, systems interpreting screenshots, and applications reliant on real-time image analysis. OpenAI indicates that smaller models such as these can strike a better balance between performance and speed compared to their larger counterparts.

In coding workflows, OpenAI points out that both GPT-5.4 mini and nano are well-suited for tasks that benefit from swift iterations, including targeted edits, debugging, and navigating extensive codebases. Notably, the GPT-5.4 mini is reported to exceed the performance of GPT-5 mini while maintaining similar processing speeds, bringing it closer to the capabilities of the larger GPT-5.4 on various tests.

OpenAI has also underscored the significance of smaller models in multi-model systems. In setups like Codex, larger models may handle strategic planning and decision-making, while smaller models, such as GPT-5.4 mini, execute narrower tasks in parallel, such as searching codebases or processing documents. This design approach enables developers to allocate workloads more effectively, resulting in enhanced system performance.

Performance benchmarks reveal that the GPT-5.4 mini excels in multimodal tasks related to computer use, such as interpreting complex user interface screenshots. On the OSWorld-Verified benchmark, the model reportedly approaches the performance levels of GPT-5.4 while surpassing GPT-5 mini.

Developers can access the GPT-5.4 mini through OpenAI’s API, as well as within Codex and ChatGPT. In the API context, it supports both text and image inputs, tool utilization, function calling, web and file searching, and computer-based interactions, boasting a context window of 400,000 tokens. Pricing for this model is set at $0.75 per million input tokens and $4.50 per million output tokens.

Within the Codex environment, the GPT-5.4 mini is available across its app, command-line interface, IDE extension, and web interface. OpenAI states that it consumes approximately 30% of the GPT-5.4 usage quota, thereby allowing developers to tackle simpler tasks at a lower cost. Codex systems can also assign less complex assignments to GPT-5.4 mini while reserving more challenging tasks for larger models.

In ChatGPT, the GPT-5.4 mini is accessible to Free and Go users through the “Thinking” feature and serves as a fallback for GPT-5.4 Thinking in other tiers when usage limits are reached.

The GPT-5.4 nano model is currently only available via the API, with pricing set at $0.20 per million input tokens and $1.25 per million output tokens. As the landscape of AI continues to evolve, these new models from OpenAI are expected to play a critical role in enhancing efficiency and performance in various applications, ultimately shaping the future of AI-driven technologies.

Microsoft Mulls Legal Action Over $50B Amazon-OpenAI Cloud Deal Amid Rising AI Competition

Microsoft considers legal action over Amazon's $50 billion cloud deal with OpenAI, raising stakes in the fierce AI competition and cloud dominance battle.

Staff2 hours ago

Microsoft Considers Legal Action Against Amazon and OpenAI Over $50B AI Deal

Microsoft considers legal action against Amazon and OpenAI over a $50 billion deal that threatens its Azure exclusivity with OpenAI's Frontier product.

Staff4 hours ago

AI Generative

OpenAI Unveils GPT-5.4 Mini and Nano for Faster, Cost-Effective AI Workflows

OpenAI launches GPT-5.4 mini and nano, enhancing performance by over 100% at $0.75 and $0.20 per million tokens, revolutionizing cost-effective AI workflows.

Staff6 hours ago

AI Generative

OpenAI Launches GPT-5.4 Mini and Nano for Free Users with 2x Speed Boost

OpenAI launches GPT-5.4 Mini and Nano models, delivering over 2x speed increase for free users, enhancing coding and multimodal processing efficiency.

Staff10 hours ago

OpenAI Secures Amazon AWS Deal for Classified Work, Leaving Anthropic Behind

Tesla plans a $35B-$45B investment in its Terafab project to produce 200M chips annually, aiming to lead in autonomous tech and robotics.

Staff18 hours ago

AI Generative

OpenAI’s Generative AI Users Face Potential Copyright Liability Amid Legal Uncertainties

Generative AI users, including those leveraging OpenAI's ChatGPT, risk copyright liability as courts explore the legal implications of AI-generated content.

Staff22 hours ago

Anthropic Hires Explosives Expert to Enhance AI Safety Against Weapon Misuse

Anthropic hires a chemicals and explosives policy manager to bolster safety protocols for its Claude AI amid rising concerns over AI's role in weapon...

Staff23 hours ago

AI Technology

Interview Kickstart Launches Advanced Generative AI Course to Equip Engineers with Key Skills

Interview Kickstart unveils its Advanced Generative AI Course to meet surging demand, equipping engineers with hands-on skills for AI-driven applications.

Staff2 days ago

AIPRESSA.COM

AI Generative

OpenAI Launches GPT-5.4 Mini and Nano Models for High-Volume AI Tasks at Lower Costs

Trending

AI Cybersecurity

Endpoint Security Market to Reach $23.9B by 2030 with 7.2% CAGR Amid Rising Cyber Threats

Top Stories

Albania Appoints AI Bot Minister Diella Amid Corruption Concerns and EU Membership Goals

AI Government

BigBear.ai Launches Biometric Platform at O’Hare, Acquires Generative AI Ask Sage for $250M

AI Technology

AI Hardware Market Grows 30% in 2025, Driven by Generative AI and Edge Computing Demand

AI Business

Enterprise Architecture Shifts to Strategic Enabler in AI-Driven Business Models

You May Also Like

Top Stories

Microsoft Mulls Legal Action Over $50B Amazon-OpenAI Cloud Deal Amid Rising AI Competition

Top Stories

Microsoft Considers Legal Action Against Amazon and OpenAI Over $50B AI Deal

AI Generative

OpenAI Unveils GPT-5.4 Mini and Nano for Faster, Cost-Effective AI Workflows

AI Generative

OpenAI Launches GPT-5.4 Mini and Nano for Free Users with 2x Speed Boost

Top Stories

OpenAI Secures Amazon AWS Deal for Classified Work, Leaving Anthropic Behind

AI Generative

OpenAI’s Generative AI Users Face Potential Copyright Liability Amid Legal Uncertainties

Top Stories

Anthropic Hires Explosives Expert to Enhance AI Safety Against Weapon Misuse

AI Technology

Interview Kickstart Launches Advanced Generative AI Course to Equip Engineers with Key Skills