AI Generative

OpenAI Launches GPT-5.4 Mini and Nano Models for High-Volume AI Tasks at Lower Costs

OpenAI introduces GPT-5.4 mini and nano models, achieving over 2x speed improvements at costs as low as $0.20 per million tokens for efficient high-volume AI tasks

Staff

Published

18 March, 2026

OpenAI has unveiled two new AI models, GPT-5.4 mini and GPT-5.4 nano, designed to enhance efficiency for high-volume tasks where speed and cost are critical. These models aim to incorporate several capabilities of the larger GPT-5.4 system into more agile formats, thereby improving response times significantly. The introduction of these models aligns with use cases where latency is crucial, including coding assistance, automated subagents, and real-time image processing applications.

The GPT-5.4 mini is positioned as a successor to its predecessor, GPT-5 mini, offering advancements in coding, reasoning, multimodal understanding, and tool utilization. OpenAI claims that this model operates over twice as fast as the earlier version and performs comparably to the larger GPT-5.4 model on specific benchmarks such as SWE-Bench Pro and OSWorld-Verified.

In contrast, GPT-5.4 nano is the smallest and most cost-effective variant in the new series. It is specifically tailored for lighter tasks including classification, data extraction, and ranking, while also supporting coding functions. According to OpenAI, this model represents a notable improvement over GPT-5 nano, emphasizing efficiency without compromising essential capabilities.

Both models are engineered for environments that demand quick and dependable outputs rather than maximum scale. Use cases include coding tools that require rapid responsiveness, systems interpreting screenshots, and applications reliant on real-time image analysis. OpenAI indicates that smaller models such as these can strike a better balance between performance and speed compared to their larger counterparts.

In coding workflows, OpenAI points out that both GPT-5.4 mini and nano are well-suited for tasks that benefit from swift iterations, including targeted edits, debugging, and navigating extensive codebases. Notably, the GPT-5.4 mini is reported to exceed the performance of GPT-5 mini while maintaining similar processing speeds, bringing it closer to the capabilities of the larger GPT-5.4 on various tests.

OpenAI has also underscored the significance of smaller models in multi-model systems. In setups like Codex, larger models may handle strategic planning and decision-making, while smaller models, such as GPT-5.4 mini, execute narrower tasks in parallel, such as searching codebases or processing documents. This design approach enables developers to allocate workloads more effectively, resulting in enhanced system performance.

Performance benchmarks reveal that the GPT-5.4 mini excels in multimodal tasks related to computer use, such as interpreting complex user interface screenshots. On the OSWorld-Verified benchmark, the model reportedly approaches the performance levels of GPT-5.4 while surpassing GPT-5 mini.

Developers can access the GPT-5.4 mini through OpenAI’s API, as well as within Codex and ChatGPT. In the API context, it supports both text and image inputs, tool utilization, function calling, web and file searching, and computer-based interactions, boasting a context window of 400,000 tokens. Pricing for this model is set at $0.75 per million input tokens and $4.50 per million output tokens.

Within the Codex environment, the GPT-5.4 mini is available across its app, command-line interface, IDE extension, and web interface. OpenAI states that it consumes approximately 30% of the GPT-5.4 usage quota, thereby allowing developers to tackle simpler tasks at a lower cost. Codex systems can also assign less complex assignments to GPT-5.4 mini while reserving more challenging tasks for larger models.

In ChatGPT, the GPT-5.4 mini is accessible to Free and Go users through the “Thinking” feature and serves as a fallback for GPT-5.4 Thinking in other tiers when usage limits are reached.

The GPT-5.4 nano model is currently only available via the API, with pricing set at $0.20 per million input tokens and $1.25 per million output tokens. As the landscape of AI continues to evolve, these new models from OpenAI are expected to play a critical role in enhancing efficiency and performance in various applications, ultimately shaping the future of AI-driven technologies.

AI Government

US Defense Partners with Anthropic, OpenAI, and Tech Giants for AI-First Military Initiative

US Department of Defense partners with tech giants including SpaceX and OpenAI to launch an "AI-first" initiative aimed at enhancing military decision-making efficiency.

Staff3 May, 2026

AI Research

OpenAI’s AI Model Achieves 81.6% Diagnostic Accuracy, Surpassing Human Doctors in ER Tests

OpenAI's o1 model achieves 81.6% diagnostic accuracy in emergency situations, surpassing human doctors and signaling a major shift in medical practice.

Staff3 May, 2026

AI Generative

OpenAI Launches GPT Image 2, Surpassing Google Nano Banana 2 in Key Categories

OpenAI unveils GPT Image 2, achieving a record 242-point lead over competitors, transforming the AI image generation landscape with native reasoning capabilities.

Staff2 May, 2026

AI Technology

Apple Faces Mac Mini and Studio Shortage as OpenClaw Drives AI Demand Surge

Apple CEO Tim Cook warns of several-month supply shortages for the Mac mini and Mac Studio as demand surges, pushing Mac revenue to $8.4...

Staff2 May, 2026

DeepSeek Launches V4 Open-Source Model, Underpricing GPT-5.5 and Claude Opus 4.7

DeepSeek's V4 open-source model undercuts GPT-5.5 and Claude Opus 4.7 with costs of $1.74 per million tokens, promising a disruptive shift in AI pricing...

Staff2 May, 2026

AI Generative

OpenAI’s ChatGPT Images 2.0 Surges in India, Sees Mixed Global Response with 11% App Growth

OpenAI's ChatGPT Images 2.0 sees 5 million downloads in India within a week, driving an 11% global app growth amid varied international adoption trends

Staff1 May, 2026

AI Cybersecurity

OpenAI’s GPT-5.5 Matches Claude Mythos in Cyberattack Efficiency, Solves Puzzles in 10 Minutes

OpenAI's GPT-5.5 autonomously executed complex cyberattacks with a 71.4% pass rate, raising alarms as U.K. officials unveil £90M to enhance cyber resilience.

Rachel Torres1 May, 2026

AI Generative

OpenAI Tests GPT 5.6 in Codex Update to Enhance AI Coding and Cybersecurity Features

OpenAI tests GPT 5.6 in Codex, aiming to enhance AI-driven coding efficiency and cybersecurity, potentially reshaping the developer landscape.

Staff1 May, 2026

AIPRESSA.COM

AI Generative

OpenAI Launches GPT-5.4 Mini and Nano Models for High-Volume AI Tasks at Lower Costs

Trending

Top Stories

Albania Appoints AI Bot Minister Diella Amid Corruption Concerns and EU Membership Goals

AI Government

BigBear.ai Launches Biometric Platform at O’Hare, Acquires Generative AI Ask Sage for $250M

AI Cybersecurity

Endpoint Security Market to Reach $23.9B by 2030 with 7.2% CAGR Amid Rising Cyber Threats

AI Business

Enterprise Architecture Shifts to Strategic Enabler in AI-Driven Business Models

AI Research

Amazon Awards 63 Research Grants to 41 Universities Across 8 Countries for AI Innovation

You May Also Like

AI Government

US Defense Partners with Anthropic, OpenAI, and Tech Giants for AI-First Military Initiative

AI Research

OpenAI’s AI Model Achieves 81.6% Diagnostic Accuracy, Surpassing Human Doctors in ER Tests

AI Generative

OpenAI Launches GPT Image 2, Surpassing Google Nano Banana 2 in Key Categories

AI Technology

Apple Faces Mac Mini and Studio Shortage as OpenClaw Drives AI Demand Surge

Top Stories

DeepSeek Launches V4 Open-Source Model, Underpricing GPT-5.5 and Claude Opus 4.7

AI Generative

OpenAI’s ChatGPT Images 2.0 Surges in India, Sees Mixed Global Response with 11% App Growth

AI Cybersecurity

OpenAI’s GPT-5.5 Matches Claude Mythos in Cyberattack Efficiency, Solves Puzzles in 10 Minutes

AI Generative

OpenAI Tests GPT 5.6 in Codex Update to Enhance AI Coding and Cybersecurity Features