Connect with us

Hi, what are you looking for?

AI Generative

OpenAI Launches GPT-5.4 Mini and Nano Models for High-Volume AI Tasks at Lower Costs

OpenAI introduces GPT-5.4 mini and nano models, achieving over 2x speed improvements at costs as low as $0.20 per million tokens for efficient high-volume AI tasks

OpenAI has unveiled two new AI models, GPT-5.4 mini and GPT-5.4 nano, designed to enhance efficiency for high-volume tasks where speed and cost are critical. These models aim to incorporate several capabilities of the larger GPT-5.4 system into more agile formats, thereby improving response times significantly. The introduction of these models aligns with use cases where latency is crucial, including coding assistance, automated subagents, and real-time image processing applications.

The GPT-5.4 mini is positioned as a successor to its predecessor, GPT-5 mini, offering advancements in coding, reasoning, multimodal understanding, and tool utilization. OpenAI claims that this model operates over twice as fast as the earlier version and performs comparably to the larger GPT-5.4 model on specific benchmarks such as SWE-Bench Pro and OSWorld-Verified.

In contrast, GPT-5.4 nano is the smallest and most cost-effective variant in the new series. It is specifically tailored for lighter tasks including classification, data extraction, and ranking, while also supporting coding functions. According to OpenAI, this model represents a notable improvement over GPT-5 nano, emphasizing efficiency without compromising essential capabilities.

Both models are engineered for environments that demand quick and dependable outputs rather than maximum scale. Use cases include coding tools that require rapid responsiveness, systems interpreting screenshots, and applications reliant on real-time image analysis. OpenAI indicates that smaller models such as these can strike a better balance between performance and speed compared to their larger counterparts.

In coding workflows, OpenAI points out that both GPT-5.4 mini and nano are well-suited for tasks that benefit from swift iterations, including targeted edits, debugging, and navigating extensive codebases. Notably, the GPT-5.4 mini is reported to exceed the performance of GPT-5 mini while maintaining similar processing speeds, bringing it closer to the capabilities of the larger GPT-5.4 on various tests.

OpenAI has also underscored the significance of smaller models in multi-model systems. In setups like Codex, larger models may handle strategic planning and decision-making, while smaller models, such as GPT-5.4 mini, execute narrower tasks in parallel, such as searching codebases or processing documents. This design approach enables developers to allocate workloads more effectively, resulting in enhanced system performance.

Performance benchmarks reveal that the GPT-5.4 mini excels in multimodal tasks related to computer use, such as interpreting complex user interface screenshots. On the OSWorld-Verified benchmark, the model reportedly approaches the performance levels of GPT-5.4 while surpassing GPT-5 mini.

Developers can access the GPT-5.4 mini through OpenAI’s API, as well as within Codex and ChatGPT. In the API context, it supports both text and image inputs, tool utilization, function calling, web and file searching, and computer-based interactions, boasting a context window of 400,000 tokens. Pricing for this model is set at $0.75 per million input tokens and $4.50 per million output tokens.

Within the Codex environment, the GPT-5.4 mini is available across its app, command-line interface, IDE extension, and web interface. OpenAI states that it consumes approximately 30% of the GPT-5.4 usage quota, thereby allowing developers to tackle simpler tasks at a lower cost. Codex systems can also assign less complex assignments to GPT-5.4 mini while reserving more challenging tasks for larger models.

In ChatGPT, the GPT-5.4 mini is accessible to Free and Go users through the “Thinking” feature and serves as a fallback for GPT-5.4 Thinking in other tiers when usage limits are reached.

The GPT-5.4 nano model is currently only available via the API, with pricing set at $0.20 per million input tokens and $1.25 per million output tokens. As the landscape of AI continues to evolve, these new models from OpenAI are expected to play a critical role in enhancing efficiency and performance in various applications, ultimately shaping the future of AI-driven technologies.

See also
Staff
Written By

The AiPressa Staff team brings you comprehensive coverage of the artificial intelligence industry, including breaking news, research developments, business trends, and policy updates. Our mission is to keep you informed about the rapidly evolving world of AI technology.

You May Also Like

Top Stories

Microsoft considers legal action over Amazon's $50 billion cloud deal with OpenAI, raising stakes in the fierce AI competition and cloud dominance battle.

Top Stories

Microsoft considers legal action against Amazon and OpenAI over a $50 billion deal that threatens its Azure exclusivity with OpenAI's Frontier product.

AI Generative

OpenAI launches GPT-5.4 mini and nano, enhancing performance by over 100% at $0.75 and $0.20 per million tokens, revolutionizing cost-effective AI workflows.

AI Generative

OpenAI launches GPT-5.4 Mini and Nano models, delivering over 2x speed increase for free users, enhancing coding and multimodal processing efficiency.

Top Stories

Tesla plans a $35B-$45B investment in its Terafab project to produce 200M chips annually, aiming to lead in autonomous tech and robotics.

AI Generative

Generative AI users, including those leveraging OpenAI's ChatGPT, risk copyright liability as courts explore the legal implications of AI-generated content.

Top Stories

Anthropic hires a chemicals and explosives policy manager to bolster safety protocols for its Claude AI amid rising concerns over AI's role in weapon...

AI Technology

Interview Kickstart unveils its Advanced Generative AI Course to meet surging demand, equipping engineers with hands-on skills for AI-driven applications.

© 2025 AIPressa · Part of Buzzora Media · All rights reserved. This website provides general news and educational content for informational purposes only. While we strive for accuracy, we do not guarantee the completeness or reliability of the information presented. The content should not be considered professional advice of any kind. Readers are encouraged to verify facts and consult appropriate experts when needed. We are not responsible for any loss or inconvenience resulting from the use of information on this site. Some images used on this website are generated with artificial intelligence and are illustrative in nature. They may not accurately represent the products, people, or events described in the articles.