AI Generative

Microsoft Open Sources Phi-4-Reasoning-Vision-15B Model for Efficient Multimodal Tasks

Microsoft open-sources the Phi-4-reasoning-vision-15B model, featuring 15 billion parameters for high-performance multi-modal tasks at a fraction of typical costs.

Staff

Published

13 April, 2026

Microsoft open-sources the Phi-4-reasoning-vision-15B model, featuring 15 billion parameters for high-performance multi-modal tasks at a fraction of typical costs.

Microsoft has officially open-sourced its latest multi-modal reasoning model, Phi-4-reasoning-vision-15B. With a parameter scale of 15 billion, this model strikes a balance between high performance and low cost while maintaining a lightweight design, making it a viable option for complex visual tasks in resource-constrained environments.

In contrast to prevailing industry models that typically rely on trillions of tokens for training, Phi-4-reasoning-vision was trained using only 200 billion multi-modal tokens. The development team focused on data quality, employing techniques such as deep cleaning of open-source data, the generation of targeted synthetic data, and a meticulous domain data ratio. This included an increase in math data to enhance its capabilities in scientific reasoning and screen positioning tasks.

A standout feature of this model is its innovative hybrid reasoning path design. For simpler tasks like image description and optical character recognition (OCR), the model defaults to a direct answer mode, effectively minimizing latency. In contrast, for more complex reasoning tasks that involve mathematical formulas and scientific charts, it automatically engages a structured chain-of-thought (CoT) path to ensure answer accuracy. Users also have the option to manually switch between these two modes using specific guiding words, allowing for adaptability in various scenarios.

Another notable aspect is the integration of the SigLIP-2 dynamic resolution encoder, which enhances the model’s perception capabilities when dealing with small elements in high-resolution screenshots. This makes Phi-4-reasoning-vision an excellent choice for developing computer operation assistants (CUA), capable of accurately identifying and interacting with buttons and input fields on both web and mobile interfaces.

Currently, the Phi-4-reasoning-vision-15B model is available on multiple open-source platforms. Microsoft aims to demonstrate that in the multi-modal AI field, the concepts of “smaller and faster” can coexist with “stronger,” thereby promoting the growth of spatial intelligence and real-time interaction technologies. As AI continues to evolve, the implications of such advancements could significantly influence the development of user-friendly interfaces and smart assistants, potentially reshaping the landscape of how individuals interact with technology.

AI Cybersecurity

Anthropic’s Mythos Reveals Thousands of Vulnerabilities, Banks Prepare for AI Cyberattacks

Anthropic's Mythos exposes thousands of critical vulnerabilities in major systems, prompting $100M in defensive action from tech giants and U.S. banks.

Rachel Torres3 May, 2026

AI Government

US Defense Partners with Anthropic, OpenAI, and Tech Giants for AI-First Military Initiative

US Department of Defense partners with tech giants including SpaceX and OpenAI to launch an "AI-first" initiative aimed at enhancing military decision-making efficiency.

Staff3 May, 2026

AI Business

Iren’s 1.6GW Oklahoma Site Boosts AI Potential, But Nebius Secures $27B in New Deals

Iren's new 1.6GW site in Oklahoma enhances its AI data center capacity, while Nebius secures $27B in deals, raising stakes in the competitive neocloud...

Marcus Chen2 May, 2026

Apple, Google, and Amazon Shine Post-Earnings as AI Demand Reshapes Tech Landscape

Apple's Q2 earnings reveal a price hike for the Mac mini to $799, fueled by AI memory demand, as Google and Amazon also report...

Staff2 May, 2026

AI Technology

Vertiv Reports 83% Earnings Growth Amid $15B AI Data Center Demand Surge

Vertiv reports an 83% earnings growth, driven by a $15 billion project backlog fueled by soaring demand for AI data center infrastructure.

Staff2 May, 2026

AI Government

Nearly All States Pilot AI, Yet Only 7 Have Established Evaluation Mechanisms

Only seven states have implemented effective evaluation mechanisms for AI, despite nearly all initiating pilot projects, highlighting a critical gap in public sector accountability.

Staff1 May, 2026

AI Technology

Big Tech to Invest $3.7 Trillion in AI Infrastructure, Surpassing Historic Rail Expansion

Major tech giants, including Google and Amazon, are set to invest $3.7 trillion in AI infrastructure over five years, reshaping the workforce and economy.

Staff1 May, 2026

AI Cybersecurity

Australia Post Partners with Alpha Level to Enhance Cybersecurity with AI Machine Learning

Australia Post partners with Alpha Level to enhance cybersecurity, utilizing machine learning to analyze 4 billion monthly data points for improved threat detection.

Rachel Torres1 May, 2026

AIPRESSA.COM

AI Generative

Microsoft Open Sources Phi-4-Reasoning-Vision-15B Model for Efficient Multimodal Tasks

Trending

Top Stories

Albania Appoints AI Bot Minister Diella Amid Corruption Concerns and EU Membership Goals

AI Government

BigBear.ai Launches Biometric Platform at O’Hare, Acquires Generative AI Ask Sage for $250M

AI Cybersecurity

Endpoint Security Market to Reach $23.9B by 2030 with 7.2% CAGR Amid Rising Cyber Threats

AI Business

Enterprise Architecture Shifts to Strategic Enabler in AI-Driven Business Models

AI Research

Amazon Awards 63 Research Grants to 41 Universities Across 8 Countries for AI Innovation

You May Also Like

AI Cybersecurity

Anthropic’s Mythos Reveals Thousands of Vulnerabilities, Banks Prepare for AI Cyberattacks

AI Government

US Defense Partners with Anthropic, OpenAI, and Tech Giants for AI-First Military Initiative

AI Business

Iren’s 1.6GW Oklahoma Site Boosts AI Potential, But Nebius Secures $27B in New Deals

Top Stories

Apple, Google, and Amazon Shine Post-Earnings as AI Demand Reshapes Tech Landscape

AI Technology

Vertiv Reports 83% Earnings Growth Amid $15B AI Data Center Demand Surge

AI Government

Nearly All States Pilot AI, Yet Only 7 Have Established Evaluation Mechanisms

AI Technology

Big Tech to Invest $3.7 Trillion in AI Infrastructure, Surpassing Historic Rail Expansion

AI Cybersecurity

Australia Post Partners with Alpha Level to Enhance Cybersecurity with AI Machine Learning