Skip to main content

Phi-4: Microsoft's Small Language Models with Advanced Reasoning

ยท 2 min read
Alexander Carrington
COO of Neuronic AI
Phi-4

๐Ÿง  Microsoft Launches Phi-4 Reasoning Modelsโ€‹

Microsoft has released Phi-4-reasoning, Phi-4-reasoning-plus, and Phi-4-mini-reasoning โ€” marking a new era for small language models. These 14B and 3.8B parameter models achieve performance comparable to much larger models through advanced reinforcement learning and inference-time scaling.

๐Ÿš€ Key Features of Phi-4

  • Advanced Reasoning
    โ†’ Inference-time scaling for complex multi-step problems
    โ†’ Chain-of-thought reasoning with detailed explanations
    โ†’ Trained on high-quality reasoning demonstrations

  • Exceptional Efficiency
    โ†’ Phi-4-reasoning: 14B parameters rivaling much larger models
    โ†’ Phi-4-mini-reasoning: 3.8B parameters for edge deployment
    โ†’ Lower computational costs and faster inference

  • Superior Performance
    โ†’ Better than OpenAI o1-mini and DeepSeek-R1-Distill-Llama-70B
    โ†’ AIME 2025: 78% (Phi-4-reasoning-plus) vs DeepSeek-R1's performance
    โ†’ Outperforms models 5x larger on reasoning benchmarks

  • Open Weight Models
    โ†’ MIT license for maximum deployment flexibility
    โ†’ Available on Azure AI Foundry and Hugging Face
    โ†’ No licensing fees for commercial use

  • Multiple Variants
    โ†’ Phi-4-reasoning: Base 14B reasoning model
    โ†’ Phi-4-reasoning-plus: Enhanced with RL for higher accuracy
    โ†’ Phi-4-mini-reasoning: Compact 3.8B model for resource-constrained environments

๐Ÿ“ก Performance & Benchmarks

  • Mathematical Excellence
    โ†’ AIME 2024: 75.3% (reasoning), 81.3% (reasoning-plus)
    โ†’ GPQA Diamond: 65.8% (reasoning), 68.9% (reasoning-plus)
    โ†’ OmniMath: 76.6% (reasoning), 81.9% (reasoning-plus)

  • Coding Capabilities
    โ†’ HumanEvalPlus: 92.9% (reasoning), 92.3% (reasoning-plus)
    โ†’ Strong performance on programming tasks
    โ†’ Effective code generation and debugging

  • General Reasoning
    โ†’ Excellent instruction following and alignment
    โ†’ Strong performance across diverse problem types
    โ†’ Effective multi-step problem decomposition

๐Ÿ› ๏ธ Available now โ€” plug in, route smart, and start building see it on our ๐Ÿ‘‰ Dashboard

Learn more: Microsoft Azure Phi-4 Blog