MiniMax / minimax-m2

MiniMax M2: High-efficiency 230B parameter model with 200K context, optimized for coding and agentic workflows. Cloud-optimized with superior intelligence and agentic performance.

Models

ModelSizeContext LengthInput ModalitiesActivated Parameters
minimax-m2230B200KText10B

MiniMax M2: High-Efficiency Coding and Agentic Model

MiniMax M2 is a high-efficiency large language model engineered specifically for coding and agentic workflows. With 230 billion total parameters (10 billion activated), M2 delivers exceptional performance in software development tasks while maintaining high efficiency, low latency, and cost-effective deployment.

Key Features

  • Superior Intelligence: Ranks #1 among open-source models globally on Artificial Analysis composite intelligence benchmarks
  • Advanced Coding: Engineered for end-to-end developer workflows with multi-file editing and test-validated repairs
  • Agentic Performance: Excels at planning and executing complex, long-horizon toolchains across shell, browser, and code runners
  • Efficient Design: 10B activated parameters from 230B total for optimal performance-to-cost ratio
  • Long Context: 200K token context window for comprehensive codebase understanding
  • Recovery Capabilities: Graceful recovery from flaky steps in complex workflows
  • Evidence Traceability: Maintains clear evidence chains for agentic decision making

Model Variants

NameSizeContextInput ModalitiesActivated ParametersDescription
minimax-m2230B200KText10BCloud-optimized high-efficiency model

Technical Capabilities

Coding Excellence

MiniMax M2 delivers exceptional performance across the software development lifecycle:

  • Multi-File Editing: Comprehensive codebase modifications and refactoring
  • Coding-Run-Fix Loops: End-to-end development workflows with execution and debugging
  • Test-Validated Repairs: Automated testing and validation of code changes
  • Language Support: Strong performance across multiple programming languages
  • IDE Integration: Optimized for terminal, IDE, and CI/CD workflows

Agentic Intelligence

  • Complex Toolchains: Planning and execution across shell, browser, retrieval, and code runners
  • Long-Horizon Tasks: Effective handling of multi-step, complex workflows
  • Web Browsing: Advanced web exploration and information retrieval
  • Recovery Mechanisms: Graceful handling of failures and flaky steps
  • Evidence Tracking: Maintains traceable evidence chains for decision making

Efficiency Optimization

  • Parameter Efficiency: 10B activated parameters from 230B total
  • Latency Optimization: Low-latency performance for interactive applications
  • Cost Efficiency: High throughput for batched sampling and deployment
  • Deployment Flexibility: Optimized for cloud and edge deployment scenarios

Benchmark Performance

Coding & Agentic Benchmarks

MiniMax M2 demonstrates superior performance on comprehensive coding and agentic evaluations:

BenchmarkMiniMax-M2Claude Sonnet 4Claude Sonnet 4.5Gemini 2.5 ProGPT-5 (thinking)GLM-4.6DeepSeek-V3.2
SWE-bench Verified69.472.7*77.2*63.8*74.9*68*67.8*
Multi-SWE-Bench36.235.7*44.3//3030.6
SWE-bench Multilingual56.556.9*68//53.857.9*
Terminal-Bench46.336.4*50*25.3*43.8*40.5*37.7*
ArtifactsBench66.857.3*61.557.7*73*59.855.8
BrowseComp4412.219.69.954.9*45.1*40.1*
BrowseComp-zh48.529.140.832.26549.547.9*
GAIA (text only)75.768.371.260.276.471.963.5
xbench-DeepSearch7264.6665677.87071
τ²-Bench77.265.5*84.7*59.280.1*75.9*66.7

Intelligence Benchmarks

Artificial Analysis composite intelligence scores across math, science, instruction following, coding, and agentic tool use:

Metric (AA)MiniMax-M2Claude Sonnet 4Claude Sonnet 4.5Gemini 2.5 ProGPT-5 (thinking)GLM-4.6DeepSeek-V3.2
AIME2578748888948688
MMLU-Pro82848886878385
GPQA-Diamond78788384857880
HLE (w/o tools)12.59.617.321.126.513.313.8
LiveCodeBench (LCB)83667180857079
SciCode36404543433838
IFBench72555749734354
AA Intelligence61576360695657

Note: Data points marked with an asterisk () are taken from official model reports or blogs. All other metrics follow Artificial Analysis evaluation methodologies.*

Use Cases

Software Development

  • End-to-End Coding: Complete software development lifecycle support
  • Code Refactoring: Intelligent codebase modernization and optimization
  • Test-Driven Development: Automated test generation and validation
  • Debugging: Advanced bug detection and repair workflows
  • Multi-Language Support: Consistent performance across programming languages

Agentic Workflows

  • Enterprise Automation: Complex business process automation
  • Research Assistance: Advanced information retrieval and synthesis
  • Web Exploration: Intelligent web browsing and data collection
  • Tool Integration: Seamless integration with development and productivity tools
  • Long-Horizon Tasks: Complex, multi-step workflow management

Development Operations

  • CI/CD Optimization: Continuous integration and deployment automation
  • Infrastructure as Code: Automated infrastructure provisioning and management
  • DevOps Automation: End-to-end development operations support
  • Monitoring and Alerting: Intelligent system monitoring and incident response

Research and Innovation

  • Scientific Computing: Advanced algorithm development and implementation
  • Data Analysis: Comprehensive data processing and analysis
  • Hypothesis Testing: Automated research workflows and validation
  • Literature Review: Intelligent research paper analysis and synthesis

Getting Started

MiniMax M2 cloud model is available through various API providers. For more information: