How does MiMo-7B-RL perform specifically in mathematical reasoning tasks?

AIME 2024: 68.21 TP3T Pass@1 (modeled first round answers correct)
AIME 2025: 55.41 TP3T Pass@1
MATH-500: : 95.8% Pass@1

2025-08-23

1.6 K

Mathematical Reasoning Performance Evaluation

MiMo-7B-RL has shown excellent performance on several international math competition datasets:

These results suggest that the model is capable of:

best practice::

set up temperature=0.6 Balancing Quality and Diversity of Answers
Problem descriptions should be as clear and complete as possible, and complex problems can be entered in segments
Suitable for AMC/AIME and other competition training, college math teaching support and other scenarios

Tests have shown its performance to be comparable to larger commercial models such as OpenAI o1-mini.