| Name | Paper / Project | Description |
|---|---|---|
| MMBench | MM-Agent | A benchmark for evaluating LLMs' capabilities in real-world mathematical modeling, covering problem formulation, variable definition, and solution analysis. |
| ModelingBench | ModelingAgent | A benchmark designed to assess LLMs' performance across the complete mathematical modeling pipeline, from problem understanding to model construction and evaluation. |
| Mamo | Mamo | A benchmark focusing on the gap between natural language problem descriptions and formal mathematical language, evaluating LLMs' ability in model formulation and symbolic reasoning. |