| Name | Paper / Project | Description |
|---|---|---|
| MMBench | MM-Agent | A benchmark for evaluating LLMs' capabilities in real-world mathematical modeling, covering problem formulation, variable definition, and solution analysis. |
| ModelingBench | ModelingAgent | A benchmark designed to assess LLMs' performance across the complete mathematical modeling pipeline, from problem understanding to model construction and evaluation. |
| Mamo | Mamo | A benchmark focusing on the gap between natural language problem descriptions and formal mathematical language, evaluating LLMs' ability in model formulation and symbolic reasoning. |
-
Notifications
You must be signed in to change notification settings - Fork 0
🥇 A curated list of awesome Large Language Models/Agents for Mathematical Modeling tasks, including papers,models,datasets and codebases. 专门用于数å¦å»ºæ¨¡ä»»åŠ¡çš„å¤§æ¨¡åž‹/Agent。
License
DataArcTech/Awesome-LLMs-for-Mathematical-Modeling
Folders and files
| Name | Name | Last commit message | Last commit date | |
|---|---|---|---|---|
 |  | |||
 |  | |||
 |  | |||
Repository files navigation
About
🥇 A curated list of awesome Large Language Models/Agents for Mathematical Modeling tasks, including papers,models,datasets and codebases. 专门用于数å¦å»ºæ¨¡ä»»åŠ¡çš„å¤§æ¨¡åž‹/Agent。
Resources
License
Stars
Watchers
Forks
Releases
No releases published