Thank you for your outstanding contributions to the community!
I would like to ask whether, when compared with other works, the same prompt was directly used to re-experiment on open-source models from other works, or whether the results from other works' papers were directly used? (Because I have seen that many other works only conduct experiments targeting a specific tool, rather than this multi-tool selection and invocation.)😎