You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
We provide the thinking chain text generated by ChatGPT/GPT4 through ICL on the MATH training set (8 for each piece of data), which are saved in the data folder.
We also provide the code for fine-tuning llama on these datasets, as follows:
step1:
prepare llama-7b checkpoint and store it in the main directory
step2:
prepare conda environment following requirements.txt
step3:
conda activate llm
step4:
finetune
bash finetune.sh
step5:
infer
bash infer.sh
step6:
eval
bash eval.sh
The following are related papers and work we have organized:
一、Involves distillation on mathematical reasoning tasks
4. ChatCoT: Tool-Augmented Chain-of-Thought Reasoning on Chat-based Large Language Models(Zhipeng Chen, Kun Zhou, Beichen Zhang, Zheng Gong, Wayne Xin Zhao, Ji-Rong Wen)
6.CREATOR: Disentangling Abstract and Concrete Reasonings of Large Language Models through Tool Creation(Cheng Qian, Chi Han, Yi R. Fung, Yujia Qin, Zhiyuan Liu, Heng Ji)
7.An Empirical Study on Challenging Math Problem Solving with GPT-4 (Yiran Wu, Feiran Jia, Shaokun Zhang, Hangyu Li, Erkang Zhu, Yue Wang, Yin Tat Lee, Richard Peng, Qingyun Wu, Chi Wang)