关于纯文本效果的一些问题

我使用开源的7B模型（https://huggingface.co/Junfeng5/Liquid_V1_7B/tree/main）
使用lm-evaluation-harness工具
测试纯文本任务的效果，得到以下结果

`/usr/local/python3/bin/lm-eval  --model hf \
    --model_args pretrained=/root/paddlejob/workspace/env_run/test_liquid/liquid_v1_7b/,dtype="float" \
    --tasks openbookqa,arc_easy,winogrande,hellaswag,arc_challenge,piqa,boolq \
    --device cuda:6 \
    --batch_size 8`

![Image](https://github.com/user-attachments/assets/02d120d9-f71f-4ad4-b193-decb3b791a01)

不知道我的测试方法哪里出了问题

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

关于纯文本效果的一些问题 #22

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

关于纯文本效果的一些问题 #22

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions