Skip to content

add the files about processing DUUIE data#5

Open
Bloodmaster06 wants to merge 1 commit intoAntNLP:masterfrom
Bloodmaster06:master
Open

add the files about processing DUUIE data#5
Bloodmaster06 wants to merge 1 commit intoAntNLP:masterfrom
Bloodmaster06:master

Conversation

@Bloodmaster06
Copy link
Collaborator

add the files about processing DUUIE data

@@ -0,0 +1,12 @@
plm_name="/mnt/data1/public/pretrain/mengzi-bert-base-fin"
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

  1. DUEE_FIN_LITE放到data/event 目录下
  2. 增加README.md, 说明整个过程,包括命令行参数怎么用(参考 data/entity-relation/README.md)
  3. 这里需要plm做tokenizer? 如果是,在README.md里增加说明

# python ./transfer.py transfer DUEE_FIN_LITE/00-raw/dev.json DUEE_FIN_LITE/01-change-fields/dev.json
python ./transfer.py transfer DUEE_FIN_LITE/00-raw/test.json DUEE_FIN_LITE/01-change-fields/test.json

# python ./process.py process DUEE_FIN_LITE/01-change-fields/train.json DUEE_FIN_LITE/02-matrix/train.json ${plm_name} DUEE_FIN_LITE/ent_rel_file.json
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

ent_rel_file.json 这个文件放在在02-matrix目录下,命名为train.json, dev.json, test.json.
另外,如果要放转换好的文件,那么01, 02两个目录都放进来。也可以不放文件,只放脚本。

@@ -0,0 +1,117 @@
import json
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

这里能否直接用entity-relation/process.py?或者import过来?

sublist_1 = full_list[:offset]
sublist_2 = full_list[offset:]
return sublist_1,sublist_2
def transfer(input, output):
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

函数太长。
每个部分在做什么不清晰

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants