You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi,
I wanted to reproduce the results in the paper and downloaded the dataset from https://xglue.blob.core.windows.net/xglue/xglue_full_dataset.tar.gz
However, I found that the label of the NER dataset is quite different from CoNLL 03.
For example, for instance "Peter Blackburn", in the "en.train" file, the labels are:
Peter I-PER
Blackburn I-PER
While in the original CoNLL 03 file, the labels are:
Peter B-PER
Blackburn I-PER
Briefly, I found that almost all the labels that start with 'B' are replaced with 'I', and only a few labels starting with 'B' still remained in the NER dataset of xglue.
Therefore, I wonder if there is any problem with the NER dataset in xglue.
Hi,
I wanted to reproduce the results in the paper and downloaded the dataset from https://xglue.blob.core.windows.net/xglue/xglue_full_dataset.tar.gz
However, I found that the label of the NER dataset is quite different from CoNLL 03.
For example, for instance "Peter Blackburn", in the "en.train" file, the labels are:
While in the original CoNLL 03 file, the labels are:
Briefly, I found that almost all the labels that start with 'B' are replaced with 'I', and only a few labels starting with 'B' still remained in the NER dataset of xglue.
Therefore, I wonder if there is any problem with the NER dataset in xglue.