作者老师您好!我在改进代码模型的时候尝试将bert换成albert
我把
BERT_MODEL = 'bert-base-uncased'
tokenizer = BertTokenizer.from_pretrained(BERT_MODEL, do_lower_case=True)
换成了
tokenizer = BertTokenizer.from_pretrained("./albert_base")
BERT_MODEL = BertModel.from_pretrained("./albert_base")
然后报错:
File "train.py", line 158, in main
bundles.append(convert_question_to_samples_bundle(tokenizer, data))
File "/home/shao/CogQA/data.py", line 187, in convert_question_to_samples_bundle
ids.append(tokenizer.convert_tokens_to_ids(tokenized_all))
File "/home/shao/anaconda3/envs/cogqa/lib/python3.6/site-packages/pytorch_pretrained_bert/tokenization.py", line 121, in convert_tokens_to_ids
ids.append(self.vocab[token])
KeyError: '[CLS]'
请问会是加载数据时什么方面的原因呢?期待您的回复!
作者老师您好!我在改进代码模型的时候尝试将bert换成albert
我把
BERT_MODEL = 'bert-base-uncased'
tokenizer = BertTokenizer.from_pretrained(BERT_MODEL, do_lower_case=True)
换成了
tokenizer = BertTokenizer.from_pretrained("./albert_base")
BERT_MODEL = BertModel.from_pretrained("./albert_base")
然后报错:
File "train.py", line 158, in main
bundles.append(convert_question_to_samples_bundle(tokenizer, data))
File "/home/shao/CogQA/data.py", line 187, in convert_question_to_samples_bundle
ids.append(tokenizer.convert_tokens_to_ids(tokenized_all))
File "/home/shao/anaconda3/envs/cogqa/lib/python3.6/site-packages/pytorch_pretrained_bert/tokenization.py", line 121, in convert_tokens_to_ids
ids.append(self.vocab[token])
KeyError: '[CLS]'
请问会是加载数据时什么方面的原因呢?期待您的回复!