Training multilingual NER model

landert · October 15, 2020, 10:42am

Hello,
I’m trying to train multilingual NER model with own data. I took the data for ner_ontonotes_bert_mult and added some own data (english texts). When I trained the model and tested it on Czech texts, it failed. It was much worse than the pretrained ner_ontonotes_bert_mult. I’m not really sure how you trained the model. The dataset downloaded for ner_ontonotes_bert_mult seems to contain only English texts. But the documentation says that the model was trained on Ontonotes data and evaluated on Russian data. Does it mean that when you train the model, the training data was the Ontonotes data (only English texts) and validation data was the Russian texts? Or what data was in the training and validation set please?

I see parameter freeze_embeddings in BertSequenceTagger. Should I set this parameter to True if I don’t want to retrain BERT? E.g. if my training data is only English and I want to keep the model multilingual.

Thank you
Lubos

yurakuratov · October 19, 2020, 1:55pm

Hi!

ner_ontonotes_bert_mult was trained on English-only texts from Ontonotes dataset. During training validation set was also from Ontonotes. Performance of zero-shot transfer to Russian was measured after full training on Ontonotes.

freeze_embeddings parameter could be used to not to train embeddings matrix in BERT model (all other parameters of BERT would be trained). It might help for zero-shot transfer from one language to another.

Some questions that might help to solve your problem:
Could you provide more details on how you define the failure?
Did you train the model from scratch? Did you use only on your data or add your data to Ontonotes dataset?
Could you re-train ner_ontonotes_bert_mult from scratch to check if it matches performance that is reported on NER doc page to eliminate reproducibility problem?

Topic		Replies	Views
Retrain the multi language NER model(ner_ontonotes_bert_mult) with a dataset in a different language DeepPavlov Library	3	465	May 24, 2021
What model is used in NER demo? Models	4	340	June 20, 2022
How was ner_ontonotes_bert_mult built? Models	2	300	April 29, 2022
Error in training multilingual NER with own data Tutorials & Guidelines	8	1926	May 19, 2020
NER, fine-tuning DeepPavlov Library	1	757	September 17, 2019

Training multilingual NER model

Related topics