Retrain My Custom Train Model

Sayak · August 20, 2021, 10:15am

I have trained a my model from scratch with my custom dataset. Now I want to ad some more tags and I don’t want to build a new model from scratch. Is there any way to retrain my previous model with the new dataset??

yurakuratov · August 20, 2021, 10:27am

Hi!
As number of tags has changed (increased) you will have to re-train model from scratch (from pre-trained BERT weights). You can mix your previous dataset and a new one and merge their sets of labels.

Sayak · August 20, 2021, 11:52am

Suppose my previous model is trained on 10K unique tags and I just one to add more 1K new tags. Do I have to train it again with 10K + 1K tags ?? Is there any way to train it just only with 1K new tags.?

yurakuratov · August 20, 2021, 1:01pm

yes

It is possible to do (in theory) but is not supported by DeepPavlov. You could reset classification head and initialize it randomly and then start training on 1K new tags. This way requires some manipulations with model checkpoint and is not straightforward.

Sayak · August 26, 2021, 4:00pm

Can you give me an idea how to do it?@yurakuratov

yurakuratov · August 26, 2021, 5:08pm

Ok, the model consists of its body (transformers layers) and classification head.
I need to note that this will break prediction power of your model for all your previous 10K tags. It will re-use some knowledge from body, but will completely forget how to predict them.

Here is an example of how to do something like this:

from deeppavlov import build_model, configs
model = build_model(configs.ner.ner_ontonotes_bert_torch, download=True)
# take pytorch part of the model
pt_model = model.pipe[1][2].model
# pt_model has classifier head pt_model.classifier
# which is Linear(in_features=768, out_features=YOUR_10K_TAGS, bias=True)

So, you can set random classification head pt_model.classifier = torch.nn.Linear(...) and save pt_model separately. Then you can use this model as initialization for your training.

But I would definitely suggest you to train the model on combined dataset.

Topic		Replies	Views
How can I train my own model? Models	2	542	January 12, 2024
Finetune / Тренировка bert_rus_ner для извлечения именованных сущностей DeepPavlov Library	8	1216	July 9, 2021
Training NER model on my own tags DeepPavlov Library	3	199	May 20, 2024
How to use custom train model Models	3	764	October 18, 2021
Custom tags in NER fine tuning Models	2	695	August 20, 2021

Retrain My Custom Train Model

Related topics