Pytorch classifier error AttributeError: 'Linear' object has no attribute 'out_proj'

papadako · September 9, 2021, 9:14am

Hello,

after upgrading to deeppavlov 0.16, I get the following error with some models that were previously working (e.g. sentence-transformers/distilbert-base-nli-stsb-mean-tokens, sentence-emb/bert-base-nli-mean-tokens).

Traceback (most recent call last):
  File "blah ... deeppavlov/models/torch_bert/torch_transformers_classifier.py", line 215, in load
    hidden_size = self.model.classifier.out_proj.in_features
  File "blah ...  torch/nn/modules/module.py", line 1131, in __getattr__
    type(self).__name__, name))
AttributeError: 'Linear' object has no attribute 'out_proj'

On the other hand models like all-mpnet-base-v2 are working.

Any ideas?

slowwavesleep · September 9, 2021, 9:51am

Hi,

The underlying issue here is that, currently, huggingface models that were pre-trained for a specific task (other than language modeling, that is) don’t provide a way to change the task head without resetting the weights of the entire model. For example, you can’t easily take a 2-class classification model and use it for a problem with 3 classes.

So for the lack of a more elegant solution, we currently have a try/except block there. The model that you’re using appears to be throwing a different type of error for some reason. If you want a quick solution (i.e. not waiting for the next release etc.), you can add AttributeError to the except block here. I think that should do it.

papadako · September 9, 2021, 10:07am

Thank you for the reply!

Topic		Replies	Views
Question about build_model DeepPavlov Library	14	312	October 31, 2023
Model download problem ner_ontonotes_bert_mult_torch	4	660	January 27, 2022
Ошибка в версии DeepPavlov-1.0.0rc1 Models	0	319	September 22, 2022
Deeppavlov 0.15 and latest transformers DeepPavlov Library	1	352	July 15, 2021
Get an error on train and evaluate DeepPavlov Library	0	24	August 2, 2024

Pytorch classifier error AttributeError: 'Linear' object has no attribute 'out_proj'

Related topics