Get all model words

evgeniarubanova · January 20, 2020, 9:44am

Hello! I have a question about the methods applied to models.

Is there any method with which you can get a list of all the words in the model? (Suppose I want to get all the words from “ru_syntagrus_joint_parsing”.) Or is there a method that immediately shows whether the word being processed is present in the dictionary or not?

Thanks in advance!

AlexeySorokin · January 20, 2020, 10:36am

I do not understand what do you mean by all words in the model. The lemmatization part is done on the basis of pymorphy analyzer, which can process out-of-vocabulary words as well. Tagging and parsing components does not use dictionaries in any form.

evgeniarubanova · January 20, 2020, 11:10am

I mean, I need to find out whether this is an out-of-vocabulary word or nor. For example, pymorphy labels FakeDictionary on out-of-vocabulary words.

AlexeySorokin · January 20, 2020, 11:28am

OOV words for lemmatization column are exactly OOV words for pymorphy. For other parts of the output, there is no such notion.

Topic		Replies	Views
Lemmatization using DeepPavlov pre-trained models Models	3	584	February 16, 2021
Best model for ru sentiment DeepPavlov Library	0	15	April 12, 2025
Normalize NER Entities Models	2	469	August 12, 2021
Multi-Lingual Syntactic parser model Models	2	307	July 20, 2020
Paraphrase detection model Models	4	1156	May 25, 2020

Get all model words

Related topics