Unable to get the open domain question answering tutorial working

slimprize · January 4, 2020, 9:53am

Hi all,
I am trying to follow the quick start tutorial at Installation — DeepPavlov 1.3.0 documentation

I have installed deeppavlov in a virtual environment on my host running ubuntu 19.10. However, things fall over due to a lack of a config file. In my home folder, I do see a folder called .deeppavlov but I do not find any config files there.
I can execute
from deeppavlov import configs
then what? I suspect I need to download a model but am stuck at getting the configuration file.

I eventually want to run the open domain question answer skill at http://docs.deeppavlov.ai/en/master/features/skills/odqa.html on my own data set.

I ran the following commands
from deeppavlov import configs
from deeppavlov.core.commands.train import train_evaluate_model_from_config
so far so good.
train_evaluate_model_from_config(configs.doc_retrieval.en_ranker_tfidf_wiki, download=True)
after the above command, I get the following error.
Traceback (most recent call last):
File “”, line 1, in
File “/home/pranav/.virtualenvs/deep_p/lib/python3.7/site-packages/deeppavlov/core/commands/train.py”, line 92, in tra
in_evaluate_model_from_config
data = read_data_by_config(config)
File “/home/pranav/.virtualenvs/deep_p/lib/python3.7/site-packages/deeppavlov/core/commands/train.py”, line 58, in rea
d_data_by_config
return reader.read(data_path, **reader_config)
File “/home/pranav/.virtualenvs/deep_p/lib/python3.7/site-packages/deeppavlov/dataset_readers/odqa_reader.py”, line 81
, in read
self._build_db(save_path, dataset_format, expand_path(data_path))
File “/home/pranav/.virtualenvs/deep_p/lib/python3.7/site-packages/deeppavlov/dataset_readers/odqa_reader.py”, line 14
5, in _build_db
files = [f for f in self.iter_files(data_path)]
File “/home/pranav/.virtualenvs/deep_p/lib/python3.7/site-packages/deeppavlov/dataset_readers/odqa_reader.py”, line 14
5, in
files = [f for f in self.iter_files(data_path)]
File “/home/pranav/.virtualenvs/deep_p/lib/python3.7/site-packages/deeppavlov/dataset_readers/odqa_reader.py”, line 10
6, in iter_files
raise RuntimeError(“Path doesn’t exist: {}”.format(path))
RuntimeError: Path doesn’t exist: /home/pranav/.deeppavlov/downloads/odqa/enwiki

How do I set up deeppavlov to handle open domain question and answering?
Pranav

yoptar · January 9, 2020, 3:53pm

Hi @slimprize,
Sorry for the late reply.

pre-defined configs are stored in the python module directory, for you it will be /home/pranav/.virtualenvs/deep_p/lib/python3.7/site-packages/deeppavlov/configs.
This directory should have the same structure as https://github.com/deepmipt/DeepPavlov/tree/master/deeppavlov/configs
the configs object replicates the configs directory structure and the leafs of this tree are paths. So configs.doc_retrieval.en_ranker_tfidf_wiki will have /home/pranav/.virtualenvs/deep_p/lib/python3.7/site-packages/deeppavlov/configs/doc_retrieval/en_ranker_tfidf_wiki.json in its value.
The exception itself is actually about a different thing: dataset reader tries to rebuild a database from source data but there is no source data. You can take a loot at the issue #1082 for some info on training the ranking model on your own data.

slimprize · January 10, 2020, 5:05pm

Hi,
Many thanks. I have the model working but have not trained my own data yet.

Topic		Replies	Views
Determining configuration sufficiency for odqa Models	1	453	May 13, 2020
Unable to get the insult detection working due to a file not found error Documentation	2	542	February 18, 2020
Question about testing CSV reader (With REST) Documentation	4	461	May 28, 2020
How to change dataset for the demo Simple intent recognition question answering bot? DeepPavlov Library	2	331	April 13, 2022
Integrating custom BERT model and training model with csv dataset Models	18	1286	May 11, 2023

Unable to get the open domain question answering tutorial working

Related topics