I want to create my own dataset to train the ODQA model. So I want to know the format of the dataset.