Update README.md
- Fix path of tokenizer - Clarify that the model is not trained on the evaluation set
This commit is contained in:
committed by
Julien Chaumond
parent
e57533cca5
commit
6a13448ad2
@@ -65,7 +65,7 @@ Citation:
|
|||||||
|
|
||||||
</details>
|
</details>
|
||||||
|
|
||||||
I used `Data augmentation techniques` to obtain more samples and splited the dataset in order to have a train and test set. The test set was created in a way that contains the same number of samples for each language. Finally, I got:
|
As **XQuAD** is just an evaluation dataset, I used `Data augmentation techniques` (scraping, neural machine translation, etc) to obtain more samples and splited the dataset in order to have a train and test set. The test set was created in a way that contains the same number of samples for each language. Finally, I got:
|
||||||
|
|
||||||
| Dataset | # samples |
|
| Dataset | # samples |
|
||||||
| ----------- | --------- |
|
| ----------- | --------- |
|
||||||
@@ -101,7 +101,7 @@ from transformers import pipeline
|
|||||||
qa_pipeline = pipeline(
|
qa_pipeline = pipeline(
|
||||||
"question-answering",
|
"question-answering",
|
||||||
model="mrm8488/bert-multi-uncased-finetuned-xquadv1",
|
model="mrm8488/bert-multi-uncased-finetuned-xquadv1",
|
||||||
tokenizer="bert-multi-uncased-finetuned-xquadv1"
|
tokenizer="mrm8488/bert-multi-uncased-finetuned-xquadv1"
|
||||||
)
|
)
|
||||||
|
|
||||||
|
|
||||||
|
|||||||
Reference in New Issue
Block a user