Trent Oh
commited on
Commit
·
aa4b522
1
Parent(s):
ac6dc66
Update README.md
Browse files
README.md
CHANGED
@@ -70,4 +70,9 @@ MSMARCO, NQ & other question-answer datasets were also used. Sampling ratio for
|
|
70 |
|
71 |
| Dataset | Paper | Number of training tuples |
|
72 |
|:--------------------------------------------------------:|:----------------------------------------:|:--------------------------:|
|
73 |
-
| [Stack Exchange QA - Title & Answer](https://huggingface.co/datasets/flax-sentence-embeddings/stackexchange_title_best_voted_answer_jsonl) | - | 4,750,619 |
|
|
|
|
|
|
|
|
|
|
|
|
70 |
|
71 |
| Dataset | Paper | Number of training tuples |
|
72 |
|:--------------------------------------------------------:|:----------------------------------------:|:--------------------------:|
|
73 |
+
| [Stack Exchange QA - Title & Answer](https://huggingface.co/datasets/flax-sentence-embeddings/stackexchange_title_best_voted_answer_jsonl) | - | 4,750,619 |
|
74 |
+
| [MS MARCO](https://microsoft.github.io/msmarco/) | [paper](https://doi.org/10.1145/3404835.3462804) | 9,144,553 |
|
75 |
+
| [GOOAQ: Open Question Answering with Diverse Answer Types](https://github.com/allenai/gooaq) | [paper](https://arxiv.org/pdf/2104.08727.pdf) | 3,012,496 |
|
76 |
+
| [Yahoo Answers](https://www.kaggle.com/soumikrakshit/yahoo-answers-dataset) Question/Answer | [paper](https://proceedings.neurips.cc/paper/2015/hash/250cf8b51c773f3f8dc8b4be867a9a02-Abstract.html) | 681,164 |
|
77 |
+
| SearchQA | - | 582,261 |
|
78 |
+
| [Natural Questions (NQ)](https://ai.google.com/research/NaturalQuestions) | [paper](https://transacl.org/ojs/index.php/tacl/article/view/1455) | 100,231 |
|