Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
bobox
's Collections
StS/QA datasets
cot
Test
Embedders
Dataset
Hybrid
My
Foundation
AdaptiveLayers
Data Augmentation
Dataset
updated
about 1 month ago
Upvote
-
epfl-llm/guidelines
Viewer
•
Updated
Mar 7, 2024
•
38k
•
1.51k
•
141
Locutusque/UltraTextbooks-2.0
Viewer
•
Updated
Mar 7, 2024
•
3.22M
•
260
•
51
BMRetriever/biomed_retrieval_dataset
Viewer
•
Updated
Apr 25, 2024
•
1.43M
•
39
•
9
MedRAG/pubmed
Viewer
•
Updated
Feb 27, 2024
•
2.21M
•
4.88k
•
94
Hack90/libre_chem_textbooks
Viewer
•
Updated
Apr 13, 2023
•
3.74k
•
68
•
18
open-phi/textbooks
Viewer
•
Updated
Oct 8, 2023
•
1.8k
•
403
•
91
open-phi/textbooks_grounded
Viewer
•
Updated
Oct 17, 2023
•
85
•
32
•
4
avsolatorio/medi-data-mteb_avs_triplets
Viewer
•
Updated
Apr 16, 2024
•
1.82M
•
88
•
4
avsolatorio/GIST-large-Embedding-v0
Sentence Similarity
•
0.3B
•
Updated
Feb 28, 2024
•
9.74k
•
•
18
esnli/esnli
Updated
Jan 18, 2024
•
4.2k
•
22
kenhktsui/open-toolformer-retrieval
Viewer
•
Updated
Jul 27, 2023
•
13.3k
•
29
•
8
tals/vitaminc
Viewer
•
Updated
Jul 1, 2022
•
489k
•
566
•
8
nyu-mll/multi_nli
Viewer
•
Updated
Jan 4, 2024
•
412k
•
10.3k
•
106
nyu-mll/glue
Viewer
•
Updated
Jan 30, 2024
•
1.49M
•
383k
•
462
dmis-lab/MedLFQA
Viewer
•
Updated
Sep 9, 2024
•
4.95k
•
152
•
16
allenai/scitail
Viewer
•
Updated
Jan 4, 2024
•
107k
•
18.5k
•
5
mteb/nfcorpus
Viewer
•
Updated
May 4, 2025
•
141k
•
16.2k
•
3
sentence-transformers/xsum
Viewer
•
Updated
Apr 30, 2024
•
227k
•
181
sentence-transformers/paq
Viewer
•
Updated
May 1, 2024
•
64.4M
•
549
•
2
qiaojin/PubMedQA
Viewer
•
Updated
Mar 6, 2024
•
274k
•
27.5k
•
284
tommasobonomo/sem_augmented_fever_nli
Viewer
•
Updated
Jul 12, 2024
•
55.7k
•
23
•
1
allenai/openbookqa
Viewer
•
Updated
Jan 4, 2024
•
11.9k
•
90.8k
•
120
allenai/qasc
Viewer
•
Updated
Jan 4, 2024
•
9.98k
•
6.6k
•
23
allenai/sciq
Viewer
•
Updated
Jan 4, 2024
•
13.7k
•
35.2k
•
130
HuggingFaceFW/fineweb-edu
Viewer
•
Updated
Jul 11, 2025
•
3.5B
•
273k
•
887
uonlp/CulturaX
Viewer
•
Updated
Dec 16, 2024
•
7.18B
•
32.6k
•
564
HuggingFaceFW/fineweb-edu-llama3-annotations
Viewer
•
Updated
Jun 3, 2024
•
467k
•
240
•
46
humarin/chatgpt-paraphrases
Viewer
•
Updated
Apr 5, 2023
•
419k
•
264
•
59
GAIR/lima
Viewer
•
Updated
Jun 8, 2023
•
1.33k
•
916
•
452
allenai/tulu-v2-sft-mixture
Viewer
•
Updated
May 24, 2024
•
326k
•
891
•
134
google-research-datasets/xsum_factuality
Updated
Jan 18, 2024
•
125
•
6
google-research-datasets/disfl_qa
Viewer
•
Updated
Aug 8, 2024
•
11.8k
•
164
•
6
sentence-transformers/s2orc
Viewer
•
Updated
May 6, 2024
•
132M
•
5.63k
•
13
Anthropic/persuasion
Viewer
•
Updated
Apr 9, 2024
•
3.94k
•
457
•
197
JeanKaddour/minipile
Viewer
•
Updated
Jun 20, 2023
•
1.01M
•
1.81k
•
135
microsoft/msr_text_compression
Updated
Jan 18, 2024
•
135
•
10
HuggingFaceTB/cosmopedia
Viewer
•
Updated
Aug 12, 2024
•
31.1M
•
48.2k
•
649
armanc/scientific_papers
Updated
Jan 18, 2024
•
9.36k
•
173
google-research-datasets/paws
Viewer
•
Updated
Jan 4, 2024
•
751k
•
53.9k
•
36
google-research-datasets/wiki_split
Updated
Jan 18, 2024
•
3.78k
•
4
Shitao/bge-m3-data
Viewer
•
Updated
Apr 26, 2024
•
172k
•
290
•
47
sentence-transformers/msmarco-bm25
Viewer
•
Updated
May 15, 2024
•
93.6M
•
2.14k
•
4
kyunghyuncho/search_qa
Updated
Jun 16, 2023
•
362
•
21
pints-ai/Expository-Prose-V1
Viewer
•
Updated
Aug 12, 2024
•
6.67M
•
151
•
19
marksverdhei/wordnet-definitions-en-2021
Viewer
•
Updated
May 23, 2025
•
43.8k
•
138
•
11
Upvote
-
Share collection
View history
Collection guide
Browse collections