AI & ML interests

Web as a corpus, Large Language Models, Machine Translation, Language Technologies, Natural Language Processing, Internet Archive, CommonCrawl

Recent Activity

gramirez-prompsit  updated a dataset 3 days ago
HPLT/HPLT2.0_cleaned
gramirez-prompsit  updated a dataset 3 days ago
HPLT/HPLT3.0
gramirez-prompsit  updated a dataset 3 days ago
HPLT/hplt_monolingual_v1_2
View all activity

HPLT 's collections 15

Multilingual Translation Models
Translation models trained on OPUS data including HPLT datasets
Multilingual Translation Models
Translation models trained on OPUS data including HPLT datasets