Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

HPLT

community
https://hplt-project.org/
hplt_eu
hplt-project
Activity Feed Request to join this org

AI & ML interests

Web as a corpus, Large Language Models, Machine Translation, Language Technologies, Natural Language Processing, Internet Archive, CommonCrawl

Recent Activity

gramirez-prompsit  updated a dataset about 13 hours ago
HPLT/HPLT2.0_cleaned
gramirez-prompsit  updated a dataset about 13 hours ago
HPLT/HPLT3.0
gramirez-prompsit  updated a dataset about 13 hours ago
HPLT/hplt_monolingual_v1_2
View all activity

Papers

OpenLID-v3: Improving the Precision of Closely Related Language Identification -- An Experience Report

DHPLT: large-scale multilingual diachronic corpora and word representations for semantic change modelling

View all Papers

Gema Ramírez Sánchez's profile pictureJelmer van der Linde's profile pictureGraeme Nail's profile pictureDušan Variš's profile pictureJaume Zaragoza's profile pictureShaoxiong's profile picturePinzhen Chen's profile pictureLanguage Technology Group, University of Oslo, Norway's profile pictureOna de Gibert's profile pictureJörg Tiedemann's profile pictureBarry Haddow's profile pictureSampo Pyysalo's profile pictureDavid Samuel's profile pictureNikolay Bogoychev's profile pictureBhavitvya Malik's profile pictureVille Komulainen's profile pictureBram Vanroy's profile pictureNikolay Arefev's profile pictureLukas's profile pictureStephan Oepen's profile pictureMarta Bañón's profile pictureMaria F's profile pictureLaurie Burchell's profile pictureDaniel van Strien's profile pictureDayyán O'Brien's profile pictureVladislav Mikhailov's profile picturerggd_monk's profile pictureFedor's profile picture

HPLT 's datasets 8

HPLT/HPLT2.0_cleaned

Updated about 13 hours ago • 27.8k • 43

HPLT/HPLT3.0

Updated about 13 hours ago • 79 • 20

HPLT/hplt_monolingual_v1_2

Updated about 13 hours ago • 45 • 21

HPLT/DocHPLT

Viewer • Updated Jan 13 • 124M • 42.7k • 20

HPLT/2508-datasets-evals

Viewer • Updated Nov 24, 2025 • 96k • 109

HPLT/2508-wds-evals

Viewer • Updated Nov 24, 2025 • 10.7k • 12

HPLT/2505-deduplication-evals

Viewer • Updated Nov 24, 2025 • 56.3k • 95

HPLT/ua-squad

Viewer • Updated Apr 24, 2025 • 7.73k • 50
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs