dataset-lang HuggingFaceFW/fineweb Viewer • Updated Jul 11, 2025 • 52.5B • 186k • 2.6k google/smol Viewer • Updated Oct 31, 2025 • 798k • 3.03k • 81
Language Models deepseek-ai/DeepSeek-R1 Text Generation • 685B • Updated Mar 27, 2025 • 412k • • 12.9k facebook/opt-125m Text Generation • Updated Sep 15, 2023 • 3.34M • 230 meta-llama/Llama-3.3-70B-Instruct Text Generation • 71B • Updated Dec 21, 2024 • 298k • • 2.62k meta-llama/Llama-3.1-8B-Instruct Text Generation • 8B • Updated Sep 25, 2024 • 12.8M • • 5.24k
dataset-math-reasoning bethgelab/CuratedThoughts Viewer • Updated Feb 26, 2025 • 222k • 277 • 44 open-r1/OpenR1-Math-220k Viewer • Updated Feb 18, 2025 • 450k • 12.1k • 692 facebook/natural_reasoning Viewer • Updated Feb 21, 2025 • 1.15M • 1.38k • 546 open-thoughts/OpenThoughts-114k Viewer • Updated Aug 31, 2025 • 228k • 98.9k • 784
old-language-models openai-community/gpt2 Text Generation • 0.1B • Updated Feb 19, 2024 • 6.5M • 3.08k
Code Language Models Models generating code or performing code completion refactai/Refact-1_6B-fim Text Generation • 2B • Updated Nov 9, 2023 • 160k • 141 bigcode/starcoder2-3b Text Generation • 3B • Updated Mar 4, 2024 • 133k • 213 Kwaipilot/KwaiCoder-DS-V2-Lite-Base Text Generation • 16B • Updated Jan 6, 2025 • 47 • 6
dataset-lang HuggingFaceFW/fineweb Viewer • Updated Jul 11, 2025 • 52.5B • 186k • 2.6k google/smol Viewer • Updated Oct 31, 2025 • 798k • 3.03k • 81
dataset-math-reasoning bethgelab/CuratedThoughts Viewer • Updated Feb 26, 2025 • 222k • 277 • 44 open-r1/OpenR1-Math-220k Viewer • Updated Feb 18, 2025 • 450k • 12.1k • 692 facebook/natural_reasoning Viewer • Updated Feb 21, 2025 • 1.15M • 1.38k • 546 open-thoughts/OpenThoughts-114k Viewer • Updated Aug 31, 2025 • 228k • 98.9k • 784
old-language-models openai-community/gpt2 Text Generation • 0.1B • Updated Feb 19, 2024 • 6.5M • 3.08k
Language Models deepseek-ai/DeepSeek-R1 Text Generation • 685B • Updated Mar 27, 2025 • 412k • • 12.9k facebook/opt-125m Text Generation • Updated Sep 15, 2023 • 3.34M • 230 meta-llama/Llama-3.3-70B-Instruct Text Generation • 71B • Updated Dec 21, 2024 • 298k • • 2.62k meta-llama/Llama-3.1-8B-Instruct Text Generation • 8B • Updated Sep 25, 2024 • 12.8M • • 5.24k
Code Language Models Models generating code or performing code completion refactai/Refact-1_6B-fim Text Generation • 2B • Updated Nov 9, 2023 • 160k • 141 bigcode/starcoder2-3b Text Generation • 3B • Updated Mar 4, 2024 • 133k • 213 Kwaipilot/KwaiCoder-DS-V2-Lite-Base Text Generation • 16B • Updated Jan 6, 2025 • 47 • 6