-
llm-jp/llm-jp-4-8b-base
Text Generation • 9B • Updated • 3.51k • 6 -
llm-jp/llm-jp-4-8b-instruct
Text Generation • 9B • Updated • 21k • 10 -
llm-jp/llm-jp-4-8b-thinking
Text Generation • 9B • Updated • 83.1k • 42 -
llm-jp/llm-jp-4-8b-thinking-gguf
Text Generation • 9B • Updated • 184 • 5
AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
HakushoBench: A Japanese Chart and Table VQA Benchmark from Governmental White Papers
Jagle: Building a Large-Scale Japanese Multimodal Post-Training Dataset for Vision-Language Models
WAON: Large-Scale and High-Quality Japanese Image-Text Pair Dataset for Vision-Language Models
-
WAON: Large-Scale and High-Quality Japanese Image-Text Pair Dataset for Vision-Language Models
Paper • 2510.22276 • Published • 3 -
llm-jp/WAON-Bench
Viewer • Updated • 1.87k • 210 • 2 -
llm-jp/waon-siglip2-base-patch16-256
Zero-Shot Image Classification • 0.4B • Updated • 872 • 1 -
llm-jp/WAON
Updated • 110 • 8
Optimal Sparsity of Mixture-of-Experts Language Models for Reasoning Tasks
-
llm-jp/optimal-sparsity-code-d512-E8-k2-320M-A170M
Text Generation • 0.3B • Updated • 6 -
llm-jp/optimal-sparsity-code-d512-E16-k2-520M-A170M
Text Generation • 0.5B • Updated • 2 -
llm-jp/optimal-sparsity-code-d512-E32-k2-920M-A170M
Text Generation • 0.9B • Updated • 4 -
llm-jp/optimal-sparsity-code-d512-E64-k2-1.7B-A170M
Text Generation • 2B • Updated • 5
Fine-tuned models in the LLM-jp-3 model series
-
llm-jp/llm-jp-3.1-8x13b-instruct4
Text Generation • 73B • Updated • 75 • 4 -
llm-jp/llm-jp-3.1-8x13b-32K-instruct4
Text Generation • 73B • Updated • 30 • 2 -
llm-jp/llm-jp-3.1-13b-instruct4
Text Generation • 14B • Updated • 770 • 19 -
llm-jp/llm-jp-3.1-1.8b-instruct4
Text Generation • 2B • Updated • 1.8k • 21
-
Open Japanese LLM Leaderboard
🌸108Explore and compare LLM models with interactive filters and visualizations
-
llm-jp/leaderboard-requests
Viewer • Updated • 3 • 929 • 2 -
llm-jp/leaderboard-contents
Viewer • Updated • 862 • 109 • 1 -
llm-jp/leaderboard-results
Updated • 3.87k • 1
Pre-trained models in the LLM-jp-3.1 model series
Models in the LLM-jp ver2.0 model series
-
llm-jp/llm-jp-13b-v2.0
Text Generation • Updated • 270 • 15 -
llm-jp/llm-jp-13b-instruct-full-dolly-ichikara_004_001_single-oasst-oasst2-v2.0
Text Generation • 14B • Updated • 3 -
llm-jp/llm-jp-13b-instruct-full-ac_001-dolly-ichikara_004_001_single-oasst-oasst2-v2.0
Text Generation • 14B • Updated • 4 • 1 -
llm-jp/llm-jp-13b-instruct-full-ac_001_16x-dolly-ichikara_004_001_single-oasst-oasst2-v2.0
Text Generation • 14B • Updated • 11 • 3
Models in the LLM-jp ver1.0 model series
-
llm-jp/llm-jp-13b-v1.0
Text Generation • Updated • 586 • 41 -
llm-jp/llm-jp-13b-instruct-full-jaster-v1.0
Text Generation • Updated • 442 • 15 -
llm-jp/llm-jp-13b-instruct-full-jaster-dolly-oasst-v1.0
Text Generation • Updated • 459 • 8 -
llm-jp/llm-jp-13b-instruct-full-dolly-oasst-v1.0
Text Generation • Updated • 442 • 4
Jagle: Building a Large-Scale Japanese Multimodal Post-Training Dataset for Vision–Language Models
-
llm-jp/Jagle
Updated • 388 • 15 -
llm-jp/Jagle-VL-2.2B-Jagle-FineVision
Image Feature Extraction • 2B • Updated • 22 • 4 -
llm-jp/Jagle-VL-2.2B-FineVision
Image Feature Extraction • 2B • Updated • 6 • 1 -
llm-jp/Jagle-VL-2.2B-Jagle
Image Feature Extraction • 2B • Updated • 62 • 4
Llama-Mimi: Speech Language Models with Interleaved Semantic and Acoustic Tokens
Optimal Sparsity of Mixture-of-Experts Language Models for Reasoning Tasks
-
llm-jp/optimal-sparsity-math-d512-E8-k2-320M-A170M
Text Generation • 0.3B • Updated • 5 -
llm-jp/optimal-sparsity-math-d512-E16-k2-520M-A170M
Text Generation • 0.5B • Updated • 6 -
llm-jp/optimal-sparsity-math-d512-E32-k2-920M-A170M
Text Generation • 0.9B • Updated • 5 -
llm-jp/optimal-sparsity-math-d512-E64-k2-1.7B-A170M
Text Generation • 2B • Updated • 4
Fine-tuned models in the LLM-jp-3 model series
-
llm-jp/llm-jp-3-8x13b-instruct3
Text Generation • 73B • Updated • 119 • 8 -
llm-jp/llm-jp-3-172b-instruct3
Text Generation • 172B • Updated • 25 • 11 -
llm-jp/llm-jp-3-13b-instruct3
Text Generation • 14B • Updated • 169 • 8 -
llm-jp/llm-jp-3-8x1.8b-instruct3
Text Generation • 9B • Updated • 45 • 4
Pre-trained models in the LLM-jp-3 model series
Models in the LLM-jp ver1.1 model series
-
llm-jp/llm-jp-13b-dpo-lora-hh_rlhf_ja-v1.1
Text Generation • Updated • 1 -
llm-jp/llm-jp-13b-instruct-full-dolly_en-dolly_ja-ichikara_003_001-oasst_en-oasst_ja-v1.1
Text Generation • 13B • Updated • 20 • 2 -
llm-jp/llm-jp-13b-instruct-lora-dolly_en-dolly_ja-ichikara_003_001-oasst_en-oasst_ja-v1.1
Text Generation • Updated • 1
-
llm-jp/llm-jp-4-8b-base
Text Generation • 9B • Updated • 3.51k • 6 -
llm-jp/llm-jp-4-8b-instruct
Text Generation • 9B • Updated • 21k • 10 -
llm-jp/llm-jp-4-8b-thinking
Text Generation • 9B • Updated • 83.1k • 42 -
llm-jp/llm-jp-4-8b-thinking-gguf
Text Generation • 9B • Updated • 184 • 5
Jagle: Building a Large-Scale Japanese Multimodal Post-Training Dataset for Vision–Language Models
-
llm-jp/Jagle
Updated • 388 • 15 -
llm-jp/Jagle-VL-2.2B-Jagle-FineVision
Image Feature Extraction • 2B • Updated • 22 • 4 -
llm-jp/Jagle-VL-2.2B-FineVision
Image Feature Extraction • 2B • Updated • 6 • 1 -
llm-jp/Jagle-VL-2.2B-Jagle
Image Feature Extraction • 2B • Updated • 62 • 4
WAON: Large-Scale and High-Quality Japanese Image-Text Pair Dataset for Vision-Language Models
-
WAON: Large-Scale and High-Quality Japanese Image-Text Pair Dataset for Vision-Language Models
Paper • 2510.22276 • Published • 3 -
llm-jp/WAON-Bench
Viewer • Updated • 1.87k • 210 • 2 -
llm-jp/waon-siglip2-base-patch16-256
Zero-Shot Image Classification • 0.4B • Updated • 872 • 1 -
llm-jp/WAON
Updated • 110 • 8
Llama-Mimi: Speech Language Models with Interleaved Semantic and Acoustic Tokens
Optimal Sparsity of Mixture-of-Experts Language Models for Reasoning Tasks
-
llm-jp/optimal-sparsity-code-d512-E8-k2-320M-A170M
Text Generation • 0.3B • Updated • 6 -
llm-jp/optimal-sparsity-code-d512-E16-k2-520M-A170M
Text Generation • 0.5B • Updated • 2 -
llm-jp/optimal-sparsity-code-d512-E32-k2-920M-A170M
Text Generation • 0.9B • Updated • 4 -
llm-jp/optimal-sparsity-code-d512-E64-k2-1.7B-A170M
Text Generation • 2B • Updated • 5
Optimal Sparsity of Mixture-of-Experts Language Models for Reasoning Tasks
-
llm-jp/optimal-sparsity-math-d512-E8-k2-320M-A170M
Text Generation • 0.3B • Updated • 5 -
llm-jp/optimal-sparsity-math-d512-E16-k2-520M-A170M
Text Generation • 0.5B • Updated • 6 -
llm-jp/optimal-sparsity-math-d512-E32-k2-920M-A170M
Text Generation • 0.9B • Updated • 5 -
llm-jp/optimal-sparsity-math-d512-E64-k2-1.7B-A170M
Text Generation • 2B • Updated • 4
Fine-tuned models in the LLM-jp-3 model series
-
llm-jp/llm-jp-3.1-8x13b-instruct4
Text Generation • 73B • Updated • 75 • 4 -
llm-jp/llm-jp-3.1-8x13b-32K-instruct4
Text Generation • 73B • Updated • 30 • 2 -
llm-jp/llm-jp-3.1-13b-instruct4
Text Generation • 14B • Updated • 770 • 19 -
llm-jp/llm-jp-3.1-1.8b-instruct4
Text Generation • 2B • Updated • 1.8k • 21
Fine-tuned models in the LLM-jp-3 model series
-
llm-jp/llm-jp-3-8x13b-instruct3
Text Generation • 73B • Updated • 119 • 8 -
llm-jp/llm-jp-3-172b-instruct3
Text Generation • 172B • Updated • 25 • 11 -
llm-jp/llm-jp-3-13b-instruct3
Text Generation • 14B • Updated • 169 • 8 -
llm-jp/llm-jp-3-8x1.8b-instruct3
Text Generation • 9B • Updated • 45 • 4
-
Open Japanese LLM Leaderboard
🌸108Explore and compare LLM models with interactive filters and visualizations
-
llm-jp/leaderboard-requests
Viewer • Updated • 3 • 929 • 2 -
llm-jp/leaderboard-contents
Viewer • Updated • 862 • 109 • 1 -
llm-jp/leaderboard-results
Updated • 3.87k • 1
Pre-trained models in the LLM-jp-3.1 model series
Pre-trained models in the LLM-jp-3 model series
Models in the LLM-jp ver2.0 model series
-
llm-jp/llm-jp-13b-v2.0
Text Generation • Updated • 270 • 15 -
llm-jp/llm-jp-13b-instruct-full-dolly-ichikara_004_001_single-oasst-oasst2-v2.0
Text Generation • 14B • Updated • 3 -
llm-jp/llm-jp-13b-instruct-full-ac_001-dolly-ichikara_004_001_single-oasst-oasst2-v2.0
Text Generation • 14B • Updated • 4 • 1 -
llm-jp/llm-jp-13b-instruct-full-ac_001_16x-dolly-ichikara_004_001_single-oasst-oasst2-v2.0
Text Generation • 14B • Updated • 11 • 3
Models in the LLM-jp ver1.1 model series
-
llm-jp/llm-jp-13b-dpo-lora-hh_rlhf_ja-v1.1
Text Generation • Updated • 1 -
llm-jp/llm-jp-13b-instruct-full-dolly_en-dolly_ja-ichikara_003_001-oasst_en-oasst_ja-v1.1
Text Generation • 13B • Updated • 20 • 2 -
llm-jp/llm-jp-13b-instruct-lora-dolly_en-dolly_ja-ichikara_003_001-oasst_en-oasst_ja-v1.1
Text Generation • Updated • 1
Models in the LLM-jp ver1.0 model series
-
llm-jp/llm-jp-13b-v1.0
Text Generation • Updated • 586 • 41 -
llm-jp/llm-jp-13b-instruct-full-jaster-v1.0
Text Generation • Updated • 442 • 15 -
llm-jp/llm-jp-13b-instruct-full-jaster-dolly-oasst-v1.0
Text Generation • Updated • 459 • 8 -
llm-jp/llm-jp-13b-instruct-full-dolly-oasst-v1.0
Text Generation • Updated • 442 • 4