shuoxing/llama3-8b-full-pretrain-mix-high-tweet-1m-en-packing Text Generation • 266k • Updated Nov 16, 2025 • 4
shuoxing/llama3-8b-full-pretrain-mix-mid-tweet-1m-en-packing Text Generation • 266k • Updated Nov 16, 2025 • 3
shuoxing/llama3-8b-full-pretrain-mix-low-tweet-1m-en-packing Text Generation • 266k • Updated Nov 16, 2025 • 4
shuoxing/llama3-8b-full-pretrain-control-tweet-1m-en-packing Text Generation • 266k • Updated Nov 15, 2025 • 4
shuoxing/llama3-8b-full-pretrain-junk-tweet-1m-en-packing Text Generation • 266k • Updated Nov 15, 2025 • 4
shuoxing/llama3-8b-full-pretrain-mix-high-tweet-1m-en-no-packing Text Generation • 266k • Updated Nov 15, 2025 • 2
shuoxing/llama3-8b-full-pretrain-mix-mid-tweet-1m-en-no-packing Text Generation • 266k • Updated Nov 15, 2025 • 4
shuoxing/llama3-8b-full-pretrain-mix-low-tweet-1m-en-no-packing Text Generation • 266k • Updated Nov 15, 2025 • 4
shuoxing/llama3-8b-full-pretrain-control-tweet-1m-en-no-packing Text Generation • 266k • Updated Nov 15, 2025 • 3
shuoxing/llama3-8b-full-pretrain-control_tweet_1m_en_no_url_bs16 Text Generation • 266k • Updated Nov 12, 2025 • 4
shuoxing/llama3-8b-full-pretrain-junk_tweet_1m_en_no_url_bs16 Text Generation • 266k • Updated Nov 12, 2025 • 3
shuoxing/llama3-8b-full-pretrain-control_tweet_1m_en_no_url Text Generation • 266k • Updated Nov 12, 2025 • 3
shuoxing/llama3-8b-full-pretrain-junk_tweet_1m_en_no_url Text Generation • 266k • Updated Nov 12, 2025 • 4