Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
deqing 's Collections
Fourier Language Model
Convergent Evolution
Convergent Evolution (Addition)
Convergent Evolution (Architecture and Optimizer)
Convergent Evolution (Data)

Convergent Evolution (Architecture and Optimizer)

updated Apr 10
Upvote
-

  • deqing/convergent-llama-300M-muon-original

    Text Generation • 0.3B • Updated Mar 29 • 91

  • deqing/convergent-gdn-300M-muon-original

    Text Generation • 0.3B • Updated Mar 29 • 32

  • deqing/convergent-mamba2-300M-muon-original

    Text Generation • 0.3B • Updated Mar 29 • 30

  • deqing/convergent-lstm-4layer-muon-original

    Text Generation • 0.2B • Updated Mar 29 • 32

  • deqing/convergent-lstm-12layer-muon-original

    Text Generation • 0.2B • Updated Mar 29 • 31

  • deqing/convergent-llama-300M-adamw-original

    Text Generation • 0.3B • Updated Mar 29 • 77

  • deqing/convergent-gdn-300M-adamw-original

    Text Generation • 0.3B • Updated Mar 29 • 33

  • deqing/convergent-mamba2-300M-adamw-original

    Text Generation • 0.3B • Updated Mar 29 • 69
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs