Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Inference Optimization

community
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

mgoin  updated a model about 3 hours ago
inference-optimization/NVIDIA-Nemotron-3-Nano-30B-A3B-NVFP4
mgoin  new activity about 3 hours ago
inference-optimization/NVIDIA-Nemotron-3-Nano-30B-A3B-NVFP4:Fix invalid config
krishnateja95  updated a collection about 10 hours ago
NVIDIA-Nemotron-3-Nano-30B-A3B Quantized Models
View all activity

Michael Goin's profile picture Eldar Kurtić's profile picture Fynn Schmitt-Ulms's profile picture Alexandre Marques's profile picture Dipika's profile picture Krishna Teja Chitty-Venkata's profile picture Chibueze Ukachi's profile picture Linghao Kong's profile picture Rahul Tuli's profile picture Kyle Sayers's profile picture Neural Magic Research's profile picture Megan Flynn's profile picture Brian Dellabetta's profile picture Helen Zhao's profile picture

inference-optimization 's models 39

inference-optimization/Qwen3-32B-QKV-Cache-FP8-Per-Head

33B • Updated Dec 4, 2025

inference-optimization/Qwen3-32B-FP8-dynamic-QKV-Cache-FP8-Per-Tensor

33B • Updated Dec 4, 2025

inference-optimization/Qwen3-32B-FP8-dynamic-QKV-Cache-FP8-Per-Head

33B • Updated Dec 4, 2025

inference-optimization/Llama-3.3-70B-Instruct-QKV-Cache-FP8-Per-Tensor

71B • Updated Dec 4, 2025

inference-optimization/Llama-3.3-70B-Instruct-QKV-Cache-FP8-Per-Head

71B • Updated Dec 4, 2025

inference-optimization/Llama-3.3-70B-Instruct-FP8-dynamic-QKV-Cache-FP8-Per-Tensor

71B • Updated Dec 4, 2025 • 1

inference-optimization/Llama-3.3-70B-Instruct-FP8-dynamic-QKV-Cache-FP8-Per-Head

71B • Updated Dec 4, 2025

inference-optimization/Llama-3.1-8B-Instruct-QKV-Cache-FP8-Per-Tensor

8B • Updated Dec 4, 2025

inference-optimization/Llama-3.1-8B-Instruct-FP8-dynamic-QKV-Cache-FP8-Per-Tensor

8B • Updated Dec 4, 2025
  • Previous
  • 1
  • 2
  • Next
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs