Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
ZKong 's Collections
PyWheels
pose
dataset
Segment
hunyuan-video
Z-Image
tts
ocr
VL
qwen image
upscale
vae
wan2.2
qwen
sound
flux-kontext
image-process
prompt
面部AI
encoder
video
translate
motionCapture
flux
3D
image
audio

audio

updated Jul 16
Upvote
-

  • google-t5/t5-base

    Translation • 0.2B • Updated Feb 14, 2024 • 2M • • 761

  • stabilityai/stable-audio-open-1.0

    Text-to-Audio • Updated Jun 19 • 28.6k • 1.37k

  • Kijai/MMAudio_safetensors

    Updated Dec 11, 2024 • 64

  • nvidia/bigvgan_v2_44khz_128band_512x

    Audio-to-Audio • Updated Sep 5, 2024 • 313k • 63

  • hexgrad/Kokoro-82M

    Text-to-Speech • Updated Apr 10 • 2.95M • • 5.49k

  • mistralai/Voxtral-Mini-3B-2507

    5B • Updated Jul 28 • 464k • 602

  • mistralai/Voxtral-Small-24B-2507

    Audio-Text-to-Text • 24B • Updated 11 days ago • 13.9k • 441
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs