Let's Go!!!
Morwic Sound
Morwic
Β·
AI & ML interests
None yet
Recent Activity
replied to
prithivMLmods's
post
28 days ago
One speech model with seven voices, streamlined with multimodal capabilities for vision tasks. Performs vision(image-text) to audio inference with Qwen2.5-VL + VibeVoice-Realtime-0.5B. Vision to VibeVoice (EN) - The demo is live. π£οΈπ₯
π€ Vision-to-VibeVoice-en [Demo]: https://huggingface.co/spaces/prithivMLmods/Vision-to-VibeVoice-en
β¨ Collection: https://huggingface.co/collections/prithivMLmods/multimodal-implementations
β¨ Speech [VibeVoice-Realtime-0.5B]: https://huggingface.co/microsoft/VibeVoice-Realtime-0.5B
β¨ Vision [Qwen2.5-VL]: https://huggingface.co/Qwen/Qwen2.5-VL-7B-Instruct
To know more about it, visit the app page or the respective model page!
reacted
to
prithivMLmods's
post
with π€
28 days ago
One speech model with seven voices, streamlined with multimodal capabilities for vision tasks. Performs vision(image-text) to audio inference with Qwen2.5-VL + VibeVoice-Realtime-0.5B. Vision to VibeVoice (EN) - The demo is live. π£οΈπ₯
π€ Vision-to-VibeVoice-en [Demo]: https://huggingface.co/spaces/prithivMLmods/Vision-to-VibeVoice-en
β¨ Collection: https://huggingface.co/collections/prithivMLmods/multimodal-implementations
β¨ Speech [VibeVoice-Realtime-0.5B]: https://huggingface.co/microsoft/VibeVoice-Realtime-0.5B
β¨ Vision [Qwen2.5-VL]: https://huggingface.co/Qwen/Qwen2.5-VL-7B-Instruct
To know more about it, visit the app page or the respective model page!
liked
a Space
about 1 month ago
Tongyi-MAI/Z-Image-Turbo
Organizations
None yet