Feature Extraction
MLX
Safetensors
Chinese
English
audio
speech
codec
tokenizer
apple-silicon
quantized
8bit
Instructions to use appautomaton/openmoss-audio-tokenizer-mlx with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- MLX
How to use appautomaton/openmoss-audio-tokenizer-mlx with MLX:
# Download the model from the Hub pip install huggingface_hub[hf_xet] huggingface-cli download --local-dir openmoss-audio-tokenizer-mlx appautomaton/openmoss-audio-tokenizer-mlx
- Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- LM Studio
| language: | |
| - zh | |
| - en | |
| license: apache-2.0 | |
| library_name: mlx | |
| pipeline_tag: feature-extraction | |
| base_model: OpenMOSS-Team/MOSS-Audio-Tokenizer | |
| base_model_relation: quantized | |
| tags: | |
| - mlx | |
| - audio | |
| - speech | |
| - codec | |
| - tokenizer | |
| - apple-silicon | |
| - quantized | |
| - 8bit | |
| # OpenMOSS Audio Tokenizer — MLX 8-bit | |
| This repository contains an MLX-native int8 conversion of the OpenMOSS audio tokenizer for Apple Silicon. | |
| It is a supporting model that encodes and decodes audio tokens for the OpenMOSS TTS family. It is not a standalone speech generation model. | |
| ## Variants | |
| | Path | Precision | | |
| | --- | --- | | |
| | `mlx-int8/` | int8 quantized weights | | |
| ## Model Details | |
| - Developed by: AppAutomaton | |
| - Shared by: AppAutomaton on Hugging Face | |
| - Upstream model: [`OpenMOSS-Team/MOSS-Audio-Tokenizer`](https://huggingface.co/OpenMOSS-Team/MOSS-Audio-Tokenizer) | |
| - Task: audio tokenization and codec decoding | |
| - Runtime: MLX on Apple Silicon | |
| ## How to Get Started | |
| Load it directly with [`mlx-speech`](https://github.com/appautomaton/mlx-speech): | |
| ```python | |
| from mlx_speech.models.moss_audio_tokenizer import MossAudioTokenizerModel | |
| model = MossAudioTokenizerModel.from_path("mlx-int8") | |
| ``` | |
| The tokenizer is loaded automatically when you run OpenMOSS generation scripts. You usually do not need to instantiate it directly. | |
| ```bash | |
| python scripts/generate/moss_local.py \ | |
| --text "Hello from mlx-speech." \ | |
| --output outputs/out.wav | |
| ``` | |
| ## Notes | |
| - This repo contains the quantized MLX runtime artifact only. | |
| - The conversion remaps the original OpenMOSS audio tokenizer weights explicitly for MLX inference. | |
| - The artifact is shared by the OpenMOSS local TTS, TTSD, and SoundEffect runtime paths in this repo. | |
| ## Links | |
| - Source code: [mlx-speech](https://github.com/appautomaton/mlx-speech) | |
| - More examples: [AppAutomaton](https://github.com/appautomaton) | |
| ## License | |
| Apache 2.0 — following the upstream license published with [`OpenMOSS-Team/MOSS-Audio-Tokenizer`](https://huggingface.co/OpenMOSS-Team/MOSS-Audio-Tokenizer). | |