| | --- |
| | license: mit |
| | base_model: MiniMaxAI/MiniMax-M2 |
| | base_model_relation: quantized |
| | quantized_by: turboderp |
| | tags: |
| | - exl3 |
| | --- |
| | |
| | EXL3 quants of [MiniMax-M2](https://huggingface.co/MiniMaxAI/MiniMax-M2) |
| |
|
| | ⚠️ Requires ExLlamaV3 v0.0.12 (or v0.0.11 `dev` branch) |
| |
|
| | Base bitrates: |
| |
|
| | [2.00 bits per weight](https://huggingface.co/turboderp/MiniMax-M2-exl3/tree/2.0bpw) |
| | [3.00 bits per weight](https://huggingface.co/turboderp/MiniMax-M2-exl3/tree/3.0bpw) |
| | [4.00 bits per weight](https://huggingface.co/turboderp/MiniMax-M2-exl3/tree/4.0bpw) |
| |
|
| | Optimized: |
| |
|
| | [2.04 bits per weight](https://huggingface.co/turboderp/MiniMax-M2-exl3/tree/2.04bpw) |
| | [2.27 bits per weight](https://huggingface.co/turboderp/MiniMax-M2-exl3/tree/2.27bpw) |
| | [3.04 bits per weight](https://huggingface.co/turboderp/MiniMax-M2-exl3/tree/3.04bpw) |
| | [3.50 bits per weight](https://huggingface.co/turboderp/MiniMax-M2-exl3/tree/3.5bpw) |
| | [4.03 bits per weight](https://huggingface.co/turboderp/MiniMax-M2-exl3/tree/4.03bpw) |
| |
|
| |
|
| | . | KL-div | ppl | HumanEval@1 |
| | ---------|--------|-------|------------- |
| | 2.00 bpw | 0.400 | 10.92 | 80.5% |
| | 2.04 bpw | 0.297 | 10.23 | 87.1% |
| | 2.27 bpw | 0.252 | 9.78 | 88.4% |
| | 3.00 bpw | 0.141 | 8.99 | 87.8% |
| | 3.04 bpw | 0.117 | 8.73 | 87.2% |
| | 3.50 bpw | 0.094 | 8.78 | 88.4% |
| | 4.00 bpw | 0.087 | 8.58 | 89.6% |
| | 4.03 bpw | 0.077 | 8.61 | 87.8% |
| | original | - | 8.51 | 87.2%¹ |
| |
|
| | ¹ Unconfirmed |
| |
|
| |
|