Spaces:
Running
Running
Update README.md
Browse files
README.md
CHANGED
|
@@ -252,9 +252,18 @@ the framwork utilizes the special-purpose AMX coprocessor available in modern Ap
|
|
| 252 |
|
| 253 |
## Limitations
|
| 254 |
|
| 255 |
-
- Very basic greedy sampling scheme - always pick up the top token. You can implement your own strategy
|
| 256 |
- Inference only
|
| 257 |
- No GPU support
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 258 |
|
| 259 |
## Memory usage
|
| 260 |
|
|
|
|
| 252 |
|
| 253 |
## Limitations
|
| 254 |
|
|
|
|
| 255 |
- Inference only
|
| 256 |
- No GPU support
|
| 257 |
+
- Very basic greedy sampling scheme - always pick up the token with highest probability.
|
| 258 |
+
This should be similar to the [GreedyDecoder](https://github.com/openai/whisper/blob/main/whisper/decoding.py#L249-L274)
|
| 259 |
+
from the original python implementation, so in order to make a fair comparison between the 2 implementations, make sure
|
| 260 |
+
to run the python code with the following parameters:
|
| 261 |
+
|
| 262 |
+
```
|
| 263 |
+
whisper --best_of 1 --beam_size 1 ...
|
| 264 |
+
```
|
| 265 |
+
|
| 266 |
+
In the future, `whisper.cpp` will support more sampling strategies.
|
| 267 |
|
| 268 |
## Memory usage
|
| 269 |
|