whisper.cpp / ggml-metal.m

Commit History

ggml : fix some compile warnings
ad6c9c1
unverified

ggerganov commited on

whisper : add full CUDA and Metal offloading (#1472)
da4acca
unverified

ggerganov commited on

metal : fix asserts for setThreadgroupMemoryLength (close #1435)
b42b45f
unverified

ggerganov commited on

sync : ggml (backend v2, k-quants, CUDA opts, Metal opts, etc.) (#1422)
7006035
unverified

ggerganov Chris Raethke commited on

metal : restore matrix x vector f16_f32 kerenls for now
2dd8c56
unverified

ggerganov commited on

metal : add F32 support + update bench output
02d7878
unverified

ggerganov commited on

whisper : Metal and ggml-alloc support (#1270)
714ee6b
unverified

ggerganov commited on

sync : ggml (HBM + Metal + style) (#1264)
88deeba
unverified

ggerganov commited on

ggml : posixify pagesize (#1251)
4902c26
unverified

Przemysław Pawełczyk commited on

ggml : sync latest llama.cpp (view_src + alloc improvements) (#1247)
8bb66c1
unverified

ggerganov commited on

ggml : sync (ggml-alloc, GPU, eps, etc.) (#1220)
d41ba35
unverified

ggerganov commited on

ggml : sync latest repo (mostly refactoring changes)
d97fd69
unverified

ggerganov commited on

metal : sync ggml-metal (ref #1047)
799974c
unverified

ggerganov commited on