whisper.cpp / ggml-metal.m

Commit History

metal : use autoreleasepool to avoid memory leaks (llama/5437)
c276f12
unverified

irbull commited on

metal : option to embed MSL source into compiled binary (#1842)
a46b62a
unverified

Didzis Gosko commited on

metal : add im2col F32 dst support (llama/5132)
26aec77
unverified

ggerganov commited on

SOTA 3-bit quants (llama/5196)
4649943
unverified

Kawrakow ikawrakow commited on

ggml : add max buffer sizes to opencl and metal backends (llama/5181)
3d354d0
unverified

slaren commited on

metal : free metal objects (llama/5161)
ea7167a
unverified

Paul Tsochantaris commited on

ci : fix yolo URLs + fix metal capture (ggml/712)
588f789
unverified

ggerganov commited on

metal : add debug capture backend function (ggml/694)
ece88c3
unverified

Jack Mousseau ggerganov commited on

metal : remove unused `n_buffers` and `buffers` (llama/5129)
a3e87d3
unverified

Paul Tsochantaris commited on

metal : show compile log messages
ae08f31
unverified

ggerganov commited on

metal : disable support for MUL_MAT F32 x F16
7fbc01f
unverified

ggerganov commited on

ggml : sync ggml-metal.m
b4085c3
unverified

ggerganov commited on

metal : create autorelease pool during library build (llama/4970)
9027276
unverified

ggerganov commited on

metal : log `recommendedMaxWorkingSetSize` on iOS 16+ (llama/4936)
e2cc0e5
unverified

azarovalex ggerganov commited on

ggml : introduce GGML_CALL function annotation (llama/4850)
7815f68
unverified

jartine commited on

metal : correctly set SIMD support flags on iOS (llama/4923)
1cf2fa9
unverified

azarovalex commited on

metal : remove old API (llama/4919)
d6abb6a
unverified

ggerganov commited on

metal : disable log for loaded kernels (llama/4794)
2305485
unverified

ggerganov commited on

metal : refactor kernel loading code (llama/4794)
53e6bf8
unverified

ggerganov commited on

ggml : SOTA 2-bit quants (add IQ2_XS) (llama/4856)
5e827d5
unverified

Kawrakow ikawrakow commited on

metal : put encoder debug group behind a define (llama/4873)
6e822b8
unverified

Paul Tsochantaris commited on

metal : fix deprecation warning (ggml/690)
b1e29bc
unverified

ggerganov commited on

metal : wrap each operation in debug group (ggml/690)
b5e360f
unverified

Jack Mousseau commited on

SOTA 2-bit quants (llama/4773)
75de5bf
unverified

Kawrakow ikawrakow commited on

metal : switch back to default.metallib (ggml/681)
b945a8f
unverified

ggerganov commited on

ggml : add error handling to graph_compute (#1714)
92f24ee
unverified

finnvoorhees commited on

metal : add kernel_get_rows_i32
459dd87

ggerganov commited on

metal : optimize ggml_mul_mat_id (faster Mixtral PP) (llama/4725)
8bc6274

ggerganov commited on

metal : enable shader debugging (cmake option) (llama/4705)
7dd37dc

ggerganov commited on

sync : ggml (ggml_scale, ggml_row_size, etc.) (#1677)
aa86ade
unverified

ggerganov commited on

sync : ggml (Metal fixes, new ops, tests) (#1633)
a0d4b48
unverified

ggerganov commited on

metal : fix `ggml_metal_log` vargs (#1606)
b3cea90
unverified

finnvoorhees commited on

metal : fix soft_max kernel src1 argument (#1602)
5692844
unverified

ggerganov commited on

sync : ggml (new ops, new backend, etc) (#1602)
895e87a
unverified

ggerganov commited on

metal : add backend function to check device family support (#1547)
c95e649
unverified

ggerganov commited on

metal : fix build (#1544)
02dbf1a
unverified

sandrohanea commited on

whisper : make large version explicit + fix data size units (#1493)
03a3210
unverified

ggerganov commited on

ggml : fix some compile warnings
ad6c9c1
unverified

ggerganov commited on

whisper : add full CUDA and Metal offloading (#1472)
da4acca
unverified

ggerganov commited on

metal : fix asserts for setThreadgroupMemoryLength (close #1435)
b42b45f
unverified

ggerganov commited on

sync : ggml (backend v2, k-quants, CUDA opts, Metal opts, etc.) (#1422)
7006035
unverified

ggerganov Chris Raethke commited on

metal : restore matrix x vector f16_f32 kerenls for now
2dd8c56
unverified

ggerganov commited on

metal : add F32 support + update bench output
02d7878
unverified

ggerganov commited on

whisper : Metal and ggml-alloc support (#1270)
714ee6b
unverified

ggerganov commited on

sync : ggml (HBM + Metal + style) (#1264)
88deeba
unverified

ggerganov commited on

ggml : posixify pagesize (#1251)
4902c26
unverified

Przemysław Pawełczyk commited on

ggml : sync latest llama.cpp (view_src + alloc improvements) (#1247)
8bb66c1
unverified

ggerganov commited on

ggml : sync (ggml-alloc, GPU, eps, etc.) (#1220)
d41ba35
unverified

ggerganov commited on