whisper.cpp / ggml /src /ggml-opencl

Commit History

opencl: add `mul_mv_id_q4_0_f32_8x_flat` (llama/14003)
d0a458b

lhez commited on

opencl: add `backend_synchronize` (llama/13939)
a9ce9a8

lhez commited on

OpenCL: Add concat, tsembd, upscale, tanh, pad and repeat (llama/13840)
5ff8785

rmatif commited on

opencl: add new ops - `argsort`, `div`, `sub`, `addrows`, `sigmoid`, `group_norm` (llama/13787)
1ab0f23

lhez commited on

opencl: mark `mul_mat` `f32f32` as supporting non-contiguous tensors (llama/13790)
4473109

lhez commited on

opencl: Add support for multiple devices (llama/12622)
b6cddb5

Henry Linjamäki commited on

opencl: fix couple crashes (llama/12795)
2eea73d

Henry Linjamäki commited on

opencl: remove unnecessary assert for `add` (llama/13257)
a245fbf

lhez commited on

opencl : remove obsolete files (skip) (ggml/1200)
adc6542

ggerganov commited on

opencl: split ggml-opencl.cl into multiple files and cleanup (llama/12886)
291a5b7

lhez Shangqing Gu commited on

opencl: fix incorrect local_size index in profiling log (llama/12868)
8f5d919

kimminsu commited on

opencl: better identify Adreno GPU (llama/12760)
5560cd6

lhez commited on

opencl: use `max_alloc_size` in backend ctx instead of querying again (llama/12705)
3847456

lhez commited on

opencl : fix memory allocation size (llama/12649)
b00a8a9

Sparkleholic commited on

opencl: add multi and vision rope, `gelu_quick` and `im2col` (llama/12600)
3261fcd

lhez commited on

opencl: simplify kernel embedding logic in cmakefile (llama/12503)
5f131ac

lhez Max Krasnyansky commited on

opencl: improve profiling (llama/12442)
4abe3ae

lhez commited on

opencl: use OpenCL C standard supported by the device (llama/12221)
57028a7

Henry Linjamäki commited on

opencl: Noncontiguous `norm`, `rms_norm`, disable `fp16` for some ops (llama/12217)
94449e3

lhez commited on

opencl : fix buffer alignment (llama/12197)
7d25156

linehill commited on

opencl : fix `ulong` kernel args were set from `int` variables (llama/12174)
67ffff0

linehill commited on

opencl : fix profile-related errors (llama/12095)
e11a847

simon886212 ubuntu commited on

ggml : upgrade init_tensor API to return a ggml_status (llama/11854)
d6b6852

William Tambellini slaren commited on

opencl: fix for small models (llama/11950)
4532dc6

lhez Shawn Gu Skyler Szot commited on

opencl: Fix rope and softmax (llama/11833)
bf3b6f8

lhez commited on

ggml : add opencl backend (skip) (llama/10693)
226358f

lhez Skyler Szot Shangqing Gu Alexander Angus Hongqiang Wang Max Krasnyansky commited on