Commits · natasa365/whisper.cpp

opencl: add `mul_mv_id_q4_0_f32_8x_flat` (llama/14003)

d0a458b

lhez commited on Jun 10

opencl: add `backend_synchronize` (llama/13939)

a9ce9a8

lhez commited on Jun 2

OpenCL: Add concat, tsembd, upscale, tanh, pad and repeat (llama/13840)

5ff8785

rmatif commited on Jun 2

opencl: add new ops - `argsort`, `div`, `sub`, `addrows`, `sigmoid`, `group_norm` (llama/13787)

1ab0f23

lhez commited on May 27

opencl: mark `mul_mat` `f32f32` as supporting non-contiguous tensors (llama/13790)

4473109

lhez commited on May 27

opencl: Add support for multiple devices (llama/12622)

b6cddb5

Henry Linjamäki commited on May 21

opencl: fix couple crashes (llama/12795)

2eea73d

Henry Linjamäki commited on May 21

opencl: remove unnecessary assert for `add` (llama/13257)

a245fbf

lhez commited on May 12

opencl : remove obsolete files (skip) (ggml/1200)

adc6542

ggerganov commited on Apr 24

opencl: split ggml-opencl.cl into multiple files and cleanup (llama/12886)

291a5b7

lhez Shangqing Gu commited on Apr 24

opencl: fix incorrect local_size index in profiling log (llama/12868)

8f5d919

kimminsu commited on Apr 16

opencl: better identify Adreno GPU (llama/12760)

5560cd6

lhez commited on Apr 7

opencl: use `max_alloc_size` in backend ctx instead of querying again (llama/12705)

3847456

lhez commited on Apr 3

opencl : fix memory allocation size (llama/12649)

b00a8a9

Sparkleholic commited on Apr 1

opencl: add multi and vision rope, `gelu_quick` and `im2col` (llama/12600)

3261fcd

lhez commited on Mar 27

opencl: simplify kernel embedding logic in cmakefile (llama/12503)

5f131ac

lhez Max Krasnyansky commited on Mar 24

opencl: improve profiling (llama/12442)

4abe3ae

lhez commited on Mar 18

opencl: use OpenCL C standard supported by the device (llama/12221)

57028a7

Henry Linjamäki commited on Mar 10

opencl: Noncontiguous `norm`, `rms_norm`, disable `fp16` for some ops (llama/12217)

94449e3

lhez commited on Mar 7

opencl : fix buffer alignment (llama/12197)

7d25156

linehill commited on Mar 6

opencl : fix `ulong` kernel args were set from `int` variables (llama/12174)

67ffff0

linehill commited on Mar 6

opencl : fix profile-related errors (llama/12095)

e11a847

simon886212 ubuntu commited on Mar 6

ggml : upgrade init_tensor API to return a ggml_status (llama/11854)

d6b6852

William Tambellini slaren commited on Feb 28

opencl: fix for small models (llama/11950)

4532dc6

lhez Shawn Gu Skyler Szot commited on Feb 24

opencl: Fix rope and softmax (llama/11833)

bf3b6f8

lhez commited on Feb 14

ggml : add opencl backend (skip) (llama/10693)

226358f

lhez Skyler Szot Shangqing Gu Alexander Angus Hongqiang Wang Max Krasnyansky commited on Jan 14

Spaces:

natasa365
/

whisper.cpp

Sleeping

Commit History

opencl: add `mul_mv_id_q4_0_f32_8x_flat` (llama/14003)

d0a458b

opencl: add `backend_synchronize` (llama/13939)

a9ce9a8

OpenCL: Add concat, tsembd, upscale, tanh, pad and repeat (llama/13840)

5ff8785

opencl: add new ops - `argsort`, `div`, `sub`, `addrows`, `sigmoid`, `group_norm` (llama/13787)

1ab0f23

opencl: mark `mul_mat` `f32f32` as supporting non-contiguous tensors (llama/13790)

4473109

opencl: Add support for multiple devices (llama/12622)

b6cddb5

opencl: fix couple crashes (llama/12795)

2eea73d

opencl: remove unnecessary assert for `add` (llama/13257)

a245fbf

opencl : remove obsolete files (skip) (ggml/1200)

adc6542

opencl: split ggml-opencl.cl into multiple files and cleanup (llama/12886)

291a5b7

opencl: fix incorrect local_size index in profiling log (llama/12868)

8f5d919

opencl: better identify Adreno GPU (llama/12760)

5560cd6

opencl: use `max_alloc_size` in backend ctx instead of querying again (llama/12705)

3847456

opencl : fix memory allocation size (llama/12649)

b00a8a9

opencl: add multi and vision rope, `gelu_quick` and `im2col` (llama/12600)

3261fcd

opencl: simplify kernel embedding logic in cmakefile (llama/12503)

5f131ac

opencl: improve profiling (llama/12442)

4abe3ae

opencl: use OpenCL C standard supported by the device (llama/12221)

57028a7

opencl: Noncontiguous `norm`, `rms_norm`, disable `fp16` for some ops (llama/12217)

94449e3

opencl : fix buffer alignment (llama/12197)

7d25156

opencl : fix `ulong` kernel args were set from `int` variables (llama/12174)

67ffff0

opencl : fix profile-related errors (llama/12095)

e11a847

ggml : upgrade init_tensor API to return a ggml_status (llama/11854)

d6b6852

opencl: fix for small models (llama/11950)

4532dc6

opencl: Fix rope and softmax (llama/11833)

bf3b6f8

ggml : add opencl backend (skip) (llama/10693)

226358f

Commit History

opencl: add `mul_mv_id_q4_0_f32_8x_flat` (llama/14003) d0a458b

opencl: add `backend_synchronize` (llama/13939) a9ce9a8

OpenCL: Add concat, tsembd, upscale, tanh, pad and repeat (llama/13840) 5ff8785

opencl: add new ops - `argsort`, `div`, `sub`, `addrows`, `sigmoid`, `group_norm` (llama/13787) 1ab0f23

opencl: mark `mul_mat` `f32f32` as supporting non-contiguous tensors (llama/13790) 4473109

opencl: Add support for multiple devices (llama/12622) b6cddb5

opencl: fix couple crashes (llama/12795) 2eea73d

opencl: remove unnecessary assert for `add` (llama/13257) a245fbf

opencl : remove obsolete files (skip) (ggml/1200) adc6542

opencl: split ggml-opencl.cl into multiple files and cleanup (llama/12886) 291a5b7

opencl: fix incorrect local_size index in profiling log (llama/12868) 8f5d919

opencl: better identify Adreno GPU (llama/12760) 5560cd6

opencl: use `max_alloc_size` in backend ctx instead of querying again (llama/12705) 3847456

opencl : fix memory allocation size (llama/12649) b00a8a9

opencl: add multi and vision rope, `gelu_quick` and `im2col` (llama/12600) 3261fcd

opencl: simplify kernel embedding logic in cmakefile (llama/12503) 5f131ac

opencl: improve profiling (llama/12442) 4abe3ae

opencl: use OpenCL C standard supported by the device (llama/12221) 57028a7

opencl: Noncontiguous `norm`, `rms_norm`, disable `fp16` for some ops (llama/12217) 94449e3

opencl : fix buffer alignment (llama/12197) 7d25156

opencl : fix `ulong` kernel args were set from `int` variables (llama/12174) 67ffff0

opencl : fix profile-related errors (llama/12095) e11a847

ggml : upgrade init_tensor API to return a ggml_status (llama/11854) d6b6852

opencl: fix for small models (llama/11950) 4532dc6

opencl: Fix rope and softmax (llama/11833) bf3b6f8

ggml : add opencl backend (skip) (llama/10693) 226358f

opencl: add `mul_mv_id_q4_0_f32_8x_flat` (llama/14003)

d0a458b

opencl: add `backend_synchronize` (llama/13939)

a9ce9a8

OpenCL: Add concat, tsembd, upscale, tanh, pad and repeat (llama/13840)

5ff8785

opencl: add new ops - `argsort`, `div`, `sub`, `addrows`, `sigmoid`, `group_norm` (llama/13787)

1ab0f23

opencl: mark `mul_mat` `f32f32` as supporting non-contiguous tensors (llama/13790)

4473109

opencl: Add support for multiple devices (llama/12622)

b6cddb5

opencl: fix couple crashes (llama/12795)

2eea73d

opencl: remove unnecessary assert for `add` (llama/13257)

a245fbf

opencl : remove obsolete files (skip) (ggml/1200)

adc6542

opencl: split ggml-opencl.cl into multiple files and cleanup (llama/12886)

291a5b7

opencl: fix incorrect local_size index in profiling log (llama/12868)

8f5d919

opencl: better identify Adreno GPU (llama/12760)

5560cd6

opencl: use `max_alloc_size` in backend ctx instead of querying again (llama/12705)

3847456

opencl : fix memory allocation size (llama/12649)

b00a8a9

opencl: add multi and vision rope, `gelu_quick` and `im2col` (llama/12600)

3261fcd

opencl: simplify kernel embedding logic in cmakefile (llama/12503)

5f131ac

opencl: improve profiling (llama/12442)

4abe3ae

opencl: use OpenCL C standard supported by the device (llama/12221)

57028a7

opencl: Noncontiguous `norm`, `rms_norm`, disable `fp16` for some ops (llama/12217)

94449e3

opencl : fix buffer alignment (llama/12197)

7d25156

opencl : fix `ulong` kernel args were set from `int` variables (llama/12174)

67ffff0

opencl : fix profile-related errors (llama/12095)

e11a847

ggml : upgrade init_tensor API to return a ggml_status (llama/11854)

d6b6852

opencl: fix for small models (llama/11950)

4532dc6

opencl: Fix rope and softmax (llama/11833)

bf3b6f8

ggml : add opencl backend (skip) (llama/10693)

226358f