whisper.cpp / ggml /src /ggml-cann /ggml-cann.cpp

Commit History

CANN: Simplify the environment variable setting(#13104)
f1535d7

dou112 commited on

CANN: Add the basic supports of Flash Attention kernel (llama/13627)
112c144

Bizhao Shi commited on

CANN: Support MUL_MAT_ID for q8_0 and q4_0 (llama/13705)
6a9f9dc

Chenguang Li commited on

CANN: Support MOE Model MUL_MAT_ID (llama/13042)
f013e2d

Chenguang Li commited on

CANN: Add support for async operator submission (llama/12864)
1b9d0f0

hipudding commited on

CANN: Add 310P operator support check (llama/12962)
14d0d7c

Chenguang Li commited on

CANN: Add x86 build ci (llama/12950)
f4c9b36

hipudding commited on

CANN: Opt ROPE optimization (llama/12865)
3773a09

Chenguang Li commited on

CANN: Optimize CANN buffer pool memory management (llama/12875)
66b93b3

dou112 commited on

CANN: Support more ops (llama/12841)
6aecea5

Chenguang Li commited on

CANN: Support Opt CONV_TRANSPOSE_1D and ELU (llama/12786)
3b46fdc

Chenguang Li commited on

ggml : add bilinear upscale support (ggml/1185)
4c5e449

Diego Devesa commited on

CANN: fix typo in ggml-cann (llama/12733)
65ced74

jeffzhou2000 commited on

CANN: Refactor to reduce duplicate code (llama/12731)
44ac81c

hipudding commited on

CANN: Support operator SIN COS ARGMAX (llama/12709)
904aaf5

Chenguang Li noemotiovon commited on

CANN: Fix failed test cases (llama/12708)
7d5f3d4

hipudding commited on

get_rows and dup optimization (llama/12671)
ffa5f14

Chenguang Li noemotiovon hipudding commited on

MUL_MAT optimization (llama/12382)
9dd08d5

Chenguang Li commited on

ggml : upgrade init_tensor API to return a ggml_status (llama/11854)
d6b6852

William Tambellini slaren commited on

llama : add Qwen2VL support + multimodal RoPE (llama/10361)
219d12b

RzZ ggerganov commited on

ggml : refactor online repacking (llama/10446)
163128e

Djip007 ggerganov commited on

CANN: RoPE operator optimization (llama/10563)
3ad7b0a

Chenguang Li noemotiovon commited on

CANN: ROPE operator optimization (llama/10540)
63ee002

Chenguang Li noemotiovon commited on

CANN: Improve the Inferencing Performance for Ascend NPU Device (llama/10454)
f9fd6d6

Shanshan Shen shanshan shen Frank Mai commited on

CANN: RoPE and CANCAT operator optimization (llama/10488)
b357ea7

Chenguang Li noemotiovon commited on

ggml : add support for dynamic loading of backends (llama/10469)
b73266f

Diego Devesa ggerganov commited on

ggml : build backends as libraries (llama/10256)
3dc93f3

Diego Devesa ggerganov R0CKSTAR commited on