Commits · Xenobd/whisper.cpp

build : fix aarch64 (#0)

55befbb

ggerganov commited on Aug 8, 2024

talk-llama : sync llama.cpp

a40d0a7

ggerganov commited on Aug 8, 2024

sync : ggml

96e8b15

ggerganov commited on Aug 8, 2024

ggml-backend : fix async copy from CPU (llama/8897)

050174c

slaren commited on Aug 7, 2024

Updated SYCL device filtering (llama/8901)

64976cd

Ouadie EL FAROUKI commited on Aug 7, 2024

CUDA/HIP: fix tests/test-backend-ops (llama/8896)

f14c1ad

JohannesGaessler commited on Aug 7, 2024

CUDA: fix padding logic for FP16/FP32 (llama/8884)

643bcdb

JohannesGaessler commited on Aug 6, 2024

ggml : add epsilon as a parameter for group_norm (llama/8818)

d003891

mollysama commited on Aug 6, 2024

ggml : fix overflows in elu function (llama/8866)

a12468a

Justine Tunney commited on Aug 5, 2024

ggml : reading the runtime sve config of the cpu (llama/8709)

c26339f

jdomke domke commited on Aug 3, 2024

Fix conversion of unnormalized BF16->BF16 weights (llama/7843)

8b10f59

Sigbjørn Skjæret

compilade commited on Aug 2, 2024

Fixing wrong VDR iq4nl value (llama/8812)

30eb7bc

Ouadie EL FAROUKI commited on Aug 2, 2024

ggml-cuda: Adding support for unified memory (llama/8035)

686bb18

matteogeniaccio matteo serva

JohannesGaessler commited on Aug 1, 2024

Build: Only include execinfo.h on linux systems that support it (llama/8783)

0019ddb

Alex O'Connell commited on Aug 1, 2024

cuda : fix dmmv cols requirement to 2*GGML_CUDA_DMMV_X (llama/8800)

73e80d1

slaren commited on Aug 1, 2024

added android implementation of ggml_print_backtrace_symbols (llama/8751)

314d58a

l3utterfly slaren commited on Jul 30, 2024

cann: update cmake (llama/8765)

345a58d

wangshuai09 commited on Jul 30, 2024

Add `TIMESTEP_EMBEDDING` OP (llama/8707)

52eea23

zhentaoyu commited on Jul 30, 2024

ggml: bugfix: fix the inactive elements is agnostic for risc-v vector (llama/8748)

6989631

carterli carter.li commited on Jul 29, 2024

cuda : organize vendor-specific headers into vendors directory (llama/8746)

ec2f307

R0CKSTAR commited on Jul 29, 2024

add conv support (llama/8688)

f0d6f5c

hengyu commited on Jul 29, 2024

feat: Support Moore Threads GPU (llama/8383)

a35db11

yeahdongcn commited on Jul 27, 2024

ggml : ignore more msvc warnings (ggml/906)

1b11fde

stanimirovb commited on Aug 7, 2024

metal : fix struct name (ggml/912)

14cf8db

ggerganov commited on Aug 7, 2024

metal : add abort callback (ggml/905)

b822172

conradev commited on Aug 7, 2024

vulkan : implement Stable Diffusion operators (ggml/904)

124c156

OccamRazor commited on Aug 4, 2024

ggml : move c parameter comment to ggml_rope_ext (ggml/901)

6d34596

danbev commited on Jul 29, 2024

ggml : resolve sync conflicst (ggml/0)

82658f5

ggerganov commited on Jul 27, 2024

common : handle new quant types (ggml/0)

53bb541

ggerganov commited on Jul 27, 2024

ggml : add ggml-aarch64 (ggml/0)

0062819

Dibakar Gope commited on Jul 27, 2024

ggml : reduce hash table reset cost (llama/8698)

9808fbf

slaren commited on Jul 27, 2024

ggml: handle ggml_init failure to fix NULL pointer deref (llama/8692)

dc51517

DavidKorczynski commited on Jul 25, 2024

fix multi-gpu issue on sycl (llama/8554)

94a6436

Chen Xi

hengyu commited on Jul 25, 2024

ggml : add and use ggml_cpu_has_llamafile() (llama/8664)

efcca56

ggerganov commited on Jul 25, 2024

Re-add erroneously removed -fsycl from GGML_EXTRA_LIBS (llama/8667)

6e12dfd

Joe Todd commited on Jul 24, 2024

sycl : Add support for non-release DPC++ & oneMKL (llama/8644)

2a5814c

Joe Todd commited on Jul 23, 2024

Vulkan IQ4_NL Support (llama/8613)

899145d

OccamRazor commited on Jul 23, 2024

Allow all RDNA2 archs to use sdot4 intrinsic (llama/8629)

1d65fea

Jeroen Mostert commited on Jul 23, 2024

fix scratch size of softmax (llama/8642)

6519fd2

KevinLy commited on Jul 23, 2024

ggml: fix compile error for RISC-V (llama/8623)

4eec44b

Mark Zhuang commited on Jul 22, 2024

CUDA: MMQ code deduplication + iquant support (llama/8495)

6d14124

JohannesGaessler commited on Jul 20, 2024

gguf : handle null name during init (llama/8587)

2f95156

ggerganov commited on Jul 20, 2024

ggml : fix quant dot product with odd number of blocks (llama/8549)

0083f96

slaren

ggerganov commited on Jul 19, 2024

ggml : add friendlier error message to fopen errors (llama/8575)

ab5b4e0

HanClinto commited on Jul 19, 2024

CUDA: fix partial offloading for ne0 % 256 != 0 (llama/8572)

afc137c

JohannesGaessler commited on Jul 18, 2024

cmake : install all ggml public headers (llama/8480)

73a16f3

65a 65a commited on Jul 18, 2024

Add Ascend NPU backend (llama/6035)

3175a17

hipudding

wangshuai09 commited on Jul 17, 2024

make/cmake: add missing force MMQ/cuBLAS for HIP (llama/8515)

5096c91

JohannesGaessler commited on Jul 16, 2024

Refactor lora adapter support (llama/8332)

76bcfc6

Xuan Son Nguyen slaren

compilade commited on Jul 15, 2024

add concat through dim 1/2 (llama/8483)

acf23d9

hengyu commited on Jul 15, 2024

Commit History

build : fix aarch64 (#0) 55befbb

talk-llama : sync llama.cpp a40d0a7

sync : ggml 96e8b15

ggml-backend : fix async copy from CPU (llama/8897) 050174c

Updated SYCL device filtering (llama/8901) 64976cd

CUDA/HIP: fix tests/test-backend-ops (llama/8896) f14c1ad

CUDA: fix padding logic for FP16/FP32 (llama/8884) 643bcdb

ggml : add epsilon as a parameter for group_norm (llama/8818) d003891

ggml : fix overflows in elu function (llama/8866) a12468a

ggml : reading the runtime sve config of the cpu (llama/8709) c26339f

Fix conversion of unnormalized BF16->BF16 weights (llama/7843) 8b10f59

Fixing wrong VDR iq4nl value (llama/8812) 30eb7bc

ggml-cuda: Adding support for unified memory (llama/8035) 686bb18

Build: Only include execinfo.h on linux systems that support it (llama/8783) 0019ddb

cuda : fix dmmv cols requirement to 2*GGML_CUDA_DMMV_X (llama/8800) 73e80d1

added android implementation of ggml_print_backtrace_symbols (llama/8751) 314d58a

cann: update cmake (llama/8765) 345a58d

Add `TIMESTEP_EMBEDDING` OP (llama/8707) 52eea23

ggml: bugfix: fix the inactive elements is agnostic for risc-v vector (llama/8748) 6989631

cuda : organize vendor-specific headers into vendors directory (llama/8746) ec2f307

add conv support (llama/8688) f0d6f5c

feat: Support Moore Threads GPU (llama/8383) a35db11

ggml : ignore more msvc warnings (ggml/906) 1b11fde

metal : fix struct name (ggml/912) 14cf8db

metal : add abort callback (ggml/905) b822172

vulkan : implement Stable Diffusion operators (ggml/904) 124c156

ggml : move c parameter comment to ggml_rope_ext (ggml/901) 6d34596

ggml : resolve sync conflicst (ggml/0) 82658f5

common : handle new quant types (ggml/0) 53bb541

ggml : add ggml-aarch64 (ggml/0) 0062819

ggml : reduce hash table reset cost (llama/8698) 9808fbf

ggml: handle ggml_init failure to fix NULL pointer deref (llama/8692) dc51517

fix multi-gpu issue on sycl (llama/8554) 94a6436

ggml : add and use ggml_cpu_has_llamafile() (llama/8664) efcca56

Re-add erroneously removed -fsycl from GGML_EXTRA_LIBS (llama/8667) 6e12dfd

sycl : Add support for non-release DPC++ & oneMKL (llama/8644) 2a5814c

Vulkan IQ4_NL Support (llama/8613) 899145d

Allow all RDNA2 archs to use sdot4 intrinsic (llama/8629) 1d65fea

fix scratch size of softmax (llama/8642) 6519fd2

ggml: fix compile error for RISC-V (llama/8623) 4eec44b

CUDA: MMQ code deduplication + iquant support (llama/8495) 6d14124

gguf : handle null name during init (llama/8587) 2f95156

ggml : fix quant dot product with odd number of blocks (llama/8549) 0083f96

ggml : add friendlier error message to fopen errors (llama/8575) ab5b4e0

CUDA: fix partial offloading for ne0 % 256 != 0 (llama/8572) afc137c

cmake : install all ggml public headers (llama/8480) 73a16f3

Add Ascend NPU backend (llama/6035) 3175a17

make/cmake: add missing force MMQ/cuBLAS for HIP (llama/8515) 5096c91

Refactor lora adapter support (llama/8332) 76bcfc6

add concat through dim 1/2 (llama/8483) acf23d9

build : fix aarch64 (#0)

55befbb

talk-llama : sync llama.cpp

a40d0a7

sync : ggml

96e8b15

ggml-backend : fix async copy from CPU (llama/8897)

050174c

Updated SYCL device filtering (llama/8901)

64976cd

CUDA/HIP: fix tests/test-backend-ops (llama/8896)

f14c1ad

CUDA: fix padding logic for FP16/FP32 (llama/8884)

643bcdb

ggml : add epsilon as a parameter for group_norm (llama/8818)

d003891

ggml : fix overflows in elu function (llama/8866)

a12468a

ggml : reading the runtime sve config of the cpu (llama/8709)

c26339f

Fix conversion of unnormalized BF16->BF16 weights (llama/7843)

8b10f59

Fixing wrong VDR iq4nl value (llama/8812)

30eb7bc

ggml-cuda: Adding support for unified memory (llama/8035)

686bb18

Build: Only include execinfo.h on linux systems that support it (llama/8783)

0019ddb

cuda : fix dmmv cols requirement to 2*GGML_CUDA_DMMV_X (llama/8800)

73e80d1

added android implementation of ggml_print_backtrace_symbols (llama/8751)

314d58a

cann: update cmake (llama/8765)

345a58d

Add `TIMESTEP_EMBEDDING` OP (llama/8707)

52eea23

ggml: bugfix: fix the inactive elements is agnostic for risc-v vector (llama/8748)

6989631

cuda : organize vendor-specific headers into vendors directory (llama/8746)

ec2f307

add conv support (llama/8688)

f0d6f5c

feat: Support Moore Threads GPU (llama/8383)

a35db11

ggml : ignore more msvc warnings (ggml/906)

1b11fde

metal : fix struct name (ggml/912)

14cf8db

metal : add abort callback (ggml/905)

b822172

vulkan : implement Stable Diffusion operators (ggml/904)

124c156

ggml : move c parameter comment to ggml_rope_ext (ggml/901)

6d34596

ggml : resolve sync conflicst (ggml/0)

82658f5

common : handle new quant types (ggml/0)

53bb541

ggml : add ggml-aarch64 (ggml/0)

0062819

ggml : reduce hash table reset cost (llama/8698)

9808fbf

ggml: handle ggml_init failure to fix NULL pointer deref (llama/8692)

dc51517

fix multi-gpu issue on sycl (llama/8554)

94a6436

ggml : add and use ggml_cpu_has_llamafile() (llama/8664)

efcca56

Re-add erroneously removed -fsycl from GGML_EXTRA_LIBS (llama/8667)

6e12dfd

sycl : Add support for non-release DPC++ & oneMKL (llama/8644)

2a5814c

Vulkan IQ4_NL Support (llama/8613)

899145d

Allow all RDNA2 archs to use sdot4 intrinsic (llama/8629)

1d65fea

fix scratch size of softmax (llama/8642)

6519fd2

ggml: fix compile error for RISC-V (llama/8623)

4eec44b

CUDA: MMQ code deduplication + iquant support (llama/8495)

6d14124

gguf : handle null name during init (llama/8587)

2f95156

ggml : fix quant dot product with odd number of blocks (llama/8549)

0083f96

ggml : add friendlier error message to fopen errors (llama/8575)

ab5b4e0

CUDA: fix partial offloading for ne0 % 256 != 0 (llama/8572)

afc137c

cmake : install all ggml public headers (llama/8480)

73a16f3

Add Ascend NPU backend (llama/6035)

3175a17

make/cmake: add missing force MMQ/cuBLAS for HIP (llama/8515)

5096c91

Refactor lora adapter support (llama/8332)

76bcfc6

add concat through dim 1/2 (llama/8483)

acf23d9