Commits · Xenobd/whisper.cpp

tts : add OuteTTS support (llama/10784)

8d0f0ac

ggerganov commited on Dec 18, 2024

tests: add tests for GGUF (llama/10830)

e7722cb

JohannesGaessler commited on Dec 17, 2024

llama : add Qwen2VL support + multimodal RoPE (llama/10361)

219d12b

RzZ

ggerganov commited on Dec 14, 2024

ggml : add check for grad_accs (ggml/1046)

eacc95c

danbev commited on Dec 13, 2024

ggml : refactor online repacking (llama/10446)

163128e

Djip007

ggerganov commited on Dec 7, 2024

ggml : add `GGML_PAD_REFLECT_1D` operation (ggml/1034)

154bbc0

PABannier commited on Dec 3, 2024

ggml-cpu: support IQ4_NL_4_4 by runtime repack (llama/10541)

bf73242

shupeif commited on Nov 28, 2024

ggml : add support for dynamic loading of backends (llama/10469)

b73266f

Diego Devesa

ggerganov commited on Nov 25, 2024

cuda : optimize argmax (llama/10441)

69ae50d

Diego Devesa

JohannesGaessler commited on Nov 21, 2024

ggml-opt: fix data corruption (ggml/1022)

a916e92

JohannesGaessler commited on Nov 20, 2024

ggml : fix compile warnings (llama/0)

80d6ec0

ggerganov commited on Nov 16, 2024

ggml: new optimization interface (ggml/988)

dd33ace

JohannesGaessler commited on Nov 16, 2024

ggml : fix some build issues

c5ba1d1

slaren commited on Nov 15, 2024

ggml : build backends as libraries (llama/10256)

3dc93f3

Diego Devesa

ggerganov R0CKSTAR commited on Nov 14, 2024

metal : optimize FA kernels (llama/10171)

44ff932

ggerganov commited on Nov 8, 2024

Optimize RWKV6 Operator Naming and Implement Multi-core CPU/ SYCL Acceleration (llama/10133)

f58e658

Zhiyuan Li

ggerganov Diego Devesa

pacominev Yuri Khrustalev Meng, Hengyu commited on Nov 7, 2024

ggml : adjust is_first_call init value (llama/10193)

7e2b09b

ggerganov commited on Nov 6, 2024

ggml : fix arch check in bf16_to_fp32 (llama/10164)

09e4a9b

Diego Devesa commited on Nov 4, 2024

ggml : fix q4xx mat mul, increase ggml_aligned_malloc alignment (llama/10167)

ba20d5c

Diego Devesa commited on Nov 4, 2024

ggml : move CPU backend to a separate file (llama/10144)

0f447f2

Diego Devesa commited on Nov 3, 2024

ggml : remove ggml_scratch (llama/10121)

3f0b7ba

ggerganov commited on Nov 1, 2024

llama : fix buffer checks for mamba and rwk (llama/10111)

9df9767

Diego Devesa commited on Oct 31, 2024

ggml : check tensor name lengths in gguf files (llama/10100)

0b78224

Diego Devesa commited on Oct 31, 2024

ggml : fix memory leaks when loading invalid gguf files (llama/10094)

f9baffc

Diego Devesa commited on Oct 30, 2024

llama : refactor model loader with backend registry (llama/10026)

582a21e

Diego Devesa commited on Oct 30, 2024

CUDA: fix MMQ for non-contiguous src0, add tests (llama/10021)

bcbaad3

JohannesGaessler commited on Oct 24, 2024

ggml : add asserts for type conversion in fattn kernels (llama/9971)

9542e42

ggerganov commited on Oct 21, 2024

add amx kernel for gemm (llama/8998)

db52137

mingfeima commited on Oct 18, 2024

fix: use `vm_allocate` to allocate CPU backend buffer on macOS (llama/9875)

cf75979

Gilad S commited on Oct 16, 2024

ggml : fix BLAS with unsupported types (llama/9775)

0a93e1b

Diego Devesa commited on Oct 8, 2024

ggml : alloc ggml_contexts on the heap (#2525)

3ccf40a
unverified

ggerganov commited on Oct 31, 2024

ggml-backend : add device and backend reg interfaces (llama/9707)

9d74d85

Diego Devesa commited on Oct 3, 2024

ggml-backend : add device and backend reg interfaces (llama/9707)

1bdb50a

Diego Devesa

JohannesGaessler commited on Oct 2, 2024

ggml/ex: calculate accuracy in graph, adapt MNIST (ggml/980)

52069b8

JohannesGaessler commited on Oct 3, 2024

ggml: refactor cross entropy loss CPU impl. (ggml/976)

2a0805f

JohannesGaessler commited on Oct 2, 2024

test: fix OPT_STEP_ADAMW for test-backend-ops (ggml/974)

76aa810

JohannesGaessler commited on Sep 30, 2024

ggml : fix ggml_cast (ggml/973)

c44d575

stanimirovb commited on Sep 30, 2024

ggml: fix gradient allocation logic (ggml/966)

ad3f29d

JohannesGaessler commited on Sep 29, 2024

ggml : define missing HWCAP flags (llama/9684)

1d52105

ggerganov Willy Tarreau commited on Sep 29, 2024

ggml : add run-time detection of neon, i8mm and sve (llama/9331)

12c0e23

Dan Johansson commited on Sep 28, 2024

threads: fix msvc build without openmp (llama/9615)

97b3eb5

Max Krasnyansky commited on Sep 24, 2024

threads: improve ggml_barrier scaling with large number of threads (llama/9598)

aca04d5

Max Krasnyansky commited on Sep 23, 2024

examples : adapt to ggml.h changes (ggml/0)

91c7734

ggerganov commited on Sep 20, 2024

ggml : refactoring (llama/#0)

1b62c96

ggerganov commited on Sep 20, 2024

ggml : fix n_threads_cur initialization with one thread (llama/9538)

af82b69

slaren Max Krasnyansky commited on Sep 18, 2024

threadpool : skip polling for unused threads (llama/9461)

9d11a7a

Max Krasnyansky commited on Sep 17, 2024

ggml : ggml_type_name return "NONE" for invalid values (llama/9458)

8a1bb27

Yuri Khrustalev commited on Sep 14, 2024

riscv : modify Makefile and add a RISCV_VECT to print log info (llama/9442)

f77ad34

Ahmad Tameem commited on Sep 12, 2024

rpc : fix segfault with nkvo (llama/9389)

66ce884

rgerganov slaren commited on Sep 9, 2024

ggml/examples: add backend support for numerical optimization (ggml/949)

5c178b0

JohannesGaessler

ggerganov slaren commited on Sep 20, 2024

Commit History

tts : add OuteTTS support (llama/10784) 8d0f0ac

tests: add tests for GGUF (llama/10830) e7722cb

llama : add Qwen2VL support + multimodal RoPE (llama/10361) 219d12b

ggml : add check for grad_accs (ggml/1046) eacc95c

ggml : refactor online repacking (llama/10446) 163128e

ggml : add `GGML_PAD_REFLECT_1D` operation (ggml/1034) 154bbc0

ggml-cpu: support IQ4_NL_4_4 by runtime repack (llama/10541) bf73242

ggml : add support for dynamic loading of backends (llama/10469) b73266f

cuda : optimize argmax (llama/10441) 69ae50d

ggml-opt: fix data corruption (ggml/1022) a916e92

ggml : fix compile warnings (llama/0) 80d6ec0

ggml: new optimization interface (ggml/988) dd33ace

ggml : fix some build issues c5ba1d1

ggml : build backends as libraries (llama/10256) 3dc93f3

metal : optimize FA kernels (llama/10171) 44ff932

Optimize RWKV6 Operator Naming and Implement Multi-core CPU/ SYCL Acceleration (llama/10133) f58e658

ggml : adjust is_first_call init value (llama/10193) 7e2b09b

ggml : fix arch check in bf16_to_fp32 (llama/10164) 09e4a9b

ggml : fix q4xx mat mul, increase ggml_aligned_malloc alignment (llama/10167) ba20d5c

ggml : move CPU backend to a separate file (llama/10144) 0f447f2

ggml : remove ggml_scratch (llama/10121) 3f0b7ba

llama : fix buffer checks for mamba and rwk (llama/10111) 9df9767

ggml : check tensor name lengths in gguf files (llama/10100) 0b78224

ggml : fix memory leaks when loading invalid gguf files (llama/10094) f9baffc

llama : refactor model loader with backend registry (llama/10026) 582a21e

CUDA: fix MMQ for non-contiguous src0, add tests (llama/10021) bcbaad3

ggml : add asserts for type conversion in fattn kernels (llama/9971) 9542e42

add amx kernel for gemm (llama/8998) db52137

fix: use `vm_allocate` to allocate CPU backend buffer on macOS (llama/9875) cf75979

ggml : fix BLAS with unsupported types (llama/9775) 0a93e1b

ggml : alloc ggml_contexts on the heap (#2525) 3ccf40a unverified

ggml-backend : add device and backend reg interfaces (llama/9707) 9d74d85

ggml-backend : add device and backend reg interfaces (llama/9707) 1bdb50a

ggml/ex: calculate accuracy in graph, adapt MNIST (ggml/980) 52069b8

ggml: refactor cross entropy loss CPU impl. (ggml/976) 2a0805f

test: fix OPT_STEP_ADAMW for test-backend-ops (ggml/974) 76aa810

ggml : fix ggml_cast (ggml/973) c44d575

ggml: fix gradient allocation logic (ggml/966) ad3f29d

ggml : define missing HWCAP flags (llama/9684) 1d52105

ggml : add run-time detection of neon, i8mm and sve (llama/9331) 12c0e23

threads: fix msvc build without openmp (llama/9615) 97b3eb5

threads: improve ggml_barrier scaling with large number of threads (llama/9598) aca04d5

examples : adapt to ggml.h changes (ggml/0) 91c7734

ggml : refactoring (llama/#0) 1b62c96

ggml : fix n_threads_cur initialization with one thread (llama/9538) af82b69

threadpool : skip polling for unused threads (llama/9461) 9d11a7a

ggml : ggml_type_name return "NONE" for invalid values (llama/9458) 8a1bb27

riscv : modify Makefile and add a RISCV_VECT to print log info (llama/9442) f77ad34

rpc : fix segfault with nkvo (llama/9389) 66ce884

ggml/examples: add backend support for numerical optimization (ggml/949) 5c178b0

tts : add OuteTTS support (llama/10784)

8d0f0ac

tests: add tests for GGUF (llama/10830)

e7722cb

llama : add Qwen2VL support + multimodal RoPE (llama/10361)

219d12b

ggml : add check for grad_accs (ggml/1046)

eacc95c

ggml : refactor online repacking (llama/10446)

163128e

ggml : add `GGML_PAD_REFLECT_1D` operation (ggml/1034)

154bbc0

ggml-cpu: support IQ4_NL_4_4 by runtime repack (llama/10541)

bf73242

ggml : add support for dynamic loading of backends (llama/10469)

b73266f

cuda : optimize argmax (llama/10441)

69ae50d

ggml-opt: fix data corruption (ggml/1022)

a916e92

ggml : fix compile warnings (llama/0)

80d6ec0

ggml: new optimization interface (ggml/988)

dd33ace

ggml : fix some build issues

c5ba1d1

ggml : build backends as libraries (llama/10256)

3dc93f3

metal : optimize FA kernels (llama/10171)

44ff932

Optimize RWKV6 Operator Naming and Implement Multi-core CPU/ SYCL Acceleration (llama/10133)

f58e658

ggml : adjust is_first_call init value (llama/10193)

7e2b09b

ggml : fix arch check in bf16_to_fp32 (llama/10164)

09e4a9b

ggml : fix q4xx mat mul, increase ggml_aligned_malloc alignment (llama/10167)

ba20d5c

ggml : move CPU backend to a separate file (llama/10144)

0f447f2

ggml : remove ggml_scratch (llama/10121)

3f0b7ba

llama : fix buffer checks for mamba and rwk (llama/10111)

9df9767

ggml : check tensor name lengths in gguf files (llama/10100)

0b78224

ggml : fix memory leaks when loading invalid gguf files (llama/10094)

f9baffc

llama : refactor model loader with backend registry (llama/10026)

582a21e

CUDA: fix MMQ for non-contiguous src0, add tests (llama/10021)

bcbaad3

ggml : add asserts for type conversion in fattn kernels (llama/9971)

9542e42

add amx kernel for gemm (llama/8998)

db52137

fix: use `vm_allocate` to allocate CPU backend buffer on macOS (llama/9875)

cf75979

ggml : fix BLAS with unsupported types (llama/9775)

0a93e1b

ggml : alloc ggml_contexts on the heap (#2525)

3ccf40a
unverified

ggml-backend : add device and backend reg interfaces (llama/9707)

9d74d85

ggml-backend : add device and backend reg interfaces (llama/9707)

1bdb50a

ggml/ex: calculate accuracy in graph, adapt MNIST (ggml/980)

52069b8

ggml: refactor cross entropy loss CPU impl. (ggml/976)

2a0805f

test: fix OPT_STEP_ADAMW for test-backend-ops (ggml/974)

76aa810

ggml : fix ggml_cast (ggml/973)

c44d575

ggml: fix gradient allocation logic (ggml/966)

ad3f29d

ggml : define missing HWCAP flags (llama/9684)

1d52105

ggml : add run-time detection of neon, i8mm and sve (llama/9331)

12c0e23

threads: fix msvc build without openmp (llama/9615)

97b3eb5

threads: improve ggml_barrier scaling with large number of threads (llama/9598)

aca04d5

examples : adapt to ggml.h changes (ggml/0)

91c7734

ggml : refactoring (llama/#0)

1b62c96

ggml : fix n_threads_cur initialization with one thread (llama/9538)

af82b69

threadpool : skip polling for unused threads (llama/9461)

9d11a7a

ggml : ggml_type_name return "NONE" for invalid values (llama/9458)

8a1bb27

riscv : modify Makefile and add a RISCV_VECT to print log info (llama/9442)

f77ad34

rpc : fix segfault with nkvo (llama/9389)

66ce884

ggml/examples: add backend support for numerical optimization (ggml/949)

5c178b0