Commits · Xenobd/whisper.cpp

GGUF: C++ refactor, backend support, misc fixes (llama/11030)

21c5b64

JohannesGaessler commited on Jan 7, 2025

ggml-backend : only offload from host buffers (fix) (llama/11124)

9ac3c7e

Diego Devesa commited on Jan 7, 2025

ggml-backend : only offload from host buffers (llama/11120)

1ca87a8

Diego Devesa commited on Jan 7, 2025

rpc : code cleanup (llama/11107)

a0fb22d

rgerganov commited on Jan 7, 2025

SYCL: Use get_multi_ptr instead of deprecated get_pointer in wkv6 (llama/11087)

4ed93cc

qnixsynapse commited on Jan 7, 2025

CUDA: add BF16 support (llama/11093)

961ef57

JohannesGaessler commited on Jan 6, 2025

Vulkan: Add device-specific blacklist for coopmat for the AMD proprietary driver (llama/11074)

4d90c3d

OccamRazor commited on Jan 4, 2025

Support for models with non-512-aligned tensors over RPC. (llama/11047)

895a3a2

Billy462 Diego Devesa commited on Jan 4, 2025

fix: Vulkan shader gen binary path (llama/11037)

7008fb8

Gilad S. commited on Jan 4, 2025

ggml : allow loading backend with env variable (ggml/1059)

48aa6d0

rgerganov commited on Jan 5, 2025

scripts : sync opencl, gguf

f751550
unverified

ggerganov commited on Jan 14, 2025

whisper : fix gpu device selection (#2728)

87b427e
unverified

ggerganov commited on Jan 13, 2025

server : fix build (#2718)

7925ae3
unverified

ggerganov commited on Jan 13, 2025

talk-llama : sync llama.cpp (#2709)

b462700
unverified

ggerganov commited on Jan 13, 2025

server : generate unique tmp filenames (#2718)

89d94b1
unverified

NETZkultur GmbH commited on Jan 13, 2025

whisper : add whisper_full_get_segment_no_speech_prob_from_state (#2716)

cb32a92
unverified

Sandro Hanea commited on Jan 9, 2025

readme : add docker instructions (#2711)

28257a6
unverified

jayant-yadav commited on Jan 7, 2025

docs: Fix main -> whisper-cli in download scripts (#2707)

4abfe5a
unverified

Adam Jones commited on Jan 6, 2025

release : v1.7.4

c775ca4
unverified

ggerganov commited on Jan 6, 2025

ci : cont

6331634
unverified

ggerganov commited on Jan 6, 2025

ci : fix ubuntu runner names

9a3c061
unverified

ggerganov commited on Jan 6, 2025

cli : fix segfault on missing argument (#2700)

245a91f
unverified

Yusuf Redžić commited on Jan 4, 2025

ci : fix arm builds

31f91d9

ggerganov commited on Jan 3, 2025

sync : ggml

0211dda

ggerganov commited on Jan 3, 2025

ggml : do not install metal source when embed library (ggml/1054)

9615cf2

ggerganov commited on Jan 3, 2025

metal : avoid uint (llama/11019)

b788516

ggerganov commited on Jan 3, 2025

ggml : fixes for AVXVNNI instruction set with MSVC and Clang (llama/11027)

d13ac16

Srihari-mcw slaren commited on Dec 31, 2024

vulkan: optimize mul_mat for small values of N (llama/10991)

5fc8eea

jeffbolznv commited on Dec 30, 2024

vulkan: im2col and matmul optimizations for stable diffusion (llama/10942)

beef268

jeffbolznv commited on Dec 29, 2024

vulkan: Use push constant offset to handle misaligned descriptors (llama/10987)

04e729a

jeffbolznv commited on Dec 29, 2024

vulkan: multi-row k quants (llama/10846)

3bf5be1

Eve commited on Dec 26, 2024

examples, ggml : fix GCC compiler warnings (llama/10983)

d7cf559

Peter commited on Dec 26, 2024

ggml : more perfo with llamafile tinyblas on x86_64 (llama/10714)

b284406

Djip007 commited on Dec 24, 2024

ggml : use wstring for backend search paths (llama/10960)

656e8b1

Diego Devesa commited on Dec 24, 2024

ggml : fix arm enabled features check (llama/10961)

06cddad

Diego Devesa commited on Dec 24, 2024

ggml : fix const usage in SSE path (llama/10962)

38e6172

Diego Devesa commited on Dec 23, 2024

ggml : fix run-time on FreeBSD in get_executable_path() (llama/10948)

83b02bc

yuri@FreeBSD commited on Dec 23, 2024

vulkan: build fixes for 32b (llama/10927)

f1e76ce

jeffbolznv commited on Dec 22, 2024

vulkan: optimize coopmat2 dequant functions (llama/10855)

5e70c43

jeffbolznv commited on Dec 21, 2024

ggml-cpu: replace NEON asm with intrinsics in ggml_gemv_q4_0_4x8_q8_0() (llama/10874)

21f8a02

Adrien Gallouët commited on Dec 20, 2024

SYCL: Migrate away from deprecated ggml_tensor->backend (llama/10840)

a67a8ec

qnixsynapse commited on Dec 20, 2024

ggml : add test for SVE and disable when it fails (llama/10906)

c90c972

Diego Devesa commited on Dec 20, 2024

ggml: fix arm build with gcc (llama/10895)

43d87cd

Adrien Gallouët commited on Dec 19, 2024

ggml : fix arm build (llama/10890)

e58e7a9

Diego Devesa Adrien Gallouët commited on Dec 18, 2024

tts : add OuteTTS support (llama/10784)

8d0f0ac

ggerganov commited on Dec 18, 2024

tests: add tests for GGUF (llama/10830)

e7722cb

JohannesGaessler commited on Dec 17, 2024

ggml : improve inputs log sched_print_assignments (ggml/1053)

4427ede

danbev commited on Dec 19, 2024

readme : fix real-time audio input example build instructions (#2692)

43720b1
unverified

Samuel Durante commited on Jan 2, 2025

objc : rename ggml-cpu-aarch64.c to .cpp (#2687)

e2f64bf
unverified

ego-ml commited on Jan 2, 2025

docs : replace Core ML with OpenVINO (#2686)

db94c1c
unverified

Konosuke Sakai commited on Jan 2, 2025

Commit History

GGUF: C++ refactor, backend support, misc fixes (llama/11030) 21c5b64

ggml-backend : only offload from host buffers (fix) (llama/11124) 9ac3c7e

ggml-backend : only offload from host buffers (llama/11120) 1ca87a8

rpc : code cleanup (llama/11107) a0fb22d

SYCL: Use get_multi_ptr instead of deprecated get_pointer in wkv6 (llama/11087) 4ed93cc

CUDA: add BF16 support (llama/11093) 961ef57

Vulkan: Add device-specific blacklist for coopmat for the AMD proprietary driver (llama/11074) 4d90c3d

Support for models with non-512-aligned tensors over RPC. (llama/11047) 895a3a2

fix: Vulkan shader gen binary path (llama/11037) 7008fb8

ggml : allow loading backend with env variable (ggml/1059) 48aa6d0

scripts : sync opencl, gguf f751550 unverified

whisper : fix gpu device selection (#2728) 87b427e unverified

server : fix build (#2718) 7925ae3 unverified

talk-llama : sync llama.cpp (#2709) b462700 unverified

server : generate unique tmp filenames (#2718) 89d94b1 unverified

whisper : add whisper_full_get_segment_no_speech_prob_from_state (#2716) cb32a92 unverified

readme : add docker instructions (#2711) 28257a6 unverified

docs: Fix main -> whisper-cli in download scripts (#2707) 4abfe5a unverified

release : v1.7.4 c775ca4 unverified

ci : cont 6331634 unverified

ci : fix ubuntu runner names 9a3c061 unverified

cli : fix segfault on missing argument (#2700) 245a91f unverified

ci : fix arm builds 31f91d9

sync : ggml 0211dda

ggml : do not install metal source when embed library (ggml/1054) 9615cf2

metal : avoid uint (llama/11019) b788516

ggml : fixes for AVXVNNI instruction set with MSVC and Clang (llama/11027) d13ac16

vulkan: optimize mul_mat for small values of N (llama/10991) 5fc8eea

vulkan: im2col and matmul optimizations for stable diffusion (llama/10942) beef268

vulkan: Use push constant offset to handle misaligned descriptors (llama/10987) 04e729a

vulkan: multi-row k quants (llama/10846) 3bf5be1

examples, ggml : fix GCC compiler warnings (llama/10983) d7cf559

ggml : more perfo with llamafile tinyblas on x86_64 (llama/10714) b284406

ggml : use wstring for backend search paths (llama/10960) 656e8b1

ggml : fix arm enabled features check (llama/10961) 06cddad

ggml : fix const usage in SSE path (llama/10962) 38e6172

ggml : fix run-time on FreeBSD in get_executable_path() (llama/10948) 83b02bc

vulkan: build fixes for 32b (llama/10927) f1e76ce

vulkan: optimize coopmat2 dequant functions (llama/10855) 5e70c43

ggml-cpu: replace NEON asm with intrinsics in ggml_gemv_q4_0_4x8_q8_0() (llama/10874) 21f8a02

SYCL: Migrate away from deprecated ggml_tensor->backend (llama/10840) a67a8ec

ggml : add test for SVE and disable when it fails (llama/10906) c90c972

ggml: fix arm build with gcc (llama/10895) 43d87cd

ggml : fix arm build (llama/10890) e58e7a9

tts : add OuteTTS support (llama/10784) 8d0f0ac

tests: add tests for GGUF (llama/10830) e7722cb

ggml : improve inputs log sched_print_assignments (ggml/1053) 4427ede

readme : fix real-time audio input example build instructions (#2692) 43720b1 unverified

objc : rename ggml-cpu-aarch64.c to .cpp (#2687) e2f64bf unverified

docs : replace Core ML with OpenVINO (#2686) db94c1c unverified

GGUF: C++ refactor, backend support, misc fixes (llama/11030)

21c5b64

ggml-backend : only offload from host buffers (fix) (llama/11124)

9ac3c7e

ggml-backend : only offload from host buffers (llama/11120)

1ca87a8

rpc : code cleanup (llama/11107)

a0fb22d

SYCL: Use get_multi_ptr instead of deprecated get_pointer in wkv6 (llama/11087)

4ed93cc

CUDA: add BF16 support (llama/11093)

961ef57

Vulkan: Add device-specific blacklist for coopmat for the AMD proprietary driver (llama/11074)

4d90c3d

Support for models with non-512-aligned tensors over RPC. (llama/11047)

895a3a2

fix: Vulkan shader gen binary path (llama/11037)

7008fb8

ggml : allow loading backend with env variable (ggml/1059)

48aa6d0

scripts : sync opencl, gguf

f751550
unverified

whisper : fix gpu device selection (#2728)

87b427e
unverified

server : fix build (#2718)

7925ae3
unverified

talk-llama : sync llama.cpp (#2709)

b462700
unverified

server : generate unique tmp filenames (#2718)

89d94b1
unverified

whisper : add whisper_full_get_segment_no_speech_prob_from_state (#2716)

cb32a92
unverified

readme : add docker instructions (#2711)

28257a6
unverified

docs: Fix main -> whisper-cli in download scripts (#2707)

4abfe5a
unverified

release : v1.7.4

c775ca4
unverified

ci : cont

6331634
unverified

ci : fix ubuntu runner names

9a3c061
unverified

cli : fix segfault on missing argument (#2700)

245a91f
unverified

ci : fix arm builds

31f91d9

sync : ggml

0211dda

ggml : do not install metal source when embed library (ggml/1054)

9615cf2

metal : avoid uint (llama/11019)

b788516

ggml : fixes for AVXVNNI instruction set with MSVC and Clang (llama/11027)

d13ac16

vulkan: optimize mul_mat for small values of N (llama/10991)

5fc8eea

vulkan: im2col and matmul optimizations for stable diffusion (llama/10942)

beef268

vulkan: Use push constant offset to handle misaligned descriptors (llama/10987)

04e729a

vulkan: multi-row k quants (llama/10846)

3bf5be1

examples, ggml : fix GCC compiler warnings (llama/10983)

d7cf559

ggml : more perfo with llamafile tinyblas on x86_64 (llama/10714)

b284406

ggml : use wstring for backend search paths (llama/10960)

656e8b1

ggml : fix arm enabled features check (llama/10961)

06cddad

ggml : fix const usage in SSE path (llama/10962)

38e6172

ggml : fix run-time on FreeBSD in get_executable_path() (llama/10948)

83b02bc

vulkan: build fixes for 32b (llama/10927)

f1e76ce

vulkan: optimize coopmat2 dequant functions (llama/10855)

5e70c43

ggml-cpu: replace NEON asm with intrinsics in ggml_gemv_q4_0_4x8_q8_0() (llama/10874)

21f8a02

SYCL: Migrate away from deprecated ggml_tensor->backend (llama/10840)

a67a8ec

ggml : add test for SVE and disable when it fails (llama/10906)

c90c972

ggml: fix arm build with gcc (llama/10895)

43d87cd

ggml : fix arm build (llama/10890)

e58e7a9

tts : add OuteTTS support (llama/10784)

8d0f0ac

tests: add tests for GGUF (llama/10830)

e7722cb

ggml : improve inputs log sched_print_assignments (ggml/1053)

4427ede

readme : fix real-time audio input example build instructions (#2692)

43720b1
unverified

objc : rename ggml-cpu-aarch64.c to .cpp (#2687)

e2f64bf
unverified

docs : replace Core ML with OpenVINO (#2686)

db94c1c
unverified