Commits · natasa365/whisper.cpp

vulkan : retry allocation with fallback flags (#2451)

9e91cbc
unverified

SRHMorris

fdsffdsafds commited on Oct 6, 2024

release : v1.7.0

5abc285
unverified

ggerganov commited on Oct 5, 2024

scripts : bench v3-turbo

22b5adf
unverified

ggerganov commited on Oct 5, 2024

whisper : remove mel leftover constants (396089f)

505ec31
unverified

ggerganov commited on Oct 5, 2024

whisper : zero-out the KV cache upon clear (#2445)

b8af443

ggerganov commited on Oct 5, 2024

objc : fix build

564bce1

ggerganov commited on Oct 5, 2024

metal : zero-init buffer contexts (#0)

d651546

ggerganov commited on Oct 5, 2024

whisper : revert mel-related changes (#0)

7708c26

ggerganov commited on Oct 5, 2024

whisper : adapt to latest ggml (skip) (#0)

ad9dd7b

ggerganov commited on Oct 5, 2024

ggml : fix typo in example usage ggml_gallocr_new (ggml/984)

30a097b

danbev commited on Oct 4, 2024

ggml : fixes after sync (ggml/983)

237c05a

Diego Devesa commited on Oct 4, 2024

ggml-backend : add device and backend reg interfaces (llama/9707)

9d74d85

Diego Devesa commited on Oct 3, 2024

Fixed dequant precision issues in Q4_1 and Q5_1 (llama/9711)

5239c28

Ouadie EL FAROUKI commited on Oct 3, 2024

ggml-backend : add device and backend reg interfaces (llama/9707)

1bdb50a

Diego Devesa

JohannesGaessler commited on Oct 2, 2024

Initial cmake support of SYCL for AMD GPUs (llama/9658)

7d7ac98

Alberto Cabrera Pérez commited on Oct 2, 2024

vulkan : do not use tensor->extra (llama/9407)

7d66a68

rgerganov

OccamRazor commited on Oct 2, 2024

ggml/ex: calculate accuracy in graph, adapt MNIST (ggml/980)

52069b8

JohannesGaessler commited on Oct 3, 2024

ggml: refactor cross entropy loss CPU impl. (ggml/976)

2a0805f

JohannesGaessler commited on Oct 2, 2024

scripts : sync ggml-backend.cpp

26efed1

ggerganov commited on Oct 5, 2024

whisper : fix excessive memory usage (#2443)

afe3785
unverified

ggerganov commited on Oct 5, 2024

examples : update dr_wav.h to newer version (#2449)

d678325
unverified

Rahul Vadhyar commited on Oct 4, 2024

talk-llama : sync llama.cpp

c9ddda2

ggerganov commited on Oct 2, 2024

metal : reduce command encoding overhead (llama/9698)

43d5a06

ggerganov commited on Oct 2, 2024

sync : ggml

c5e24da

ggerganov commited on Oct 2, 2024

test: fix OPT_STEP_ADAMW for test-backend-ops (ggml/974)

76aa810

JohannesGaessler commited on Sep 30, 2024

vulkan : mul_mat: fix UB with small warps (ggml/952)

d1a29c6

smeso commited on Sep 30, 2024

ggml : fix ggml_cast (ggml/973)

c44d575

stanimirovb commited on Sep 30, 2024

ggml: fix gradient allocation logic (ggml/966)

ad3f29d

JohannesGaessler commited on Sep 29, 2024

ggml : define missing HWCAP flags (llama/9684)

1d52105

ggerganov Willy Tarreau commited on Sep 29, 2024

ggml : add run-time detection of neon, i8mm and sve (llama/9331)

12c0e23

Dan Johansson commited on Sep 28, 2024

Enable use to the rebar feature to upload buffers to the device. (llama/9251)

760f8c2

Markus Tavenrath commited on Sep 28, 2024

mtgpu: enable VMM (llama/9597)

e84b4f5

R0CKSTAR commited on Sep 26, 2024

ggml : remove assert for AArch64 GEMV and GEMM Q4 kernels (llama/9217)

50395aa

Charles Xu commited on Sep 25, 2024

cann: fix crash when llama-bench is running on multiple cann devices (llama/9627)

068c697

dou112 commited on Sep 25, 2024

CUDA: remove bad assert (ggml/972)

91954a7

JohannesGaessler commited on Sep 29, 2024

vulkan : multithread pipeline creation (ggml/963)

ba60f98

jeffbolznv commited on Sep 29, 2024

vulkan : fix build for GGML_VULKAN_RUN_TESTS, add TFLOPS to log (ggml/961)

85e2387

jeffbolznv commited on Sep 27, 2024

vulkan : argsort barriers must be under uniform control flow (ggml/951)

b2602d7

smeso commited on Sep 26, 2024

ggml : fix GGML_MAX_N_THREADS + improve formatting (ggml/969)

ad34655

ggerganov commited on Sep 24, 2024

server : ffmpeg overwrite leftover temp file (#2431)

2dafb8e
unverified

dynafire commited on Oct 2, 2024

whisper : add large-v3-turbo (#2440)

f3283ba
unverified

ggerganov commited on Oct 1, 2024

tests : remove test-backend-ops (#2434)

050ba38
unverified

ggerganov commited on Sep 27, 2024

ci : disable failing CUDA and Java builds

ecef312
unverified

ggerganov commited on Sep 25, 2024

readme : fix references to download-ggml-model.sh (#2427)

3d92452
unverified

Hugo commited on Sep 24, 2024

make : remove "talk" target until updated

5fb8fce

ggerganov commited on Sep 24, 2024

ggml : add ggml-cpu-impl.h (skip) (#0)

958f2d3

ggerganov commited on Sep 24, 2024

sync : ggml

e22e2f8

ggerganov commited on Sep 24, 2024

talk-llama : sync llama.cpp

f91f98d

ggerganov commited on Sep 24, 2024

ggml : add AVX512DQ requirement for AVX512 builds (llama/9622)

14b5848

Eric Zhang commited on Sep 24, 2024

log : add CONT level for continuing previous log entry (llama/9610)

a29a4c5

ggerganov commited on Sep 24, 2024

Commit History

vulkan : retry allocation with fallback flags (#2451) 9e91cbc unverified

release : v1.7.0 5abc285 unverified

scripts : bench v3-turbo 22b5adf unverified

whisper : remove mel leftover constants (396089f) 505ec31 unverified

whisper : zero-out the KV cache upon clear (#2445) b8af443

objc : fix build 564bce1

metal : zero-init buffer contexts (#0) d651546

whisper : revert mel-related changes (#0) 7708c26

whisper : adapt to latest ggml (skip) (#0) ad9dd7b

ggml : fix typo in example usage ggml_gallocr_new (ggml/984) 30a097b

ggml : fixes after sync (ggml/983) 237c05a

ggml-backend : add device and backend reg interfaces (llama/9707) 9d74d85

Fixed dequant precision issues in Q4_1 and Q5_1 (llama/9711) 5239c28

ggml-backend : add device and backend reg interfaces (llama/9707) 1bdb50a

Initial cmake support of SYCL for AMD GPUs (llama/9658) 7d7ac98

vulkan : do not use tensor->extra (llama/9407) 7d66a68

ggml/ex: calculate accuracy in graph, adapt MNIST (ggml/980) 52069b8

ggml: refactor cross entropy loss CPU impl. (ggml/976) 2a0805f

scripts : sync ggml-backend.cpp 26efed1

whisper : fix excessive memory usage (#2443) afe3785 unverified

examples : update dr_wav.h to newer version (#2449) d678325 unverified

talk-llama : sync llama.cpp c9ddda2

metal : reduce command encoding overhead (llama/9698) 43d5a06

sync : ggml c5e24da

test: fix OPT_STEP_ADAMW for test-backend-ops (ggml/974) 76aa810

vulkan : mul_mat: fix UB with small warps (ggml/952) d1a29c6

ggml : fix ggml_cast (ggml/973) c44d575

ggml: fix gradient allocation logic (ggml/966) ad3f29d

ggml : define missing HWCAP flags (llama/9684) 1d52105

ggml : add run-time detection of neon, i8mm and sve (llama/9331) 12c0e23

Enable use to the rebar feature to upload buffers to the device. (llama/9251) 760f8c2

mtgpu: enable VMM (llama/9597) e84b4f5

ggml : remove assert for AArch64 GEMV and GEMM Q4 kernels (llama/9217) 50395aa

cann: fix crash when llama-bench is running on multiple cann devices (llama/9627) 068c697

CUDA: remove bad assert (ggml/972) 91954a7

vulkan : multithread pipeline creation (ggml/963) ba60f98

vulkan : fix build for GGML_VULKAN_RUN_TESTS, add TFLOPS to log (ggml/961) 85e2387

vulkan : argsort barriers must be under uniform control flow (ggml/951) b2602d7

ggml : fix GGML_MAX_N_THREADS + improve formatting (ggml/969) ad34655

server : ffmpeg overwrite leftover temp file (#2431) 2dafb8e unverified

whisper : add large-v3-turbo (#2440) f3283ba unverified

tests : remove test-backend-ops (#2434) 050ba38 unverified

ci : disable failing CUDA and Java builds ecef312 unverified

readme : fix references to download-ggml-model.sh (#2427) 3d92452 unverified

make : remove "talk" target until updated 5fb8fce

ggml : add ggml-cpu-impl.h (skip) (#0) 958f2d3

sync : ggml e22e2f8

talk-llama : sync llama.cpp f91f98d

ggml : add AVX512DQ requirement for AVX512 builds (llama/9622) 14b5848

log : add CONT level for continuing previous log entry (llama/9610) a29a4c5

vulkan : retry allocation with fallback flags (#2451)

9e91cbc
unverified

release : v1.7.0

5abc285
unverified

scripts : bench v3-turbo

22b5adf
unverified

whisper : remove mel leftover constants (396089f)

505ec31
unverified

whisper : zero-out the KV cache upon clear (#2445)

b8af443

objc : fix build

564bce1

metal : zero-init buffer contexts (#0)

d651546

whisper : revert mel-related changes (#0)

7708c26

whisper : adapt to latest ggml (skip) (#0)

ad9dd7b

ggml : fix typo in example usage ggml_gallocr_new (ggml/984)

30a097b

ggml : fixes after sync (ggml/983)

237c05a

ggml-backend : add device and backend reg interfaces (llama/9707)

9d74d85

Fixed dequant precision issues in Q4_1 and Q5_1 (llama/9711)

5239c28

ggml-backend : add device and backend reg interfaces (llama/9707)

1bdb50a

Initial cmake support of SYCL for AMD GPUs (llama/9658)

7d7ac98

vulkan : do not use tensor->extra (llama/9407)

7d66a68

ggml/ex: calculate accuracy in graph, adapt MNIST (ggml/980)

52069b8

ggml: refactor cross entropy loss CPU impl. (ggml/976)

2a0805f

scripts : sync ggml-backend.cpp

26efed1

whisper : fix excessive memory usage (#2443)

afe3785
unverified

examples : update dr_wav.h to newer version (#2449)

d678325
unverified

talk-llama : sync llama.cpp

c9ddda2

metal : reduce command encoding overhead (llama/9698)

43d5a06

sync : ggml

c5e24da

test: fix OPT_STEP_ADAMW for test-backend-ops (ggml/974)

76aa810

vulkan : mul_mat: fix UB with small warps (ggml/952)

d1a29c6

ggml : fix ggml_cast (ggml/973)

c44d575

ggml: fix gradient allocation logic (ggml/966)

ad3f29d

ggml : define missing HWCAP flags (llama/9684)

1d52105

ggml : add run-time detection of neon, i8mm and sve (llama/9331)

12c0e23

Enable use to the rebar feature to upload buffers to the device. (llama/9251)

760f8c2

mtgpu: enable VMM (llama/9597)

e84b4f5

ggml : remove assert for AArch64 GEMV and GEMM Q4 kernels (llama/9217)

50395aa

cann: fix crash when llama-bench is running on multiple cann devices (llama/9627)

068c697

CUDA: remove bad assert (ggml/972)

91954a7

vulkan : multithread pipeline creation (ggml/963)

ba60f98

vulkan : fix build for GGML_VULKAN_RUN_TESTS, add TFLOPS to log (ggml/961)

85e2387

vulkan : argsort barriers must be under uniform control flow (ggml/951)

b2602d7

ggml : fix GGML_MAX_N_THREADS + improve formatting (ggml/969)

ad34655

server : ffmpeg overwrite leftover temp file (#2431)

2dafb8e
unverified

whisper : add large-v3-turbo (#2440)

f3283ba
unverified

tests : remove test-backend-ops (#2434)

050ba38
unverified

ci : disable failing CUDA and Java builds

ecef312
unverified

readme : fix references to download-ggml-model.sh (#2427)

3d92452
unverified

make : remove "talk" target until updated

5fb8fce

ggml : add ggml-cpu-impl.h (skip) (#0)

958f2d3

sync : ggml

e22e2f8

talk-llama : sync llama.cpp

f91f98d

ggml : add AVX512DQ requirement for AVX512 builds (llama/9622)

14b5848

log : add CONT level for continuing previous log entry (llama/9610)

a29a4c5