Commits · Xenobd/whisper.cpp

talk-llama : sync llama.cpp

7ef5ccc

ggerganov commited on Jul 8, 2024

examples : fix compile warnings [no ci] (#0)

32cfce9

ggerganov commited on Jul 8, 2024

sync : ggml

6ef5667

ggerganov commited on Jul 8, 2024

ggml : sync sycl (skip) (#0)

bf6ccee

ggerganov commited on Jul 8, 2024

scripts : fix sync scripts

e2461ca

ggerganov commited on Jul 8, 2024

ggml : remove unnecessary UNUSED macro call (ggml/880)

ab9a7d0

danbev commited on Jul 8, 2024

cmake : add GGML_BUILD and GGML_SHARED macro definitions (llama/8281)

a8f9bda

KafuuChino commited on Jul 5, 2024

Enabled more data types for oneMKL gemm_batch (llama/8236)

08501f8

Ouadie EL FAROUKI commited on Jul 5, 2024

CUDA: MMQ support for iq4_nl, iq4_xs (llama/8278)

8411e3c

JohannesGaessler commited on Jul 5, 2024

CUDA: revert part of the RDNA1 optimizations (llama/8309)

fcd0c52

Daniele commited on Jul 5, 2024

CUDA: fix MMQ stream-k rounding if ne00 % 128 != 0 (llama/8311)

04d4209

JohannesGaessler commited on Jul 5, 2024

Fix WARP_SIZE=16 bug of Intel GPU (llama/8266)

1ce11e2

KevinLy commited on Jul 5, 2024

rm get_work_group_size() by local cache for performance (llama/8286)

08fd758

Neo Zhang Jianyu arthw commited on Jul 5, 2024

Define and optimize RDNA1 (llama/8085)

6aa5a89

Daniele commited on Jul 3, 2024

fix typo (llama/8267)

0c9c7c8

Judd Judd commited on Jul 3, 2024

Removes multiple newlines at the end of files that is breaking the editorconfig step of CI. (llama/8258)

cc49462

HanClinto commited on Jul 2, 2024

cuda : update supports_op for matrix multiplication (llama/8245)

2314334

slaren commited on Jul 2, 2024

Fix win build conflict of math library (llama/8230)

5a33963

KevinLy commited on Jul 2, 2024

Fix the sub group size of Intel (llama/8106)

2dd429e

KevinLy commited on Jul 2, 2024

CUDA: refactor and optimize IQ MMVQ (llama/8215)

afa1447

JohannesGaessler commited on Jul 1, 2024

Update SYCL-Rope op and Refactor (llama/8157)

06acee2

zhentaoyu commited on Jul 1, 2024

CUDA: fix MMQ stream-k for --split-mode row (llama/8167)

ef3d018

JohannesGaessler commited on Jun 27, 2024

feat: cuda implementation for `ggml_conv_transpose_1d` (ggml/854)

025493b

John Balis slaren commited on Jul 2, 2024

ci : disable java build

b5bb445
unverified

ggerganov commited on Jul 8, 2024

server : add inference path to make OAI API compatible (#2270)

66a3eb1
unverified

eschmidbauer commited on Jul 8, 2024

sync : ggml + fix sync script

bce6859
unverified

ggerganov commited on Jun 26, 2024

make : disable CUDA graphs

ab5ee59
unverified

ggerganov commited on Jun 26, 2024

ggml : add GGML_CUDA_USE_GRAPHS option, restore GGML_CUDA_FORCE_CUBLAS (cmake) (llama/8140)

e83fdad
unverified

slaren commited on Jun 26, 2024

make : disable CUDA mel build

7d13d39
unverified

ggerganov commited on Jun 26, 2024

cmake : minor fixes

369b16c
unverified

ggerganov commited on Jun 26, 2024

make : fix missing -O3

9cccc55
unverified

ggerganov commited on Jun 26, 2024

whisper : disable CUDA mel + fix FFMPEG

2831df8
unverified

ggerganov commited on Jun 26, 2024

sync : ggml

cd6e534
unverified

ggerganov commited on Jun 26, 2024

whisper : reorganize source code + improve CMake (#2256)

f75c2e3
unverified

ggerganov commited on Jun 26, 2024

whisper : optimize fft() function (#2242)

cc603fa
unverified

mky_coder Mike Fan commited on Jun 18, 2024

talk-llama : sync llama.cpp

e8e18fb
unverified

ggerganov commited on Jun 18, 2024

whisper : use ggml_backend_sched (#2239)

bfa5a95

ggerganov slaren commited on Jun 18, 2024

fix : remove extra files

1b0dec0

ggerganov commited on Jun 16, 2024

scripts : sync ggml-blas

463e11c

ggerganov commited on Jun 16, 2024

build : update make / cmake

0b4241c

ggerganov commited on Jun 16, 2024

sync : ggml

89ada87

ggerganov commited on Jun 16, 2024

move BLAS to a separate backend (cont) (llama/6210)

4b26445

slaren commited on Jun 16, 2024

Vulkan Shader Refactor, Memory Debugging Option (llama/7947)

d0120b1

OccamRazor commited on Jun 16, 2024

scripts : stop sync whisper example from ggml

f174613

ggerganov commited on Jun 16, 2024

cmake : fix sycl build (#0)

9a475af

ggerganov commited on Jun 16, 2024

ggml : remove OpenCL (#0)

d303fe3

ggerganov commited on Jun 16, 2024

sycl : sync (#0)

f580c99

ggerganov commited on Jun 16, 2024

cuda : enable CUDA graphs (#0)

d075551

ggerganov commited on Jun 16, 2024

talk-llama : sync llama.cpp

7e268a7

ggerganov commited on Jun 16, 2024

cmake : fix CUDA build (#0)

ddc04a3

ggerganov commited on Jun 16, 2024

Commit History

talk-llama : sync llama.cpp 7ef5ccc

examples : fix compile warnings [no ci] (#0) 32cfce9

sync : ggml 6ef5667

ggml : sync sycl (skip) (#0) bf6ccee

scripts : fix sync scripts e2461ca

ggml : remove unnecessary UNUSED macro call (ggml/880) ab9a7d0

cmake : add GGML_BUILD and GGML_SHARED macro definitions (llama/8281) a8f9bda

Enabled more data types for oneMKL gemm_batch (llama/8236) 08501f8

CUDA: MMQ support for iq4_nl, iq4_xs (llama/8278) 8411e3c

CUDA: revert part of the RDNA1 optimizations (llama/8309) fcd0c52

CUDA: fix MMQ stream-k rounding if ne00 % 128 != 0 (llama/8311) 04d4209

Fix WARP_SIZE=16 bug of Intel GPU (llama/8266) 1ce11e2

rm get_work_group_size() by local cache for performance (llama/8286) 08fd758

Define and optimize RDNA1 (llama/8085) 6aa5a89

fix typo (llama/8267) 0c9c7c8

Removes multiple newlines at the end of files that is breaking the editorconfig step of CI. (llama/8258) cc49462

cuda : update supports_op for matrix multiplication (llama/8245) 2314334

Fix win build conflict of math library (llama/8230) 5a33963

Fix the sub group size of Intel (llama/8106) 2dd429e

CUDA: refactor and optimize IQ MMVQ (llama/8215) afa1447

Update SYCL-Rope op and Refactor (llama/8157) 06acee2

CUDA: fix MMQ stream-k for --split-mode row (llama/8167) ef3d018

feat: cuda implementation for `ggml_conv_transpose_1d` (ggml/854) 025493b

ci : disable java build b5bb445 unverified

server : add inference path to make OAI API compatible (#2270) 66a3eb1 unverified

sync : ggml + fix sync script bce6859 unverified

make : disable CUDA graphs ab5ee59 unverified

ggml : add GGML_CUDA_USE_GRAPHS option, restore GGML_CUDA_FORCE_CUBLAS (cmake) (llama/8140) e83fdad unverified

make : disable CUDA mel build 7d13d39 unverified

cmake : minor fixes 369b16c unverified

make : fix missing -O3 9cccc55 unverified

whisper : disable CUDA mel + fix FFMPEG 2831df8 unverified

sync : ggml cd6e534 unverified

whisper : reorganize source code + improve CMake (#2256) f75c2e3 unverified

whisper : optimize fft() function (#2242) cc603fa unverified

talk-llama : sync llama.cpp e8e18fb unverified

whisper : use ggml_backend_sched (#2239) bfa5a95

fix : remove extra files 1b0dec0

scripts : sync ggml-blas 463e11c

build : update make / cmake 0b4241c

sync : ggml 89ada87

move BLAS to a separate backend (cont) (llama/6210) 4b26445

Vulkan Shader Refactor, Memory Debugging Option (llama/7947) d0120b1

scripts : stop sync whisper example from ggml f174613

cmake : fix sycl build (#0) 9a475af

ggml : remove OpenCL (#0) d303fe3

sycl : sync (#0) f580c99

cuda : enable CUDA graphs (#0) d075551

talk-llama : sync llama.cpp 7e268a7

cmake : fix CUDA build (#0) ddc04a3

talk-llama : sync llama.cpp

7ef5ccc

examples : fix compile warnings [no ci] (#0)

32cfce9

sync : ggml

6ef5667

ggml : sync sycl (skip) (#0)

bf6ccee

scripts : fix sync scripts

e2461ca

ggml : remove unnecessary UNUSED macro call (ggml/880)

ab9a7d0

cmake : add GGML_BUILD and GGML_SHARED macro definitions (llama/8281)

a8f9bda

Enabled more data types for oneMKL gemm_batch (llama/8236)

08501f8

CUDA: MMQ support for iq4_nl, iq4_xs (llama/8278)

8411e3c

CUDA: revert part of the RDNA1 optimizations (llama/8309)

fcd0c52

CUDA: fix MMQ stream-k rounding if ne00 % 128 != 0 (llama/8311)

04d4209

Fix WARP_SIZE=16 bug of Intel GPU (llama/8266)

1ce11e2

rm get_work_group_size() by local cache for performance (llama/8286)

08fd758

Define and optimize RDNA1 (llama/8085)

6aa5a89

fix typo (llama/8267)

0c9c7c8

Removes multiple newlines at the end of files that is breaking the editorconfig step of CI. (llama/8258)

cc49462

cuda : update supports_op for matrix multiplication (llama/8245)

2314334

Fix win build conflict of math library (llama/8230)

5a33963

Fix the sub group size of Intel (llama/8106)

2dd429e

CUDA: refactor and optimize IQ MMVQ (llama/8215)

afa1447

Update SYCL-Rope op and Refactor (llama/8157)

06acee2

CUDA: fix MMQ stream-k for --split-mode row (llama/8167)

ef3d018

feat: cuda implementation for `ggml_conv_transpose_1d` (ggml/854)

025493b

ci : disable java build

b5bb445
unverified

server : add inference path to make OAI API compatible (#2270)

66a3eb1
unverified

sync : ggml + fix sync script

bce6859
unverified

make : disable CUDA graphs

ab5ee59
unverified

ggml : add GGML_CUDA_USE_GRAPHS option, restore GGML_CUDA_FORCE_CUBLAS (cmake) (llama/8140)

e83fdad
unverified

make : disable CUDA mel build

7d13d39
unverified

cmake : minor fixes

369b16c
unverified

make : fix missing -O3

9cccc55
unverified

whisper : disable CUDA mel + fix FFMPEG

2831df8
unverified

sync : ggml

cd6e534
unverified

whisper : reorganize source code + improve CMake (#2256)

f75c2e3
unverified

whisper : optimize fft() function (#2242)

cc603fa
unverified

talk-llama : sync llama.cpp

e8e18fb
unverified

whisper : use ggml_backend_sched (#2239)

bfa5a95

fix : remove extra files

1b0dec0

scripts : sync ggml-blas

463e11c

build : update make / cmake

0b4241c

sync : ggml

89ada87

move BLAS to a separate backend (cont) (llama/6210)

4b26445

Vulkan Shader Refactor, Memory Debugging Option (llama/7947)

d0120b1

scripts : stop sync whisper example from ggml

f174613

cmake : fix sycl build (#0)

9a475af

ggml : remove OpenCL (#0)

d303fe3

sycl : sync (#0)

f580c99

cuda : enable CUDA graphs (#0)

d075551

talk-llama : sync llama.cpp

7e268a7

cmake : fix CUDA build (#0)

ddc04a3