Commit History

whisper.android.java : fix returns in JNI (#1929)
c1b258d
unverified

zhouwg commited on

cmake : add library versioning (#1352)
3ab7ee7
unverified

kennethge ggerganov commited on

readme : recommend MacOS Sonoma for Core ML (#1917)
c4e849b
unverified

Gavin Cai commited on

talk-llama : sync llama.cpp
06c222c
unverified

ggerganov commited on

sync : ggml
b85f30e
unverified

ggerganov commited on

sync : llama.cpp (ggml/0)
8ea3a45
unverified

ggerganov commited on

ggml : make i-quants work with super-blocks of 64 (CPU,Metal) (llama/5760)
9a07f42
unverified

Kawrakow ikawrakow commited on

Attempt to fix android build (llama/5752)
e720b3b
unverified

Kawrakow ikawrakow commited on

IQ4_XS: a 4.25 bpw quantization (llama/5747)
0ee1bfb
unverified

Kawrakow ikawrakow commited on

cuda : replace remaining shfl_xor with calls to warp_reduce functions (llama/5744)
753b30d
unverified

Engininja2 commited on

ggml-quants : fix avx2 iq1_s vec_dot when compiled with gcc (llama/5742)
72e8610
unverified

Engininja2 commited on

Adding IQ2_S and IQ2_M to complete coverage of the 2-3 bit quantization range (llama/5721)
2b9bb9e
unverified

Kawrakow ikawrakow ggerganov commited on

CUDA: fix DEBUG_CUDA_MALLOC (llama/5729)
f18f386
unverified

JohannesGaessler commited on

Add support for soft_max ALiBi (llama/5639)
86d6a5e
unverified

AidanBeltonS Abhilash Majumder commited on

ggml-quants : provide ggml_vqtbl1q_u8 for 64bit compatibility (llama/5711)
430efc6
unverified

Crad commited on

add google magika inference example (ggml/748)
10ac4bb
unverified

slaren commited on

stream.wasm : fix invalid memory access when no segments (#1902)
3273767
unverified

Andrew S commited on

talk-llama : sync llama.cpp
b92d757
unverified

ggerganov commited on

sync : ggml
3eb6cbf
unverified

ggerganov commited on

sync : llama.cpp (ggml/0)
6c535a8
unverified

ggerganov commited on

code : normalize enum names (llama/5697)
93e0830
unverified

ggerganov commited on

IQ3_S: a much better alternative to Q3_K (llama/5676)
32589c9
unverified

Kawrakow ikawrakow commited on

Introduce backend GUIDs (ggml/743)
a7eb9f6
unverified

UEXTM.com slaren commited on

talk, talk-llama : pass text_to_speak as a file (#1865)
3fd8b4d
unverified

Tamotsu Takahashi commited on

whisper : add SYCL support (#1863)
9a168fc
unverified

Abhilash Majumder commited on

talk-llama : sync llama.cpp
53d0282
unverified

ggerganov commited on

sync : ggml
4f680fe
unverified

ggerganov commited on

ggml : always define ggml_fp16_t as uint16_t (llama/5666)
bc567d3
unverified

ggerganov commited on

ci : fix whitespace
d75e158
unverified

ggerganov commited on

ggml : 32-bit arm compat (#1891)
7551157
unverified

ggerganov commited on

sync : ggml
096caf3
unverified

ggerganov commited on

sync : llama.cpp (ggml/0)
f8e8d34
unverified

ggerganov commited on

conext add name (llama/5624)
3c39d4b
unverified

hengyu commited on

Update ggml_sycl_op_mul_mat_vec_q (llama/5502)
963ffd5
unverified

AidanBeltonS Abhilash Majumder commited on

Refactor validation and enumeration platform checks into functions to clean up ggml_vk_instance_init()
8637c17
unverified

OccamRazor commited on

Add check for VK_KHR_portability_enumeration for MoltenVK support
85caa3f
unverified

OccamRazor commited on

Add preprocessor checks for Apple devices.
b8e3b87
unverified

dokterbob commited on

Resolve ErrorIncompatibleDriver with Vulkan on MacOS.
0bc3433
unverified

dokterbob commited on

Allow for Vulkan build with Accelerate.
7d255ac
unverified

dokterbob commited on

cuda : ignore peer access already enabled errors (llama/5597)
a817d85
unverified

slaren commited on

ggml : compute forward no longer pass src tensors (ggml/729)
4e31c82
unverified

Siddharth Ramakrishnan siddharthvader commited on

ggml : fix conv_2d batch mode (ggml/737)
99ece5c
unverified

bssrdf bssrdf commited on

openvino : fix convert-whisper-to-openvino.py (#1890)
dfd53cc
unverified

st-gr Stefan Grundmann commited on

main : fix file existence check in main.cpp (#1889)
9162df9
unverified

Theldus commited on

talk-llama : sync llama.cpp
542accf
unverified

ggerganov commited on

make : fix CUBLAS link with WSL (#1878)
b3c9e81
unverified

LBlue commited on

sync : ggml
cb5b2be
unverified

ggerganov commited on

ggml : resolve merge conflicts (ggml/0)
7ee6ffa
unverified

ggerganov commited on

common : add IQ1_S (ggml/0)
39c054e
unverified

ggerganov commited on

ci : enable -Werror for CUDA builds (llama/5579)
df03a10
unverified

ggerganov commited on