Commit History

docs : add cmake "-j" flag in README.md (#3284)
5fe3033
unverified

toboil-features commited on

ci : add support for tag-based releases (#3287)
f21cf37
unverified

danbev commited on

release : v1.7.6
5cade6e
unverified

ggerganov commited on

bench : update benches
d4f72cd
unverified

ggerganov commited on

bench : print system info before ctx check
835c3e8
unverified

ggerganov commited on

stream : add nullptr check of whisper_context (#3283)
9f0c009
unverified

danbev commited on

ci : enable main-cuda build (#3282)
9bee7f3
unverified

danbev commited on

bindings.java : update java example (#3281)
f001158
unverified

Joas Dev commited on

coreml : backport CoreML features to macos < 14 (#3255)
dc0917f
unverified

glaszig commited on

ci : reduce musa image size (#3277)
a45c78b
unverified

danbev commited on

whisper : add .gitignore entries for OpenVINO support (#3276)
ca0545e
unverified

Yukimasa Funaoka commited on

command: output commands to text file (#3273)
a482bd7
unverified

Aaron Ang commited on

ci : add apt-get clean to musa Dockerfile (#3275)
32a61ec
unverified

danbev commited on

ruby : specify Apple frameworks explicitly on build (#3270)
728defc
unverified

KitaitiMakoto commited on

talk-llama : sync llama.cpp
ade9bc3

ggerganov commited on

sync : ggml
48a7292

ggerganov commited on

CUDA: add conv_2d_transpose (llama/14287)
a728b83

am17an commited on

sycl: add usage of enqueue_functions extension (llama/14244)
2e59a96

Nicolò Scipione commited on

Implement GGML_CPU_ALL_VARIANTS for PowerPC (llama/14286)
0bcd751

Christian Kastner Diego Devesa commited on

cuda : synchronize graph capture and cublas handle destruction (llama/14288)
39c4fa5

Diego Devesa commited on

ggml : fix repack work size for mul_mat_id (llama/14292)
4b0d2de

ggerganov commited on

ggml: Update KleidiAI to v1.9.0 (llama/14277)
90ccf35

Charles Xu commited on

CUDA: add conv_2d_dw (llama/14265)
5cca3ec

am17an commited on

ggml-cpu : remove unnecesary arm feature detection (llama/14281)
62cf694

Diego Devesa commited on

build : suppress gcc15 compile warnings (llama/14261)
0454008

fanyang commited on

sycl: Cleanup codepaths in Get Rows in sycl backend (llama/14215)
feee739

Anton Mitkov commited on

llamafile : support s390x SIMD instruction set (llama/14273)
26bafb6

taronaeo commited on

Vulkan: Set device max size for host memory to avoid OOM warning and fallback to CPU buffer (llama/14249)
08debcd

OccamRazor commited on

metal : add mean kernel (llama/14267)
a726ecc

ggerganov commited on

ggml-cpu: reduce asm calls for hsum (llama/14037)
17c0dfa

taronaeo commited on

ggml-cpu: fix uncaught underscore terminators (llama/14023)
c005248

taronaeo commited on

ggml: Add Apple support for GGML_CPU_ALL_VARIANTS (llama/14258)
9d1d21b

Charles Xu commited on

Add `ggml_roll` (ggml/1274)
71923e5

Acly commited on

android : update CMakeLists.txt to use FetchContent for ggml (#3268)
e5d47d0
unverified

danbev commited on

cmake : fix android build (#3265)
e70bf99
unverified

ggerganov danbev commited on

examples : add stereo to mono conversion in read_audio_data (#3266)
5451562
unverified

danbev commited on

talk-llama : sync llama.cpp
fc04dc0

ggerganov commited on

sync : ggml
23e1986

ggerganov commited on

cmake: remove shader-gen step-targets from ggml-vulkan (llama/14226)
b7a7257

bandoti commited on

ggml-cpu : remove the weak alias trick (llama/14221)
a1bcb29

xctan commited on

musa: fix build warning (unused variable) (llama/14231)
165c242

yeahdongcn commited on

llama : add thread safety test (llama/14035)
acc9311

Diego Devesa ggerganov commited on

cmake: clean up external project logic for vulkan-shaders-gen (llama/14179)
bc8b1f7

bandoti commited on

HIP: disable rocwmma on gfx12 by default until rocm 7.0 (llama/14202)
f95736f

uvos commited on

ggml: Add Android support for GGML_CPU_ALL_VARIANTS (llama/14206)
7ddd89c

Charles Xu commited on

vulkan: mutex around vkQueueSubmit (llama/14127)
ef3a7d0

jeffbolznv commited on

ggml-cpu : rework weak alias on apple targets (llama/14146)
de5e986

xctan commited on

CUDA/HIP: fix ssm_scan on devices where warp size is not 32 (llama/14196)
adf6b4b

uvos commited on

HIP: Replace usage of depricated preprocessor macro __AMDGCN_WAVEFRONT_SIZE__ (llama/14183)
c3467c7

uvos commited on

sycl: Adding additional cpy dbg print output (llama/14034)
6799437

Anton Mitkov commited on