Spaces:
Running
Running
Commit History
whisper : add .gitignore entries for OpenVINO support (#3276)
ca0545e
unverified
Yukimasa Funaoka
commited on
command: output commands to text file (#3273)
a482bd7
unverified
Aaron Ang
commited on
ci : add apt-get clean to musa Dockerfile (#3275)
32a61ec
unverified
ruby : specify Apple frameworks explicitly on build (#3270)
728defc
unverified
talk-llama : sync llama.cpp
ade9bc3
sync : ggml
48a7292
CUDA: add conv_2d_transpose (llama/14287)
a728b83
sycl: add usage of enqueue_functions extension (llama/14244)
2e59a96
Nicolò Scipione
commited on
Implement GGML_CPU_ALL_VARIANTS for PowerPC (llama/14286)
0bcd751
Christian Kastner
Diego Devesa
commited on
cuda : synchronize graph capture and cublas handle destruction (llama/14288)
39c4fa5
Diego Devesa
commited on
ggml : fix repack work size for mul_mat_id (llama/14292)
4b0d2de
ggml: Update KleidiAI to v1.9.0 (llama/14277)
90ccf35
Charles Xu
commited on
CUDA: add conv_2d_dw (llama/14265)
5cca3ec
ggml-cpu : remove unnecesary arm feature detection (llama/14281)
62cf694
Diego Devesa
commited on
build : suppress gcc15 compile warnings (llama/14261)
0454008
fanyang
commited on
sycl: Cleanup codepaths in Get Rows in sycl backend (llama/14215)
feee739
Anton Mitkov
commited on
llamafile : support s390x SIMD instruction set (llama/14273)
26bafb6
Vulkan: Set device max size for host memory to avoid OOM warning and fallback to CPU buffer (llama/14249)
08debcd
metal : add mean kernel (llama/14267)
a726ecc
ggml-cpu: reduce asm calls for hsum (llama/14037)
17c0dfa
ggml-cpu: fix uncaught underscore terminators (llama/14023)
c005248
ggml: Add Apple support for GGML_CPU_ALL_VARIANTS (llama/14258)
9d1d21b
Charles Xu
commited on
Add `ggml_roll` (ggml/1274)
71923e5
android : update CMakeLists.txt to use FetchContent for ggml (#3268)
e5d47d0
unverified
examples : add stereo to mono conversion in read_audio_data (#3266)
5451562
unverified
talk-llama : sync llama.cpp
fc04dc0
sync : ggml
23e1986
cmake: remove shader-gen step-targets from ggml-vulkan (llama/14226)
b7a7257
bandoti
commited on
ggml-cpu : remove the weak alias trick (llama/14221)
a1bcb29
xctan
commited on
musa: fix build warning (unused variable) (llama/14231)
165c242
llama : add thread safety test (llama/14035)
acc9311
cmake: clean up external project logic for vulkan-shaders-gen (llama/14179)
bc8b1f7
bandoti
commited on
HIP: disable rocwmma on gfx12 by default until rocm 7.0 (llama/14202)
f95736f
uvos
commited on
ggml: Add Android support for GGML_CPU_ALL_VARIANTS (llama/14206)
7ddd89c
Charles Xu
commited on
vulkan: mutex around vkQueueSubmit (llama/14127)
ef3a7d0
ggml-cpu : rework weak alias on apple targets (llama/14146)
de5e986
xctan
commited on
CUDA/HIP: fix ssm_scan on devices where warp size is not 32 (llama/14196)
adf6b4b
uvos
commited on
HIP: Replace usage of depricated preprocessor macro __AMDGCN_WAVEFRONT_SIZE__ (llama/14183)
c3467c7
uvos
commited on
sycl: Adding additional cpy dbg print output (llama/14034)
6799437
Anton Mitkov
commited on
SYCL: Bump oneMath commit (llama/14152)
4d12916
Ewan Crawford
commited on
sycl: Remove not needed copy f16->f32 for dnnl mul mat (llama/14125)
eed049f
Anton Mitkov
commited on
Implement GGML_CPU_ALL_VARIANTS for ARM (llama/14080)
c9cec9d
Christian Kastner
commited on
vulkan: Better thread-safety for command pools/buffers (llama/14116)
fdc26e7
vulkan: Track descriptor pools/sets per-context (llama/14109)
855a3bf
opencl: add `mul_mv_id_q4_0_f32_8x_flat` (llama/14003)
d0a458b
lhez
commited on