Commit History

build : fix aarch64 (#0)
55befbb

ggerganov commited on

talk-llama : sync llama.cpp
a40d0a7

ggerganov commited on

sync : ggml
96e8b15

ggerganov commited on

ggml-backend : fix async copy from CPU (llama/8897)
050174c

slaren commited on

Updated SYCL device filtering (llama/8901)
64976cd

Ouadie EL FAROUKI commited on

CUDA/HIP: fix tests/test-backend-ops (llama/8896)
f14c1ad

JohannesGaessler commited on

CUDA: fix padding logic for FP16/FP32 (llama/8884)
643bcdb

JohannesGaessler commited on

ggml : add epsilon as a parameter for group_norm (llama/8818)
d003891

mollysama commited on

ggml : fix overflows in elu function (llama/8866)
a12468a

Justine Tunney commited on

ggml : reading the runtime sve config of the cpu (llama/8709)
c26339f

jdomke domke commited on

Fix conversion of unnormalized BF16->BF16 weights (llama/7843)
8b10f59

Sigbjørn Skjæret compilade commited on

Fixing wrong VDR iq4nl value (llama/8812)
30eb7bc

Ouadie EL FAROUKI commited on

Build: Only include execinfo.h on linux systems that support it (llama/8783)
0019ddb

Alex O'Connell commited on

cuda : fix dmmv cols requirement to 2*GGML_CUDA_DMMV_X (llama/8800)
73e80d1

slaren commited on

added android implementation of ggml_print_backtrace_symbols (llama/8751)
314d58a

l3utterfly slaren commited on

cann: update cmake (llama/8765)
345a58d

wangshuai09 commited on

Add `TIMESTEP_EMBEDDING` OP (llama/8707)
52eea23

zhentaoyu commited on

ggml: bugfix: fix the inactive elements is agnostic for risc-v vector (llama/8748)
6989631

carterli carter.li commited on

cuda : organize vendor-specific headers into vendors directory (llama/8746)
ec2f307

R0CKSTAR commited on

add conv support (llama/8688)
f0d6f5c

hengyu commited on

feat: Support Moore Threads GPU (llama/8383)
a35db11

yeahdongcn commited on

ggml : ignore more msvc warnings (ggml/906)
1b11fde

stanimirovb commited on

metal : fix struct name (ggml/912)
14cf8db

ggerganov commited on

metal : add abort callback (ggml/905)
b822172

conradev commited on

vulkan : implement Stable Diffusion operators (ggml/904)
124c156

OccamRazor commited on

ggml : move c parameter comment to ggml_rope_ext (ggml/901)
6d34596

danbev commited on

ggml : resolve sync conflicst (ggml/0)
82658f5

ggerganov commited on

common : handle new quant types (ggml/0)
53bb541

ggerganov commited on

ggml : add ggml-aarch64 (ggml/0)
0062819

Dibakar Gope commited on

ggml : reduce hash table reset cost (llama/8698)
9808fbf

slaren commited on

ggml: handle ggml_init failure to fix NULL pointer deref (llama/8692)
dc51517

DavidKorczynski commited on

fix multi-gpu issue on sycl (llama/8554)
94a6436

Chen Xi hengyu commited on

ggml : add and use ggml_cpu_has_llamafile() (llama/8664)
efcca56

ggerganov commited on

Re-add erroneously removed -fsycl from GGML_EXTRA_LIBS (llama/8667)
6e12dfd

Joe Todd commited on

sycl : Add support for non-release DPC++ & oneMKL (llama/8644)
2a5814c

Joe Todd commited on

Vulkan IQ4_NL Support (llama/8613)
899145d

OccamRazor commited on

Allow all RDNA2 archs to use sdot4 intrinsic (llama/8629)
1d65fea

Jeroen Mostert commited on

fix scratch size of softmax (llama/8642)
6519fd2

KevinLy commited on

ggml: fix compile error for RISC-V (llama/8623)
4eec44b

Mark Zhuang commited on

CUDA: MMQ code deduplication + iquant support (llama/8495)
6d14124

JohannesGaessler commited on

gguf : handle null name during init (llama/8587)
2f95156

ggerganov commited on

ggml : fix quant dot product with odd number of blocks (llama/8549)
0083f96

slaren ggerganov commited on

ggml : add friendlier error message to fopen errors (llama/8575)
ab5b4e0

HanClinto commited on

CUDA: fix partial offloading for ne0 % 256 != 0 (llama/8572)
afc137c

JohannesGaessler commited on

cmake : install all ggml public headers (llama/8480)
73a16f3

65a 65a commited on

make/cmake: add missing force MMQ/cuBLAS for HIP (llama/8515)
5096c91

JohannesGaessler commited on

Refactor lora adapter support (llama/8332)
76bcfc6

Xuan Son Nguyen slaren compilade commited on

add concat through dim 1/2 (llama/8483)
acf23d9

hengyu commited on