Releases · ggml-org/llama.cpp

06 Jan 08:43

github-actions

b7642

e75ee11

b7642 Latest

Latest

ggml : fix avx512bf16 build (#18623)

include immintrin.h when required
remove unused m512bh

Signed-off-by: Adrien Gallouët [email protected]

macOS/iOS:

Linux:

Windows:

openEuler:

Assets 22

cudart-llama-bin-win-cuda-12.4-x64.zip

sha256:8c79a9b226de4b3cacfd1f83d24f962d0773be79f1e7b75c6af4ded7e32ae1d6

373 MB 2026-01-06T08:43:04Z
cudart-llama-bin-win-cuda-13.1-x64.zip

sha256:f96935e7e385e3b2d0189239077c10fe8fd7e95690fea4afec455b1b6c7e3f18

384 MB 2026-01-06T08:43:13Z
llama-b7642-bin-310p-openEuler-aarch64.tar.gz

sha256:867cee0fa02110ae76124ce478acd9eca1cfd5c2037d6c768dede1ced318c234

42.1 MB 2026-01-06T08:43:23Z
llama-b7642-bin-310p-openEuler-x86.tar.gz

sha256:6ecc20249644a07ba632ae9886ad5483cc37047c138e6aef97ced4d092cb751a

46.3 MB 2026-01-06T08:43:25Z
llama-b7642-bin-910b-openEuler-aarch64.tar.gz

sha256:f0a31fc9e64ae2440ea036c2deb056ddde16d8ecde403f861ced5830584c0618

42.1 MB 2026-01-06T08:43:27Z
llama-b7642-bin-910b-openEuler-x86.tar.gz

sha256:f075f68b18b3ed4c2833c7a40194ba8d2fd61918f855f9657d72bbaf207f5eae

46.3 MB 2026-01-06T08:43:29Z
llama-b7642-bin-macos-arm64.tar.gz

sha256:4f8bf979b006994b9f6b6af68172219abfc55ca0fe2aebfd9ab291fd4bcf44f1

16 MB 2026-01-06T08:43:31Z
llama-b7642-bin-macos-x64.tar.gz

sha256:2ca56194be738958b4bc9cc71d3eddbe5b2ee9e6a73742b7a0ed11d7bb97d8b5

41.3 MB 2026-01-06T08:43:32Z
llama-b7642-bin-ubuntu-s390x.tar.gz

sha256:d4363855697b4fbc2a626ac377a1810291e41c8b0c7f6e2ff9ff8475b58e2d6a

21.4 MB 2026-01-06T08:43:33Z
llama-b7642-bin-ubuntu-vulkan-x64.tar.gz

sha256:bb83b22ce2daa832f4d4dd6ff8529878711fd88edd2fcf60f4015b6e014d3306

36.6 MB 2026-01-06T08:43:35Z
Source code (zip)

2026-01-06T06:54:10Z
Source code (tar.gz)

2026-01-06T06:54:10Z

06 Jan 00:16

github-actions

b7640

e443fbc

b7640

ggml webgpu: add CEIL operation support (#18605)

ggml-webgpu: add CEIL operation support

Add support for the CEIL unary operation in the WebGPU backend:
- Add CEIL_FUNC shader template in unary_op.wgsl
- Add 4 shader variants (f32, f16, inplace versions)
- Initialize CEIL pipelines in ggml-webgpu.cpp
- Register CEIL in supports_op function