llama.cpp b9653 发布:扩大 Vulkan 对 CONCAT 操作的支持并提供多平台二进制文件
英文摘要
The b9653 release of llama.cpp extends the Vulkan backend to handle additional CONCAT tensor operation types, improving compatibility for models that rely on these operations. It also ships pre-built binaries for macOS (Apple Silicon, Intel), Linux (multiple GPU backends including Vulkan, ROCm, OpenVINO, SYCL), Android, Windows (CUDA 12/13, Vulkan, SYCL, HIP), and openEuler platforms. The release was published automatically on June 15, 2026.
中文摘要
llama.cpp 的 b9653 版本扩展了 Vulkan 后端,使其支持更多 CONCAT 张量操作类型,提升了依赖此类操作的模型的兼容性。该版本同时提供了针对 macOS(Apple Silicon、Intel)、Linux(含 Vulkan、ROCm、OpenVINO、SYCL 等多种 GPU 后端)、Android、Windows(CUDA 12/13、Vulkan、SYCL、HIP)以及 openEuler 平台的预编译二进制文件,于 2026 年 6 月 15 日自动发布。
关键要点
Vulkan backend now supports more CONCAT operation types, which can unblock previously incompatible model architectures.
Vulkan 后端现在支持更多 CONCAT 操作类型,可解锁先前不兼容的模型架构。
Pre-built binaries are available for a wide range of platforms and GPU backends, including CUDA, Vulkan, ROCm, OpenVINO, and SYCL.
预编译二进制文件覆盖了广泛的平台和 GPU 后端,包括 CUDA、Vulkan、ROCm、OpenVINO 和 SYCL。
The release is an automated build (b9653) triggered by CI/CD workflows, not a manually curated version.
该版本是通过 CI/CD 工作流自动构建的(b9653),而非人工策划的版本。