开源项目来源: GITHUB2026年6月16日重要度: 2/5

llama.cpp 发布 b9659 修复 mtmd 中 n_tokens 计数错误 (#24656)

英文摘要

The llama.cpp project released tag b9659, which includes a bug fix for the mtmd component that was miscounting n_tokens (PR #24656). This release also provides pre-built binaries for a wide range of platforms, including macOS (ARM64, Intel), Linux (x64, ARM64, s390x with Vulkan, ROCm, OpenVINO, SYCL), Android (ARM64), and Windows (x64, ARM64 with CUDA 12/13, Vulkan, SYCL, HIP). Notably, the macOS Apple Silicon build with KleidiAI enabled is marked as disabled, while the iOS XCFramework artifact is available.

中文摘要

llama.cpp 发布标签 b9659，包含对 mtmd 组件中 n_tokens 计数错误的修复（PR #24656）。该版本同时提供了面向多平台的预编译二进制文件，涵盖 macOS（ARM64、Intel）、Linux（x64、ARM64、s390x，支持 Vulkan、ROCm、OpenVINO、SYCL）、Android（ARM64）和 Windows（x64、ARM64，支持 CUDA 12/13、Vulkan、SYCL、HIP）。其中，启用 KleidiAI 的 macOS Apple Silicon 构建被标记为禁用，而 iOS XCFramework 产物可用。

关键要点

Fixes a miscounting of n_tokens in the mtmd component of llama.cpp (PR #24656).
修复 llama.cpp 中 mtmd 组件对 n_tokens 的计数错误（PR #24656）。
Release b9659 includes the fix and provides pre-built binaries for macOS, Linux, Android, and Windows, covering CPU, GPU, and various accelerator backends.
b9659 版本包含此修复，并提供 macOS、Linux、Android 和 Windows 的预构建二进制文件，涵盖 CPU、GPU 及多种加速器后端。
The macOS Apple Silicon build with KleidiAI support is disabled in this release.
此次发布中，启用 KleidiAI 的 macOS Apple Silicon 构建处于禁用状态。

打开原文