Orthrus Diffusion Head Models for Qwen 3.5/3.6 and Gemma 4 Announced with Open-Source Training Code
English summary
The developer behind Orthrus diffusion head architectures has finalized testing and is preparing to release model checkpoints for Qwen 3.5, Qwen 3.6, and Gemma 4 base language models. The release will include complete end-to-end training and evaluation code, fully open-sourcing the pipeline. Updates to the repository are expected very shortly, according to a Reddit announcement. A Hugging Face page for Orthrus-Qwen3-8B is already live, with additional models imminent. Community members note that llama.cpp inference support is not yet available.
Chinese summary
Orthrus扩散头项目已完成测试,准备发布适配Qwen 3.5、Qwen 3.6和Gemma 4基础语言模型的检查点。开发者通过Reddit宣布,将同时开源完整的端到端训练和评估代码,仓库更新即将推送。Hugging Face上已有Orthrus-Qwen3-8B页面,更多模型即将到来。社区指出目前尚无llama.cpp推理支持。
Key points
Orthrus diffusion head models are being released for Qwen 3.5, Qwen 3.6, and Gemma 4 LLM families.
Orthrus扩散头模型将适配Qwen 3.5、Qwen 3.6和Gemma 4系列大语言模型。
Complete end-to-end training and evaluation code will be open-sourced alongside the checkpoints.
完整的端到端训练和评估代码将与模型检查点一起开源。
Repository updates and model uploads are imminent, following a public announcement.
根据公开声明,仓库更新和模型上传即将进行。
llama.cpp inference support is currently lacking, according to community discussion.
社区讨论指出,目前尚无llama.cpp推理支持。