A developer released claude_converter, an open-source tool that converts Claude Code session .jsonl files into the messages format accepted by fine-tuning frameworks like TRL/SFTTrainer, Axolotl, and LLaMA-Factory (ShareGPT format). It includes a clean_messages() helper to strip tool-use blocks and an inspect_session() function for token counts and breakdowns. The tool has zero dependencies and can be installed via `uv pip install claude-converter`. Users are advised to filter sessions to only those where the final assistant turn solved the problem before training.
A Reddit user from r/LocalLLaMA noted that the community's visible fine-tuning activity on consumer-grade hardware has seemingly dropped off, particularly after the release of capable generalist models like Llama-3-8B. They speculate that improved base models may reduce the need for custom fine-tunes, as prompt tweaking often suffices. The user expressed nostalgia for the earlier era when home-brewed models trained with tools like Unsloth or MLX were frequently shared, and asked whether others are still fine-tuning locally. They also inquired about subreddits more dedicated to local model training.
The developer behind Orthrus diffusion head architectures has finalized testing and is preparing to release model checkpoints for Qwen 3.5, Qwen 3.6, and Gemma 4 base language models. The release will include complete end-to-end training and evaluation code, fully open-sourcing the pipeline. Updates to the repository are expected very shortly, according to a Reddit announcement. A Hugging Face page for Orthrus-Qwen3-8B is already live, with additional models imminent. Community members note that llama.cpp inference support is not yet available.