AI signal, minus the noise.

Events

Jul 6–11
ICML 2026 (Seoul)AISoon
Jul 10
China June CPI / PPICNSoon
Jul 14
US June CPIUS
Jul 15
China Q2 GDP & June activityCN
Jul 28–29
Fed FOMC MeetingUS
Jul 30
US Q2 GDP (advance)US
Jul 31
China July PMICN
Sep 15–16
Fed FOMC (dot plot)US

Date

A Hands-On Tutorial for Building a Local Coding Agent Stack with Qwen3.6 and Open-Weight Models

Sebastian Raschka provides a detailed guide on setting up a fully local coding agent environment. The tutorial uses Ollama to serve open-weight LLMs such as Qwen3.6 35B-A3B and Cohere North Mini Code, connecting them to agent harnesses like Qwen-Code, Codex, and Claude Code. Performance testing shows both Qwen3.6 and North Mini Code generate ~30–40 tokens per second on a Mac Mini or DGX Spark and solve 4–5 out of 5 tasks on a custom agent problem pack. The article also includes an audit checklist for agent codebases, noting that Claude Code consumes substantially more input tokens than Codex for comparable task outcomes. Setup instructions cover modeling serving, harness configuration, and an SSH tunnel for offloading model execution to a dedicated machine.