The article claims Anthropic reduced Claude Code's prompt costs by deleting 80% of prompts using Fable 5, illustrating a broader trend of cost-cutting in AI. No further details, data, or verification are provided in this feed; the full article is only accessible via an external link.
The b9864 release of llama.cpp patches the server’s SSE streaming to ping silent connections every second and only disconnect after 3 seconds of inactivity, so healthy WebUI connections are not dropped during long prefill operations. A new per-request field `sse_ping_interval` is introduced into the request schema; its global default remains 30 seconds to keep API clients unchanged, while the built-in WebUI sends a value of 1 to implement its own 3-second visibility contract. The field is now a typed parameter with hard limits, seeded from the CLI default, and benefits from automatic schema type and range validation. Prebuilt binaries for macOS, Linux, Windows, Android, and iOS are included.
Meta may monetize AI infrastructure by selling external access to older compute and models, with Deutsche Bank estimating $9-30 billion added revenue by 2027. Apple released iOS 27 beta with Apple Intelligence and new AI development frameworks. GitHub Copilot integrated the open-source model Kimi K2.7 from Moonshot AI, its first open-source model integration. Alibaba Cloud announced its AI-native database service Databridge Agent will begin commercial billing on August 1, 2026. Hanwha Group plans to invest 55 trillion won by 2040 in aerospace and AI. xTool launched O1 UV printer with AI tools.
Anthropic has entered early-stage development of its own custom AI inference chips optimized for Claude, seeking to reduce dependency on AWS Trainium and Google TPUs. The company is in discussions with Samsung for manufacturing using its 2nm GAA process and advanced packaging, and also exploring partnerships with Microsoft's Maia and UK startup Fractile. This move follows the hiring of Clive Chan, a key engineer from OpenAI's Jalapeño chip project, and comes after a $6.5B funding round that propelled its valuation to $965B. Anthropic aims to define a chip solely for inference, stripping away unnecessary components for cost and efficiency gains. The development marks a strategic shift from participant to leader in AI hardware, as it faces surging compute demands that even global chip capacity cannot meet.
Meituan announced LongCat-2.0, a 1.6-trillion-parameter Mixture-of-Experts model with 48 billion activated parameters and up to 1 million token context. The model was trained entirely on a 50,000-card cluster of domestic AI accelerator chips using a proprietary distributed communication protocol, rather than NVIDIA's NCCL. It scores 59.5 on SWE-bench Pro, slightly above GPT-5.5's 58.6. On Hugging Face, the model carries an MIT License but weights are marked 'coming soon'; only inference framework and Infra code have been released. LongCat ran anonymously as 'Owl Alpha' on OpenRouter, achieving top-3 monthly call volume with pricing of $0.30 per million tokens and free credits. The model is vertically optimized for Meituan's local services like food delivery and store operations. Despite the engineering achievement, chip vendor, total training cost, wall-clock time, and training data composition remain undisclosed, limiting independent verification and reproducibility.
China’s AI sector is pivoting from free expansion to monetization. DeepSeek completed a record ¥500 billion ($74 billion) funding round with founder Liang Wenfeng personally contributing ¥200 billion, followed by Tencent (¥100 billion), CATL (¥50 billion), and others; external funds go to a Liang-controlled limited partnership with no voting rights and a 5-year lock-up, lifting its post-money valuation above $50 billion. ByteDance’s Doubao rolled out a subscription based on its latest Doubao 2.1 models: Standard at ¥68/month, Enhanced at ¥200/month (¥2,048/year), and Professional at ¥500/month (¥5,088/year), targeting professional document processing, data analysis, and enterprise API use. Moonshot AI’s Kimi earlier introduced ¥49/¥99 monthly subscriptions and raised nearly $6 billion in half a year, its valuation soaring from $4.3 billion to $30 billion. The funding logic has shifted from parameter and user growth to monthly revenue, cost amortization, and paid conversion, forcing AI companies to demonstrate clear monetization paths.