An independent AI consultant developed a three-tier system for a 400-employee precision parts manufacturer near Osaka to address the impending retirement of two veteran inspectors with 99.7% accuracy. First, 60 hours of inspectors' verbal explanations were recorded and transcribed using LLMs to build a retrievable knowledge base of tacit diagnostic logic. A RAG system enables new inspectors to query similar cases, while a vision AI model classifies defects with root cause suggestions. After five months, A/B testing showed AI-assisted new inspectors reached 99.2% accuracy, up from 96%, and the knowledge base was adopted by production engineers. The project reframed the initial request from a simple AI visual inspection to a knowledge preservation system.
SocialSource: V2EXImportance: 2/5
Livid demonstrated that by creating a custom node like /go/wunder on V2EX and posting detailed product feature descriptions there, V2EX Chat can answer product-related questions using those posts as context. The example uses a specific edge.v2ex.com chat session link to show the AI replying solely based on node content. This effectively turns V2EX Chat into a retrieval-augmented knowledge base, enabling product Q&A without building a separate chatbot.
SocialSource: XImportance: 2/5
Pinecone and Pulumi are co-hosting an evening of talks on June 18 at 5 PM in NYC. The event will cover the infrastructure behind vector search and retrieval-augmented generation (RAG), infrastructure as code (IaC) practices, and a demo of an AI running coach Slack bot that incorporates real-world data into model context. The program includes demos, a Q&A session, and a hangout.
SocialSource: REDDIT ARTIFICIALImportance: 3/5
Reddit user KobyStam built the open-source tool 'The AI Counsel,' packaging Andrej Karpathy's LLM Council concept into a configurable Docker container. It offers two deliberation modes: a Council mode with individual replies, anonymous peer reviews, and a chairman synthesis for factual questions; and an Advisors mode where multiple personas debate a query across configurable rounds for decisions and tradeoffs. The tool includes a built-in MCP server for agent integration, supports local Ollama models and cloud providers like OpenAI, Anthropic, Mistral, and DeepSeek, and embeds web search via DuckDuckGo, Serper, Brave, and TinyFish with Jina AI for full article retrieval. Everything from system prompts to temperatures is configurable, and the project is entirely free and open-source on GitHub.
Unlike most AI agents that reset every session, Jenova AI agents persist user context, with the longest session spanning 16 million tokens. All session data is retrievable in under 10 milliseconds via Pinecone vector retrieval. This persistent knowledge layer enabled the company to reach over $1M in annual recurring revenue, 200,000+ users, and 10x revenue growth in 5 months, almost entirely through organic growth. Founder Boris Wang stated that Pinecone's knowledge layer is the foundation determining user retention, calling it the product's moat.
A well-known game company in Beijing (near Anzhenmen) has posted urgent openings for three AI engineering roles: AI Development Senior Engineer (30k-60k RMB), Agent Development Engineer (30k-50k RMB), and Algorithm Engineer (45k-65k RMB). The positions involve building AI-powered workflows and agent systems for game AB testing and commercial optimization, integrating large language models, and deploying RAG pipelines with LangChain. Key responsibilities include automating experiment design, optimizing ad bidding and recommendation algorithms, and supporting cross-team data engineering. Required skills span Python, PyTorch, LangChain, distributed systems, and familiarity with both closed-source and open-source LLMs (GPT-4o, Llama 3, Qwen, Mistral). The company offers standard benefits including weekends off, social insurance, free meals, and housing allowance.