Sakana AI's Fugu collapses a multi-agent orchestration system into one OpenAI-compatible endpoint. The idea is genuinely interesting. The benchmark and export-control claims need a second look.
Read more →
Real footgun stories and the deterministic hooks that would've prevented them. From $30k API key leaks to nuked home directories.
Read more →
The official Claude Code plugin that lets agents work autonomously for hours. When to use it, when not to, and the philosophy behind letting AI fail repeatedly until it succeeds.
Read more →
A community file distilling Karpathy's coding-agent observations hit 60K stars on four principles. I opened my own CLAUDE.md to compare. I'd independently written two of them. The two I hadn't are the ones that matter most.
Read more →
From 20 lines of shell to production apps. Anthropic renamed Claude Code SDK to Agent SDK because deep research is now a first-class use case.
Read more →
Claude Code can now watch your PRs in the cloud, fix CI failures, and address reviewer comments while you're away. It's the logical next step after auto mode - and it raises the same trust questions, harder.
Read more →
Task management designed for AI coding agents. CLI-first, git-native sync, and Model Context Protocol integration.
Read more →
The bottleneck isn't AI capability - it's that developers lack design vocabulary. Impeccable bridges the gap, and the Tessl benchmarks prove it: 1.59x improvement over baseline.
Read more →
Skills are auto-invoked by Claude's judgment. For engineering workflows that need predictability, slash commands give you explicit control.
Read more →
Boris Cherny shared his workflow for the tool he built. The setup is surprisingly vanilla. The philosophy is worth studying.
Read more →