ai
How We Broke Top AI Agent Benchmarks: And What Comes Next
https://rdi.berkeley.edu/blog/trustworthy-benchmarks-cont/
Added 2 months ago
Claude Mythos Preview \ red.anthropic.com
https://red.anthropic.com/2026/mythos-preview/
Added 2 months ago
System Card: Claude Mythos Preview [pdf]
https://www-cdn.anthropic.com/53566bf5440a10affd749724787c8913a2ae0841.pdf
Added 2 months ago
METATRON - Open-Source AI Penetration Testing Assistant Brings Local LLM Analysis to Linux
https://cybersecuritynews.com/metatron-ai-penetration-testing/
Added 2 months ago
I Quit. The Clankers Won
https://dbushell.com/2026/04/01/i-quit-the-clankers-won/
Added 2 months ago
Copilot edited an ad into my PR
https://notes.zachmanson.com/copilot-edited-an-ad-into-my-pr/
Added 2 months ago
The Claude Code Source Leak: fake tools, frustration regexes, undercover mode
https://alex000kim.com/posts/2026-03-31-claude-code-source-leak/
Added 2 months ago
Claude Code's source code has been leaked via a map file in their NPM registry
https://twitter.com/Fried_rice/status/2038894956459290963
Added 2 months ago
GitHub - awslabs/mcp: Official MCP Servers for AWS · GitHub
https://github.com/awslabs/mcp
Added 2 months ago
GitHub - gsd-build/get-shit-done: A light-weight and powerful meta-prompting, context engineering and spec-driven development system for Claude Code by TÂCHES. · GitHub
https://github.com/gsd-build/get-shit-done
Added 2 months ago
Show HN: I put an AI agent on a $7/month VPS with IRC as its transport layer
https://georgelarson.me/writing/2026-03-23-nullclaw-doorman/
Added 2 months ago
Push events into a running session with channels
https://code.claude.com/docs/en/channels
Added 3 months ago
A sufficiently detailed spec is code
https://haskellforall.com/2026/03/a-sufficiently-detailed-spec-is-code
Added 3 months ago
Warranty Void If Regenerated
https://nearzero.software/p/warranty-void-if-regenerated
Added 3 months ago
Grace Hopper's Revenge
https://www.thefuriousopposites.com/p/grace-hoppers-revenge
Added 3 months ago
Leanstral: Open-source agent for trustworthy coding and formal proof engineering
https://mistral.ai/news/leanstral
Added 3 months ago
What is agentic engineering?
https://simonwillison.net/guides/agentic-engineering-patterns/what-is-agentic-engineering/
Added 3 months ago
GitHub - mistralai/mistral-vibe: Minimal CLI coding agent by Mistral · GitHub
https://github.com/mistralai/mistral-vibe
Added 3 months ago
GitHub - github/copilot-cli: GitHub Copilot CLI brings the power of Copilot coding agent directly to your terminal. · GitHub
https://github.com/github/copilot-cli
Added 3 months ago
1M context is now generally available for Opus 4.6 and Sonnet 4.6
https://claude.com/blog/1m-context-ga
Added 3 months ago