#ai-agents

22 signals tagged with this topic.

AI Agent Memory: Agentic RAG That Survives Restarts

Agentic RAG explains how an agent retrieves external knowledge mid-task. It doesn't explain what happens when the session ends and the next one starts from zero. That's an AI agent memory problem, and it's a different problem.

July 10, 20268m read

Engineering

The AI Agent Tech Stack Behind 325 Agents in Production

A Reddit thread ranking #1 for the exact question — what tools to use to build AI agents — is mostly vendor noise. Here's the actual AI agent tech stack running 325 agents in production, layer by layer.

July 10, 202611m read

Engineering★ Featured

AI Agents in Production: The Operator's Handbook

Everyone teaches building agents. Almost nobody teaches running ai agents in production — the operational discipline that separates a demo from a fleet that survives contact with reality.

July 10, 202617m read

Engineering

Claude Code Skills: 6 Stacks That Turn One Tool Into a Fleet

Claude Code skills look like small individual commands until you stack them — then quality-check, verify, and five others become the pipeline that gates every merge across a 325-agent fleet.

July 10, 20268m read

How to Start Building AI Agents: From Zero to First Agent

If you searched how to start building AI agents and got a slide deck instead of a starting point, this is the practitioner's version — the order to learn things in, and the one small agent worth shipping first.

July 10, 20269m read

No-Code AI Agents: What They're Good For and Where They Break

No-code AI agents get a non-engineer from idea to working automation in an afternoon. Here's exactly where that holds up, and the specific signal that tells you it's time to graduate to code.

July 10, 20268m read

Engineering

Running AI Agents in Production: Why Building Was the Easy Part

A demo proves an agent can work once. Running ai agents in production means proving it works the 10,000th time, unattended, with money or reputation on the line. Those are different disciplines.

July 10, 202610m read

Engineering

Why AI Agents Fail in Production: 7 Failure Modes From a Live Fleet

Why AI agents fail in production rarely makes it into the guides — they say 'log every decision path' and stop there. Here are 7 real failure modes from running 325 agents, and the exact signal that catches each one.

July 10, 202610m read

AI◆ Deep Dive

Claude Fable 5: We No Longer Verify the Work — We Verify the Direction

Thariq Shihipar from the Claude Code team lays out the 3 workflow changes Fable 5 made real — and the paradigm shift underneath all of them: we no longer verify the work, we verify the direction.

June 10, 20266m read

I Built an Agentic Business. These Are the Four Principles It Runs On.

Jensen Huang named four principles for running a company on AI agents. I didn't just agree with them — I built a company that runs on all four. Here's the structure, and where running it corrected the fourth.

June 8, 20267m read

Engineering

Judgment Debt: The Hidden Cost of Agentic AI

AI coding agents don't just autocomplete — they plan, delegate, and decide. Most engineers haven't noticed the threshold they already crossed.

May 9, 20267m read

Engineering

Frustration Is the Raw Material: The Only Retro Discipline That Matters

Every rule worth keeping came from something going wrong. The durable value of a retro isn't its narratives — it's its imperatives. If your post-mortem doesn't produce rules for next time, you shipped stories.

April 20, 20267m read

Engineering

Grep the Consumers Before Writing the Producer

I specified a dataclass field name in a dispatch prompt. The agent built to spec, then stopped and flagged that the consuming interface expected a different name. The drift was on me, and it only took one grep to prevent.

April 20, 20266m read

Engineering

Parallel AI Agents Need Isolation. I Learned This the Hard Way.

Four agents writing code in the same git checkout. Ten stashes and 45 minutes of recovery later, the rule wasn't the lesson — the announcement that enforces it was.

April 20, 20267m read

Engineering

'Spec Merged' Does Not Mean 'Implementation Queued'

An engineer agent dispatched to wire a module discovered the module didn't exist — only an empty __init__.py. The spec had merged two days earlier. Nobody had queued the build.

April 20, 20266m read

Engineering

The Killed List: Why Aggressive Scope Cuts Are a Scheduling Primitive

A rebuild's timeline is set by what you refuse to rebuild. Three days to ship a greenfield system worked because the cuts were in the requirements document before anyone felt the pressure to reverse them.

April 20, 20266m read

Engineering

Trust Forward: When Agent Rigor Compounds Across Dispatches

An agent caught a latent bug in legacy code the orchestrator's prompt didn't flag. That single act earned weight on their next flag — and that weighted flag caught two more bugs before they shipped. Trust compounds through a chain, not just a single delivery.

April 20, 20267m read

Why AI Agents Approve Their Own Bad Work (And How to Fix It)

Anthropic just admitted that Claude approves its own mediocre output. Their fix — borrowed from GANs — separates the agent doing work from the agent judging it. Here's how adversarial evaluation changes everything for agent systems.

March 26, 20268m read

Engineering

Thoth: How I Built an Automated Documentation System That Caught Up 455 PRs in One Night

47 repos. 455 merged PRs. 24 knowledge base docs generated automatically. Documentation doesn't drift when a god of knowledge is watching.

February 28, 20268m read

Engineering

Three PRs. One Morning. The Parallel Agent Pattern That Changes How You Ship

The bottleneck in AI-assisted development isn't writing code faster — it's thinking sequentially when the work isn't. Here's how dispatching three agents simultaneously collapsed three review cycles into one.

February 26, 20267m read

Engineering

Mission Control: The Command Center I Built for Managing an Autonomous AI Empire

49 services. 7 agents running 24/7. 54 monitors. One dashboard. Here's how I built the cognitive hub that makes running an autonomous AI ecosystem survivable.

February 25, 20269m read

Engineering

Engineering Leadership in the Agentic Era: What Changes When AI Writes AND Executes

The shift from AI-as-assistant to AI-as-agent isn't just a capability upgrade. It's a fundamental reorganization of how engineering teams are structured, how work flows, and what the engineering manager's job actually is.

February 23, 20267m read