Intelligence

Debug Investigate

Scientific method debugging with a persistent eliminated-hypotheses log. Prevents the #1 AI debugging failure: re-testing disproven theories across context resets.

How It Works

Debug Investigate · Workflow

Hypotheses confirmed or killed — eliminated ones never come back.

TRIGGERWhy is this failing?A bug or failure to root-cause; --continue resumes

STEP 1Init persistent debug file.debug/{slug}.md — immutable symptoms, append-only Eliminated log

STEP 2Gather evidence firstFull error · codebase search · git log · tests — before any hypothesis

STEP 3Form falsifiable hypothesesSpecific + testable; anti-bias check (confirmation · anchoring · sunk cost)

GATETest one hypothesis at a timeOne experiment each; never re-test the Eliminated section↻ fail → Eliminated → append with evidence, form next hypothesis

confirmed

STEP 4Fix, verify, post-mortemTargeted test + full suite, mark resolved, write lessons.md entry

OUTPUTResolved + elimination logRoot cause recorded; dead ends survive context resets

ↆ download card

Invocation Triggers

/debug-investigatedebuginvestigateroot causewhy is this failing

Use Cases

Investigate a production bug with a structured methodology
Resume a debugging session across context resets without repeating dead ends
Debug a failing CI test with systematic hypothesis elimination

The Problem

The #1 AI debugging failure isn't wrong hypotheses — it's re-testing the ones you already eliminated. Context resets, long sessions, multiple agents: without a log, you circle the same dead ends indefinitely. The scientific method works. The problem is that nothing enforces it. This skill creates a persistent debug file that makes eliminated hypotheses append-only and session-proof.

What It Does

1
Initialize the debug file
Creates `.debug/{slug}.md` in the project root with immutable Symptoms, a Current Focus field (overwritten before every action), an append-only Eliminated section, and an Evidence Log. If `--continue` is passed, reads the existing file instead.
2
Gather evidence before forming hypotheses
Reads the full error, searches the codebase for the exact message, reads the full function (not just the line), checks git log for recent changes, runs tests. All findings go into the Evidence Log before any hypothesis is formed.
3
Form falsifiable hypotheses
Each hypothesis must be specific and falsifiable — "the auth token expires because the refresh logic uses < instead of <= for the expiry check", not "something is wrong with auth". Anti-bias checklist enforced: confirmation bias, anchoring, sunk cost, recency.
4
Test one hypothesis at a time
Updates Current Focus before every action. Runs one experiment per hypothesis. CONFIRMED → fix. ELIMINATED → appended to the Eliminated section with evidence reference, then next hypothesis. Never re-tests anything in Eliminated.
5
Fix, verify, post-mortem
Implements the fix, runs the failing test, runs the full suite. Updates the debug file to resolved. Writes a lessons.md entry with root cause, detection latency, and alerting gap. Checks if a regression test is needed.

What You Get / What It Doesn't Do

What you get

A `.debug/{slug}.md` file that survives context resets and agent handoffs
Append-only Eliminated section — dead-end hypotheses never get re-tested
Evidence Log with timestamped action/result pairs
A post-mortem entry in lessons.md with root cause and alerting gap
Regression test recommendation if the bug can recur

What it doesn't do

Guarantee the bug is found — it structures the search, not the outcome
Replace logs, metrics, or observability tooling
Work without symptoms — you need a reproducible failure or error message to start
Auto-fix — it diagnoses and fixes after confirmation, not speculatively

Tips

Always use --continue

Start a new session on the same bug with `/debug-investigate --continue`. It reads the Eliminated section first — that's the whole point. Starting fresh means repeating dead ends.

Binary search the pipeline

If the bug is in a multi-step pipeline, test the midpoint. Healthy → bug is downstream. Unhealthy → upstream. Repeat until isolated. Faster than sequential hypothesis testing.

Rubber duck the Current Focus

When stuck, write out your Current Focus as a full narrative sentence. "I believe X is happening because Y, and I'm about to test Z." The act of writing it usually surfaces the gap.

Get the Skill

Free DownloadFREE

Debug Investigate

The full SKILL.md — copy it into ~/.claude/skills/ and trigger it by name.

Commonly Used With

DevOps & OpsIncident ResponseStructured production incident response: triage severity, contain the blast, create P0 ticket, gather evidence, run investigation, and generate post-mortem.Build & DeployAuto FixClassifies runtime failures, locates root cause in the codebase, implements a targeted fix, runs tests, and redeploys. Full autonomous loop. No manual steps.IntelligenceLearnCaptures what you learned from a session into lessons.md before the context window closes. Formats it as a structured entry — mistake, root cause, rule — so it compounds across sessions.

// What's Next

New skills ship here first

That one's yours — keep it. The library grows every week. Get each new production skill the day it drops, free, and follow the build along the way.

Get every new skill free

// Follow the build

X@jeremyknox Instagram@jeremyknox.ai DiscordThe Operators

30 production skills in the library — ready to install.

Browse all →Founders pricing →