Expert subagents
Narrow, role-scoped specialists instead of one do-everything prompt. The infra agent thinks like an ops engineer; the security auditor thinks like an adversary. Each is good at its lane and stays in it.
A personal team of AI experts for your homelab, code, and ideas
Before your AI walks you through building it, here's the shape of the thing: what it is, why it's built this way, and how the pieces fit. This is the what & why — not the how. Read it like a map.
The elevator pitch — one idea, two states.
One general AI chat that forgets, doesn't know your rules, and answers from a blank slate every time.
A versioned, personal team of specialized AI agents that share memory, follow your rules, and get sharper over time.
Not a chatbot — a small, governed, self-improving AI org that's yours.
Versioned and backed up from day one — you can read it, diff it, and roll it back.
Built on familiar primitives — subagents, slash commands, hooks. No new app.
Nothing here is prescribed — your AI interviews you and tailors every specific.
Five ideas hold the whole thing up. Everything else is detail.
Narrow, role-scoped specialists instead of one do-everything prompt. The infra agent thinks like an ops engineer; the security auditor thinks like an adversary. Each is good at its lane and stays in it.
Semantic (facts that are true now) + episodic (a decision journal) + procedural (skills & checklists) — with time-awareness so it doesn't silently go stale and start lying to you.
Always-on rules: no silent behaviour changes, strict secret hygiene, and human approval for anything risky.
Agents flag friction → you run a periodic review → they get coached and improve. Compounding, human-triggered, no always-on daemon.
Agents look at reality — your repos, optionally live systems — instead of guessing. You choose the blast radius.
Who's on the team — and what each one is trusted to touch.
Knows your homelab. Starts read-only; you decide if it earns write access.
Writes and refactors code in your repos, following your conventions.
Designs and builds interfaces with a committed point of view, not templates.
Hunts for risks and leaks. Reports them — never edits anything itself.
Argues the other side and pokes holes in plans before you commit to them.
Helps you turn raw ideas into shaped specs. Thinks with you, doesn't ship.
Turns audit findings into an ordered fix plan. Plans the work; doesn't do it.
Runs the review and coaches the roster. Proposes changes — you approve them.
› runs the self-improving loopRead-only by default; write is a privilege — and the agent that decides what to change never applies it unattended. The reviewer is never the applier.
The heart of it. A closed cycle with one explicit human-approval gate.
While you work, agents flag what got in the way — a missing fact, a gotcha, a correction you had to make.
Those notes accrue into a local, private decision journal. It's read only at review time, so it never bloats your day-to-day startup.
/eng-reviewPeriodically — and only when you choose, never on a timer — you kick off the review.
It reads the journal, finds patterns, and proposes concrete improvements to memory, skills, and agent instructions.
Nothing changes until you say yes. Then the agents improve — and the loop returns to Capture, a little sharper.
It compounds — but only if you actually run the review. A start-of-session nudge reminds you when retros are piling up.
Three layers, each with a different job. Together they make it remember.
Your current reality — hosts, conventions, the lay of the land. The fast-loading top layer the agents lean on every session.
The decision & activity log. Read only at review, so it can grow rich without ever bloating startup.
The "how we do X here" layer — distilled playbooks the agents reach for when a familiar task comes around again.
It stores the non-derivable — not what's already in your code.
Memory captures gotchas and decisions: the things you'd otherwise re-explain every time. It deliberately doesn't duplicate what an agent can just read from your repo.
Each layer knows roughly how fresh it is — because staleness is silent. Old facts get flagged and pruned at review, so the system never quietly lies to you.
You don't build it all at once. You grow it as real friction tells you to.
It asks about your domains, stack, conventions, guardrails, and risk posture — so everything that follows fits you, not a template.
A tidy structure, committed and backed up from day one — so your knowledge is never sitting in a single fragile local folder.
Not the whole roster. The two or three roles that match what you do today, so the thing is useful in week one.
Once you have agents worth coaching, wire up the journal and /eng-review.
Now the system starts getting sharper on its own cadence.
Add domains and agents when you actually hit a wall — never speculatively. The journal will show you where the friction is.
Build the smallest useful thing, use it, and let friction guide what's next.
This page is the what & why. This guide ships with a companion setup doc for your Claude Code — a single Markdown file, agentic-setup-bootstrap.md, that it reads and executes to scaffold the repo, interview you, and stand up your first agents.
~/).
Read agentic-setup-bootstrap.md and set up my agentic environment.
Keep these on the left. Watch for the ones on the right.