Claude Code Starter

Turn Claude Code from a smart assistant into a full engineering team. This repo gives you 35 specialized AI agents, 48 skills, and 21 commands that handle everything from deep debugging to production deployment — out of the box.

Why This Repo Is Powerful

Out of the box, Claude Code is a single AI that responds to what you type. With this starter kit installed, it becomes a team of specialists:

/carmack — A debugging agent modeled after John Carmack. It doesn't ask you for logs — it builds reproduction harnesses, attaches debuggers, instruments code, and finds root causes autonomously. It writes the fix AND the regression test.
/ship — A deployment agent that runs 12+ quality gates before your code hits production: lint, tests, security audit, bundle verification, health checks, and rollback preparation. It catches the bugs you'd find at 2am.
/debug — A 5-phase systematic debugging agent that refuses to guess. It gathers evidence, forms hypotheses, tests them, and only implements fixes after proving the root cause.
/ghidra — A reverse-engineering skill for compiled binaries (.exe, .dylib, .so, .ipa, .apk, firmware). Wraps NSA's Ghidra headless mode with ready-to-run Jython scripts (decompile_all.py, dump_strings_imports.py) and a built-in router that sends you to the right tool when "the thing" is actually a browser extension, source map, or pyc file instead of a binary.
35 agents total — Code reviewers (Rails, Python, TypeScript), security auditors, architecture strategists, performance oracles, design iterators, and more. Each one is a specialist that knows its domain deeply.
Safety hooks — Pre-commit checks that prevent you from pushing secrets, destructive bash commands that require confirmation, and automatic task tracking across sessions.
Session persistence — Debug investigations survive across conversations. Start debugging today, resume tomorrow with full context.
OpenClaw pairing — The starter kit is designed to pair with OpenClaw, an unattended-agent runtime. Same skills, same agents, same carmack / ship / debug workflows — but firing on a cron with results delivered to Telegram, Slack, or your inbox. See the OpenClaw section below.

This isn't a collection of prompts. It's a battle-tested engineering workflow that catches real production bugs, prevents real security issues, and ships real code safely.

Getting Started (First-Time Setup)

You don't need to know the terminal. Just follow these 3 steps:

Step 1: Open Claude Code

Open Claude Code in your terminal, VS Code, or the desktop app. If you don't have it yet, install it from claude.ai/code.

Step 2: Tell Claude to Install It

Paste this into Claude Code:

Clone https://github.com/barkleesanders/claude-code-starter.git and run the install script to set up all the agents, skills, commands, and CLI tools

Claude will clone the repo and run ./install.sh. The script:

Copies all agents/skills/commands to ~/.claude/
Bootstraps Homebrew if missing (prompts once for your password)
Installs every CLI tool the skills depend on (gh, node, osgrep, agent-browser, wrangler, etc.)
Asks once about merging CLAUDE.md and settings.json — choose option 1 or 3.

Or run it directly in your terminal:

git clone https://github.com/barkleesanders/claude-code-starter.git && cd claude-code-starter && ./install.sh

Step 3: Verify It Works

After installation, restart Claude Code and type:

/carmack verify that the claude-code-starter installation is working correctly — check that the agents, skills, and commands are all loaded

Carmack will scan your ~/.claude/ directory and confirm everything is installed. If anything is missing, it will fix it for you.

That's it. You now have 35 agents, 52 skills, and 26 commands ready to use, plus an idempotent CLI installer (install-tools.sh) that brings in bd, gh, node, osgrep, agent-browser, wrangler, vercel, rclone, ffmpeg, and more. See Installation below for details.

Your First Real Command

Navigate to any project and try:

/carmack what are the top 3 bugs or issues in this codebase?

Or if you're ready to deploy:

/ship

What's Included

Core Agents (35)

Agent	Purpose	Invoke
`systematic-debugging`	5-phase bug investigation with evidence checkpoints	`/debug [issue]`
`carmack-mode-engineer`	Deep debugging with repro harnesses & debugger	`/carmack [issue]`
`ship-working-code`	Safe deployment with quality gates	`/ship`
`git-safety`	Scan for secrets, clean git history, prevent leaks	`/git-safety`
`git-preflight`	Pre-flight checks before git commands	`/git-preflight`
`code-reviewer`	AI-powered code review	`/code-review`
+ 29 more specialized agents	See `agents/` directory

Skills (45+)

Skill	Description
`/debug`	Launch systematic-debugging for bug reports, test failures
`/carmack`	Launch carmack-mode for race conditions, perf issues
`/ship`	Deploy with lint, test, security checks
`/safety-audit`	Tiered production safety checks
`/git-safety`	Scan for secrets, clean history
`/git-preflight`	Pre-flight git checks
`/ralph`	Autonomous feature implementation
`/prd`	Product requirements document generation
`/brainstorming`	AI-assisted brainstorming
`/token-usage`	Analyze token usage across all projects and sessions
+ many more	See `skills/` directory

Commands (21)

Slash commands for common workflows. See commands/ directory.

Hooks

SessionStart - Initialize task tracking, run config backup
PreToolUse - Bash command safety checks
SubagentStop - Check for saved debug sessions

Agent Details

systematic-debugging

Enforces a disciplined 5-phase workflow:

Root Cause Investigation - Gather evidence, no assumptions
Pattern Analysis - Find similar issues, related code
Hypothesis & Testing - Form and test theories
Implementation & Verification - Fix with approval checkpoint
Session Persistence - Save state for resumability

Best for: Bug reports, test failures, "it worked before" issues

carmack-mode-engineer

Empirical debugging in the style of John Carmack:

Builds reproduction harnesses in tools/repro/
Attaches debuggers autonomously (lldb)
Instruments code without asking for logs
Takes complete ownership of investigation
Creates minimal, surgical fixes

Best for: Race conditions, memory leaks, performance issues, intermittent failures

ship-working-code

Production deployment with comprehensive quality gates:

Repository verification & merge conflict resolution
Intelligent lint fixing (automatic + AI-powered)
Security audits & test execution
Multi-platform deployment with health checks
Rollback capability

Best for: When you're ready to deploy

Safety Audit Tiers

/safety-audit           # Tier 1 (default) - Critical checks
/safety-audit tier2     # Tier 2 - Investigation
/safety-audit tier3     # Tier 3 - Deep analysis
/safety-audit full      # All tiers

Tier	Agent	Checks
1	ship-working-code	Silent failures, security, tests
2	systematic-debugging	Blind spots, test quality, rate limits
3	carmack-mode-engineer	Code archaeology, critical paths

Token Usage Tracking

Track how many tokens your Claude Code sessions consume across all projects:

/token-usage              # All time usage
/token-usage 7            # Last 7 days
/token-usage 2025-01-15   # Since specific date

Or run the script directly:

python3 ~/.claude/skills/token-usage/token-usage.py
SINCE_DAYS=7 python3 ~/.claude/skills/token-usage/token-usage.py

Generates a report at ~/.claude/usage/token_report.md with:

Total tokens across all projects and sessions
Per-project breakdown (input, cache, output)
Top 25 costliest sessions with first prompt context
Subagent token consumption analysis

All data stays local — nothing is sent anywhere.

Self-Improvement System

Claude Code gets smarter every session through five feedback loops that capture mistakes and surface them as reviewable patterns on the next /carmack invocation.

What gets captured automatically

Hook	Captures	Log file
`PostToolUse`	Tool errors (any `is_error:true`)	`~/.claude/tool-errors.jsonl`
`UserPromptSubmit`	Correction ("don't do X") and success ("perfect") phrases	`~/.claude/feedback-pending.jsonl`
`PostToolUse Skill\|Agent`	Skill and subagent invocations + outcome	`~/.claude/skill-usage.jsonl`

What gets surfaced

On every /carmack invocation, a single scanner (~/.claude/skills/carmack/tools/auto-improve.sh) runs all five checks and prints a compact summary:

Novel tool errors → tool-errors-pending.md — Claude classifies each into shared/tool-error-recovery.md
User behavioral feedback → feedback-pending.md — Claude saves as memory entries (feedback type)
Skill usage report — flags skills with >30% error rate (weekly, needs 5+ invocations)
Skill drift — validates commands/paths in every skill.md still resolve (weekly, allowlist-backed)
Memory search — memory-search.sh <topic> greps both MEMORY.md and bd memories before starting work

Token & efficiency gains

Measured against a real VPS long-session workload before/after today's changes:

Metric	Before	After	Gain
Context ceiling (featherless GLM-4.7)	32,768	48,000	+46%
Tokens sent to model at turn 5	~44K (near ceiling, compaction deferred)	46,752 with 231,863 of history compressed in	5x effective context
LCM mode default	`deferred` (no-op boundary writes)	`inline` (real summarization before next prompt)	compaction actually shrinks
Tool-error recovery catalog	9 patterns	20 patterns	+11 from scanning 1,141 transcripts (3,256 errors classified)
Blind-spot reference	—	10 patterns	covers schema-validate-before-restart, self-upgrade traps, adjacent-system breakage
Stop-hook checks	3 (tasks, errors, request coverage)	5 (adds real-traffic verification, fix-all-issues rule)	catches silent infra "done"

Per-session savings on long debugging workloads: typical 50-turn session now stays under the 48K ceiling indefinitely; previously would crash at turn 2-3 with "context exceeded." For shorter sessions, the tool-error catalog saves roughly 2–5K tokens per known failure pattern (no re-diagnosis needed).

Safe JSON config mutation

New helper replaces hand-rolled python-heredoc patches:

~/.claude/skills/carmack/tools/json-patch.sh <config> '<jq-expr>' \
  --validate-cmd "<cmd>" --restart <systemd-service>

Auto-backup + jq patch + JSON validate + optional validate-cmd + optional systemd restart + auto-rollback on any failure.

Usage

# See what's accumulated since last review
~/.claude/skills/carmack/tools/auto-improve.sh

# After classifying pending items into the catalog, archive the logs
~/.claude/skills/carmack/tools/auto-improve.sh --clear

# Skill-specific
~/.claude/skills/carmack/tools/skill-drift-check.sh
~/.claude/skills/carmack/tools/memory-search.sh openclaw

The system is agent-native: every learning loop compounds into Claude's next session without manual curation.

Debug Session Persistence

Investigations are saved to tools/debug-sessions/{issue-name}/:

tools/debug-sessions/
  auth-timeout/
    auth-timeout-state.md       # Current phase, findings
    auth-timeout-evidence.md    # Logs, traces, data
    auth-timeout-hypothesis.md  # Tested theories

Resume any investigation: "Resume the auth-timeout investigation"

Installation

One-line install (recommended)

git clone https://github.com/barkleesanders/claude-code-starter.git && cd claude-code-starter && ./install.sh

That single command:

Copies 35 agents, 51 skills, 26 commands to ~/.claude/
Installs/merges CLAUDE.md and settings.json
Bootstraps Homebrew if missing (you'll be prompted for your macOS password once)
Installs all CLI tools in dependency order: bd, jq, node, gh, ripgrep, osgrep, qmd, rclone, ffmpeg, agent-browser, wrangler, vercel, eas-cli, plus claude-hud (cloned from https://github.com/barkleesanders/claude-hud and built — provides the rich statusLine HUD)
Adds Homebrew to your shell rc (zsh/bash) for future sessions
Auto-inits .beads/ in the current repo if it's a git repo

Skip the CLI tool install with ./install.sh --no-tools.

The script is idempotent — safe to re-run. It detects what's already installed and only runs the missing pieces.

CLI Tools Installer

The agents and skills reference ~14 external CLIs. install-tools.sh installs them all by tier:

./install-tools.sh              # Install everything (default)
./install-tools.sh --check      # Report what's missing, install nothing
./install-tools.sh --core       # Just the mandatory ones (bd, gh, jq, node)
./install-tools.sh --search     # Code/doc search (ripgrep)
./install-tools.sh --browser    # Browser automation (agent-browser)
./install-tools.sh --deploy     # Cloud clients (wrangler, vercel, rclone)
./install-tools.sh --media      # Media tools (ffmpeg)
./install-tools.sh --hud        # claude-hud (statusLine HUD)
./install-tools.sh --optional   # eas-cli, codex, asc

Idempotent — safe to re-run. Detects what's already installed via command -v and only runs the missing installs.

What gets installed

Tier	Tool	Source	Used by
core	`bd` (beads)	`brew install steveyegge/tap/beads`	Mandatory task tracking (CLAUDE.md rule)
core	`gh`	`brew install gh`	GitHub API rate-limit rule
core	`jq`	`brew install jq`	JSON in many skill scripts
core	`node`	`brew install node`	Provides `npm` for the rest
search	`rg` (ripgrep)	`brew install ripgrep`	Fast code search
search	`osgrep`	`npm i -g osgrep`	Open Source Semantic Search (Ryandonofrio3/osgrep) — AST-aware code search
search	`qmd`	`npm i -g @tobilu/qmd`	Local doc/notes semantic search (tobi/qmd)
hud	`claude-hud`	`git clone barkleesanders/claude-hud && npm i && npm run build`	Rich statusLine HUD (used by `settings.json` `"statusLine"`)
browser	`agent-browser`	`npm i -g agent-browser`	Headless browser skill
deploy	`wrangler`	`npm i -g wrangler`	Cloudflare Workers deploys
deploy	`vercel`	`npm i -g vercel`	Vercel deploys
deploy	`rclone`	`brew install rclone`	Cloud storage skill
media	`ffmpeg`	`brew install ffmpeg`	avatar-video, video skills
optional	`eas-cli`	`npm i -g eas-cli`	Expo builds
optional	`codex`	per `skills/codex/SKILL.md`	codex-chat / codex-rescue skills
optional	`asc`	per `skills/ios-ship/SKILL.md`	App Store Connect (iOS shipping)

Homebrew bootstrap: macOS users without brew will see one-liner instructions on first run. Install Homebrew once interactively, then re-run ./install-tools.sh.

Manual

Copy to your Claude config directory:

# Agents
cp agents/*.md ~/.claude/agents/

# Skills
cp -r skills/* ~/.claude/skills/

# Commands (top-level + workflows subdirectory)
mkdir -p ~/.claude/commands
cp commands/*.md ~/.claude/commands/
cp -r commands/workflows ~/.claude/commands/

# CLAUDE.md (merge with existing or replace)
cp CLAUDE.md ~/.claude/CLAUDE.md

# Settings (review and merge hooks/permissions into your settings.json)
cat settings.json

# CLI tools (do this last)
./install-tools.sh

File Structure

claude-code-starter/
├── README.md
├── install.sh                       # Installs agents/skills/commands + offers CLI tools
├── install-tools.sh                 # Installs the ~14 external CLIs (idempotent, tiered)
├── settings.json                    # Claude Code settings (permissions, hooks, plugins)
├── CLAUDE.md                        # Quick reference config
├── agents/                          # 35 specialized agent definitions
│   ├── systematic-debugging.md
│   ├── carmack-mode-engineer.md
│   ├── ship-working-code.md
│   └── ...
├── skills/                          # 51 skill definitions
│   ├── debug/skill.md
│   ├── carmack/skill.md
│   ├── ship/skill.md
│   └── ...
├── commands/                        # 26 slash commands (incl. workflows/)
│   ├── browser.md
│   ├── changelog.md
│   └── workflows/                   # work, plan, review, brainstorm, compound
├── beads-formulas/                  # Workflow templates (records-request, ship-deploy, …)
├── scripts/
│   └── backup-claude-config.sh      # Auto-backup script
└── hooks-examples/
    └── settings-hooks.json          # Example hook configurations

OpenClaw — Unattended Agent Jobs

The starter kit makes Claude Code interactive powerful. OpenClaw makes it unattended powerful. Together they cover both halves of the agent-work problem: in-session work at your keyboard, and out-of-session work running on a schedule.

What OpenClaw is

OpenClaw is a runtime for scheduled Claude agents. You install it on any always-on host (a Mac mini, a small Linux box, a NUC, a VPS) and it lets you create cron jobs that invoke Claude with a prompt — same carmack, ship, debug skills you use interactively, but firing on a schedule and delivering results to Telegram, Slack, Discord, or email.

Think of it as the difference between bash -c "..." and a crontab line: OpenClaw is crontab for Claude.

Why pair OpenClaw with this starter kit

Both layers share the same ~/.claude/ directory. Skills you write once work in both contexts:

You write /carmack once — it runs both when you type /carmack interactively and when an OpenClaw cron fires openclaw cron add --message "/carmack check the build" --cron '0 9 * * *'.
Hooks fire in both contexts — the same pre-bash-block-deploy hook that protects your interactive session also protects unattended runs.
Memory and beads are shared — what OpenClaw learns at 3am is in your context when you sit down at 9am.

This is why the starter kit's carmack skill ships with tools/openclaw-remote-doctor.sh — when something breaks on the unattended side, Claude on your laptop can SSH into the OpenClaw host, run openclaw doctor, scan recent tool-usage, and surface the failures locally without you needing to leave your editor.

Two-rule operating model

The starter kit's CLAUDE.md codifies two rules for picking the right scheduler:

openclaw cron is ONLY for LLM work. Anything that needs a model — agent turns, browsing, summarization, code review on a PR queue — goes in OpenClaw.
Pure shell work goes in systemd timers (or launchd on macOS). git pull, rsync, curl, systemctl restart — burning agent tokens on deterministic shell is pure waste.

This split is what makes the combined setup sustainable cost-wise: agent tokens for things only an agent can do, shell timers for everything else.

Native-first rule (also codified in `CLAUDE.md`)

Before writing custom scripts/crons/systemd units that wrap OpenClaw, check openclaw --help, openclaw <area> --help, and --dry-run. The native CLI covers more than people assume — adding --best-effort-deliver to a cron with a Telegram channel beats writing a shell wrapper to swallow delivery errors. The full rule is in feedback_openclaw_native_first.md when you run with the starter kit's auto-memory enabled.

Getting started

Install OpenClaw on your always-on host: see docs.openclaw.ai.
Make sure ~/.claude/ on that host has the starter kit installed (run the same install.sh there).
Override the host in skills/carmack/tools/openclaw-remote-doctor.sh — replace <YOUR_VPS_IP> with your host, or set OPENCLAW_HOST=user@your-host:port in the environment.

Create your first scheduled agent:

openclaw cron add --agent main --cron '0 9 * * 1-5' \
  --message "/carmack scan ~/code/myproject for the top 3 bugs and post a summary" \
  --channel telegram --best-effort-deliver

That's it — you now have a Claude agent that wakes up every weekday at 9am, runs /carmack on a project, and posts the result to Telegram.

Reference implementation: see hermes-setup — my Mac mini deployment of NousResearch/hermes-agent, running this exact pattern. launchd for deterministic shell, OpenClaw for scheduled Claude agents, claude-code-starter for the shared ~/.claude/ skills.

Privacy & PII Boundaries

This is a public repo. The author's working ~/.claude/ directory is private and contains personal data (real names, case files, medical records, financial data, project specifics). The two are kept separate by an explicit firewall:

Stays private (never pushed here)	Lives in this repo
`~/.claude/projects/<id>/memory/*.md` — auto-memory with names, cases, addresses, phones	The CLAUDE.md rules about how memory works, but never memory content itself
Skill files that reference specific people, properties, court cases, medical providers	Skill files that document a workflow with placeholders (`<YOUR_VPS_IP>`, `<YOUR_PROJECT>`, etc.)
Real OpenClaw cron jobs, hostnames, Tailscale IPs, Telegram chat IDs	The patterns for how to create such jobs
`~/.beads/` issue databases with case identifiers	The `bd` CLI and the `beads-formulas/` templates
Anything matching `barklee`, real email addresses, `@aivaclaims`, real phone numbers, real addresses, real case numbers	Scrubbed equivalents and `<PLACEHOLDER>` strings

How sync works in practice: before pushing any change to this repo, the author runs a PII grep (grep -rniE "barklee|sanders|@gmail|@aivaclaims|/Users/[a-z]+|<actual-phone>|<actual-address>") over the staged files. Anything that matches gets sanitized (replaced with <PLACEHOLDER>) or extracted into a separate private skill that stays in ~/.claude/skills/ but is never committed here.

The Ghidra skill in this repo is an example: it ships with two ready-to-run Jython scripts and a complete workflow guide — but the binaries the author actually reverse-engineers, the project paths they use, and the findings they produce never leave their laptop. The skill is reusable; the work product is private.

If you fork this repo and add your own skills: do the same. Any skill that names a real person, property, case, or proprietary system should live in a separate private repo or a non-tracked subdirectory — never here. The starter kit is the scaffolding, not the content.

Customization

Adding New Agents

Create ~/.claude/agents/your-agent.md:

# Your Agent Name

Your agent's system prompt and instructions here.

## When to Use
- Scenario 1
- Scenario 2

## Workflow
1. Step 1
2. Step 2

Adding New Skills

Create ~/.claude/skills/your-skill/skill.md:

---
name: your-skill
description: "What this skill does"
---

# /your-skill - Skill Title

Instructions for when this skill is invoked.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
agents		agents
beads-formulas		beads-formulas
commands		commands
hooks-examples		hooks-examples
scripts		scripts
skills		skills
.gitattributes		.gitattributes
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
install-tools.sh		install-tools.sh
install.sh		install.sh
settings.json		settings.json

Folders and files

Latest commit

History

Repository files navigation

Claude Code Starter

Why This Repo Is Powerful

Getting Started (First-Time Setup)

Step 1: Open Claude Code

Step 2: Tell Claude to Install It

Step 3: Verify It Works

Your First Real Command

What's Included

Core Agents (35)

Skills (45+)

Commands (21)

Hooks

Agent Details

systematic-debugging

carmack-mode-engineer

ship-working-code

Safety Audit Tiers

Token Usage Tracking

Self-Improvement System

What gets captured automatically

What gets surfaced

Token & efficiency gains

Safe JSON config mutation

Usage

Debug Session Persistence

Installation

One-line install (recommended)

CLI Tools Installer

What gets installed

Manual

File Structure

OpenClaw — Unattended Agent Jobs

What OpenClaw is

Why pair OpenClaw with this starter kit

Two-rule operating model

Native-first rule (also codified in CLAUDE.md)

Getting started

Privacy & PII Boundaries

Customization

Adding New Agents

Adding New Skills

License

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Native-first rule (also codified in `CLAUDE.md`)

Packages