How to Use Smith

Smith is a task-tree runner where the LLM is the runtime. You write markdown files, organize them into folders, and Smith executes them as programs.

Setup

1. Build

go build -o smith ./cmd/smith

2. Configure your API key

Launch the interactive TUI:

smith

Or use the command-line config:

smith config

Or set the environment variable:

export SMITH_ANTHROPIC_API_KEY="sk-ant-..."

3. Initialize libraries

smith init

This extracts the built-in planner and shipped lib tools to ~/.smith/lib/.

If you are working on Smith itself and changing files under internal/lib/builtin/ or the shipped native tool catalog, rebuild and refresh the extracted lib before testing smith plan or any lib-backed app:

go build -o smith ./cmd/smith
./smith init

smith init preserves user-modified files in ~/.smith/lib/. If you need the embedded copies to win, run smith lib update --force, or test in a clean temporary home directory.

Creating a task tree manually

A task tree is a directory. The minimum is a task.md and an agent.md:

my-project/
  task.md       # what to do (the prompt)
  agent.md      # which model to use

task.md — the instruction:

Summarize the key arguments in the provided text.
Keep it under 200 words.

agent.md — the model config:

model: anthropic/claude-sonnet-4-6
temperature: 0.2

Validate and run

smith validate my-project    # check the tree is valid
smith run my-project         # execute it

Output lands in my-project/output/result.md.

Adding subtasks

Subtasks go in a subtasks/ directory. Numeric prefixes control execution order:

my-project/
  task.md
  agent.md
  subtasks/
    01-gather/
      task.md
    02-analyze/
      task.md
    03-write/
      task.md

Tasks with numeric prefixes run sequentially (01 before 02 before 03). Each subtask's output is available to the next via depends_on.

Explicit dependencies

If a subtask needs another subtask's output, declare it in frontmatter:

---
depends_on:
  - 01-gather
---

Analyze the information gathered in the previous step.

The dependency's output is injected into the prompt automatically.

Structured output (JSON)

Tasks can produce JSON instead of markdown. Add output.type: json to frontmatter and provide a schema.md:

task.md:

---
output:
  type: json
---

Extract the top 5 key points from the provided text.

schema.md:

{
  "type": "object",
  "required": ["points"],
  "properties": {
    "points": {
      "type": "array",
      "items": { "type": "string" }
    }
  }
}

The output goes to output/result.json and is validated against the schema.

Optional sidecar files

File	Purpose
`agent.md`	Model, persona, temperature, max_tokens
`tools.md`	Tool access (one per line: `- project.read`)
`schema.md`	JSON Schema for `output.type: json` tasks
`context/static/`	Reference files injected into the prompt

Agent config inherits from parent to child — subtasks without their own agent.md use the parent's.

Constraints

Add behavioral guardrails in frontmatter:

---
constraints:
  - Keep responses under 500 words
  - Use only peer-reviewed sources
  - Write in British English
---

Shell tasks

Tasks can run shell scripts instead of LLM prompts. Set model: shell in agent.md:

agent.md:

model: shell

task.md:

curl -s https://api.example.com/data | jq '.results'

Shell tasks receive sibling outputs via $SMITH_DEP_* environment variables and run input via $SMITH_INPUT_*.

Static context

Put reference documents in context/static/:

my-project/
  task.md
  agent.md
  context/
    static/
      company-style-guide.md
      product-requirements.md

These files are injected into the task's prompt as labeled sections.

Using `smith plan` (v2)

Instead of writing task trees by hand, let Smith design them for you:

smith plan ./my-project "research climate change impacts and write a summary report"

This runs the built-in planner, which:

Examines the target directory
Designs a task tree to achieve your goal
Creates a proposal in .smith/proposals/

Review the proposal

cat my-project/.smith/proposals/<id>/summary.md

Browse the staged files in my-project/.smith/proposals/<id>/files/ to see exactly what will be created.

Apply the proposal

smith apply my-project/.smith/proposals/<id>

This materializes the task tree and runs validation.

Run it

smith run my-project

Full lifecycle

smith plan ./demo "create a research pipeline about renewable energy"
smith apply ./demo/.smith/proposals/<id>
smith validate ./demo
smith run ./demo

Plan options

smith plan ./demo "goal" --model anthropic/claude-sonnet-4-6   # override planner model
smith plan ./demo "goal" --max-cost-usd 1.00                    # set cost ceiling

Planning a web-backed app

The shipped web.* tools are especially useful for research and briefing apps. A good pattern is to describe the exact web behavior you want and use explicit dates instead of relative phrases like "yesterday".

Example:

./smith plan ./demo-news "Create a web-backed news briefing app that accepts user inputs for a target location and briefing goal. Use explicit dates rather than relative phrases like 'yesterday'. Prefer the shipped web.lookup and web.fetch_markdown tools instead of creating new web tools. The output should be a concise markdown briefing with sections for world news, sports, finance, and local or regional updates." --max-cost-usd 0.75

Suggested follow-through:

cat ./demo-news/.smith/proposals/<id>/summary.md
./smith apply ./demo-news/.smith/proposals/<id>
./smith validate ./demo-news

If you are testing planner or lib changes from source, rebuild and run ./smith init first so the planner and shipped web tools in ~/.smith/lib/ match your current checkout.

See examples for checked-in example apps and repo hygiene notes for .smith/ runtime data.

Caching

Smith caches task outputs. If inputs haven't changed, re-running skips cached tasks:

smith run my-project          # runs everything
smith run my-project          # skips cached tasks
smith run my-project --no-cache  # force re-run everything

To mark a task as never cacheable:

---
cache: never
---

Piping (stdin/stdout)

Smith works as a Unix filter:

cat document.txt | smith run ./summarizer > summary.md

Piped stdin becomes a run input entry
Stdout receives the root task's canonical output
Status messages go to stderr

Run input

Pass data to a task tree at invocation time:

smith run ./analyzer --input topic="quantum computing" --input format="bullet points"

Run input appears in the root task's prompt as labeled sections.

Module references

Reuse task trees from ~/.smith/lib/ via module.yaml:

# module.yaml
source: summarize

This imports the summarize library module. Override its config with local sidecar files alongside module.yaml.

Using the TUI

Run smith with no arguments to launch the interactive terminal dashboard:

smith

The TUI has four tabs (navigate with Tab/Shift+Tab):

Config — set API keys, Ollama endpoint, probe providers for live model discovery, and configure default planner/task models
Projects — browse known projects, view run history, remove stale entries
Packages — browse installed lib modules and tool definitions with provenance indicators (embedded, user-modified, user-added)
Search — search file names and content across all known projects (press Enter to focus the search input, Esc to unfocus)

Press q to quit. Ctrl+C always quits, even during editing.

CLI reference

Command	Description
`smith`	Launch TUI (TTY) or print help (non-TTY)
`smith tui`	Launch TUI explicitly
`smith run <path>`	Execute a task tree
`smith plan <path> <goal>`	Generate a proposal from a goal
`smith apply <proposal-path>`	Materialize a proposal
`smith validate <path>`	Check a task tree is valid
`smith status <path>`	Report per-task execution status
`smith init`	Extract built-in libraries
`smith lib update`	Update libraries from binary
`smith config`	Configure API credentials

smith run flags

Flag	Description
`--dry`	Show execution plan without running
`--no-cache`	Force all tasks to execute
`--input key=value`	Pass run input (repeatable)
`--scope key=value`	Set tool scope (repeatable)

smith plan flags

Flag	Description
`--model <model>`	Override planner model
`--max-cost-usd <amount>`	Cost ceiling for planning

smith apply flags

Flag	Description
`--dry`	Show what would be applied
`--force`	Skip conflict detection

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to Use Smith

Setup

1. Build

2. Configure your API key

3. Initialize libraries

Creating a task tree manually

Validate and run

Adding subtasks

Explicit dependencies

Structured output (JSON)

Optional sidecar files

Constraints

Shell tasks

Static context

Using `smith plan` (v2)

Review the proposal

Apply the proposal

Run it

Full lifecycle

Plan options

Planning a web-backed app

Caching

Piping (stdin/stdout)

Run input

Module references

Using the TUI

CLI reference

smith run flags

smith plan flags

smith apply flags

FilesExpand file tree

howto.md

Latest commit

History

howto.md

File metadata and controls

How to Use Smith

Setup

1. Build

2. Configure your API key

3. Initialize libraries

Creating a task tree manually

Validate and run

Adding subtasks

Explicit dependencies

Structured output (JSON)

Optional sidecar files

Constraints

Shell tasks

Static context

Using smith plan (v2)

Review the proposal

Apply the proposal

Run it

Full lifecycle

Plan options

Planning a web-backed app

Caching

Piping (stdin/stdout)

Run input

Module references

Using the TUI

CLI reference

smith run flags

smith plan flags

smith apply flags

Using `smith plan` (v2)