docs_ci

CI tooling that uses a small LLM (Claude Haiku 4.5 by default) to judge natural-language criteria against a project's documentation. Fills the gap classical linters like Vale and markdownlint leave open: semantic rules, written as prose.

v0, and my first public project. The code is rough and the design is exploratory — see ROADMAP.md for direction and AGENTS.md for project conventions and architecture invariants.

Quick start

pip install -e '.[dev]'
export ANTHROPIC_API_KEY=...
docs-ci check ./docs --rules ./examples/rules.example.yaml

A rules file looks like:

rules:
  - id: has-title
    severity: error
    criterion: |
      The file must begin with a level-1 Markdown heading that names what
      the page is about.

  - id: no-todos
    severity: warning
    criterion: |
      The file must not contain "TODO", "FIXME", or "XXX" markers in its
      prose. Occurrences inside fenced code blocks are acceptable.

Each (file, criterion) pair becomes one independent LLM call. Results are reported per file:

docs/api/users.md
  ✗ has-title (error) — file starts with prose; the heading is on line 4
  ✓ no-todos

1 error, 0 warnings across 1 file

CLI

docs-ci check PATH --rules RULES.yaml \
  [--provider anthropic|openrouter|nvidia] \
  [--model MODEL] [--fail-on error|warning] [--format text|github] \
  [--no-cache] [--cache-path PATH] \
  [--retries N] [--retry-delay-seconds SECONDS] \
  [--retry-max-delay-seconds SECONDS] \
  [--debug-model-output] \
  [--changed-only] [--base-ref REF]

Flag	Default	Notes
`PATH`	required	Docs directory to scan (markdown only in v0).
`--rules`	required	Path to rules YAML.
`--provider`	`anthropic`	LLM provider. See Providers below.
`--model`	provider default	Model ID. Defaults vary per provider.
`--fail-on`	`error`	Exit 1 on failures at or above this severity.
`--format`	`text`	Output format. `github` emits Actions annotations. See GitHub Actions below.
`--no-cache`	off	Disable the persistent verdict cache for this run.
`--cache-path`	`.docs-ci/cache.json`	Where the verdict cache lives. See Verdict cache below.
`--retries`	`0`	Retry each transient provider/model failure this many extra times.
`--retry-delay-seconds`	`2`	Initial delay before retrying a failed verdict call.
`--retry-max-delay-seconds`	`30`	Maximum delay between verdict call retries.
`--debug-model-output`	off	Print truncated raw model output when a provider response cannot be parsed.
`--changed-only`	off	Only judge files that changed since `--base-ref`. See Diff mode below.
`--base-ref`	auto-detected	Git ref to diff against in `--changed-only` mode.

Exit codes: 0 (all required rules passed), 1 (failure at or above --fail-on), 2 (config / CLI error).

Retries happen per (file, rule) call, after cache lookup and before writing a fresh verdict to the cache. They are only used for transient provider/model failures such as HTTP 429/5xx, connection errors, timeouts, and missing structured tool-call responses. Config/auth errors are not retried.

--debug-model-output is meant for provider/model debugging. It may print snippets of model output derived from your docs into CI logs, so leave it off unless you are diagnosing malformed or missing tool calls.

Providers

`--provider`	Endpoint	API key env var	Default model
`anthropic`	api.anthropic.com (native)	`ANTHROPIC_API_KEY`	`claude-haiku-4-5`
`openrouter`	openrouter.ai (OpenAI-compatible)	`OPENROUTER_API_KEY`	`poolside/laguna-xs.2:free`
`nvidia`	integrate.api.nvidia.com (OpenAI-compat)	`NVIDIA_API_KEY`	`meta/llama-3.1-70b-instruct`

Anthropic prompt caching is applied when calling the Anthropic provider directly, and is forwarded as best-effort when routing through OpenRouter to an anthropic/* model. Other provider+model combinations send no cache hints.

Setup

Pick a provider from the table above.
Generate an API key from the linked dashboard.
Copy .env.example to .env (or export the variable directly) and fill in the matching *_API_KEY.
Run docs-ci check ./docs --rules ./examples/rules.example.yaml --provider <name>. Add --model <id> to override the per-provider default.

Examples:

# Anthropic, default model
export ANTHROPIC_API_KEY=sk-ant-...
docs-ci check ./docs --rules ./examples/rules.example.yaml

# OpenRouter, routing to Anthropic Haiku (free tier on some accounts)
export OPENROUTER_API_KEY=sk-or-...
docs-ci check ./docs --rules ./examples/rules.example.yaml \
  --provider openrouter --model anthropic/claude-haiku-4-5

# OpenRouter free router, useful for low-cost dogfooding
docs-ci check ./docs --rules ./examples/rules.example.yaml \
  --provider openrouter --model openrouter/free --retries 3

# NVIDIA build.nvidia.com (free credits / free models on some accounts)
export NVIDIA_API_KEY=nvapi-...
docs-ci check ./docs --rules ./examples/rules.example.yaml \
  --provider nvidia --model meta/llama-3.1-70b-instruct

Tip: OpenRouter and NVIDIA both occasionally offer free access to specific models — handy for trying docs-ci on a real docs set without spending anything. Whatever model you pick must support tool / function calling; docs-ci forces a structured submit_verdict call and will error out otherwise.

Verdict cache

docs-ci keeps a persistent JSON cache at .docs-ci/cache.json keyed on (file_content, criterion, prompt_fingerprint, provider, model). On the second run, any unchanged (file, rule) pair is served from the cache without an LLM call — a typical incremental CI run on a 100-file repo where one file changed drops from ~100 calls to ~1.

The cache invalidates automatically when:

a file's content changes;
a rule's criterion text changes (renaming a rule's id alone does not invalidate);
the provider, model, or internal prompt/tool schema changes.

Add .docs-ci/ to your .gitignore — the cache is local. In CI, persist it across runs with the standard cache action, e.g. for GitHub Actions:

- uses: actions/cache@v4
  with:
    path: .docs-ci
    key: docs-ci-${{ hashFiles('**/*.md', 'rules.yaml') }}
    restore-keys: docs-ci-

--no-cache disables it entirely for one run; --cache-path relocates the file.

Diff mode

--changed-only skips files that haven't changed since a base git ref. Composes with the verdict cache: the cache makes unchanged files free to re-judge, diff mode skips them entirely without even reading them — useful on huge repos where the per-file cache lookup itself adds up.

# Local: diff against the auto-detected default branch
docs-ci check ./docs --rules ./rules.yaml --changed-only

# CI: diff against the PR's base ref
docs-ci check ./docs --rules ./rules.yaml \
  --changed-only --base-ref origin/main

Behavior notes:

The default base ref is auto-detected via git symbolic-ref refs/remotes/origin/HEAD, falling back to origin/main. Override with --base-ref REF (e.g. origin/master, origin/develop).
Tracked-only — untracked .md files (new files not yet git add-ed) are not included. Run without --changed-only while drafting new docs locally.
If the rules YAML itself has changed since the base ref, docs-ci warns on stderr and continues anyway — re-run without --changed-only for a full check after touching rules.
Requires a git working tree; errors at exit code 2 if invoked outside one.

GitHub Actions

docs-ci ships a thin composite action (action.yml) so consumers can uses: it directly instead of writing the install / setup-python / invoke chain by hand. Linux runners only in v0 — no Windows / macOS coverage yet.

Public-repo caveat. docs-ci feeds contributor-supplied markdown to an LLM call signed with your API key. On a public repo: an adversarial PR can burn through provider budget with oversized files (set a spend cap on the key), and a verdict can be flipped via prompt injection in the markdown (treat green as advisory, not a security signal). Don't use pull_request_target to expose the key to fork PRs — it's a known footgun. See PRs from forks below.

Setup on another repo

Add an API key as a repository secret. In the target repo: Settings → Secrets and variables → Actions → New repository secret. Name it after the provider you'll use — ANTHROPIC_API_KEY, OPENROUTER_API_KEY, or NVIDIA_API_KEY (see Providers above). The secret is encrypted, never visible after creation, and auto-redacted from workflow logs.
Commit a rules file. Anywhere in the tree; rules.yaml at the root or .docs-ci/rules.yaml are common choices. See examples/rules.example.yaml for the format.
Commit the workflow file below as .github/workflows/docs-ci.yml.

Example: PR-triggered

name: docs-ci

on:
  pull_request:
    paths:
      - 'docs/**'
      - 'rules.yaml'
      - '.github/workflows/docs-ci.yml'

jobs:
  check:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
        with:
          fetch-depth: 0   # changed-only diffs against the base ref, needs history

      - uses: actions/cache@v4
        with:
          path: .docs-ci
          key: docs-ci-${{ hashFiles('**/*.md', 'rules.yaml') }}
          restore-keys: docs-ci-

      - uses: AvantBras/docs_ci@v1
        with:
          path: ./docs
          rules: ./rules.yaml
          changed-only: true
          base-ref: origin/${{ github.base_ref }}
          api-key: ${{ secrets.ANTHROPIC_API_KEY }}

Notes on the choices:

fetch-depth: 0 is required for changed-only: true; the default shallow checkout breaks git diff.
paths: filter keeps the job from running on PRs that don't touch docs. Adjust to match where your rules and docs actually live.
The actions/cache@v4 step is optional but worthwhile — verdicts persist across runs (see Verdict cache above). The action does not bundle this step itself; cache keys are project-specific.
Pin the action ref (@v1, or a commit SHA) for reproducible runs.

Example: scheduled / nightly run

Drop changed-only and trigger on schedule:. The verdict cache makes a full re-scan cheap — only files that actually changed since the last run hit the LLM.

on:
  schedule:
    - cron: '0 6 * * *'   # 06:00 UTC daily
  workflow_dispatch:        # also run on demand from the Actions tab

jobs:
  check:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
      - uses: actions/cache@v4
        with:
          path: .docs-ci
          key: docs-ci-${{ hashFiles('**/*.md', 'rules.yaml') }}
          restore-keys: docs-ci-
      - uses: AvantBras/docs_ci@v1
        with:
          path: ./docs
          rules: ./rules.yaml
          api-key: ${{ secrets.ANTHROPIC_API_KEY }}

Action inputs

Input	Default	Notes
`path`	required	Docs directory to scan (markdown only).
`rules`	required	Path to rules YAML.
`provider`	`anthropic`	LLM provider: `anthropic`, `openrouter`, or `nvidia`.
`model`	(provider default)	Model ID. Falls back to the provider's default if empty.
`fail-on`	`error`	Exit 1 on failures at or above this severity.
`format`	`github`	Output format. Note: CLI defaults to `text`; the action defaults to `github` since CI is its only surface.
`cache`	`true`	Persistent verdict cache. Set `false` to disable.
`cache-path`	`.docs-ci/cache.json`	Path to the verdict cache JSON.
`retries`	`0`	Retry each transient provider/model failure this many extra times.
`retry-delay-seconds`	`2`	Initial delay before retrying a failed verdict call.
`retry-max-delay-seconds`	`30`	Maximum delay between verdict call retries.
`debug-model-output`	`false`	Print truncated raw model output when a provider response cannot be parsed. Use only for debugging.
`changed-only`	`false`	Only judge files changed since `base-ref`. Requires `fetch-depth: 0`.
`base-ref`	(auto-detected)	Git ref to diff against in changed-only mode.
`api-key`	(env fallback)	Provider API key. Falls back to `*_API_KEY` runner env vars if empty.
`python-version`	`3.11`	Python version to install.

PRs from forks

By default, secrets are not exposed to workflows triggered by pull_request events from forks — so docs-ci will fail for external contributors with a missing-API-key error. Two ways out:

Skip on forks (recommended). Gate the run on if: github.event.pull_request.head.repo.full_name == github.repository. Maintainer PRs are checked; fork PRs are silently no-op'd. Trade-off: external contributors don't see docs-ci feedback until a maintainer pushes their branch.
pull_request_target trigger. Has secret access, but checks out the base repo by default; you'd need to explicitly check out the PR head, at which point a malicious PR can exfiltrate the secret. Only safe behind an approval gate (e.g. require a pull_request_review). See GitHub's pwn-request advisory before going this route.

For most repos, skip-on-fork is the right default.

Annotation output reference

--format github emits GitHub Actions workflow commands so failing rules surface as inline PR comments and entries in the run's Checks panel. Output looks like:

::error file=docs/api.md,line=1,title=docs-ci/has-title::file starts with prose; the heading is on line 4
::warning file=docs/intro.md,line=1,title=docs-ci/no-todos::found "TODO" in prose
1 error, 1 warning across 2 files

Each failing verdict becomes one annotation; passing verdicts are silent.
Annotation title= is namespaced as docs-ci/<rule_id> so it's distinguishable from other tools' annotations.
All annotations land on line=1 in v0 — verdicts are per-file. Per-line attribution would either need the LLM to return a line number (extends the tool schema) or a regex pass over the reason; deferred.
File paths are relativized against $GITHUB_WORKSPACE, falling back to the git working tree, falling back to the current directory.
The grouped per-file text report is suppressed under this format — the GitHub UI groups annotations per-file already, and a duplicate text dump would just clutter logs.

License

MIT — see LICENSE.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

docs_ci

Quick start

CLI

Providers

Setup

Verdict cache

Diff mode

GitHub Actions

Setup on another repo

Example: PR-triggered

Example: scheduled / nightly run

Action inputs

PRs from forks

Annotation output reference

License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
examples		examples
src/docs_ci		src/docs_ci
tests		tests
.env.example		.env.example
.gitignore		.gitignore
AGENTS.md		AGENTS.md
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
REAL_DOCS_TESTING.md		REAL_DOCS_TESTING.md
ROADMAP.md		ROADMAP.md
action.yml		action.yml
pyproject.toml		pyproject.toml

Folders and files

Latest commit

History

Repository files navigation

docs_ci

Quick start

CLI

Providers

Setup

Verdict cache

Diff mode

GitHub Actions

Setup on another repo

Example: PR-triggered

Example: scheduled / nightly run

Action inputs

PRs from forks

Annotation output reference

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages