skill-trust

Transparency verifier for Agent Skills — verify that skills declare their behavior honestly.

Don't just scan for threats — demand transparency.

What is skill-trust?

When you install an AI Agent Skill from a public registry, you're trusting code you haven't reviewed. Existing security scanners (Cisco Skill Scanner, Aguara, Snyk Agent Scan, etc.) look for known attack patterns — but they can't tell you whether a skill is being transparent about what it does.

skill-trust takes a fundamentally different approach:

Skill authors add a trust block to their SKILL.md, declaring what the skill accesses (network, filesystem, shell, environment) and where data flows
skill-trust verify scans the actual code and compares behavior against these declarations
Inconsistencies are flagged — not as malware, but as broken transparency contracts

Think of it like Android's permission manifest or a package.json engines field — not a virus scanner, but a behavioral contract between skill authors and users.

Why does this matter?

Threat scanners answer: "Is this code doing something known-bad?"

skill-trust answers: "Is this code doing what the author says it does?"

A skill might not contain any malware, yet still:

Access the network without telling you
Write files outside its declared scope
Read environment variables (potential credentials) silently
Use obfuscation techniques that hide intent

skill-trust catches these transparency gaps — it's a trust layer that works alongside (not instead of) traditional security scanners.

Quick demo

$ npx skill-trust verify ./my-skill

  skill-trust v0.1.0

  ✅ Network:      declared=false  found=0 matches
  ✅ Shell:        declared=true   found=3 matches
  ⚠  Filesystem:  declared="outputs" but writes to "/tmp"
  ✅ Data flow:    no undeclared endpoints
  ✅ Obfuscation:  none detected

  Result: PARTIAL (1 warning)

Quick start

As a CLI tool

# Verify a local skill
npx skill-trust verify ./path/to/skill

# Strict mode — warnings become errors (useful in CI)
npx skill-trust verify ./path/to/skill --strict

# Output as JSON for programmatic use
npx skill-trust verify ./path/to/skill --format json

# Output as SARIF for GitHub Code Scanning
npx skill-trust verify ./path/to/skill --format sarif

# Verify all skills in a monorepo
npx skill-trust verify-all ./path/to/monorepo

# Generate a trust declaration interactively
npx skill-trust init ./my-skill

# Run with external scanner integration
npx skill-trust verify ./path/to/skill --scan cisco,aguara

# Generate a trust badge
npx skill-trust badge ./path/to/skill > badge.svg

# Look up a skill in the registry
npx skill-trust lookup csv-analyzer

As a GitHub Action

# .github/workflows/skill-trust.yml
name: Skill Trust Check
on:
  pull_request:
    paths:
      - '**/SKILL.md'
      - '**/scripts/**'

jobs:
  verify:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
      - uses: Ryan-focus/skill-trust@v1
        with:
          skill_path: ./my-skill
          strict: false

The action posts a markdown summary to your PR and exports result, warnings, and failures as outputs.

How to declare trust

Add a trust block to your SKILL.md frontmatter:

---
name: pdf-processing
description: Extract text and tables from PDF files.
trust:
  permissions:
    network: false
    filesystem:
      read: true
      write: true
      scope: "outputs"        # "outputs" | "workspace" | "system"
    shell: true
    environment: false
  data-flow:
    exfiltration: none         # "none" | list of declared endpoints
  dependencies:
    runtime:
      - python3
    packages:
      - name: pdfplumber
        registry: pypi
  boundaries:
    - "Does not access files outside the output directory"
    - "Does not transmit any data externally"
---

See the full Trust Declaration Specification for all fields and details.

Trust levels

Based on verification results, each skill receives a trust level:

Level	Meaning	When
🟢 Verified	All checks pass	Declarations are present and match actual code behavior
🟡 Partial	Warnings exist	Minor inconsistencies detected (e.g., filesystem scope drift)
⚪ Undeclared	No trust block	Skill hasn't opted into transparency (not necessarily malicious)
🔴 Inconsistent	Checks fail	Code directly contradicts declared behavior

Verification rules

Rule	What it checks	Severity
Network consistency	`network: false` but code contains `curl`, `fetch`, `requests`, `axios`, URLs	Error
Filesystem scope	Declared write scope vs. actual file write paths	Error / Warning
Shell consistency	`shell: false` but code uses `subprocess`, `exec`, `spawn`, `popen`	Error
Environment access	`environment: false` but code reads `process.env`, `os.environ`, `$ENV`	Error
Data flow	`exfiltration: none` but code has undeclared URLs or endpoints	Error
Obfuscation	base64 commands, `eval` with dynamic strings, hex-encoded payloads	Warning
Missing declaration	No `trust` field in SKILL.md frontmatter	Info

How skill-trust fits in the ecosystem

skill-trust is complementary to existing security tools — it addresses a gap that threat scanners don't cover.

	skill-trust	Threat Scanners (Cisco, Aguara, Snyk, etc.)
Approach	Declaration verification	Threat / vulnerability detection
Question answered	"Is this skill honest about what it does?"	"Does this skill contain known-bad patterns?"
Requires author opt-in	Yes — `trust` block in SKILL.md	No
Developer-facing	✅ Self-check in CI before publish	Mostly post-install / post-hoc scanning
Catches	Undeclared network access, hidden file writes, scope violations, obfuscation	Prompt injection, malware payloads, CVEs, supply chain attacks
Best used	By skill authors during development	By users / security teams after install

Recommended workflow: Use skill-trust during development to ensure your skill is transparent, then pass it through a threat scanner (Cisco Skill Scanner, Aguara, Snyk Agent Scan) before publishing.

Output formats

Format	Use case	Flag
Terminal	Human-readable with colors	`--format terminal` (default)
JSON	Programmatic processing, CI pipelines	`--format json`
SARIF	GitHub Code Scanning integration	`--format sarif`

Project structure

skill-trust/
├── src/
│   ├── cli.ts                # CLI entry point (verify, verify-all, init, badge, lookup)
│   ├── parser.ts             # SKILL.md frontmatter parser & validator
│   ├── verifier.ts           # Core verification engine
│   ├── reporter.ts           # Output formatting (terminal, JSON, SARIF)
│   ├── badge.ts              # Trust badge SVG generator
│   ├── registry.ts           # Agent Skills registry integration
│   ├── monorepo.ts           # Multi-skill discovery & batch verification
│   ├── wizard.ts             # Interactive trust declaration generator
│   ├── types.ts              # TypeScript type definitions
│   ├── index.ts              # Public API exports
│   ├── ast/                  # AST-based code analysis
│   │   ├── context.ts        # Strips comments/strings for accurate scanning
│   │   └── analyzer.ts       # Extracts imports, function calls
│   ├── integrations/         # External scanner adapters
│   │   ├── cisco.ts          # Cisco Skill Scanner integration
│   │   ├── aguara.ts         # Aguara integration
│   │   └── combined-report.ts  # Merged trust + threat reports
│   ├── action/
│   │   └── index.ts          # GitHub Action entry point
│   └── rules/                # Verification rule implementations
│       ├── utils.ts           # AST-aware pattern scanning utilities
│       ├── network.ts        # Network access consistency
│       ├── filesystem.ts     # File write scope validation
│       ├── shell.ts          # Shell execution consistency
│       ├── environment.ts    # Environment variable access
│       ├── data-flow.ts      # Data exfiltration endpoints
│       └── obfuscation.ts    # Obfuscation technique detection
├── tests/                    # Vitest test suite
├── examples/
│   ├── trusted-skill/        # Example: CSV analyzer with proper declarations
│   └── untrusted-skill/      # Example: text formatter with intentional violations
├── CLAUDE.md                 # AI agent guide (Claude Code)
├── .cursorrules              # AI agent guide (Cursor)
├── .github/AGENTS.md         # AI agent guide (GitHub Copilot)
├── action.yml                # GitHub Action metadata
├── TRUST-SPEC.md             # Trust declaration specification (v0.1.0)
├── CONTRIBUTING.md           # Contribution guidelines
└── package.json

Roadmap

For AI Agents — Fork & Extend

This repo is designed to be immediately productive for AI agents:

File	Agent	Purpose
`CLAUDE.md`	Claude Code	Full architecture guide, extension patterns, conventions
`.cursorrules`	Cursor	Project rules, tech stack, do/don't
`.github/AGENTS.md`	GitHub Copilot	Build/test/extend instructions

Why agents love this repo:

Strict TypeScript — src/types.ts defines the entire data model; AI reads types more accurately than comments
Modular rules — each rule is an independent module implementing the Rule interface; swap or add rules without touching other code
Adapter pattern — integrations follow a consistent pattern (src/integrations/cisco.ts); agents can add new scanners by copying the template
Comprehensive tests — 125+ tests with helpers (makeSkill(), baseTrust()) that agents can reuse immediately

Contributing

Contributions are welcome! See CONTRIBUTING.md for guidelines.

License

MIT — see LICENSE for details.

Built by Raven — an independent developer who believes Agent Skills deserve the same transparency standards as the code we already write.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

skill-trust

What is skill-trust?

Why does this matter?

Quick demo

Quick start

As a CLI tool

As a GitHub Action

How to declare trust

Trust levels

Verification rules

How skill-trust fits in the ecosystem

Output formats

Project structure

Roadmap

For AI Agents — Fork & Extend

Contributing

License

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
.github		.github
dist/action-bundle		dist/action-bundle
examples		examples
mnt/user-data/outputs/skill-trust/examples/untrusted-skill		mnt/user-data/outputs/skill-trust/examples/untrusted-skill
src		src
tests		tests
.cursorrules		.cursorrules
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
DEVELOPMENT-PLAN.md		DEVELOPMENT-PLAN.md
LICENSE		LICENSE
README.md		README.md
SKILL.md		SKILL.md
TRUST-SPEC.md		TRUST-SPEC.md
action.yml		action.yml
analyze.py		analyze.py
format.py		format.py
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json

Folders and files

Latest commit

History

Repository files navigation

skill-trust

What is skill-trust?

Why does this matter?

Quick demo

Quick start

As a CLI tool

As a GitHub Action

How to declare trust

Trust levels

Verification rules

How skill-trust fits in the ecosystem

Output formats

Project structure

Roadmap

For AI Agents — Fork & Extend

Contributing

License

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages