Skip to content

richfrem/agent-plugins-skills

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1,205 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Universal Agent Plugins & Skills Ecosystem

123 skills · 23 plugins — a self-improving, cross-platform library of reusable AI agent capabilities for Claude Code, GitHub Copilot, Gemini CLI, and any compliant agent framework.


Platforms

A strictly cross-platform (Windows, Mac, Ubuntu) library — the universal upstream source for reusable AI agent plugins and skills across multiple IDEs and agent frameworks: Claude Code, GitHub Copilot, Gemini CLI, Antigravity, Roo Code, Windsurf, Cursor, and other compliant integrations.

All plugins deploy to the single .agents/ folder standard — no duplicate copies needed for .github, .gemini, .agent, etc.


Installation

Important

Start here — fresh clone or first-time setup. The single .agents/ environment directory is not committed to your repo. It will be empty by default.

All installation methods (uvx, bootstrap.py, npx skills, and Marketplace / Extension CLI) are now consolidated in a single authoritative guide:

Quick install (all plugins):

uvx --from git+https://github.com/richfrem/agent-plugins-skills plugin-add richfrem/agent-plugins-skills

Core Philosophy: Transitional Architectures & Decoupled Skills

This repository is built on a pragmatic acceptance of the current AI engineering landscape: the ecosystem changes weekly, and workflows that were revolutionary six months ago are obsolete today.

Frameworks like agent-agentic-os and spec-kitty are treated as Transitional Architectures — bridges between what agents need to do today and what native SDKs will eventually handle. When Anthropic, Google, and GitHub harden native memory persistence, execution safety, and multi-agent orchestration, large swaths of this tooling will be happily discarded.

Skills are Applications; the SDK is the OS. Individual skills must function in complete isolation — no hard dependencies on sibling plugins, no assumptions about which framework is running.


Architecture

Pillar 1: The Improvement OS (agent-agentic-os)

The OS implements an eval-gated improvement pipeline for autonomous skill evolution:

os-architect           ← intent classifier + ecosystem router
    ↓
os-improvement-loop    ← learning engine: orchestrates multi-iteration improvement
    ↓
os-eval-runner         ← inner gate: KEEP/DISCARD per iteration (evaluate.py)
    ↓
os-eval-backport       ← human gate: review before lab winner → production
    ↓
os-experiment-log      ← scientific backbone: longitudinal tracking + synthesis

Entry point: /os-architect — describe what you want in plain language. The agent classifies intent, audits the ecosystem, proposes Path A/B/C, and dispatches via your available CLI tools. os-evolution-planner writes the task plan + delegation prompt. os-architect-tester validates after any changes.

Karpathy Autoresearch Loop

Skills that score HIGH on the autoresearch viability rubric (objectivity + speed + frequency + utility) can run fully autonomous self-improvement loops:

mutate SKILL.md → evaluate.py → exit 0 (KEEP) or exit 1 (DISCARD) → repeat

Not all skills are good candidates — use eval-autoresearch-fit to score a skill before running a loop.

Live example — convert-mermaid skill, 26 iterations across 2 rounds: 0.61 → 1.00

convert-mermaid eval progress

Each blue diamond is a baseline anchor (one per session). Green = new best score. Amber = kept but not a record. The two-segment shape shows a fresh re-baseline for round 2.

Monitor a live run: python plugins/agent-agentic-os/scripts/plot_eval_progress.py --tsv <lab>/evals/ --live

Flywheel layers:

  • OUTER flywheel (os-improvement-loop): improves OS-level protocols and session ledgers between sessions
  • INNER flywheel (os-eval-runner): evaluate.py KEEP/DISCARD gate per iteration within a session

Pillar 2: Execution Patterns (agent-loops)

5 composable primitives used as the execution substrate by the Improvement OS and standalone by any agent workflow:

learning-loop · dual-loop · agent-swarm · red-team-review · triple-loop-learning

Pillar 3: Super-RAG 3-Tier Retrieval

O(1) RLM keyword → O(log N) vector semantic → wiki concept nodes.

Super-RAG stack: rlm-factory (O(1) keyword) + vector-db (O(log N) semantic) + obsidian-wiki-engine (full concept nodes)

Each plugin works standalone (Mode A) or combined for full Super-RAG power. Init agents detect what is installed in .agents/skills/ and configure only the available layers.

Hub-and-Spoke ADR

All shared scripts live once at plugins/<plugin>/scripts/. Skills reference them via file-level symlinks (skills/<skill>/scripts/script.py → ../../../scripts/script.py). Directory-level symlinks are forbidden — npx drops them on install.


Plugin Ecosystem (23 plugins · 123 skills)

Group 1: The Improvement OS

agent-agentic-os — Continuous Self-Improvement

The flagship operational framework. Eval-gated improvement loops, memory management, session lifecycle, and ecosystem evolution orchestration.

Skills (16): os-architect · os-evolution-planner · os-guide · os-improvement-loop · os-eval-lab-setup · os-eval-runner · os-eval-backport · os-environment-probe · os-evolution-verifier · os-experiment-log · os-memory-manager · os-improvement-report · os-init · os-clean-locks · todo-check · optimize-agent-instructions

Agents: os-architect-agent · os-architect-tester-agent · improvement-intake-agent · os-health-check · agentic-os-setup


Group 2: Engineering Workflows

spec-kitty-plugin — Spec-Driven Development

Enterprise-grade Spec → Plan → Tasks → Implement → Review → Merge pipeline.

Skills (19): spec-kitty-specify · spec-kitty-plan · spec-kitty-tasks · spec-kitty-implement · spec-kitty-review · spec-kitty-merge · spec-kitty-analyze · spec-kitty-accept · spec-kitty-clarify · spec-kitty-research · spec-kitty-dashboard · spec-kitty-status · spec-kitty-checklist · spec-kitty-constitution · spec-kitty-tasks-outline · spec-kitty-tasks-finalize · spec-kitty-tasks-packages · spec-kitty-workflow · spec-kitty-sync-plugin

Agents: spec-kitty-agent · spec-kitty-setup

exploration-cycle-plugin — Discovery & Requirements

Autonomous discovery loop: idea framing → business requirements → user stories → prototype → handoff into formal engineering specs.

Skills (11): exploration-workflow · exploration-session-brief · discovery-planning · business-requirements-capture · business-workflow-doc · user-story-capture · exploration-handoff · exploration-optimizer · prototype-builder · visual-companion · subagent-driven-prototyping

Agents (11): business-rule-audit-agent · discovery-planning-agent · exploration-cycle-orchestrator-agent · handoff-preparer-agent · intake-agent · planning-doc-agent · problem-framing-agent · prototype-builder-agent · prototype-companion-agent · requirements-doc-agent · requirements-scribe-agent


Group 3: Execution Patterns

agent-loops — Composable Loop Primitives

5 execution primitives used as the substrate for the Improvement OS and standalone agent workflows.

Skills (6): orchestrator · learning-loop · dual-loop · agent-swarm · red-team-review · triple-loop-learning

Agents: orchestrator


Group 4: Code Quality & Safety

coding-conventions — Standards Enforcement (34/40 HIGH)

Centralized rules engine for file headers, naming conventions, and linting across Python, TypeScript, and C#.

Skills (1): coding-conventions-agent · Agent: coding-conventions-agent

agent-scaffolders — Boilerplate & Audit (25 skills)

Interactive creators for exact file hierarchies + structured audit framework for plugin architectural maturity.

Scaffolding skills: create-plugin · create-skill · create-sub-agent · create-command · create-hook · create-github-action · create-agentic-workflow · create-azure-agent · create-docker-skill · create-mcp-integration · create-stateful-skill

Audit & analysis skills: audit-plugin · audit-plugin-l5 · l5-red-team-auditor · analyze-plugin · self-audit · mine-skill · mine-plugins · path-reference-auditor · fix-plugin-paths · synthesize-learnings · eval-autoresearch-fit · manage-marketplace · ecosystem-standards · ecosystem-authoritative-sources

link-checker — Documentation Hygiene

Continuous markdown hyperlink validation with multi-stage pipeline (inventory → extract → audit → fix).

Skills (2): link-checker-agent · symlink-manager · Agent: link-checker-agent


Group 5: CLI Sub-Agents

Dispatch specialized analysis to isolated fresh model contexts via CLI tools (security audits, architecture review, QA).

claude-cli

Skills (3): claude-cli-agent · claude-project-setup · optimize-context

Agents (3): architect-review · refactor-expert · security-auditor

copilot-cli

Skills (1): copilot-cli-agent — GPT-5 mini via Copilot CLI; used in autoresearch mutation delegation

Agents (3): architect-review · refactor-expert · security-auditor

gemini-cli

Skills (2): gemini-cli-agent · antigravity-project-setup

Agents (3): architect-review · refactor-expert · security-auditor

Execution Disciplines — Safety & Quality

Behavioural guardrails enforcing best practices on every coding session. These skills come from obra/superpowers — install that plugin to get them. This ecosystem builds on superpowers rather than duplicating it.

Install: uvx --from git+https://github.com/richfrem/agent-plugins-skills plugin-add obra/superpowers

Skills available via superpowers: verification-before-completion · test-driven-development · using-git-worktrees · systematic-debugging · finishing-a-development-branch · requesting-code-review


Group 6: Knowledge & Memory

obsidian-wiki-engine — Karpathy LLM Wiki + Super-RAG

Karpathy-style LLM wiki with cross-source concept synthesis. Transforms raw markdown sources into structured, queryable concept nodes. Full Obsidian vault CRUD, canvas, and graph traversal.

Wiki skills: obsidian-wiki-builder · obsidian-rlm-distiller · obsidian-query-agent · obsidian-wiki-linter

Vault skills: obsidian-init · obsidian-vault-crud · obsidian-canvas-architect · obsidian-graph-traversal · obsidian-markdown-mastery · obsidian-bases-manager

Setup agents: wiki-init-agent · wiki-build-agent · wiki-distill-agent · wiki-lint-agent · wiki-query-agent · super-rag-setup-agent

rlm-factory — O(1) Keyword Search

Dense per-file summaries with zero external dependencies. Works standalone or as Phase 1 of the Super-RAG stack.

Skills (6): rlm-init · rlm-curator · rlm-search · rlm-distill-agent · rlm-cleanup-agent · rlm-audit

Setup agent: rlm-factory-init-agent (guided setup, Modes A–D)

vector-db — Semantic Search

ChromaDB-driven semantic embedding indexing with Parent-Child retrieval. Supports In-Process mode (zero server setup) and HTTP Server mode. Works standalone or as Phase 2 of the Super-RAG stack.

Skills (6): vector-db-init · vector-db-launch · vector-db-ingest · vector-db-search · vector-db-cleanup · vector-db-audit

Setup agent: vector-db-init-agent — guided wizard (Modes A–D)

memory-management — Multi-Tiered Cognition

Multi-tiered cognition and context caching between long-term persistent storage and active memory.

Skills (1): memory-management


Group 7: Infrastructure & Utilities

plugin-manager — Ecosystem Sync

Skills (3): plugin-installer · plugin-remover · plugin-syncer

task-manager

Skills (1): task-agent

dependency-management — pip-compile Workflows

Cross-platform pip-compile with strict .in.txt lockfile discipline.

Skills (1): dependency-management

context-bundler — Context Packaging

Package deep directory contexts and code traces into single payloads for external LLM review.

Skills (2): context-bundler · red-team-bundler

mermaid-to-png

Skills (1): convert-mermaid (autoresearch score: 30/40 MEDIUM — eval run complete: 0.61 → 1.00 in 26 iterations)

adr-manager — Architecture Decision Records

Skills (1): adr-management

huggingface-utils — HuggingFace Lifecycle

Skills (2): hf-init · hf-upload

voice-writer — Tone & Humanization

Skills (1): humanize

rsvp-speed-reader

Token-stream speed reading with pause/resume, comprehension check-ins, and session management.

Skills (2): rsvp-reading · rsvp-comprehension-agent


Completed Experiments

Ecosystem Fitness Sweep v1 — COMPLETE (temp/ecosystem-fitness-sweep-v1/)

Scored all 116/120 production skills for Karpathy autoresearch loop viability using GPT-5 mini via Copilot CLI. Each skill scored on: objectivity (can a shell command measure it?), execution speed, frequency of use, and potential utility (max 40).

Top HIGH candidates:

Rank Skill Score Loop
1 superpowers/verification-before-completion 35/40 LLM_IN_LOOP
2 superpowers/test-driven-development 35/40 LLM_IN_LOOP
3 coding-conventions/coding-conventions-agent 34/40 HYBRID
4 superpowers/using-git-worktrees 33/40 DETERMINISTIC
5 spec-kitty-plugin/spec-kitty-status 33/40 DETERMINISTIC
6 agent-agentic-os/os-eval-runner 32/40 DETERMINISTIC

Full ranked results: summary-ranked-skills.json Top 20 opportunities with metrics + blockers: autoresearch-opportunities-report.md

Regenerate report:

python plugin-research/experiments/analyze-candidates-for-auto-reseaarch/skills/eval-autoresearch-fit/scripts/update_ranked_skills.py \
  --json-path plugin-research/experiments/analyze-candidates-for-auto-reseaarch/skills/eval-autoresearch-fit/assets/resources/summary-ranked-skills.json \
  --morning-report

Repository Structure

plugins/                    ← upstream source (23 plugins, 123 skills)
  <plugin>/
    plugin.json
    skills/<skill>/
      SKILL.md              ← skill definition (mutation target for autoresearch loops)
      evals/evals.json      ← routing evaluation suite (should_trigger boolean schema)
      evals/results.tsv     ← per-experiment score history
      autoresearch/         ← optional: evaluate.py + golden task set for improvement loops
      scripts/              ← file-level symlinks → ../../scripts/
    scripts/                ← canonical scripts (shared via symlinks, never duplicated)
    agents/                 ← sub-agent .md definitions
    assets/diagrams/        ← architecture diagrams

.agents/                    ← deployed skill copies (bridge installer output)
  skills/
  agents/

plugin-research/            ← experiments and autoresearch infrastructure
  experiments/
    analyze-candidates-for-auto-reseaarch/

temp/                       ← local scratch (gitignored except scripts)
  ecosystem-fitness-sweep-v1/

123 skills · 23 plugins · Improvement OS (os-architect) · Karpathy autoresearch loops · Super-RAG 3-tier retrieval