Skip to content

[Skill] summarize-doc command for LLM-friendly document cleanup #27

@anderskev

Description

@anderskev

Technology or framework

Document summarization/cleanup for LLM agent consumption.

Why is this needed?

Long-form documentation (whitepapers, guides, API docs) often contains noise that wastes LLM context:

  • Author credits, acknowledgements, ToCs
  • Marketing language, vendor-specific branding
  • Redundant explanations, verbose walkthroughs
  • Image placeholders, figure references

A command to distill these to actionable knowledge would make documents usable as skill references or agent context.

Key knowledge areas

  1. What to remove (credits, ToC, marketing, vendor names, verbose examples, citations, redundancy)
  2. What to retain (core concepts, architectural patterns, best practices, design patterns, security, evaluation approaches, code snippets)
  3. Formatting standards (hierarchical headers, bullet points, bold key terms, self-contained sections)
  4. Target 80-90% reduction while preserving actionable knowledge

Official documentation

N/A - this is a utility command.

Example use cases

  1. Condensing a 50-page whitepaper on AI agent architecture into a ~5-page skill reference
  2. Preparing vendor documentation (removing "Google Cloud", "AWS" branding) for generic agent design guidance
  3. Creating concise summaries of research papers for context-limited LLM interactions

Would you contribute this skill?

  • Yes, I can write this skill
  • Yes, I can help review
  • No, just requesting

Command Prompt

Clean up this document for use by LLM coding agents designing and implementing AI agents.

**Remove:**
- Author credits, acknowledgements, contributor lists
- Table of contents
- Image placeholders (<!-- image -->), figure references without content
- Marketing/promotional language
- Vendor-specific product names (keep the concepts, remove "Google Cloud", "Vertex AI", etc.)
- Verbose examples and walkthroughs (distill to the pattern)
- Endnotes, references, citations
- Redundant explanations that repeat the same concept
- Enterprise infrastructure specifics not relevant to agent design

**Retain (reorganize for clarity):**
- Core concepts and definitions
- Architectural patterns and components
- Implementation principles and best practices
- Design patterns with clear when-to-use guidance
- Security considerations
- Evaluation and debugging approaches
- Code snippets or pseudocode that illustrate patterns

**Format:**
- Use hierarchical markdown headers
- Bullet points for lists of options/choices
- Bold for key terms on first use
- Keep sections self-contained and scannable
- Target ~80-90% reduction in length while preserving all actionable knowledge

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions