Skip to content

[Wishlist] doc2video #208

@lotus2742

Description

@lotus2742

Type

Developer Tool / Codebase

Software / Service Name

doc2video

Link

https://github.com/HKUDS/CLI-Anything

Brief use-case

Convert documents (text, PDF, Markdown) to narrated MP4 videos.
Pipeline: parse doc → extract key points → TTS voiceover → render frames → ffmpeg merge.
AI agents can produce shareable video summaries from any document automatically.

Why should CLI-Anything support this?

"Doc to video" is a high-demand agentic workflow with no existing CLI-Anything harness.
The pipeline is fully automatable with open-source tools (edge-tts, Pillow, ffmpeg),
making implementation straightforward. Perfect fit for agent-native CLI design.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions