This directory contains build tools and the pipeline visualizer for Agent Mart.
The main entry point for running the ETL pipeline. Executes all pipeline steps in sequence with progress callbacks.
npm run pipelineFeatures:
- Runs pipeline steps 01-08 in order
- Reports timing for each step
- Handles errors gracefully
- Supports
REPO_LIMITenvironment variable for testing
The etl-visualizer/ directory contains tools for real-time pipeline monitoring.
For local development, use the visualizer to monitor pipeline progress in real-time:
npm run pipeline:devThis opens a browser with a live-updating HTML report showing:
- Stage progress with timing
- Metrics before/after each stage
- Validation errors
- Data previews
The report is saved to scripts/etl-visualizer/output/pipeline-status.html.
| File | Description |
|---|---|
run.js |
Runs pipeline with visualization callbacks |
md-to-html.js |
Converts markdown reports to HTML |
output/ |
Generated HTML reports |
┌─────────────────────────────────────────┐
│ Stage: 02-fetch-repos │
│ Status: Complete │
│ Duration: 2.3s │
│ Repos fetched: 45 │
│ Cache hits: 12 │
└─────────────────────────────────────────┘
| Variable | Description |
|---|---|
GITHUB_TOKEN |
Required for pipeline execution |
REPO_LIMIT |
Limit repos for testing (e.g., REPO_LIMIT=3) |