Skip to content

Latest commit

 

History

History
72 lines (50 loc) · 1.91 KB

File metadata and controls

72 lines (50 loc) · 1.91 KB

Build Scripts

Back to main README

This directory contains build tools and the pipeline visualizer for Agent Mart.

Scripts

build.js - Pipeline Orchestrator

The main entry point for running the ETL pipeline. Executes all pipeline steps in sequence with progress callbacks.

npm run pipeline

Features:

  • Runs pipeline steps 01-08 in order
  • Reports timing for each step
  • Handles errors gracefully
  • Supports REPO_LIMIT environment variable for testing

ETL Visualizer

The etl-visualizer/ directory contains tools for real-time pipeline monitoring.

Using the Visualizer

For local development, use the visualizer to monitor pipeline progress in real-time:

npm run pipeline:dev

This opens a browser with a live-updating HTML report showing:

  • Stage progress with timing
  • Metrics before/after each stage
  • Validation errors
  • Data previews

The report is saved to scripts/etl-visualizer/output/pipeline-status.html.

Visualizer Components

File Description
run.js Runs pipeline with visualization callbacks
md-to-html.js Converts markdown reports to HTML
output/ Generated HTML reports

Example Output

┌─────────────────────────────────────────┐
│ Stage: 02-fetch-repos                   │
│ Status: Complete                        │
│ Duration: 2.3s                          │
│ Repos fetched: 45                       │
│ Cache hits: 12                          │
└─────────────────────────────────────────┘

Environment Variables

Variable Description
GITHUB_TOKEN Required for pipeline execution
REPO_LIMIT Limit repos for testing (e.g., REPO_LIMIT=3)