Skip to content

Add templates to flow#405

Open
Edwardvaneechoud wants to merge 3 commits intomainfrom
claude/flow-templates-adoption-yDLSl
Open

Add templates to flow#405
Edwardvaneechoud wants to merge 3 commits intomainfrom
claude/flow-templates-adoption-yDLSl

Conversation

@Edwardvaneechoud
Copy link
Copy Markdown
Owner

This pull request introduces several new data flow templates and a sample dataset to support data cleaning, enrichment, and joining operations. These templates are designed to demonstrate common data engineering tasks such as deduplication, enrichment, fuzzy matching, and directory cleanup using CSV files as input.

The most important changes are:

New Data Flow Templates:

  • Added employee_directory_cleanup.yaml template, which reads employee data, creates a full name column, filters to active employees, and sorts by department and salary.
  • Added customer_360.yaml template to join customers, orders, and support tickets, aggregate metrics, compute a customer score, and sort results.
  • Added order_enrichment.yaml template to join orders with product details, calculate total price, and select relevant columns.
  • Added customer_deduplication.yaml template to deduplicate customers by email and clean up column names.
  • Added product_fuzzy_match.yaml template to perform fuzzy matching between internal and supplier product catalogs and filter matches by score.

Sample Data:

  • Added a sample employees.csv dataset with fields such as emp_id, first_name, last_name, department, status, and salary, which is used by the new employee directory cleanup flow.

claude added 2 commits April 5, 2026 07:28
Introduces a curated gallery of 8 pre-built flow templates that showcase
Flowfile's ETL capabilities. Templates use read nodes pointing to CSV files
downloaded from GitHub and cached locally in ~/.flowfile/template_data/.

Backend:
- Template definitions with 8 flows (Beginner/Intermediate/Advanced)
- Data downloader that fetches CSVs from GitHub on first use
- GET /templates/ and POST /templates/create_from_template/ endpoints
- template_data_directory added to FlowfileStorage

Frontend:
- TemplatesView with category filtering and template cards
- Templates API service
- Sidebar navigation entry and designer empty state integration
- Transparency notice about data download from GitHub

Templates: Sales Overview, Customer Dedup, Employee Cleanup, Order Join,
Survey Pivot, Web Analytics Funnel, Customer 360, Fuzzy Match

https://claude.ai/code/session_01LPWpAV5Xqk9ZJUTSPQytaZ
Templates are now defined as YAML files in data/templates/flows/ instead of
being constructed programmatically in Python. This is cleaner, easier to
maintain, and consistent with how flows are stored throughout the project.

- 8 flow YAML files with embedded _template_meta and _required_csv_files
- template_definitions.py now loads and parses YAMLs with lazy registry
- data_downloader.py supports downloading flow YAMLs for PyPI installs
- New GET /templates/ensure_available/ endpoint for production environments
- Frontend calls ensureAvailable() before listing templates

https://claude.ai/code/session_01LPWpAV5Xqk9ZJUTSPQytaZ
@netlify
Copy link
Copy Markdown

netlify bot commented Apr 8, 2026

Deploy Preview for flowfile-wasm canceled.

Name Link
🔨 Latest commit e429f56
🔍 Latest deploy log https://app.netlify.com/projects/flowfile-wasm/deploys/69d697d00ab14f0008eece5b

@Edwardvaneechoud Edwardvaneechoud changed the title Claude/flow templates adoption y dl sl Add templates to flow Apr 8, 2026
Resolve conflict in routes.py: keep both new db_schemas/db_tables
endpoints from main and template endpoints from this branch.

https://claude.ai/code/session_01LPWpAV5Xqk9ZJUTSPQytaZ
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants