🎼 Figaro

An intelligent federated learning platform with AI-driven experiment orchestration

🎼 Figaro turns natural-language experiment descriptions into reproducible federated learning runs. Describe what you want to compare, and the agent expands, schedules, and summarizes the results for you.

⚡ Built on FastAPI + LangGraph + PyTorch, with a React UI for live progress and comparison views.

📢 News

2026-04-14 🚦 CI is live — GitHub Actions now runs pytest on the backend and tsc + vite build on the frontend for every PR.
2026-04-14 🧹 Major refactor: dropped the legacy attack/defense modules, renamed the project to Figaro, tightened reproducibility, and cleaned up log handling.
2026-04-10 🎉 Figaro v0 — first public cut of the intelligent federated learning platform with the LangGraph-based experiment agent.

✨ Key Features

🤖 Intelligent Experiment Agent — Describe experiments in natural language. The LangGraph-based agent plans, expands, launches, and summarizes runs automatically.

🖥️ Simulation Mode — Single-machine FL with CIFAR-10 / MNIST, Dirichlet non-IID partitioning, CNN / ResNet, FedAvg, CKKS homomorphic encryption, and Top-K sparsification.

🌐 Distributed Mode — Multi-node training over gRPC with a Server / Worker architecture and Docker one-click deployment.

📊 Full-Stack — FastAPI + PostgreSQL + SQLAlchemy backend, React + Vite + TypeScript frontend, and an Agent tab that shows experiment input, real-time progress, and comparison results.

🔁 Reproducibility First — Config-driven runs, pinned seeds, and stored artifacts so every experiment can be replayed.

🎯 What the Agent Can Do

"Compare CIFAR-10 non-IID with alpha=0.1, 0.3, 0.5"
→ Agent expands to 3 experiments → runs sequentially → outputs comparison table

"Compare 10 clients vs 20 clients with 5/10/20 rounds"
→ Agent expands to 2×3 = 6 experiments → Cartesian product → matrix view

🏗️ Architecture

┌──────────────────────────────────────────────────────────────┐
│                     React + Vite Frontend                    │
│              (Agent tab · progress · comparison)             │
└───────────────────────────┬──────────────────────────────────┘
                            │ REST / SSE
┌───────────────────────────▼──────────────────────────────────┐
│                      FastAPI Backend                         │
│   ┌────────────┐  ┌──────────────┐  ┌──────────────────┐     │
│   │ LangGraph  │→ │  Experiment  │→ │  Run Service     │     │
│   │   Agent    │  │   Service    │  │ (sim / dist)     │     │
│   └────────────┘  └──────────────┘  └────────┬─────────┘     │
│           ▲                                  │               │
│           │               PostgreSQL         ▼               │
│   ┌───────┴────────┐   ┌────────────┐   ┌──────────────┐     │
│   │  LLM Service   │   │ Job / Run  │   │ fl_core libs │     │
│   │ (OpenAI etc.)  │   │ Repository │   │ FedAvg · CKKS│     │
│   └────────────────┘   └────────────┘   └──────────────┘     │
└──────────────────────────────────────────────────────────────┘

⚙️ Setup

git clone https://github.com/spire-studio/figaro.git
cd figaro
uv sync

cp .env.example .env
# Edit .env and set:
#   OPENAI_API_KEY=your-api-key
#   POSTGRES_PASSWORD=postgres

🚀 Quick Start

Option A: Docker (all-in-one)

./scripts/docker-dev-up.sh

Option B: Manual startup (recommended for GPU)

Step 1 — PostgreSQL (Docker or local):

docker run -d --name figaro-pg -p 5433:5432 \
  -e POSTGRES_PASSWORD=postgres -e POSTGRES_DB=figaro postgres:16

Step 2 — Backend:

POSTGRES_HOST=localhost POSTGRES_PORT=5433 POSTGRES_PASSWORD=postgres POSTGRES_DB=figaro \
  PYTHONPATH=libs:apps/backend/runners \
  uv run uvicorn app.main:app --app-dir apps/backend --host 0.0.0.0 --port 8000 --reload

To use a specific GPU (e.g. GPU 1):

CUDA_VISIBLE_DEVICES=1 POSTGRES_HOST=localhost ... uv run uvicorn ...

Step 3 — Frontend (in another terminal):

cd apps/frontend && pnpm install && pnpm dev

Access:

Frontend: http://localhost:5173
Backend API: http://localhost:8000/docs

LLM PEFT smoke run

The LLM PEFT smoke config uses a tiny local Hugging Face-compatible model so it can run without downloading a base model. Generate it once before selecting configs/smoke/llm_peft_simulation.yaml in the Simulation UI:

uv run python scripts/create_tiny_llm_fixture.py

For a direct PowerShell smoke run:

$env:PYTHONPATH="libs;apps/backend/runners"
uv run python -c "from pathlib import Path; from runtime_dispatcher import run_runtime; raise SystemExit(0 if run_runtime(Path('configs/smoke/llm_peft_simulation.yaml')) else 1)"

Distributed deployment

./scripts/docker-dist-up.sh

🤖 Using the Agent

Open the Agent tab in the frontend and describe your experiment:

"Compare CIFAR-10 non-IID with alpha=0.1, 0.3, 0.5"
"Compare 10 clients vs 20 clients with FedAvg"
"Test training rounds 10, 20, 50 on accuracy"

The agent will parse your request, expand it into concrete experiments, run them, and generate a comparison report.

🛠️ Configuration

dataset:
  name: "CIFAR-10"
  distribution: "non_iid"
  alpha: 0.5

model:
  name: "CNN"
  input_shape: [32, 32, 3]
  num_classes: 10

federated:
  num_clients: 10
  num_rounds: 20
  clients_per_round: 5
  local_epochs: 5
  learning_rate: 0.01
  aggregation: "fedavg"

📁 Project Structure

figaro/
├── apps/
│   ├── backend/
│   │   ├── app/              # FastAPI application
│   │   │   ├── services/
│   │   │   │   ├── agent/    # LangGraph Agent (core)
│   │   │   │   ├── llm.py    # LLM service
│   │   │   │   └── simulation.py
│   │   │   ├── api/          # REST API
│   │   │   └── models/       # Database models
│   │   └── runners/          # FL runtime
│   └── frontend/             # React UI
├── libs/fl_core/             # FL core library
│   ├── federated/            # Server / Client / Aggregation
│   ├── models/               # CNN / ResNet
│   ├── data/                 # Data loading & partitioning
│   ├── privacy/              # CKKS encryption
│   ├── compression/          # Top-K sparsification
│   └── llm/                  # LLM PEFT runtime utilities
├── configs/                  # Experiment configs
├── datasets/llm/             # Local LLM SFT/evaluation JSONL files
├── models/llm/               # Local Hugging Face-compatible LLM directories
├── skill/                    # Local operational skill guides
├── scripts/                  # Docker deployment scripts
└── .github/workflows/        # CI pipelines

🤝 Contribute & Roadmap

PRs welcome! Figaro is meant to be a readable, research-friendly FL platform.

Roadmap:

Phase 1: Solidifying the Agentic Platform

Interactive Agent Planning — Multi-turn dialogue support for refining experiments, plus visual topology previews (Plan Preview) before execution.
Execution Transparency — Real-time tracking of node-level status during execution and automated natural-language interpretation of results.
Strict Configuration Engine — Implement strict Pydantic/JSON Schema validation to resolve historical inconsistencies between config_schema and underlying algorithms.
Advanced Experiment Tracking — Multi-dimensional search filtering (by metrics, hyperparameters, status) and configuration version control (diffing).

Phase 2: LLM Federated PEFT Fine-Tuning

Simulation LoRA/PEFT SFT Route — task.type=llm_peft_sft dispatches to a dedicated single-machine simulation runtime that loads a Hugging Face or local causal LM, applies LoRA adapters, and runs per-client supervised fine-tuning from JSONL data.
LLM/PEFT Configuration Surface — config_schema and runtime normalization now cover base model, tokenizer, max sequence length, precision, SFT dataset path/format, prompt template, LoRA hyperparameters, target modules, quantization mode, and adapter resume path.
JSONL SFT Data Pipeline — Supports prompt/completion and chat messages JSONL formats, deterministic client splitting, and prompt rendering for plain/chat-style templates.
Adapter-Only Federated Aggregation — Aggregates LoRA/PEFT adapter tensors by client example count, persists global adapter artifacts, records SHA-256 lineage, and supports warm-starting from a previous global adapter.
LLM Runtime Dependencies & Metrics — Core project dependencies include transformers, peft, accelerate, safetensors, and bitsandbytes; backend metrics include train loss, perplexity, token throughput, adapter size, runtime status, dataset summary, and adapter artifact lineage.
Frontend & Agent UX for LLM Runs — Exposes the LLM route in the schema-driven Simulation and Agent planning flows, with task-aware field visibility, LLM-specific metric charts, local model/dataset selection, and Adapter Lineage artifact views.
Evaluation Harness — Adds validation JSONL configuration, per-round evaluation loss/perplexity calculation after global adapter aggregation, normalized evaluation metrics, and a generated tiny-model smoke config for LLM PEFT runs.

Phase 3: Enterprise & Team Collaboration

Multi-Tenant Workspaces — Isolated project environments with Role-Based Access Control (RBAC) and comprehensive audit logging.
Robust Distributed Scheduling — Global GPU resource queuing, quota management, and enhanced fault tolerance for client reconnections/dropouts.
Cloud-Native Infrastructure — Native Kubernetes (K8s) Runner integration with auto-scaling workers based on queue volume.
Model Asset Registry — Centralized Artifact Registry to track complete data lineage from dataset versions to final aggregated weights.
Compliance & Governance — Automated privacy compliance reporting (e.g., auditing Differential Privacy parameters) to ensure enterprise-grade security.

_{Figaro is for research and educational use.}

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
.github/workflows		.github/workflows
apps		apps
configs		configs
datasets/llm		datasets/llm
libs/fl_core		libs/fl_core
models/llm		models/llm
scripts		scripts
skill		skill
.env.example		.env.example
.gitignore		.gitignore
Dockerfile.pydev		Dockerfile.pydev
README.md		README.md
README.zh-CN.md		README.zh-CN.md
config_schema.yaml		config_schema.yaml
docker-compose.dev.yml		docker-compose.dev.yml
docker-compose.dist.yml		docker-compose.dist.yml
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🎼 Figaro

📢 News

✨ Key Features

🎯 What the Agent Can Do

🏗️ Architecture

📚 Table of Contents

⚙️ Setup

🚀 Quick Start

Option A: Docker (all-in-one)

Option B: Manual startup (recommended for GPU)

LLM PEFT smoke run

Distributed deployment

🤖 Using the Agent

🛠️ Configuration

📁 Project Structure

🤝 Contribute & Roadmap

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🎼 Figaro

📢 News

✨ Key Features

🎯 What the Agent Can Do

🏗️ Architecture

📚 Table of Contents

⚙️ Setup

🚀 Quick Start

Option A: Docker (all-in-one)

Option B: Manual startup (recommended for GPU)

LLM PEFT smoke run

Distributed deployment

🤖 Using the Agent

🛠️ Configuration

📁 Project Structure

🤝 Contribute & Roadmap

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages