GitHub - Krisocer/FigureWeave: Generate editable scientific SVG figures from method text with local SAM3 and dual-provider routing.

From method text to editable scientific figures

Overview

FigureWeave is a research-engineering project for turning paper method descriptions into publication-style figures that remain editable as SVG.

This project is inspired by AutoFigure, but it is no longer a mirror of the original system. The current codebase has been reworked into a more practical figure authoring pipeline with:

local GPU SAM3 segmentation
split routing for image drafting and SVG reasoning
multi-candidate end-to-end generation
figure caption conditioning in addition to method text
SVG-first reconstruction with template, optimized template, and final assembly stages
CUDA-accelerated local post-processing for segmentation and background removal

FigureWeave is especially useful for:

method overviews
pipeline diagrams
system schematics
architecture figures
editable draft figures for papers, slides, and reports

It is not intended to replace precise plotting tools such as matplotlib, seaborn, ggplot, or Origin for charts driven by exact numeric data.

Code Layout

The project is no longer organized as one large monolithic script.

figureweave.py is now a thin compatibility entrypoint for CLI execution and top-level imports.
src/figureweave/config.py stores provider defaults, paths, and shared constants.
src/figureweave/llm.py contains Gemini, OpenAI, Claude, OpenRouter, and related model-calling logic.
src/figureweave/vision.py covers image drafting, SAM3 segmentation, and background removal.
src/figureweave/svg_ops.py handles SVG reconstruction, repair, optimization, and asset replacement.
src/figureweave/pipeline.py orchestrates the end-to-end pipeline and multi-candidate execution.
src/figureweave/cli.py defines the command-line interface.

This split makes the codebase easier to extend, debug, and test without changing the public CLI usage.

What Is New In FigureWeave

Compared with the original AutoFigure-style workflow, this project adds several concrete contributions:

Local SAM3 on GPU Segmentation can run locally on CUDA instead of depending only on hosted APIs. This improves speed, privacy, and reproducibility for the icon-region extraction stage.
Dual-provider model routing Image drafting and SVG reasoning are now decoupled, so the pipeline can use different providers for different stages, such as Gemini -> Gemini, OpenAI -> OpenAI, Gemini -> Anthropic Claude, or OpenAI -> Anthropic Claude.
Multi-candidate generation A single run can generate multiple full candidates, preserve each artifact bundle, write a candidate manifest, and promote a selected result as the default output.
Figure caption conditioning The system accepts both method text and a figure caption / figure brief, so the generator and reconstructor can be constrained by explicit stage structure, layout intent, and narrative emphasis.
SVG-first reconstruction pipeline Instead of treating the raster image as the final result, FigureWeave explicitly reconstructs an editable SVG template, optionally refines that template, and only then assembles the final SVG with extracted assets.
CUDA-accelerated local post-processing Background removal and other local visual post-processing stages now use GPU-capable PyTorch when available, reducing the CPU bottleneck of the original workflow.
More robust fallback behavior The current pipeline includes explicit fallback paths for no-icon cases, placeholder reduction, and provider-side failures, which makes batch generation more practical for real paper figure drafting.

Gallery: Editable Vectorization & Style Transfer

The following assets are used as the current FigureWeave showcase from the multimodal_medical_report run:

Draft image: img/case/multimodal_medical_report_draft.png
Optimized SVG template: img/case/multimodal_medical_report_template.svg
Final assembled SVG: img/case/multimodal_medical_report_final.svg

This showcase highlights the intended FigureWeave workflow:

figure.png as the model-generated draft
optimized_template.svg as the editable structural reconstruction
final.svg as the assembled showcase result

UI Preview

The browser-based FigureWeave interface is shown below with both the configuration view and the editable SVG canvas.

Config view: img/UI/UI_1.png
Editable canvas: img/UI/UI_2.png

How It Works

FigureWeave currently runs in five major stages:

Image Draft Generate a scientific-style draft figure from method text, optional figure caption, and optional reference image.
Segmentation Run local SAM3 or an API backend to detect icons and visual regions, producing:
- samed.png
- boxlib.json
Asset Extraction Crop detected regions and remove backgrounds to create transparent assets.
SVG Reasoning And Reconstruction Use a multimodal model to reconstruct the draft into an editable SVG template, then optionally refine it.
Assembly Replace placeholders with extracted assets and emit:
- template.svg
- optimized_template.svg
- final.svg

Configuration

Provider Labels

Image Draft Provider

Gemini
OpenAI

SVG Reasoning And Reconstruction Provider

Gemini
OpenAI
Anthropic Claude

Practical Note

Anthropic Claude is used here for understanding and reconstruction, not for native image generation. In this project, the image drafting stage should use Gemini or OpenAI.

Web Interface

Start the server:

python server.py

Then open:

http://127.0.0.1:8000

The main configuration page now includes:

Method Text
Figure Caption
Image Draft Provider
SVG Reasoning Provider
Candidates
Generation Mode
SAM3 Backend
Reference Image

The canvas page lets you:

inspect intermediate artifacts
switch between candidate SVGs
review logs
open the result in the embedded SVG editor

Quick Start

Basic

python figureweave.py   --method_file paper.txt   --output_dir outputs/demo   --image_provider gemini   --image_api_key YOUR_GEMINI_KEY   --svg_provider anthropic   --svg_api_key YOUR_ANTHROPIC_KEY

Single-Provider Fallback

If you want to use one provider for both stages, you can still use:

python figureweave.py   --method_file paper.txt   --output_dir outputs/demo   --provider gemini   --api_key YOUR_GEMINI_KEY

Multi-Candidate Generation

python figureweave.py   --method_file paper.txt   --output_dir outputs/demo_multi   --image_provider gemini   --image_api_key YOUR_GEMINI_KEY   --svg_provider openai   --svg_api_key YOUR_OPENAI_KEY   --num_candidates 3

Local SAM3

FigureWeave supports local SAM3 execution on GPU.

Typical setup:

git clone https://github.com/facebookresearch/sam3.git
cd sam3
pip install -e .

You also need:

an available NVIDIA GPU
CUDA-enabled PyTorch in the current environment
Hugging Face access to SAM3

If local SAM3 is unavailable, the codebase can still fall back to other segmentation paths depending on your configuration.

Installation

Python Environment

pip install -r requirements.txt

Environment Variables

At minimum, you will usually want:

HF_TOKEN=your_huggingface_token
ROBOFLOW_API_KEY=your_roboflow_key

Depending on your selected routing, you may also need:

Gemini API key
OpenAI API key
Anthropic API key

Docker

Build and run:

docker compose up -d --build

Health checks:

docker compose ps
curl http://127.0.0.1:8000/healthz

Logs:

docker compose logs -f figureweave

Restart:

docker compose restart figureweave

Output Structure

Typical outputs include:

figure.png
samed.png
boxlib.json
icons/
template.svg
optimized_template.svg
final.svg
candidates_manifest.json

When multi-candidate mode is enabled, each run is stored under:

candidate_01/
candidate_02/
candidate_03/

Credits

FigureWeave is inspired by AutoFigure and builds on the broader idea of converting scientific method descriptions into figure drafts.

The current project extends that direction with:

local GPU segmentation
dual-provider routing
multi-candidate generation
in-browser SVG refinement
a more complete engineering workflow

License

This repository is released under the MIT License in LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
examples/ml_showcase		examples/ml_showcase
img		img
src/figureweave		src/figureweave
web		web
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
README_ZH.md		README_ZH.md
docker-compose.yml		docker-compose.yml
figureweave.py		figureweave.py
requirements.txt		requirements.txt
server.py		server.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Overview

Code Layout

What Is New In FigureWeave

Gallery: Editable Vectorization & Style Transfer

UI Preview

How It Works

Configuration

Provider Labels

Image Draft Provider

SVG Reasoning And Reconstruction Provider

Practical Note

Web Interface

Quick Start

Basic

Single-Provider Fallback

Multi-Candidate Generation

Local SAM3

Installation

Python Environment

Environment Variables

Docker

Output Structure

Credits

License

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Overview

Code Layout

What Is New In FigureWeave

Gallery: Editable Vectorization & Style Transfer

UI Preview

How It Works

Configuration

Provider Labels

Image Draft Provider

SVG Reasoning And Reconstruction Provider

Practical Note

Web Interface

Quick Start

Basic

Single-Provider Fallback

Multi-Candidate Generation

Local SAM3

Installation

Python Environment

Environment Variables

Docker

Output Structure

Credits

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages