V1.1 by alexmorinvt · Pull Request #136 · Murali-group/Beeline

alexmorinvt · 2026-03-04T19:48:54Z

BEELINE v1.1 Release Notes

Breaking Changes

Config files

Config file format has changed. Config files from v1.0 are not immediately compatible with v1.1. If migrating, configs should be updated, per the following changes:
- The dataset_dir parameter has been removed.
- The name fields for datasets and algorithms are replaced by dataset_id / algorithm_id.
- Each dataset now has a runs list with individual run_id entries.
- should_run is now a top-level field on each dataset and algorithm entry rather than nested inside params.
- image specifies the Docker image used for running an algorithm. This enables users to swap multiple image versions without rebuilding or retagging images.
- scan_run_subdirectories automatically searches for subdirectories of a dataset directory. If runs are not enumerated in the config file, they will be skipped (non-breaking).
- Datasets may be labelled with an optional nickname parameter for plotting(non-breaking).
Zenodo dataset version v4 is required. The input data directory structure has changed. Users on earlier versions of BEELINE should continue to use Zenodo dataset version v3.

Data Structure Changes

The default ground truth network filename is GroundTruthNetwork.csv (previously refNetwork.csv).
The ground truth network is specified once at the dataset level in config files (previously specified per-run as trueEdges).
Note: The following changes are specific to the Zenodo datasets
Curated datasets are now split into separate directories by dropout level (GSD, GSD-q50, GSD-q70), each with a shared GroundTruthNetwork.csv at the dataset directory level.
Synthetic datasets have an added cell-count subdirectory level (e.g. dyn-BF/dyn-BF-100/dyn-BF-100-1/).

New Features

Runner

Each algorithm is now implemented as a class inheriting from an abstract Runner base class, replacing the v1.0 function-map pattern.
BLRunner.py now checks for existing working directories before running and prompts for confirmation before overwriting them.
scan_run_subdirectories option on dataset entries allows BLRunner.py to auto-discover run subdirectories rather than requiring an explicit runs list.
A new experiment_id field in output_settings inserts a named path segment in the output directory, allowing multiple experiments to share one output root.

Algorithms

Added PEARSON (Pearson correlation baseline).
Added SCSGL and JUMP3.
SCNS is excluded from this release due to prohibitively long run time. SCNS is still included as a buildable Docker container, and users may manually migrate the SCNS runner code from BEELINE v1.0 if they desire.
Updated Dockerfiles as needed such that containers build locally as of Docker version 28.5.1 and Ubunutu 24.04.2

Evaluation

The evaluation subsystem is rewritten as a hierarchy of callable classes (AUPRC, AUROC, EarlyPrecision, etc.), each responsible for loading, computing, and writing its own output.
Evaluation data now mirrors the dataset config hierarchy and infers grouping from it.

Plotting

Plotting is rebuilt from 2 files to a full framework: PlotAUPRC, PlotAUROC, PlotEPR, PlotSummaryHeatmap, PlotEPRHeatmap, and shared helpers in plotter.py and _heatmap.py.
BLPlotter.py now has a proper CLI with --auprc, --auroc, --epr, --summary, --epr-summary, and --all flags.

Utilities

Moved and upgraded utils/generateExpInputs.py for preprocessing experimental scRNA-seq datasets.
Moved and upgraded utils/initialize.sh and utils/setupAnacondaVENV.sh
Moved and upgraded utils/environment.yml for direct conda environment creation.
Created utils/buildDocs.sh for locally buidling the documentation.

Documentation

Documentation is built with Sphinx.
Added a new Reproducing BEELINE Results page with step-by-step instructions for recreating Figures 2 and 4 from Pratapa et al. (2020).

Disclaimer

Portions of this codebase and documentation were prepared with the assistance of Claude Sonnet 4.6, an AI assistant developed by Anthropic. All content has been reviewed and approved by the authors, who take full responsibility for its accuracy.

…. Fix bug that did not parse params correctly if in list format.

…output/errors/keep cli clean

…user with a warning that this will happen.

…sary prints about directories from runners

…a directory under output_dir

…iles in the --help text

…fy which docker image + tag they use. This allows multiple versions of the same algorithm or local builds to be used.

… build and run properly

…h subdirectories of dataset/dataset_id

…experiment_id to deconflict. Add scan_run_subdirectories to optionally, automatically iterate through all one-level subdirectories of a dataset that sets this value to true.

…tils folder

- Set up sphinx-apidoc with sphinx_rtd_theme for BLRun, BLEval, BLPlot - Port and update BEELINE and BoolODE docs from old_docs, fixing config format, repo structure, broken labels, image paths, and algorithm table - Add BLRun/__init__.py to make BLRun a proper Python package - Fix PathStats.py docstring RST substitution conflict - Add utils/buildDocs.sh to rebuild docs with a single command - Add sphinx and sphinx_rtd_theme to utils/environment.yml Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

… on the size of algorithm-name strings, instead of width of the plot.

…hanging width with the width of the plot

alexmorinvt added 30 commits February 21, 2026 02:36

recreated BLRunner.py

c39564d

Break out should_run param into the algorithm level instead of params…

19f6f89

…. Fix bug that did not parse params correctly if in list format.

add license

5a3fd34

bug fix for writing genie3 expression data

ce6cfb2

initial commit of BLEvaluator with running AUROC and AUPRC examples

b29aac5

standardize writing of output data frame to file

3916ad3

add EPR and SEPR

404371d

print status message per runner

6cf86f0

added initial time, borda, jaccard, motifs, and spearman evaluators.

38710bb

pipe all docker output to files in a standardized manner to preserve …

0ebf921

…output/errors/keep cli clean

Erase working directories each time that BLRunner is run. Prompt the …

8ddce98

…user with a warning that this will happen.

Improve check for existing working directories and remove not unneces…

d154c2e

…sary prints about directories from runners

workaround for SCODE df sorting error

f453fe7

add path stats evaluator

bfb9900

move more duplicate code to runner.py

efd2182

GroundTruthNetwork moved to the dataset level, not ground level

58bd455

minor BLEval bugs

a85ccd3

initial BLPlotter commit

761b936

correcting minor inconsistencies in the plots and adding EPR boxplots

faa365e

Modify runner files to add new run_id parameter. It specifies an extr…

060610c

…a directory under output_dir

code reuse updates

4f0671d

fix string - int comparison bug in BLEval path stats

db1ba4f

give the scsgl runner access to copy the ground truth network file

2ead5b8

make white and grey rows wider for Overview.pdf

045dd8b

Change Overview.pdf to Summary.pdf and mention names of output plot f…

6006e20

…iles in the --help text

bugfix: label column minima for EPRSummary plot

e34207c

change minimum to near 1.0 and gradient from rocket to magma

9e5e22b

minor change to whitespace

98c20b0

add README and docs images

637c3eb

add plotter examples

6b70a21

alexmorinvt and others added 20 commits February 24, 2026 16:24

initial commit with updated algorithm folders.

3348427

add a image parameter to the config that requires that the user speci…

baf2f6c

…fy which docker image + tag they use. This allows multiple versions of the same algorithm or local builds to be used.

add image description to config.yaml

0bd750f

sort plot AUROC and AUPRC by value, not algorithm name

9e0cead

Add Pearson correlation as an available base algorithm for comparison

880b33f

update initialize scripts with more --flag options and dockerfiles to…

a432f5f

… build and run properly

bug fix where some evaluation methods were resorting inappropriately.

9b77c5c

add nickname config value to enable short plotter labels

7b777ff

split multi-dataset pdfs into directories

ce14df0

remove dead dead per previous commit

0a12986

add scan_run_subdirectories parameter that when true, iterates throug…

4f4b7a2

…h subdirectories of dataset/dataset_id

remove dataset_dir as an option and rename output_settings:run_id to …

20ee7b9

…experiment_id to deconflict. Add scan_run_subdirectories to optionally, automatically iterate through all one-level subdirectories of a dataset that sets this value to true.

add generateExpInputs.py and move repo-level utility scripts to the u…

4b302c3

…tils folder

remove now-unnecessary import for an R package from setupAnacondaVENV.sh

902d1cd

make the width of grey and white rorws in the heatmap plots dependent…

20a7d7b

… on the size of algorithm-name strings, instead of width of the plot.

updates to docs for a more sensible page organization structure

c2a7274

set the width of the summary heatmap legend to be static instead of c…

e026c73

…hanging width with the width of the plot

Add documentation for reproducing results

41bb177

Merge branch 'v1.1-dev'

1cec85f

alexmorinvt force-pushed the v1.1 branch from cf75271 to 1cec85f Compare March 4, 2026 20:01

tiny bug fix: resolve input/output paths properly in the runner

fd7f690

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

V1.1#136

V1.1#136
alexmorinvt wants to merge 51 commits intomasterfrom
v1.1

alexmorinvt commented Mar 4, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

alexmorinvt commented Mar 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

BEELINE v1.1 Release Notes

Breaking Changes

Config files

Data Structure Changes

New Features

Runner

Algorithms

Evaluation

Plotting

Utilities

Documentation

Disclaimer

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

alexmorinvt commented Mar 4, 2026 •

edited

Loading