Chiron3D: an interpretable deep learning framework for understanding the DNA code of chromatin looping

Chiron3D is a DNA-only deep learning model that predicts cell-type–specific CTCF HiChIP contact maps from 524,288 bp genomic windows. The model uses a frozen Borzoi backbone with lightweight adapters and a C.Origami-style pairwise head to output 105 × 105 contact matrices at 5 kb resolution.

This repository currently focuses on:

Training Chiron3D on the A673 CTCF HiChIP dataset
Evaluating trained checkpoints and reproducing the main quantitative and qualitative results from the figures

Overview of proposed pipeline

Data

Chiron3D is trained on CTCF HiChIP from the A673 wild-type Ewing sarcoma cell line on the hg19 reference genome.

All preprocessed inputs required to run the scripts in this repository are made available via Zenodo.

Setup

Download and unpack the Zenodo archive and have the following layout in the top level directory.

data/
  A673_WT_CTCF_5000.cool      # 5 kb binned CTCF HiChIP contact map (hg19)
  borzoi/                     # Weights of backbone from borzoi-pytorch
  chiron-model.ckpt           # pretrained checkpoint for evaluation + downstream
  chromosomes/                # hg19 FASTA files per chromosome (e.g. chr1.fa, ...)
  ctcf/                       # CTCF feature track (ChIP-seq)
  extruding_loops.csv         # Dataset of extruding loops classified by Tweed
  stable_loops.csv            # Dataset of stable loops classified by FitHiChIP
  windows_hg19.bed            # 524,288 bp windows tiled with 50 kb stride

Create a Python environment and install the package in editable mode:

conda create -n chiron python=3.10
conda activate chiron
pip install -r requirements.txt
pip install -e .

Training and evaluation

Chiron3D can be run directly from the command line. All commands below are intended to be run from the repository root.

Training from the command line

For a full run on 4 GPUs to replicate training results, run the following command:

python3 -m src.models.training.train \
  --seed 2077 \
  --save_path checkpoints \
  --regions-file data/windows_hg19.bed \
  --fasta-dir data/chromosomes \
  --cool-file data/A673_WT_CTCF_5000.cool \
  --genom-feat-path data/ctcf \
  --num-genom-feat 0 \
  --patience 7 \
  --max-epochs 25 \
  --save-top-n 1 \
  --num-gpu 4 \
  --batch-size 4 \
  --ddp-disabled \
  --num-workers 16 \
  --borzoi

This prints the best checkpoint path at the end of training.

Evaluation

The Zenodo link already contains a pre-trained model. To replicate our evaluation results, or run another trained model checkpoint, run the following command after inserting the appropiate checkpoint location.

python3 -m src.models.evaluation.evaluation \
  --regions-file data/windows_hg19.bed \
  --fasta-dir data/chromosomes \
  --cool-file data/A673_WT_CTCF_5000.cool \
  --genomic-feature data/ctcf \
  --num-genom-feat 0 \
  --ckpt-path checkpoints/models/<checkpoint>.ckpt \
  --borzoi

Optional SLURM wrappers

For SLURM-based cluster runs, example wrappers are provided in:

scripts/model-training.sh
scripts/model-evaluation.sh

These scripts are examples for our internal cluster setup and may need adaptation for your environment (e.g. conda environment name, absolute repository path, GPU partition, and resource requests).

GPU configuration

In on our experiments, we used 4 × NVIDIA RTX 4090 or 3090 with 24GB of memory for training. Training to convergence takes about a day. For evaluation (across three test chromosomes), 1 × NVIDIA RTX 4090 or 3090 with 24GB completes within an hour.

Downstream Task: Loop editing

The notebooks folder showcases four examples of using the ledidi-based editing framework to suggest in silico edits. The outputs of the runs can be viewed in the respective notebook and the corresponding example folders. Make sure to activate the environment and start jupyter from the notebooks folder.

On our SLURM cluster, the following command is used to run from within the notebooks directory:

srun --job-name jupyter -p gpu --gres=gpu:rtx4090:1 --time 01:00:00 --cpus-per-task 16 --mem 128G bash -c 'source ~/.bashrc && conda activate chiron && cd /path/to/main/folder/Chiron3D && pip install -e . && cd notebooks && jupyter lab --ip $(hostname -i) --no-browser'.

Citation

If you use this repository, please cite our preprint:

Hoenig, S., Grover, A., Neri, P., Surdez, D., & Boeva, V. (2026).
*Chiron3D: an interpretable deep learning framework for understanding the DNA code of chromatin looping*.
bioRxiv. https://doi.org/10.64898/2026.03.20.713211

Bibtex:

@article{hoenig2026chiron3d,
  title   = {Chiron3D: an interpretable deep learning framework for understanding the DNA code of chromatin looping},
  author  = {Hoenig, Sebastian and Grover, Aayush and Neri, Piero and Surdez, Delphine and Boeva, Virginie},
  journal = {bioRxiv},
  year    = {2026},
  doi     = {10.64898/2026.03.20.713211}
}

Name		Name	Last commit message	Last commit date
Latest commit History 47 Commits
checkpoints		checkpoints
config		config
data		data
notebooks		notebooks
scripts		scripts
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Chiron3D: an interpretable deep learning framework for understanding the DNA code of chromatin looping

Overview of proposed pipeline

Data

Setup

Training and evaluation

Training from the command line

Evaluation

Optional SLURM wrappers

GPU configuration

Downstream Task: Loop editing

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Chiron3D: an interpretable deep learning framework for understanding the DNA code of chromatin looping

Overview of proposed pipeline

Data

Setup

Training and evaluation

Training from the command line

Evaluation

Optional SLURM wrappers

GPU configuration

Downstream Task: Loop editing

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages