There and Back Again: On the relation between Noise and Image Inversions in Diffusion Models

This is the implementation for the paper "There and Back Again: On the relation between Noise and Image Inversions in Diffusion Models". We show that DDIM inverted latents exhibit input image patterns and propose to replace first inversion steps with forward diffusion process, boosting image editability and interpolation with Diffusion Models. Please refer to our paper for more details.

⚙️ Setup

Installation

This project uses uv for environment and dependency management. Install uv (see the docs), then from the repository root run:

uv sync

This creates a .venv/ env (Python 3.10, CUDA 12.1 PyTorch wheels).

To also install the optional notebook dependencies:

uv sync --extra notebooks

Download pre-trained models

To use ADM models, download the following checkpoints and place them in the res/openai-models/ directory:

LDM, DiT, Deepfloyd-IF, and SDXL models are downloaded automatically.

🛠️ Configuration (Hydra)

Every experiment script is configured with Hydra. Default configs live in configs/. Any field can be overridden on the command line with key=value syntax, for example:

uv run accelerate launch taba/scripts/sampling/run_dit_sampling.py n_prompts=1280 with_inversion=true with_reconstruction=true

Boolean flags are set with key=true / key=false, paths default to null, and lists use 'key=[a,b]'.

🧪 Run experiments

Sampling, inversion, and reconstruction

ADM models:

$ uv run accelerate launch --num_processes 1 taba/scripts/sampling/run_adm_sampling.py model_name=cifar_pixel_32 num_inference_steps=100 with_inversion=true with_reconstruction=true seed=420 batch_size=256 n_samples=10240 save_dir=experiments/sample_invert_reconstruct/adm32

$ uv run accelerate launch --num_processes 1 taba/scripts/sampling/run_adm_sampling.py model_name=imagenet_pixel_64 num_inference_steps=100 with_inversion=true with_reconstruction=true seed=420 batch_size=128 n_samples=10240 save_dir=experiments/sample_invert_reconstruct/adm64

$ uv run accelerate launch --num_processes 1 taba/scripts/sampling/run_adm_sampling.py model_name=imagenet_pixel_256 num_inference_steps=100 with_inversion=true with_reconstruction=true seed=420 batch_size=64 n_samples=10240 save_dir=experiments/sample_invert_reconstruct/adm256

# use internal=true to collect intermediate steps
# use n_parts and part_idx to split the dataset to multiple parts
# use --num_processes N (an accelerate flag) to sample data split with N GPUs
# use input_noise_path=PATH to sample from a tensor of sampled noises instead of torch.randn()
# use input_image_path=PATH to start with inversion from provided images

LDM model:

$ uv run accelerate launch --num_processes 1 taba/scripts/sampling/run_ldm_sampling.py num_inference_steps=100 with_inversion=true with_reconstruction=true seed=420 batch_size=128 n_samples=10240 save_dir=experiments/sample_invert_reconstruct/ldm

# use internal=true to collect intermediate steps
# use n_parts and part_idx to split the dataset to multiple parts
# use input_noise_path=PATH to sample from a tensor of sampled noises instead of torch.randn()

DiT model:

$ uv run accelerate launch --num_processes 1 taba/scripts/sampling/run_dit_sampling.py seed=420 noises_per_prompt=8 n_prompts=1280 batch_size=128 num_inference_steps=100 guidance_scale=1.0 cond_seed=10 with_inversion=true with_reconstruction=true save_dir=experiments/sample_invert_reconstruct/dit

# use internal=true to collect intermediate steps
# use --num_processes N (an accelerate flag) to sample data split with N GPUs
# use input_path=PATH and input_cond_path=PATH2 to sample from a ready tensor of sampled noises with given conditioning

Deepfloyd-IF model:

$ uv run accelerate launch --num_processes 1 taba/scripts/sampling/run_if_sampling.py seed=420 noises_per_prompt=8 n_prompts=1024 batch_size=64 num_inference_steps=100 guidance_scale=1.0 prompts_dataset=dataset cond_seed=10 with_inversion=true with_reconstruction=true save_dir=experiments/sample_invert_reconstruct/if

# use internal=true to collect intermediate steps
# use prompts_dataset=null to sample with null prompt embedding
# use --num_processes N (an accelerate flag) to sample data split with N GPUs
# use input_path=PATH to sample from a ready tensor of sampled noises

SDXL model:

$ uv run accelerate launch --num_processes 1 taba/scripts/sdxl/sdxl_ddim_sample_inv_recon.py seed=88 n_noises_per_prompt=4 n_prompts=512 batch_size=4 num_inference_steps=50 guidance_scale=1.0 cond_seed=11

# results are written under experiments/sdxl/ddim
# to use our forward-diffusion inversion, see the "Invert with forward diffusion" section below

Replacing first inversion predictions with the ground-truth one

Example: swap_before_t=5 = number of first DM predictions during inversion to replace.

LDM model:

$ uv run accelerate launch --num_processes 1 taba/scripts/invert_swap/run_ldm_invert_swap.py seed=420 batch_size=128 num_inference_steps=100 with_reconstruction=true input_image_path=experiments/sample_invert_reconstruct/ldm/samples.pt swap_path=experiments/sample_invert_reconstruct/ldm/all_t_eps_samples.pt swap_before_t=5 swap_type=eps save_dir=experiments/invert_swap/ldm5

# use internal=true to collect intermediate steps
# use n_parts and part_idx to split the dataset to multiple parts
# use --num_processes N (an accelerate flag) to sample data split with N GPUs
# input_image_path is a tensor of generated images
# use swap_type=eps to replace model predictions, use swap_type=xt to replace the whole step (avoid machine precision issues)

DiT model:

$ uv run accelerate launch --num_processes 1 taba/scripts/invert_swap/run_dit_invert_swap.py seed=420 batch_size=128 num_inference_steps=100 with_reconstruction=true input_image_path=experiments/sample_invert_reconstruct/dit/samples.pt input_cond_path=experiments/sample_invert_reconstruct/dit/conds.pt swap_path=experiments/sample_invert_reconstruct/dit/all_t_eps_samples.pt swap_before_t=5 swap_type=eps save_dir=experiments/invert_swap/dit5

# use internal=true to collect intermediate steps
# use --num_processes N (an accelerate flag) to sample data split with N GPUs
# use input_image_path=PATH and input_cond_path=PATH2 to make sure that inversion is done with the same conditions
# use swap_type=eps to replace model predictions, use swap_type=xt to replace the whole step (avoid machine precision issues)

Deepfloyd-IF model:

$ uv run accelerate launch --num_processes 1 taba/scripts/invert_swap/run_if_invert_swap.py seed=420 batch_size=64 num_inference_steps=100 guidance_scale=1.0 internal=true with_reconstruction=true input_samples_path=experiments/sample_invert_reconstruct/if/samples.pt input_prompts_path=experiments/sample_invert_reconstruct/if/prompts.pkl swap_path=experiments/sample_invert_reconstruct/if/all_t_eps_samples.pt swap_type=eps swap_before_t=5 save_dir=experiments/invert_swap/if5

# use internal=true to collect intermediate steps
# use --num_processes N (an accelerate flag) to sample data split with N GPUs
# use input_samples_path=PATH and input_prompts_path=PATH2 to make sure that inversion is done with the same conditions
# use swap_type=eps to replace model predictions, use swap_type=xt to replace the whole step (avoid machine precision issues)

Invert with forward diffusion (our method)

Example: forward_before_t=3 = number of first inversion steps to replace with forward diffusion.

ADM model:

$ uv run accelerate launch --num_processes 1 taba/scripts/invert_forward/run_adm_invert_forward.py model_name=cifar_pixel_32 seed=420 batch_size=256 num_inference_steps=100 with_reconstruction=true input_image_path=experiments/sample_invert_reconstruct/adm32/samples.pt forward_before_t=3 forward_seed=999 save_dir=experiments/invert_forward/adm32_3
$ uv run accelerate launch --num_processes 1 taba/scripts/invert_forward/run_adm_invert_forward.py model_name=imagenet_pixel_64 seed=420 batch_size=128 num_inference_steps=100 with_reconstruction=true input_image_path=experiments/sample_invert_reconstruct/adm64/samples.pt forward_before_t=3 forward_seed=999 save_dir=experiments/invert_forward/adm64_3
$ uv run accelerate launch --num_processes 1 taba/scripts/invert_forward/run_adm_invert_forward.py model_name=imagenet_pixel_256 seed=420 batch_size=64 num_inference_steps=100 with_reconstruction=true input_image_path=experiments/sample_invert_reconstruct/adm256/samples.pt forward_before_t=3 forward_seed=999 save_dir=experiments/invert_forward/adm256_3

# use --num_processes N (an accelerate flag) to sample data split with N GPUs
# use input_image_path=PATH to make sure that inversion is done with the same images
# make sure forward_seed is different from seed
# if needed, divide the dataset to multiple parts with n_parts=N and part_idx={0, ..., N-1}

DiT model:

$ uv run accelerate launch --num_processes 1 taba/scripts/invert_forward/run_dit_invert_forward.py seed=420 batch_size=128 num_inference_steps=100 guidance_scale=1.0 with_reconstruction=true input_image_path=experiments/sample_invert_reconstruct/dit/samples.pt input_cond_path=experiments/sample_invert_reconstruct/dit/conds.pt forward_before_t=3 forward_seed=999 save_dir=experiments/invert_forward/dit3

# use --num_processes N (an accelerate flag) to sample data split with N GPUs
# use input_image_path=PATH and input_cond_path=PATH2 to make sure that inversion is done with the same conditions
# make sure forward_seed is different from seed

LDM model:

$ uv run accelerate launch --num_processes 1 taba/scripts/invert_forward/run_ldm_invert_forward.py seed=420 batch_size=128 num_inference_steps=100 with_reconstruction=true input_image_path=experiments/sample_invert_reconstruct/ldm/samples.pt forward_before_t=3 forward_seed=999 save_dir=experiments/invert_forward/ldm_3

# use internal=true to collect intermediate steps
# use --num_processes N (an accelerate flag) to sample data split with N GPUs
# use input_image_path=PATH to make sure that inversion is done with the same images
# make sure forward_seed is different from seed

Deepfloyd-IF model:

$ uv run accelerate launch --num_processes 1 taba/scripts/invert_forward/run_if_invert_forward.py seed=420 batch_size=64 num_inference_steps=100 guidance_scale=1.0 with_reconstruction=true input_samples_path=experiments/sample_invert_reconstruct/if/samples.pt input_prompts_path=experiments/sample_invert_reconstruct/if/prompts.pkl forward_before_t=3 forward_seed=999 save_dir=experiments/invert_forward/if3

# use --num_processes N (an accelerate flag) to sample data split with N GPUs
# use input_samples_path=PATH and input_prompts_path=PATH2 to make sure that inversion is done with the same conditions
# make sure forward_seed is different from seed

SDXL model:

$ uv run accelerate launch --num_processes 1 taba/scripts/sdxl/sdxl_ddim_sample_inv_recon.py seed=88 n_noises_per_prompt=4 n_prompts=512 batch_size=4 num_inference_steps=50 guidance_scale=1.0 cond_seed=11 with_forward=true forward_before_t=3 forward_seed=999

# the SDXL sample/invert/reconstruct script applies forward diffusion in-place via with_forward=true
# use forward_before_t=K to set the number of first inversion steps to replace with forward diffusion
# make sure forward_seed is different from seed

🎨 Editing real images with our inversion

We provide two real-image editing pipelines that plug our forward-diffusion inversion into existing attention-based editing methods.

MasaCtrl + forward diffusion (SD1.5)

Edit a single user image by combining MasaCtrl with our inversion:

$ uv run python taba/scripts/masactrl/run_masactrl_edit_real.py \
    image_path=/path/to/image.png \
    source_prompt="" \
    target_prompt="a photo of a tiger" \
    num_inference_steps=50 guidance_scale=7.5 \
    forward_t=2 forward_seed=2115 \
    masactrl_step=4 masactrl_layer=10 \
    output_dir=experiments/masactrl/edit_real

forward_t is the number of first inversion steps replaced with forward diffusion (set forward_t=0 to fall back to standard DDIM inversion). Outputs are written to output_dir.

StyleAligned + forward diffusion (SDXL)

Transfer the style of a user image to one or more target prompts by combining StyleAligned with our inversion:

$ uv run python taba/scripts/style_aligned/transfer_style.py image_path=https://luk-st.github.io/img_ls.png p_source="a photo of a man in photography style" 'ps_target=["a photo of penguin in photography style"]' use_forward_diffusion=true forward_t=2 forward_seed=999 num_inference_steps=100 output_dir=results/style_transfer_demo/penguin

image_path can be a local path or a URL. Pass several target prompts via 'ps_target=["...","..."]'. Set use_forward_diffusion=false to fall back to standard DDIM inversion.

💗 Acknowledgements

This repository is based on openai/guided-diffusion and diffusers 🧨 DDIM Scheduler implementation. The real-image editing pipelines build on MasaCtrl and StyleAligned.

✒️ Citation

If you found our work helpful, please consider citing:

@inproceedings{
     staniszewski2026there,
     title={There and Back Again: On the relation between Noise and Image Inversions in Diffusion Models},
     author={{\L}ukasz Staniszewski and {\L}ukasz Kuci{\'n}ski and Kamil Deja},
     booktitle={The Fourteenth International Conference on Learning Representations},
     year={2026},
     url={https://openreview.net/forum?id=8PaDdLuVKN}
}

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
configs		configs
data		data
datasets		datasets
guided_diffusion		guided_diffusion
res		res
slurm_scripts		slurm_scripts
taba		taba
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.python-version		.python-version
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

There and Back Again: On the relation between Noise and Image Inversions in Diffusion Models

⚙️ Setup

Installation

Download pre-trained models

🛠️ Configuration (Hydra)

🧪 Run experiments

Sampling, inversion, and reconstruction

Replacing first inversion predictions with the ground-truth one

Invert with forward diffusion (our method)

🎨 Editing real images with our inversion

MasaCtrl + forward diffusion (SD1.5)

StyleAligned + forward diffusion (SDXL)

💗 Acknowledgements

✒️ Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

There and Back Again: On the relation between Noise and Image Inversions in Diffusion Models

⚙️ Setup

Installation

Download pre-trained models

🛠️ Configuration (Hydra)

🧪 Run experiments

Sampling, inversion, and reconstruction

Replacing first inversion predictions with the ground-truth one

Invert with forward diffusion (our method)

🎨 Editing real images with our inversion

MasaCtrl + forward diffusion (SD1.5)

StyleAligned + forward diffusion (SDXL)

💗 Acknowledgements

✒️ Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages