TADA! Tuning Audio Diffusion Models through Activation Steering

_{TLDR; We show where (through activation patching) and how (through benchmarking steering methods) to intervene Audio Diffusion Models for Musical Concept Modulation.}

⚙️ Installation

Install with uv:

uv sync
source .venv/bin/activate

Copy the env template and set the project root:

cp .env.example .env
# Edit .env: set PROJECT_ROOT. HF token + cluster vars are optional.
set -a; source .env; set +a

Model checkpoints:

ACE-Step weights (≈ 7.8 GB) will be downloaded to ${ACE_STEP_CACHE} on first run; AudioLDM2 / Stable Audio Open checkpoints pull from their respective HF repos on first use.

CLAP (music) checkpoint (≈ 2.2 GB) is required for every evaluation script. Download music_audioset_epoch_15_esc_90.14.pt from HuggingFace:

mkdir -p res/clap/pretrained
wget -O res/clap/pretrained/music_audioset_epoch_15_esc_90.14.pt https://huggingface.co/lukewys/laion_clap/resolve/main/music_audioset_epoch_15_esc_90.14.pt

🔍 Localization

Activation patching (causal tracing) with NNSight to identify which layers in each audio-diffusion architecture encode specific musical features (e.g. "violin", "female vocal", "slow tempo").

Counterfactual prompt dataset

We release lukasz-staniszewski/patching-music-musiccaps-prompts — 3,246 rows of (clean, corrupted) prompt pairs over 21 features (female, male, fast, slow, happy, sad, reggae, metal, opera, jazz, violin, trumpet, saxophone, drums, cello, bongos, flute, maracas, harmonica, trombone, xylophone). Each row is a real MusicCaps caption mentioning a feature, paired with the same caption after a word-level counterfactual swap (e.g. violin→trumpet, fast→slow).

To load:

from datasets import load_dataset

ds = load_dataset("lukasz-staniszewski/patching-music-musiccaps-prompts")
violin_rows = ds["train"].filter(lambda r: r["original_feature"] == "violin")
print(violin_rows[0])
# {'original_feature': 'violin',
#  'clean_prompt':     'A folk piece with violin and ...',
#  'corrupted_prompt': 'A folk piece with trumpet and ...'}

Each row contains: original_feature (filter on this for a per-feature run), clean_prompt (un-patched activations), and corrupted_prompt (source activations to patch in). Feature words and their counterfactual swap maps live in src/preprocess/features.py (MUSICCAPS_ORIGINAL_FEATURES, MUSICCAPS_COUNTERFACTUAL_FEATURES, MUSICCAPS_SWAPS_FEATURES).

Regenerating the MusicCaps prompts

To rebuild the CSV locally, edit src/preprocess/features.py, download the MusicCaps caption file from the HuggingFace mirror and place it at data/music_caps.csv:

mkdir -p data
wget -O data/music_caps.csv https://huggingface.co/datasets/google/MusicCaps/resolve/main/musiccaps-public.csv

Then run:

python src/preprocess/prepare_prompts.py \
    --input_file data/music_caps.csv \
    --limit 256 \
    --output_file data/generated_prompts.csv

To republish on the Hub afterwards:

python scripts/hub/push_localization_prompts.py \
    --repo-id {repo_id}/patching-music-musiccaps-prompts \
    --csv data/generated_prompts.csv

Running activation patching

The patching driver is src/patch_layers.py, configured via Hydra. The top-level config is configs/generate_audio_patch.yaml; per-architecture overrides live under configs/patch_model/<arch>_patch.yaml, per-feature data overrides under configs/patch_data/musiccaps/<feature>[_<arch>].yaml, and per-block layer presets under configs/patch_layers/<arch>/<preset>.yaml. After generation, src/eval_audio.py computes CLAP / MUQ-T metrics over the patched audio.

A typical sweep patches each block in turn (or none, the un-patched baseline) and compares CLAP/MuQ scores across blocks.

Per-architecture examples

ACE-Step

# 1. Generate patched audio (one block per run, then aggregate).
python src/patch_layers.py \
    patch_model=ace_patch \
    patch_config=ace \
    patch_data=musiccaps/violin_ace \
    patch_layers=ace/tf7 \
    paths.output_dir=outputs/ace/patching/violin/tf7

# prompt_limit=2 for low scale tests

# 2. Score CLAP / MUQ-T against the clean baseline.
bash sh_scripts/localization/eval_feature_ace.sh violin violin_summary none tf5 tf6 tf7 all

Available patch_layers presets for ACE-Step: ace/all, ace/none, ace/tf{0..23}, plus combos like ace/tf5tf6, ace/tf5tf6tf7, ace/tf6tf7. Browse configs/patch_layers/ace/ for the full list.

AudioLDM 2

python src/patch_layers.py \
    patch_model=audioldm2_patch \
    patch_data=musiccaps/violin \
    patch_layers=audioldm2/up1tf5 \
    paths.output_dir=outputs/audioldm2/patching/violin/up1tf5

bash sh_scripts/localization/eval_feature_audioldm2.sh violin violin_summary up1tf5

Available presets: audioldm2/all, audioldm2/none, audioldm2/up0, audioldm2/up1, audioldm2/up2, audioldm2/mid, audioldm2/down{1,2,3}, plus fine-grained sub-blocks like audioldm2/up1tf5, audioldm2/up1tf5attn0, audioldm2/up1tf9attn1, audioldm2/up1tf2tf5tf9tf10 (combos), etc. See configs/patch_layers/audioldm2/.

Stable Audio Open

python src/patch_layers.py \
    patch_model=stableaudio_patch \
    patch_config=stableaudio \
    patch_data=musiccaps/violin \
    patch_layers=stableaudio/tf11 \
    paths.output_dir=outputs/stableaudio/patching/violin/tf11

bash sh_scripts/localization/eval_feature_stableaudio.sh violin violin_summary tf11

Available presets: stableaudio/all, stableaudio/none, stableaudio/tf{0..N}, plus per-projection sub-blocks like stableaudio/tf11v, stableaudio/tf11k, stableaudio/tf12k, and combos like stableaudio/tf11tf12. See configs/patch_layers/stableaudio/.

🎛️ Activation Steering

Every method shares the same interface.

A typical workflow: (1) compute steering artifacts once with src/steering/run_compute.py and (2) sweep alphas with src/steering/run_eval.py. Both accept --config <path.yaml> — see configs/steering/.

Method	What it does	Needs compute step?	Wrapper
PCI	Switch to positive/negative prompt for the last \|α\| diffusion steps	no	`PCISteeringController`
Text Embeddings	Interpolate in T5 embedding space	no	`TextEmbSteeringController`
Token Embeddings	Per-concept direction at the concept-token position	yes	`TokEmbSteeringController`
FreeSliders	3-pass noise blending past `split_step`	no	`FreeSlidersSteeringController`
Concept Sliders	LoRA adapter trained with the Concept-Sliders loss	yes	`ConceptSlidersSteeringController`
AUSteer	Sparse activation-momentum scoring over freq bins	yes	`AUSteerSteeringController`
CAA	Contrastive Activation Addition (mean-diff steering vector)	yes	`CAASteeringController`
SAE	Sparse-autoencoder feature interventions	yes	`SAESteeringController`

Pretrained artifacts live in the ACE-Step Audio Steering Suite. Every example below uses concept="piano"; pass target_layers="tf6tf7" (or any list of block names) to restrict to localized layers.

Prompt Conditioned Intervention (PCI)

from src.steering import SteerableACEModel, PCISteeringController

model = SteerableACEModel(device="cuda"); model.pipeline.load()
ctrl = PCISteeringController(concept="piano", alpha=5)
with model.steer(ctrl):
    audio = model.generate(prompt="instrumental music", lyrics="[inst]",
                           audio_duration=10.0, infer_step=30, manual_seed=0)

Text Embeddings (TextEmb)

from src.steering import SteerableACEModel, TextEmbSteeringController

model = SteerableACEModel(device="cuda"); model.pipeline.load()
ctrl = TextEmbSteeringController(concept="piano", alpha=0.8, te_split_step=3)
with model.steer(ctrl):
    audio = model.generate(prompt="instrumental music", lyrics="[inst]",
                           audio_duration=10.0, infer_step=30, manual_seed=0)

Token Embeddings (TokEmb)

from src.steering import SteerableACEModel, TokEmbSteeringController

model = SteerableACEModel(device="cuda"); model.pipeline.load()
ctrl = TokEmbSteeringController.from_pretrained(
    "lukasz-staniszewski/ace-step-tokemb-piano", alpha=1.0, te_split_step=3,
)
with model.steer(ctrl):
    audio = model.generate(prompt="instrumental music", lyrics="[inst]",
                           audio_duration=10.0, infer_step=30, manual_seed=0)

Compute your own: python src/steering/run_compute.py --config configs/steering/ace/tokemb/compute_piano.yaml.

FreeSliders (FreeSliders)

from src.steering import SteerableACEModel, FreeSlidersSteeringController

model = SteerableACEModel(device="cuda"); model.pipeline.load()
ctrl = FreeSlidersSteeringController(concept="piano", alpha=2.0, split_step=3)
with model.steer(ctrl):
    audio = model.generate(prompt="instrumental music", lyrics="[inst]",
                           audio_duration=10.0, infer_step=30, manual_seed=0)

Concept Sliders (CS)

from src.steering import SteerableACEModel, ConceptSlidersSteeringController

model = SteerableACEModel(device="cuda"); model.pipeline.load()
ctrl = ConceptSlidersSteeringController.from_pretrained(
    "lukasz-staniszewski/ace-step-cs-piano-r8-all", alpha=1.0,
)
with model.steer(ctrl):
    audio = model.generate(prompt="instrumental music", lyrics="[inst]",
                           audio_duration=10.0, infer_step=30, manual_seed=0)

Train your own:

python src/steering/run_compute.py \
    --scorer concept_slider --concept piano --no-model \
    --output steering_vectors/concept_slider/ace_piano_r8_eta7_500_all \
    --scorer-kwargs '{"iterations":500,"eta":7,"layers":"all"}'

AUSteer

from src.steering import SteerableACEModel, AUSteerSteeringController

model = SteerableACEModel(device="cuda"); model.pipeline.load()
ctrl = AUSteerSteeringController.from_pretrained(
    "lukasz-staniszewski/ace-step-austeer-piano-all", alpha=15.0,
)
with model.steer(ctrl):
    audio = model.generate(prompt="instrumental music", lyrics="[inst]",
                           audio_duration=10.0, infer_step=30, manual_seed=0)

Compute your own:

python src/steering/run_compute.py \
    --scorer austeer --concept piano --no-model \
    --output steering_vectors/austeer/ace_piano_all \
    --scorer-kwargs '{"layers":"all","num_inference_steps":30,"guidance_scale":5.0,"seed":10}'

Contrastive Activation Addition (CAA)

from src.steering import SteerableACEModel, CAASteeringController

model = SteerableACEModel(device="cuda"); model.pipeline.load()
ctrl = CAASteeringController.from_pretrained(
    "lukasz-staniszewski/ace-step-caa-piano", alpha=20.0,
)
with model.steer(ctrl):
    audio = model.generate(prompt="instrumental music", lyrics="[inst]",
                           audio_duration=10.0, infer_step=30, manual_seed=0)

Compute your own: python src/steering/run_compute.py --config configs/steering/ace/caa/compute_piano.yaml.

Sparse Autoencoder (SAE)

Two SAEs (one per hookpoint) + per-concept score tables, blended via top-tau features per diffusion step.

from src.steering import SteerableACEModel, SAESteeringController, LayerSpec
from src.steering.methods.sae import load_features_from_score_cache
from src.steering.methods.sae.lib.sae.sae import Sae

model = SteerableACEModel(device="cuda"); model.pipeline.load()

sae_tf7 = Sae.load_from_hub(
    "lukasz-staniszewski/ace-step-sae-tf7-cross-attn",
    hookpoint="transformer_blocks.7.cross_attn", device="cuda",
)
sae_tf6 = Sae.load_from_hub(
    "lukasz-staniszewski/ace-step-sae-tf6-cross-attn",
    hookpoint="transformer_blocks.6.cross_attn", device="cuda",
)

top20_tf7 = load_features_from_score_cache(
    "lukasz-staniszewski/ace-step-sae-scores-piano",
    score_filename="tf7_scores.pkl", top_k=20,
)
top20_tf6 = load_features_from_score_cache(
    "lukasz-staniszewski/ace-step-sae-scores-piano",
    score_filename="tf6_scores.pkl", top_k=20,
)

ctrl = SAESteeringController({
    "transformer_blocks.7.cross_attn": LayerSpec(
        sae=sae_tf7, features_per_timestep=top20_tf7,
        intervention_mode="steering_vector", multiplier=10.0,
    ),
    "transformer_blocks.6.cross_attn": LayerSpec(
        sae=sae_tf6, features_per_timestep=top20_tf6,
        intervention_mode="steering_vector", multiplier=10.0,
    ),
})
with model.steer(ctrl):
    audio = model.generate(prompt="instrumental music", lyrics="[inst]",
                           audio_duration=10.0, infer_step=30, manual_seed=0)

Compute your own — per-concept feature-selection scores (tf{6,7}_scores.pkl):

python src/steering/run_compute.py --config configs/steering/ace/sae/compute_piano.yaml
# → steering_vectors/sae/ace_piano/{tf7_scores.pkl, tf6_scores.pkl}

To train the SAEs themselves first — cache activations, then train:

python src/steering/run_compute.py \
    --scorer sae-activations --output activations/ace_piano \
    --scorer-kwargs '{"hook_names":["transformer_blocks.7.cross_attn","transformer_blocks.6.cross_attn"],
                     "dataset_name":"data/music_caps.csv","audio_length_in_s":10.0}'

python -m src.steering.methods.sae.lib.scripts.train_ace --help

📊 Evaluation

Experiments have a corresponding YAML config in configs/steering/.

Per-method alpha sweep

python src/steering/run_eval.py --config configs/steering/ace/caa/eval_loc_piano.yaml
# → outputs/eval/caa_loc_piano/{run_config.json, alpha_<value>/p<i>.wav}

Each method has up to three eval variants per concept (mirroring the paper):

eval_all_<concept>.yaml — steer all 24 transformer blocks
eval_loc_<concept>.yaml — steer only the localized tf6 + tf7 layers
eval_ablated_<concept>.yaml — CAA only; steer all except tf6/tf7 (sanity check)

Every hparam in the YAML — alphas, layers, steer-mode, duration, steps, seed, guidance-scale, plus the method-specific method-kwargs dict — is also a CLI flag (CLI overrides YAML when both are supplied). The evaluation set defaults to the 100-prompt TEST_PROMPTS set in src/steering/eval/test_prompts.py; override with prompts: [...] inline or prompts-file: at a .txt (one per line) or .csv (named column via prompts-column:). See configs/steering/ace/caa/eval_piano_custom.yaml for an annotated template.

Compute concept-alignment + preservation metrics

src/steering/eval/eval_steering_protocol.py computes CLAP, MUQ-T, LPAPS, FAD, and Audiobox aesthetics for one sweep directory and writes protocol_results/ next to it.

python src/steering/eval/eval_steering_protocol.py \
    --steering_dir outputs/eval/caa_piano \
    --concept piano

Preservation-vs-delta-alignment plots

After running eval_steering_protocol.py on multiple methods for the same concept, plot the preservation (LPAPS-based) vs. delta-alignment curves and visualise the AUC integral:

# Alignment-vs-LPAPS curves per (concept, sign), one panel per method.
python src/steering/eval/plot_alignment_lpaps_curves.py

# Preservation × delta-alignment curves with the AUC area shaded.
python src/steering/eval/plot_preservation_delta_alignment_auc.py

Both scripts read from the standard sweep tree (outputs/<run>/ace_step/concept_<name>/{all|loc}/<method>/protocol_results/) and write PDFs into outputs/plots/.

AUC, Smoothness (CSM), Audio Quality

src/steering/eval/auc.py is the single entry point that aggregates per-method protocol_results/ directories into a LaTeX-ready table:

AUC — area under the preservation × delta-alignment curve (per direction pos/neg, and overall). Higher is better.
CSM (Conceptual Smoothness) — std of consecutive delta_alignment differences across alphas. Lower is better.
Audio Quality — Audiobox aesthetics (CE / CU / PC / PQ) interpolated at a fixed preservation level so the comparison across methods is at matched audio-quality budget.

# Compare all 8 methods on one concept (each path is a protocol_results/ dir).
python src/steering/eval/auc.py \
    "outputs/eval/{caa,sae,austeer,concept_slider,freesliders,textemb,pci,tokemb}_piano/protocol_results" \
    --direction both \
    --auto_label

For a single-method or single-direction breakdown, narrow the glob and pass --direction pos or --direction neg. --latex_only emits a LaTeX tabular row for the paper.

🙏 Credits

This repository builds on: ACE-Step, DDPM Inversion for Audio, CASteer, Universal DiffSAE.

📚 BibTeX

@article{staniszewski2026tada,
  title={TADA! Tuning Audio Diffusion Models through Activation Steering},
  author={Staniszewski, {\L}ukasz and Zaleska, Katarzyna and Modrzejewski, Mateusz and Deja, Kamil},
  journal={arXiv preprint arXiv:2602.11910},
  year={2026}
}

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
configs		configs
docs		docs
editing		editing
external_codebases		external_codebases
scripts/hub		scripts/hub
sh_scripts/localization		sh_scripts/localization
src		src
.env.example		.env.example
.gitignore		.gitignore
.project-root		.project-root
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
pyrightconfig.json		pyrightconfig.json
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TADA! Tuning Audio Diffusion Models through Activation Steering

⚙️ Installation

🔍 Localization

Counterfactual prompt dataset

Regenerating the MusicCaps prompts

Running activation patching

Per-architecture examples

ACE-Step

AudioLDM 2

Stable Audio Open

🎛️ Activation Steering

Prompt Conditioned Intervention (PCI)

Text Embeddings (TextEmb)

Token Embeddings (TokEmb)

FreeSliders (FreeSliders)

Concept Sliders (CS)

AUSteer

Contrastive Activation Addition (CAA)

Sparse Autoencoder (SAE)

📊 Evaluation

Per-method alpha sweep

Compute concept-alignment + preservation metrics

Preservation-vs-delta-alignment plots

AUC, Smoothness (CSM), Audio Quality

🙏 Credits

📚 BibTeX

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

TADA! Tuning Audio Diffusion Models through Activation Steering

⚙️ Installation

🔍 Localization

Counterfactual prompt dataset

Regenerating the MusicCaps prompts

Running activation patching

Per-architecture examples

ACE-Step

AudioLDM 2

Stable Audio Open

🎛️ Activation Steering

Prompt Conditioned Intervention (PCI)

Text Embeddings (TextEmb)

Token Embeddings (TokEmb)

FreeSliders (FreeSliders)

Concept Sliders (CS)

AUSteer

Contrastive Activation Addition (CAA)

Sparse Autoencoder (SAE)

📊 Evaluation

Per-method alpha sweep

Compute concept-alignment + preservation metrics

Preservation-vs-delta-alignment plots

AUC, Smoothness (CSM), Audio Quality

🙏 Credits

📚 BibTeX

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages