improvements to wav2aug by gfdb · Pull Request #5 · gfdb/wav2aug

gfdb · 2026-01-08T22:29:11Z

This PR brings significant performance improvements to wav2aug augmentations, aligning behavior with SpeechBrain's implementations while dramatically reducing compute time.

Key Changes

speed_perturb

Use integer percentages (90, 100, 110) instead of float multipliers (0.9, 1.0, 1.1) and then round. This ensures good GCD with sample rates, making the sinc resampling filter much smaller
Cache torchaudio.transforms.Resample objects via lru_cache to avoid recomputing filter kernels

chunk_swap

Replaced nested Python loops with fully vectorized gather/scatter operations
Eliminated per-sample iteration entirely

NoiseLoader

New preload mode
New class that preloads all noise files into CPU RAM at initialization
Configurable storage_dtype (default float16) for memory efficiency w extremely tiny perf. degradation
Noise sampling becomes a fast tensor slice with zero I/O
Memory: ~650MB for pointsource_noises pack

Other improvements
freq_drop: Ported SpeechBrain's notch filter implementation for correctness
rand_amp_clip: Fixed normalization and uses single clip value per batch (matches SpeechBrain)
time_dropout: Vectorized implementation
Wav2Aug: Simplified interface, uses NoiseLoader by default

This reverts commit 362cb47.

This reverts commit 00cceea.

gfdb and others added 17 commits December 11, 2025 15:22

align w speechbrain

618785d

refactor chunk_swap, freq_drop, time_drop

a917ca3

allow for same augmentation selection

4f83f06

single augment test

28b0533

fix circ

33d315e

add noise workers

d22e15a

update clip, freq_drop, and noise

362cb47

Revert "update clip, freq_drop, and noise"

00cceea

This reverts commit 362cb47.

Reapply "update clip, freq_drop, and noise"

149f756

This reverts commit 00cceea.

fix torchaudio version #3 (#4)

80ef1f1

added preload for add_noise

2b72925

improve speed_pert and chunk_swap

bbbe446

clean up

98a3b33

fixed speed_pert bad gcd issue

2f76609

formatting

7d68fe3

fixed tests

a578183

update readme

834e472

gfdb merged commit 7ad55cf into main Jan 11, 2026
2 checks passed

gfdb deleted the match-perf branch January 12, 2026 21:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

improvements to wav2aug#5

improvements to wav2aug#5
gfdb merged 17 commits intomainfrom
match-perf

gfdb commented Jan 8, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

gfdb commented Jan 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Key Changes

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

gfdb commented Jan 8, 2026 •

edited

Loading