DCGAN128

This is an implementation of Deep Convolutional Generative Adversarial Networks modified to create 128*128 sized spectrograms to be able to be converted to WAV files using Analysis & Resynthesis Sound Spectrograph or ARSS for short.

Prerequisites

install ffmpeg
install arss

pip install tqdm

Usage

Put wav files (songs of the same genre) in a folder and run

python process.py

Follow the prompts after running, process.py will split the wav files into 2 second segments and convert them into spectrograms of file type bmp using ARSS. Then since the bmp files are greyscale the files will be converted to png with just 1 image channel in a folder called png.

python train.py

will prompt the name of the images data (in this case png) and appropriate batch and epoch size.

After saving the model to a pth file generate.py and subsequently bmp_to_wav.py will generate new spectrograms and wav files

128DCGAN-Spectogram/
    └── data
    └── bmp
    └── png
    └── bmp_generated
    └── wav_generated
    └── bmp_to_wav.py
    └── data.py
    └── generate.py
    └── model.py
    └── process.py
    └── train.py

Generated Spectrograms using Songs

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DCGAN128

Prerequisites

Usage

Generated Spectrograms using Songs

Input

Results

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
README.md		README.md
bmp_to_wav.py		bmp_to_wav.py
data.py		data.py
generate.py		generate.py
model.py		model.py
png.py		png.py
process.py		process.py
train.py		train.py

dcgan-webapp/model-training

Folders and files

Latest commit

History

Repository files navigation

DCGAN128

Prerequisites

Usage

Generated Spectrograms using Songs

Input

Results

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages