PaKoML IDOLGAN Project

Introduction

Welcome to this Github repository, which presents a generative AI project utilizing a modified DCGAN model and minibatch discrimination technique to produce random Kpop idol portraits. The project has been trained on a dataset of Kpop female idols' images, and through deep learning algorithms, it has acquired the ability to generate realistic and unique images of Kpop idols. The modified DCGAN model and minibatch discrimination techniques have been incorporated into the project to enhance its ability to create high-quality and diverse images. With this project, users can effortlessly generate random Kpop idol faces and explore the extensive creative possibilities offered by generative AI. We invite you to join us on an exciting journey to explore the infinite potential of AI-generated art.

Architecture

Generator

The generator consists of a series of transposed convolutional layers. The initial layer takes the input noise vector $z\sim \mathcal{N}(\boldsymbol{0}, 0.02^2 \boldsymbol{I}_{512})$ and applies transposed convolution with a kernel size of 4, stride of 1, and padding of 0. Subsequent layers increase the spatial resolution by a factor of 2 while reducing the depth. The final layer generates a 64x64 RGB image using transposed convolution with a kernel size of 4, stride of 2, and padding of 1. Batch normalization and ReLU activation are applied throughout the model. The minibatch discrimination technique is used to enhance image diversity. The output image is resized to 64x64 using bilinear interpolation.

Discriminator

The discriminator is consists of convolutional layers, each followed by batch normalization and LeakyReLU activation. These layers progressively reduce the spatial dimensions while increasing the number of channels in the tensor. The resulting tensor is flattened and passed through a fully connected layer, which maps it to a lower-dimensional feature space. A minibatch discrimination module is added in the middle of the discriminator architecture to enhance diversity among generated samples by comparing features within a minibatch.

Datasets

The model is trained on about 5000 images of kpop idol's face image. The figures in the images is not evenly distributed, so the output result may be similar to some specific celebrity.

References

[1] Radford, A., Metz, L., & Chintala, S. (2015). Unsupervised representation learning with deep convolutional generative adversarial networks. arXiv preprint arXiv:1511.06434.

[2] Salimans, T., Goodfellow, I., Zaremba, W., Cheung, V., Radford, A., & Chen, X. (2016). Improved techniques for training gans. Advances in neural information processing systems, 29.

[3] Minibatch discrimination module in PyTorch. (n.d.). Gist. https://gist.github.com/t-ae/732f78671643de97bbe2c46519972491

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
frontend		frontend
model		model
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

PaKoML IDOLGAN Project

Introduction

Architecture

Generator

Discriminator

Datasets

References

Developers

About

Uh oh!

Releases

Packages

Languages

License

PaKoML/IDOLGAN

Folders and files

Latest commit

History

Repository files navigation

PaKoML IDOLGAN Project

Introduction

Architecture

Generator

Discriminator

Datasets

References

Developers

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages