Skip to content

PaKoML/IDOLGAN

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 

Repository files navigation

PaKoML IDOLGAN Project

license repo size in bytes GitHub contributors GitHub commit GitHub commit interval

Introduction

Welcome to this Github repository, which presents a generative AI project utilizing a modified DCGAN model and minibatch discrimination technique to produce random Kpop idol portraits. The project has been trained on a dataset of Kpop female idols' images, and through deep learning algorithms, it has acquired the ability to generate realistic and unique images of Kpop idols. The modified DCGAN model and minibatch discrimination techniques have been incorporated into the project to enhance its ability to create high-quality and diverse images. With this project, users can effortlessly generate random Kpop idol faces and explore the extensive creative possibilities offered by generative AI. We invite you to join us on an exciting journey to explore the infinite potential of AI-generated art.

Architecture

Generator

The generator consists of a series of transposed convolutional layers. The initial layer takes the input noise vector $z\sim \mathcal{N}(\boldsymbol{0}, 0.02^2 \boldsymbol{I}_{512})$ and applies transposed convolution with a kernel size of 4, stride of 1, and padding of 0. Subsequent layers increase the spatial resolution by a factor of 2 while reducing the depth. The final layer generates a 64x64 RGB image using transposed convolution with a kernel size of 4, stride of 2, and padding of 1. Batch normalization and ReLU activation are applied throughout the model. The minibatch discrimination technique is used to enhance image diversity. The output image is resized to 64x64 using bilinear interpolation.

Discriminator

The discriminator is consists of convolutional layers, each followed by batch normalization and LeakyReLU activation. These layers progressively reduce the spatial dimensions while increasing the number of channels in the tensor. The resulting tensor is flattened and passed through a fully connected layer, which maps it to a lower-dimensional feature space. A minibatch discrimination module is added in the middle of the discriminator architecture to enhance diversity among generated samples by comparing features within a minibatch.

Datasets

The model is trained on about 5000 images of kpop idol's face image. The figures in the images is not evenly distributed, so the output result may be similar to some specific celebrity.

References

[1] Radford, A., Metz, L., & Chintala, S. (2015). Unsupervised representation learning with deep convolutional generative adversarial networks. arXiv preprint arXiv:1511.06434.

[2] Salimans, T., Goodfellow, I., Zaremba, W., Cheung, V., Radford, A., & Chen, X. (2016). Improved techniques for training gans. Advances in neural information processing systems, 29.

[3] Minibatch discrimination module in PyTorch. (n.d.). Gist. https://gist.github.com/t-ae/732f78671643de97bbe2c46519972491

Developers

About

Generate my unique k-pop idol's portrait

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published