Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Network (DCGAN) implementation

This is an implementation of the research paper "Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks" written by Alec Radford, Luke Metz, Soumith Chintala.

Inspiration

Unsupervised Learning with CNNs has not recieved it's due attention. In thier paper, Alec Radford, Luke Metz and Soumith Chintala have tried to "bridge the gap between the success of CNNs for supervised learning and unsupervised learning". DCGAN, as introduced in this paper, is an architecture that demonstrates the strength of CNNs as a candidate for Unsupervised Learning.

Dataset

The original paper had used three datasets for training the DCGAN namely - Large-scale Scene Understanding (LSUN) (Yu et al., 2015), Imagenet-1k and a newly assembled Faces dataset. However due to computational and other limitations, we have used Large-scale CelebFaces Attributes (CelebA) Dataset.

Citation

@inproceedings{liu2015faceattributes,
 title = {Deep Learning Face Attributes in the Wild},
 author = {Liu, Ziwei and Luo, Ping and Wang, Xiaogang and Tang, Xiaoou},
 booktitle = {Proceedings of International Conference on Computer Vision (ICCV)},
 month = {December},
 year = {2015} 
}

Guidelines to download, setup and use the dataset

The CelebA dataset may be downloaded here as a file named img_align_celeba.zip.

Please write the following commands on your terminal to extract the file in the proper directory

$ mkdir celeba
$ unzip </path/to/img_align_celeba.zip> -d </path/to/celeba>

The resulting directory structure should be:

/path/to/celeba
    -> img_align_celeba
        -> 188242.jpg
        -> 173822.jpg
        -> 284702.jpg
        -> 537394.jpg
           ...

Note: You may use any other dataset of your choice. However, please ensure that the directory structure remains the same for the code to be compatible with it.

Model Components

The DCGAN Architecture has the following components:

The Generator uses fractional-strided convolutions followed by batch normalisation and ReLU activation for all layers except for the last that uses tanh activation.
The Discriminator uses strided convolutions followed by batch normalisation and LeakyReLU activation for all layers except for a single sigmoid output.

Implementation Details

The given batch of images are first upsampled through the fractional-strided convolutions of the Generator.
Batch Normalization is applied to boost the learning process.
The normalized images are passed thorugh a bounded activation function (ReLU) that helps in a faster convergence of the model and allows it to cover the entire spatial extent of the images.
The images enter the Discriminator after passing through a final tanh layer. The discriminator uses strided convolutions for downsampling the images following a set of similar actions as the Generator except that LeakyReLU is used.

Installation and Usage Guidelines

To use the repo and generate your own images please follow the guidelines below

Cloning the Repository:

  git clone https://github.com/indiradutta/DC_GAN

Entering the directory:
```
  cd DC_GAN
```
Setting up the Python Environment with dependencies:
```
  pip install -r requirements.txt
```
Running the file for training:
```
  python3 train_gen.py
```

The train_gen.py file takes the path to the dataset as "/content/dcgan/celeba" dataset by default. Please initialize the DCGAN module with your desired dataset path and train as:

dc_gan = DCGAN(data = <path/to/dataset>)
img_list, G_losses, D_losses = dc_gan.train(<path/to/save/model>)

Incase you have either no GPU (0) or more than 1 GPU on your machine, consider changing the ngpu parameter while initializing the DCGAN module with your desired dataset path and train as:

dc_gan = DCGAN(data = <path/to/dataset>, ngpu = <number of GPUs available>)
img_list, G_losses, D_losses = dc_gan.train(<path/to/save/model>)

Check out the standalone demo notebook and run dc_gan here.

Note: Is is advisable to use a GPU for training because training the DCGAN is computationally very expensive.

Running the file for inference:
```
  python3 test.py
```

The test.py file takes the path to the pre-trained as "model.pth" by default. Please initialize the Deep_Conv_GAN with the desired path to the model and get thr inferences as:

Deep_Conv_GAN.inference(Deep_Conv_GAN, set_weight_dir='model.pth' , set_gen_dir='<path/to/save/inferences>')

Check out the standalone demo notebook and run inferences here.

Results from Implementation

Plot to see how D and G’s losses changed during training

Batches of fake data from G

Training progression of G

demo.mp4

Please check out the result documentation.

Name		Name	Last commit message	Last commit date
Latest commit History 56 Commits
config		config
dcgan		dcgan
demo		demo
docs		docs
results		results
LICENSE		LICENSE
README.md		README.md
model.py		model.py
requirements.txt		requirements.txt
test.py		test.py
train_gen.py		train_gen.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Network (DCGAN) implementation

Inspiration

Dataset

Citation

Guidelines to download, setup and use the dataset

Model Components

Implementation Details

Installation and Usage Guidelines

Results from Implementation

Contributors

About

Releases

Packages

Contributors 2

Languages

License

indiradutta/DC_GAN

Folders and files

Latest commit

History

Repository files navigation

Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Network (DCGAN) implementation

Inspiration

Dataset

Citation

Guidelines to download, setup and use the dataset

Model Components

Implementation Details

Installation and Usage Guidelines

Results from Implementation

Contributors

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages