RC-ViTGAN

Dataset

RC500: https://drive.google.com/file/d/1x3tZmPw0IS9fxoKXel4xGT38nK8zd9hR/view?usp=sharing

We randomly selected 50 from 583 test images as the test set for comparative experiments:

test_dataset: https://drive.google.com/file/d/1Y3_4tuNcGbzj5bYr2JWhniVvm6W6J-Gb/view?usp=drive_link

Notably, we also selected 250 images from the 583 test images as the test set for ablation experiments to better verify the role of each component.

original_images: https://drive.google.com/file/d/1i9hv2yrG8cImo7In3KUiFnusn7GNC-7e/view?usp=drive_link

Experimental details

Environment

In this project, we use python 3.7.12 and pytorch 1.8.0, torchvision 0.9.0, cuda 11.1

Hardware conditions

We train the model using four GeForce RTX 3060.

Hyperparameters

bs = 8

lr= 0.0001

beta for EMA = (0.0, 0.99)

Supervised Pre-training max_steps=100000

Adversarial Training max_steps=20000

The quality of references has a significant impact on the model

When there's a significant discrepancy between the color of the reference and that of the input image, it results in color distortion in the recolored image, causing unnaturalness.

When an unrealistic color palette is provided, the model generates semantically unreasonable images, such as recoloring trees to blue.

Name		Name	Last commit message	Last commit date
Latest commit History 90 Commits
assets		assets
augment		augment
configs		configs
evaluate		evaluate
models		models
out		out
stylegan2		stylegan2
third_party		third_party
training		training
Digraph.gv		Digraph.gv
README.md		README.md
base_stats.npz		base_stats.npz
conv_generator_skip.py		conv_generator_skip.py
copy_train1_vitGenerator.py		copy_train1_vitGenerator.py
copy_vit_generator.py		copy_vit_generator.py
datasets.py		datasets.py
evaluation.py		evaluation.py
fid_score.py		fid_score.py
fig2.pdf		fig2.pdf
layers.py		layers.py
mydiscriminator.py		mydiscriminator.py
test_train2_sample.py		test_train2_sample.py
toLAB.py		toLAB.py
train1_vitGenerator.py		train1_vitGenerator.py
train2_vitGenerator.py		train2_vitGenerator.py
train_rcvitgan.py		train_rcvitgan.py
train_rcvitgan_Ds.py		train_rcvitgan_Ds.py
train_rcvitgan_wochloss.py		train_rcvitgan_wochloss.py
train_rcvitgan_wosemisupervised.py		train_rcvitgan_wosemisupervised.py
train_rcvitgan_wotargetloss.py		train_rcvitgan_wotargetloss.py
utils.py		utils.py
vision_transformer.py		vision_transformer.py
vit_generator.py		vit_generator.py
vit_generator_skip.py		vit_generator_skip.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RC-ViTGAN

Dataset

Experimental details

Environment

Hardware conditions

Hyperparameters

The quality of references has a significant impact on the model

Training Pipeline

Additional Results

Qualitative Results of Ablation Study

About

Releases

Packages

Languages

tsz12/RC-ViTGAN

Folders and files

Latest commit

History

Repository files navigation

RC-ViTGAN

Dataset

Experimental details

Environment

Hardware conditions

Hyperparameters

The quality of references has a significant impact on the model

Training Pipeline

Additional Results

Qualitative Results of Ablation Study

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages