Transformer-based text to image synthesis An educational project, inspired by OpenAI DALL-e project. The global intention is to explore the scalability of this approach to smaller datasets References: Generating Diverse High-Fidelity Images with VQ-VAE-2 Generating long sequences with sparse transformers DALL-E open source repository DALL-E blogpost