Transformers. This repository contains scripts for building transformer models for different data modalities, e.g: image, video, audio.