Getting started with self-supervised learning of CAST

There are two architectural variants of CAST proposed in the paper. One is for image-classification and another is for segmentation. Both architectures can be pre-trained using self-supervised learning. In the paper, we used MoCo-v3 framework for all self-supervised learning experiments.

We provide the bashscripts for running self-supervised experiments. By default, we use CAST-S. You can use larger models, e.g. CAST-B by replacing -a cast_small with -a cast_base in the bashscripts.

Model architecture

(a) CAST for classification

(b) CAST for segmentation

Pre-train on ImageNet for classification

Self-supervised learning of CAST on ImageNet-1K:

> bash scripts/moco/train_imagenet1k_cast.sh

Self-supervised learning of CAST on ImageNet-100:

> bash scripts/moco/train_imagenet100_cast.sh

Self-supervised learning of ViT on ImageNet-1K:

> bash scripts/moco/train_imagenet1k.sh

Self-supervised learning of ViT on ImageNet-100:

> bash scripts/moco/train_imagenet100.sh

In the paper, we ablate the efficacy of our Graph Pooling module by replacing it with the Token Merging module. Both models use superpixel tokens. Run the following bashscript to reproduce our ablation study of Token Merging module on ImageNet-100:

> bash scripts/moco/train_imagenet100_tome.sh

Pre-train on COCO for segmentation

Self-supervised learning of CAST on COCO:

> bash scripts/moco/train_coco_cast.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GETTING_STARTED_SELF.md

GETTING_STARTED_SELF.md

Getting started with self-supervised learning of CAST

Model architecture

Pre-train on ImageNet for classification

Pre-train on COCO for segmentation

Files

GETTING_STARTED_SELF.md

Latest commit

History

GETTING_STARTED_SELF.md

File metadata and controls

Getting started with self-supervised learning of CAST

Model architecture

Pre-train on ImageNet for classification

Pre-train on COCO for segmentation