Getting started with part segmentation of CAST on PartImageNet

We employ open-vocabulary segmentation to predict parts-and-whole labels based on the CAST segments. In this paper, we apply the OVSeg framework OVSeg, which predicts labels for masked images, except we did not fine-tune CLIP on these masked images.

We provide jupyter notebooks for predicting segmentation maps and conducting evaluations. We save the segmentations first and reuse them in subsequent evaluations.

Installation

SAM

> pip install git+https://github.com/facebookresearch/segment-anything.git

OVSeg. Follow the installation guide of OVSeg.

Data preparation

Download the PartImageNet_OOD dataset from the github. Decompress the zip file and put them under ./data

Expected directory layout

./data/PartImageNet
            |------ annotations/
            |          |------ val.json
            |          |------ train.json
            |          |------ test.json
            |
            |------ images/
                       |------ val/
                       |------ train/
                       |------ test/

Apply CAST for open-vocabulary segmentation

Save hierarchical segmentation:

CAST
ViT

Visualize open-vocabulary segmentation:

CAST/ViT
SAM

Evaluate open-vocabulary segmentation:

CAST/ViT
SAM

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GETTING_STARTED_PartImageNet.md

GETTING_STARTED_PartImageNet.md

Getting started with part segmentation of CAST on PartImageNet

Installation

Data preparation

Expected directory layout

Apply CAST for open-vocabulary segmentation

Files

GETTING_STARTED_PartImageNet.md

Latest commit

History

GETTING_STARTED_PartImageNet.md

File metadata and controls

Getting started with part segmentation of CAST on PartImageNet

Installation

Data preparation

Expected directory layout

Apply CAST for open-vocabulary segmentation