The official implementation of GPFM. This project is based on the following great projects:
- Huggingface lib is under preparing
from models import get_model, get_custom_transformer
from PIL import Image
model = get_model('GPFM', 0, 1)
transformer = get_custom_transformer('GPFM')
img = Image.open('docs/demo_pathology.jpg') # we prefer image with size of 512*512 (extracted from 40X)
img = transformer(img)
img = img[None] # to 4D tensor
img = img.cuda() # load images into gpu
feat = model(img) # [N, 1024]
Take the UBC_OCEAN dataset as an example
Step 1: segment the tissue of the WSI
cd Patching
bash get_coor_scripts/UBC_OCEAN.sh
Step 2: extract the features using foundation model. Currently, we support ResNet, Ctranspath, PLIP, Phikon, CONCH, UNI, and GPFM.
# download the weights of released GPFM and put it at `models/ckpts/`.
cd root_dir_of_project
bash extract_scripts/UBC_OCEAN.sh
Click here to download pretrained GPFM.
If you want to use other foundation models, please download the weights and put them at models/ckpts/
. You can see the models/__init__.py
for the supported model. You can also simply define your own model (see models/phikon.py
).
see https://huggingface.co/majiabo/GPFM for details. (Note: under preparing)
If you use Slurm system, you could use pretrain/train_script.sh
to pretrain your model. Note that the default Experts used in this project is UNI
, CONCH
, and Phikon
. Please remeber to download the weights of these models and put them at pretrain/dinov2/weights/
.
If you want to change the expert, you could modify the code at pretrain/dinov2/train/ssl_meta_arch.py
(see line 57).
This part is base on the CLAM project.
Step 3: run mil model to perform WSI classificatino or survival analysis.
cd root_dir_of_project
bash train_scripts/UBC_OCEAN.sh
For WSI classification with train-val-test (7:1:2) splits, please use splits712
For WSI classification with K-fold validation, please use splits
For Survival Analysis tasks, please use splits82
see HistGen for details.
Step 1: extract features using FM.
cd ROI_tasks
bash ROI_tasks/scripts/extract_feat.sh
- ROI classification
bash ROI_tasks/scripts/linear.sh
- ROI retrieval
bash ROI_tasks/scripts/roi.sh
If you find this repository useful, please consider giving a star ⭐ and citation 🦖:
@article{GPFM,
title={Towards A Generalizable Pathology Foundation Model via Unified Knowledge Distillation},
author={Ma, Jiabo and Guo, Zhengrui and Zhou, Fengtao and Wang, Yihui and Xu, Yingxue and Cai, Yu and Zhu, Zhengjie and Jin, Cheng and Jiang, Xinrui and Lin, Yi and Han, Anjia and others},
journal={arXiv preprint arXiv:2407.18449},
year={2024}
}