VAE-Adversarial-Debiasing

This is the GitHub repo for our paper submission to MIDL 2025: Towards Fair Medical AI: Adversarial Debiasing of 3D CT Foundation Embeddings. (https://openreview.net/forum?id=pv1D6CWgrY)

This repo has a jupyter notebook file with the code to reproduce our results. The embeddings of the NLST dataset can be accessed through Google's CT Foundation Tool: https://research.google/blog/taking-medical-imaging-embeddings-3d/. The demographic information (in CSV format) can be accessed from TCIA: https://www.cancerimagingarchive.net/collection/nlst/.

Abstract

Self-supervised learning has revolutionized medical imaging by enabling efficient and generalizable feature extraction from large-scale unlabeled datasets. Recently, self-supervised foundation models have been extended to three-dimensional (3D) computed tomography (CT) data, generating compact, information-rich embeddings with 1408 features that achieve state-of-the-art performance on downstream tasks such as intracranial hemorrhage detection and lung cancer risk forecasting. However, these embeddings have been shown to encode demographic information, such as age, sex, and race, which poses a significant risk to the fairness of clinical applications.

In this work, we propose a Variation Autoencoder (VAE) based adversarial debiasing framework to transform these embeddings into a new latent space where demographic information is no longer encoded, while maintaining the performance of critical downstream tasks. We validated our approach on the NLST lung cancer screening dataset, demonstrating that the debiased embeddings effectively eliminate multiple encoded demographic information and improve fairness without compromising predictive accuracy for lung cancer risk at 1-year and 2-year intervals. Additionally, our approach ensures the embeddings are robust against adversarial bias attacks. These results highlight the potential of adversarial debiasing techniques to ensure fairness and equity in clinical applications of self-supervised 3D CT embeddings, paving the way for their broader adoption in unbiased medical decision-making.

Conceptual illustration of our framework. The original 3D CT Foundation Model embedding encodes demographic information, which can lead to bias in downstream tasks. Our VAE can debias multiple demographic information while preserving downstream task performance.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
LICENSE		LICENSE
README.md		README.md
VAE_Adversarial_Debias_Concept.png		VAE_Adversarial_Debias_Concept.png
VAE_Debias.ipynb		VAE_Debias.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VAE-Adversarial-Debiasing

Abstract

About

Releases

Packages

Languages

License

BioIntelligence-Lab/VAE-Adversarial-Debiasing

Folders and files

Latest commit

History

Repository files navigation

VAE-Adversarial-Debiasing

Abstract

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages