MTL-ViT: A new multi-task learning framework using Vision Transformers [IEEE ICIP 2024]

(*Note: This is an ongoing project, hence the full code and strategy is not yet open-sourced by the author.)

We presnet a new multi-task learning strategy using Vision transformers (ViTs). Our approach is based on exploiting the class-token and self-attention mechanism of Vision Transformers in order to train multiple tasks through a single ViT, more efficiently and with limited computational budget.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
CelebA_data		CelebA_data
assets		assets
data		data
model		model
README.md		README.md
cka_sim.py		cka_sim.py
config.py		config.py
run_deit_celeba.py		run_deit_celeba.py
run_deit_cifar10.py		run_deit_cifar10.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MTL-ViT: A new multi-task learning framework using Vision Transformers [IEEE ICIP 2024]

Total Loss of the Multi-task system: $L_{total}=L_{1}+L_{1}+L_{3}+ . . . + L_{n}$

About

Releases

Packages

Languages

hananshafi/MTL-ViT

Folders and files

Latest commit

History

Repository files navigation

MTL-ViT: A new multi-task learning framework using Vision Transformers [IEEE ICIP 2024]

Total Loss of the Multi-task system:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Total Loss of the Multi-task system: $L_{total}=L_{1}+L_{1}+L_{3}+ . . . + L_{n}$

Packages