Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Running pretrained convvit on larger image sizes #18

Open
JonathanBhimaniBurrows opened this issue Sep 27, 2022 · 1 comment
Open

Running pretrained convvit on larger image sizes #18

JonathanBhimaniBurrows opened this issue Sep 27, 2022 · 1 comment

Comments

@JonathanBhimaniBurrows
Copy link

Hi,
I am looking to see how well the pretrained base model runs on my own dataset, but the current model is configured for an image size of 224
In the original MAE code, the 'interpolate_pos_embed' function would allow the user to increase the positional embedding to allow for larger image patches
In your linear probing code, that same script is commented out, and (obviously) doesn't function the same way, as there are multiple positional embeddings to take care of
Do you have a function that can allow the pretrained model to run on different image sizes?
Thanks

@gaopengpjlab
Copy link

To allow image resolution different from 224, you need to modify interpolate_pos_embed function. We will release a new codebase which allow finetuning of different resolution.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants