-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Reproducibility question #4
Comments
Hi @ytaek-oh. Thanks for your interest! It is weird to see this, can you try using --context_length 248? We use 248 for the main results for a fair comparison to the baselines (e.g. Long-CLIP). |
have you reproduced the result in paper? I've tried both context_length=248 and context_length=200, the result got the same. The "ckpt.pt" is download from https://huggingface.co/mderakhshani/TULIP/tree/main |
@5RJ Setting the context length to 248 while switching the activation layer to QuickGELU led to a slight improvement, though it did not fully bridge the gap.
|
Hi, can you please check the value of the flag --lit_style? It should be false. |
Thanks for sharing the code and checkpoint for the insightful work!
I ran the evaluation code using the provided checkpoint and got a slight gap between my evaluation results and those reported in the paper.
Specifically, I tested retrieval tasks on coco 2017 and Urban1k, and obtained following results:
Reported Results in the Paper (TULIP, ViT-L-14):
COCO:
I2T R@1: 62.6, I2T R@5: 84.7
T2I R@1: 46.1, T2I R@5: 71.1
Urban1k:
I2T R@1: 90.1, T2I R@1: 91.1
I used the following command for evaluation:
Am I missing any configuration or steps in the evaluation process?
Thanks,
The text was updated successfully, but these errors were encountered: