Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Reproducing proteingym #13

Open
sohviluukkonen opened this issue Sep 17, 2024 · 1 comment
Open

Reproducing proteingym #13

sohviluukkonen opened this issue Sep 17, 2024 · 1 comment

Comments

@sohviluukkonen
Copy link

Hi!

I'm trying to run your proteingym pipeline, and I'm having a couple of issues. I'm using the proteingym_branch and your ProtMamba_long_fondation model.

Firstly, in proteingym.py you are supposed to import prepare_target from tests/proteingym/utils.py but that function doesn't exist in there, so I'm currently importing it from ProtMamba_ssm/utils.py. Is that ok, or are this functions supposed to be different?

Secondly, in ProtMamba_ssm/modules.py it seems that you have updated the mamba version as you have changed
from mamba_ssm.modules.mamba_simple import Block from mamba_ssm.ops.triton.layer_norm import RMSNorm, layer_norm_fn, rms_norm_fn
to
from mamba_ssm.modules.block import Block from mamba_ssm.ops.triton.layernorm import RMSNorm, layer_norm_fn, rms_norm_fn.
I'm still using the old Mamba version. Do you think this is problematic?

With these two (minor) changes I'm getting very different results from the ones reported in the paper with an average Spearman correlation of ~0.10. Here are Spearman values per protein that I find: ProtMamba_long_foundation_proteingym_msalength_200_spearman.json
Any idea where this huge difference could come from?

Thanks for your help!

@CyrilMa
Copy link
Collaborator

CyrilMa commented Sep 25, 2024

Hi @sohviluukkonen,

I have updated the code to make it more clear (https://github.com/Bitbol-Lab/ProtMamba-ssm/blob/proteingym_branch/tests/proteingym/proteingym.py). This should solve a lot of issues. In particular, one problem that could happen is if you don't have checkpoint_mixer = False when you load the model.

Otherwise, it's not clear for me if using the old version or not is problematic, maybe we could investigate that if your problem is not solved later.

Hope this will help!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants