-
Notifications
You must be signed in to change notification settings - Fork 24
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Adding DESC to data integration benchmarking #28
Comments
Because DESC's dependency on other packages, such as TensorFlow,scanpy and keras, we are updating and testing our desc algorithm to be compatible with tensorflow2.0, scanpy 1.4.5+. Hopefully, the latest algorithm can be uploaded into GitHub and PyPI tomorrow. The single resolution (0.8 or 1.0) is ok for DESC. Thanks. |
Hi @eleozzr, That is great news! If you have it online by tomorrow, we will be able to add it to the benchmark on time :). I am looking forward to a |
Hi LuckyMD, I have already updated our desc algorithm.
Hope this helps. Thanks. |
hi @eleozzr, Thanks for the updates. However, I see that the |
Also, what is the limit to python 3.7 compatability? |
All the script was tested in python3.5 or python3.6, but I think the script works in python3.7. |
Are you still planning to fix the install requirements in: Lines 21 to 23 in 9acb047
Then I could just install from |
Could you install by |
I hadn't tried to install via |
Install for 2.0.3 worked, thanks :). Is there a reason you are limited to keras 2.1? Just a question... I can work with that as well ;). |
Hi @eleozzr, I have another question about using DESC. I can't see where you pass batch information to the algorithm so that it can perform data integration. Do you explicitly integrate data across batches or just produce a low-dimensional embedding that is less affected by batch? If DESC doesn't explicitly do data integration, but only produces a low-dimensional embedding which is less affected by batch effects than the high-dimensional data, maybe we shouldn't be comparing it to other data integration tools? I guess that comparison might not be fair to DESC. What do you think? |
Sometimes, the version of TensorFlow and Keras needs to match. So I directly limit the version to avoid unnecessary issues due to the unmatch of Tensorflow and Keras. |
The only batch information for
Hope this helps. |
Thanks for the example! The parametrization is slightly different to your example notebook. I will use something closer to this if you think this is a better default parametrization for datasets of 10k+ cells? I will then add batch-specific scaling and then make a PR for the Benchmarking data integration repo here. Would it be okay if I tagged you in that PR so you could check that DESC is used as you think is correct? |
If you only need the embedding of desc, you can set And it will be okay if you tagged me. |
Yes, I've already changed this :). I will test the code, make a PR and then you can tell me if I'm doing something stupid ;). |
Thing seem to be running for me now, thanks! I just quickly wanted to highlight 2 things:
|
Thanks. |
It would be great if you could look over our PR here: theislab/scib#131 Thanks! |
Hi @eleozzr, |
It would be great to get an input on the above question of how to turn off saving the weights. |
Hi @eleozzr,
We were thinking about adding DESC to our benchmark of data integration tools (https://github.com/theislab/scib). We would be running our own pre-processing for the input to DESC for this, which is reliant on Scanpy version 1.4.5+. Do you think it would be possible to use just the
desc.train()
function if we remove the Scanpy requirement and install via github? Would this also be okay for using Keras 2.2.4?Also, to compare the methods properly we would not be able to use the clustering output you provide, but instead we would use the embedding at a default clustering resolution (resolution=0.8 as in your tutorial). Would this be a suitable way of evaluating DESC?
Kind regards,
The text was updated successfully, but these errors were encountered: