About the output values? #85
Replies: 3 comments
-
Hi @OnlyBelter, thanks for reaching out! What I'd recommend is to use the low dimensional Scanorama embeddings (from See this comment and the surrounding discussion for more clarity: scverse/anndata#458 (comment). Basically, the modified values from Hope that helps! |
Beta Was this translation helpful? Give feedback.
-
This is interesting and a useful discussion. I'm experiencing failure using corrected expression features for supervised learning. The same model converges/generalizes when trained and evaluated on uncorrected, preprocessed data. Is there evidence that corrected features will produce comparable results for supervised learning? Is my assumption wrong that corrected gene expression values reflect the distributional characteristics of uncorrected features? I understand the embeddings may be more useful for this task but I am specifically wondering about the utility of "corrected features." |
Beta Was this translation helpful? Give feedback.
-
@wconnell I think the consensus in the field now is that corrected features and the integrated embeddings are useful when computing distances among cells, e.g., for constructing a KNN graph for clustering, visualization, etc. When interpreting the actual feature values, e.g., for differential expression analysis, it's best to be conservative. I'm not sure about your specific supervised task and how you are defining "failure" etc., but I'd also recommend a conservative approach to dealing with potential batch effect. |
Beta Was this translation helpful? Give feedback.
-
Hi, very impressive tool.
What's the output should be expected?
If I input log-transformed data, does it output log-transformed data? I mean if I want to use it for downstream analysis, such as differential expression analysis, then how can I calculate fold change (log or non-log)?
What I have seen is that the output values are much smaller than what I input. How can I recover the corrected values back as much as like input data (such as counts per million, CPM)?
Beta Was this translation helpful? Give feedback.
All reactions