Why is the Welch t-test used for ranking velocity genes when data is not normally distributed #710
Unanswered
paula-tataru
asked this question in
Q&A
Replies: 1 comment
-
@VolkerBergen, any comments/thoughts on this? |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi,
According to the API, you use the Welch t-test to rank the velocity genes.
https://scvelo.readthedocs.io/scvelo.tl.rank_velocity_genes/#scvelo.tl.rank_velocity_genes
This test assumes that data is normally distributed. I got curious to see if the data meets this requirement, so I looked at the pancreas data set and tested for normality the velocity estimates for the first cluster for the top velocity gene according to the ranking.
Here are the two plots:
And the p-value is about 1.75e-9.
The data looks to be far from being normally distributed. Why was the Welch t-test used?
/Paula
Beta Was this translation helpful? Give feedback.
All reactions