-
Good afternoon. For input, I am using VST (variance stabilizing transformation) counts from DESeq2 (C), the "PANDA-ready motif mappings" provided by Dr. Glass (W), and STRING-db physical interactions without a score threshold (P). I see edge weight scores split most strongly by the presence (1) or absence (0) of the edge in the motif prior. as well as for individual sample edge weights in LIONESS. Does this distribution of values impact the ability to determine the "real" and/or significantly different edges, in PANDA via the inverse cumulative distribution (pandaDiffEdges R command) or in LIONESS via LIMMA (linear modeling)? Additionally, I just discovered this article where it is clarified that it is expected for the binary nature of the motif prior to cause this strong separation in the weights. It also suggests a possible change in the analysis with the substitution of continuous values instead of binary in the motif prior. Since the posting of that article (~3 years ago), do you all know of any updated suggestions for determining input values for W other than the their suggestion of motif distance to TSS? Thank you all in advance, |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 6 replies
-
Hi Luke! I think this question goes along the lines of discussion #304. I agree that we see this often in our data, where the motif is sort of leading the edge weights, but I do not have a definitive answer on how to filter them or threshold them. I think @kimberlyglass might have a better idea about thresholding and I am also summoning @marouenbg for the discussion about non-binary priors. |
Beta Was this translation helpful? Give feedback.
-
Hi @LPotter21 , |
Beta Was this translation helpful? Give feedback.
Hi @LPotter21 ,
Thank you for your question. On continuous motif priors, we did use a continuous motif prior to mitigate some of the constraining effects of binary priors.
You can find the motif here and you're welcome to use it but we're planning to release a deeper analysis of these motifs and how they relate to TF binding being a 'continuous' phenomenon rather than on/off. These motifs are based on the distance to TSS, but I've seen some biophysical models that compute TF affinity to the underlying promoter sequence to derive a continuous score.
Best,
Marouen