Alignment process #896
Replies: 3 comments 1 reply
-
Hi David, Thanks for your question. The Z matrix represents the latent variables of the model. That is, the individual In fact, the way the EM algorithm (and similarly the VBEM algorithm) works is that, since the Note that this defines the expected value of the where So, as to the question of where lightweight mapping comes into play. Note that when we evaluate I note that the above is a slight simplification, as it describes the basic EM, but salmon can use either this or the Variational Bayesian EM. Second, if you look at the treatment in e.g. the RSEM paper, you'll note that they discuss Anyway, I hope this helps. The TLDR to your question is that you're right, the Then, given these expected Best, |
Beta Was this translation helpful? Give feedback.
-
Thank you very much for your comprehensive response. It totally make sense. Is that right to say the main job here is an "good" estimation of Cheers, |
Beta Was this translation helpful? Give feedback.
-
I very appreciate your help for explaining that comprehensively 🙏. Sincerely, |
Beta Was this translation helpful? Give feedback.
-
Hi @rob-p ,
I looked at discussions here and found very useful questions and answers.
I would like to thank you for making such a profound package.
I have a question about the alignment process and I will be very thankful if you could help me.
I understand that Salmon does "lightweight" alignment using probabilities and distributions.
However, in the paper related to Salmon which describes about Salmon's method, it defines a Matrix Z, with entries z_ij, which is 1 if fragment j belongs to transcript i. How does Salmon detect if it belongs to t_i at first step?
I understand that P(f_j | t_i, z_ij=0)=0 uniformly as the paper describes.
How does Salmon detects entries of Z?
Does it do a matching first (finding the entries of Z first) and then lightweight alignment based on the fragments distributions for that transcript?
I mean, we know that p( f_j | t_i, z_ij=1) is the defined based on the distribution of the fragments in t_i ( in other word, probability of drawing a fragment from t_i, given that it is from t_i.
When we say given that it is from t_i, to me sounds like there is a "pre checking" for matchings and then calculating the probabilities under the condition that it belongs to t_i ( the distribution of the fragments that match to t_i)
I understand that when f_j belongs to t_i, then we calculate the bias terms to see if it is " really" from t_i.
Or, does Salmon pick entries of Z randomly without actually checking that if fragment f_j matches to transcript t_i?
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5600148/
Thanks in advance for your help.
David
Beta Was this translation helpful? Give feedback.
All reactions