Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Positive splicing events are not be detected #228

Open
ym-chen opened this issue Feb 24, 2023 · 2 comments
Open

Positive splicing events are not be detected #228

ym-chen opened this issue Feb 24, 2023 · 2 comments

Comments

@ym-chen
Copy link

ym-chen commented Feb 24, 2023

Hi,
I use the Leafcutter to analyze a batch of samples. One of the samples has a splice event in the region: chr7:116411552-116415165, and the other samples are negative. But I can't find any splicing event in the positive region from the result file of leafcutter. I wonder how leafcutter process this data. The specific parameters of leafcutter which I run the data like the following:

leafcutter_cluster.py -m 2 -M 1 -l 500000
leafcutter_ds.R --min_samples_per_intron=1 --min_samples_per_group=0 --timeout=300 --min_coverage=0

The first data, I can find the junction site in the positive.junc. And also can find the intron-cluster in perind_numers.counts.gz. The number 0 in perind_numers.counts.gz is the junction count of negative samples (total 11 negative samples). But in the DAS_cluster_significance.txt, I find the cluster is filtered. It seems like some parameter(s) works. And how can I change my parameters?

## positive.junc
chr7    116411708       116414934       .       7       -
chr7    116411708       116411902       .       4       -
chr7    116411708       116414934       .       2       +

## perind_numers.counts.gz
chr7:116411708:116411903:clu_63826_NA 0 0 0 0 0 4 0 0 0 0 0 0
chr7:116411708:116414935:clu_63826_NA 0 0 0 0 0 9 0 0 0 0 0 0

## DAS_cluster_significance.txt
chr7:clu_63826_NA       <=1 sample with coverage>0      NA      NA      NA      NA      MET

The second data, I can find the junction site in the positive.junc, but can't find the intron-cluster in positive.junc.Batch.sorted.gz and perind_numers.counts.gz. I set the --minreads=1, but why is it still not clustered?

## positive.junc
chr7    116412043       116414934       .       1       +
chr7    116411708       116414934       .       11      -
chr7    116411708       116414934       .       11      +

This is very important to me, hope to get your help.

@goldenflaw
Copy link
Collaborator

goldenflaw commented Feb 24, 2023 via email

@ym-chen
Copy link
Author

ym-chen commented Mar 2, 2023

I checked the log file, but still can't find the exact reason. I may do more test in the future. Thanks for your reply.

I believe there has been other reports that some of the parameters in leafcutter do not work as intended, including when reducing the minimum number of reads too much. We are working on re-writing leafcutter with python3 and add functionality plus bug fixes, but we only expect to release it late this year.

On Thu, Feb 23, 2023, 21:50 ym-chen @.> wrote: Hi, I use the Leafcutter to analyze a batch of samples. One of the samples has a splice event in the region: chr7:116411552-116415165, and the other samples are negative. But I can't find any splicing event in the positive region from the result file of leafcutter. I wonder how leafcutter process this data. The specific parameters of leafcutter which I run the data like the following: leafcutter_cluster.py -m 2 -M 1 -l 500000 leafcutter_ds.R --min_samples_per_intron=1 --min_samples_per_group=0 --timeout=300 --min_coverage=0 The first data, I can find the junction site in the positive.junc. And also can find the intron-cluster in perind_numers.counts.gz. The number 0 in perind_numers.counts.gz is the junction count of negative samples (total 11 negative samples). But in the DAS_cluster_significance.txt, I find the cluster is filtered. It seems like some parameter(s) works. And how can I change my parameters? ## positive.junc chr7 116411708 116414934 . 7 - chr7 116411708 116411902 . 4 - chr7 116411708 116414934 . 2 + ## perind_numers.counts.gz chr7:116411708:116411903:clu_63826_NA 0 0 0 0 0 4 0 0 0 0 0 0 chr7:116411708:116414935:clu_63826_NA 0 0 0 0 0 9 0 0 0 0 0 0 ## DAS_cluster_significance.txt chr7:clu_63826_NA <=1 sample with coverage>0 NA NA NA NA MET The second data, I can find the junction site in the positive.junc, but can't find the intron-cluster in positive.junc.Batch.sorted.gz and perind_numers.counts.gz. I set the --minreads=1, but why is it still not clustered? ## positive.junc chr7 116412043 116414934 . 1 + chr7 116411708 116414934 . 11 - chr7 116411708 116414934 . 11 + This is very important to me, hope to get your help. — Reply to this email directly, view it on GitHub <#228>, or unsubscribe https://github.com/notifications/unsubscribe-auth/ABGWTCJC2KN3BABLJDU4SXDWZAVYHANCNFSM6AAAAAAVGNOIDY . You are receiving this because you are subscribed to this thread.Message ID: @.>

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants