Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

p-value and q-values are the same for top hit #23

Open
kw10 opened this issue Jun 29, 2023 · 4 comments
Open

p-value and q-values are the same for top hit #23

kw10 opened this issue Jun 29, 2023 · 4 comments

Comments

@kw10
Copy link

kw10 commented Jun 29, 2023

I am using the latest R version and I only have 1 mutually exclusive pair; the p and q values are the same. I am using fdr method DBH. I am wondering why the p and q values are the same? When I look at the next best result, the p-value and q-values are different (but not significant). For example:

number of pairs tested: 190
proportion of true null hypotheses: 1
number of significant pairs at a maximum FDR of 1 : 190
            gene1         gene2    p.value    q.value
39         GENEXXX        GENEYYY 0.08714019 0.08714019
42         GENEZZZ        GENEYYY 0.30103083 0.91313014

Thanks
Kim

@scanisius
Copy link
Member

The cases in which I have seen this were situations with limited statistical power due to low mutation frequencies. If you look at the number of mutated tumours for each gene, do you see that all genes except for GENEXXX and GENEYYY have low mutation frequencies?

@kw10
Copy link
Author

kw10 commented Jun 30, 2023

There are 41 samples total and one gene is mutated in 22/41 (53.7%) and the other gene is mutated in 4/41 (9.1%).

@scanisius
Copy link
Member

If I understand you correctly, these are the mutation frequencies for GENEXXX and GENEYYY. And do all other genes have lower frequencies? What you are observing is most likely the result of low statistical power. The q.value estimate is correct. The intuitive explanation is that the multiple testing correction does not penalize your first gene pair, because none of the other gene pairs can attain a p value lower than 0.087, even if none of their mutations co-occur in the same tumours. The discrete Benjamini-Hochberg procedure takes this into account when estimating q values.

@kw10
Copy link
Author

kw10 commented Jul 4, 2023

Yes, all other genes have lower frequencies. Thanks for the explanation!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants