Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Batch-specific contaminant detection or not? #150

Open
zztin opened this issue Aug 7, 2024 · 0 comments
Open

Batch-specific contaminant detection or not? #150

zztin opened this issue Aug 7, 2024 · 0 comments

Comments

@zztin
Copy link

zztin commented Aug 7, 2024

Hi Benjamin,

Thank you for developing this package. We are using it to exclude contaminants for the purpose of identifying sepsis-causative bacteria in newborn foals.
Can we ask your advice on whether to use a batched analysis or not in our analysis?

We have 32 foals in total, across 6 isolation batches, and 3 library prep batches (or xxx batches per library prep kit). We prefer to do a non-batched analysis AND a batched analysis and exclude all contaminants that were found in any of the comparisons (frequency-based method). However, some batches are too small (n = 3), thus we excluded these batches and performed the isContaminants with batch parameter on the remaining batches. The batch sizes are still quite small (n=4-11). On the other hand, we don't want to miss batch-specific contaminants.

  1. Would you recommend a combined approach like we proposed in this case? Or a batched approach/non-batch specific approach?

Attached is a figure with the histogram distribution of p-values of all species that was present in at least 6 samples. On the left, p-value distribution without batch-specific detection. On the right, p-value distribution with batch-specific detection ('minimum').

  1. It is hard to find a trough in the p-value distribution. We suspect that it is because we are working with low-biomass sample (with little bacterial DNA in the samples). To be lenient, we proposed to use 0.25 as a p-value cutoff to exclude contaminants. However, we would also like to hear your input.
Screenshot 2024-08-07 at 15 40 51
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant