-
Notifications
You must be signed in to change notification settings - Fork 25
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
very cross-contaminated samples #131
Comments
What sample type is this? Is it a sample type in which many samples may be effectively "sterile" (i.e. have no true resident microbiome)? The alternative approach to removing contaminants is to instead try to "rule things in" as non-contaminants, and only work in the fraction of ASVs that have solid evidence aren't just contamination. The intermingling of your extraction negatives (which are vastly preferable to PCR negatives) and real samples in terms of read depth, and the long flat distribution of decontam scores after a small but noticeable mode at score ~ 1 (strong evidence not a contaminant) would be consistent with the scenario I described. |
The DNA is from isolated unicellular organisms? And you are amplifying some gene from there? Or this is some other sample type, and you are amplifying a gene that is a marker for some class of small unicellular organisms of which you are interested?
That would be a red flag for this sequencing data perhaps. When you look at the ASVs with the highest scores (i.e. the best evidence are not contaminants) what do you see? What you expect? Or, skin microbes?
I don't know what you mean here. "batches" are not related to different types of negative controls. The extraction controls are better for decontam. |
Thank you so much for your reply again!
It is prokaryotic DNA extracted from a marine eukaryotic organisms. So basically I am studying the bacteria that live inside those eukaryotes. The sequences are one variable region of the 16S rRNA gene, which is typically used to sequence prokaryotes.
The vast majority of them are expected marine bacteria, which is good, but then for example I have an ASV with pval of 0.96 that is a skin microbe (with 100% of taxonomic assignment). The same way, marine cyanobacteria should not be considered as contaminants but I have one, for example, which has a pval of 0.03.
Sorry I expressed myself wrong. So you recommend running the decontamination using only the extraction negatives? |
If the vast majority look correct, then you are pretty safe to go forward. One can do additional filtering based on taxonomy, or at this point I would mostly do post-hoc investigation on any interesting ASVs that pop up in your analyses. It is worth double-checking taxonomic assignments with wide spectrum BLAST as well, in case it is a taxonomic misassignment.
Either just the Extraction negatives, or pooling the PCR negatives and Extraction negatives together. I don't have strong recommendation of one approach over the other given the numbers/design you described. In general though, more Extraction negatives and less PCR negatives (which only captures part of the contamination process) is better. |
Thank you so much four your advice, I will follow it :) |
Hi!
I am trying to use this package to remove as much contaminants as I can from an already cross contaminated samples.
For context, this data comes from NovaSeq sequencing, so I am working with ESVs. I have pcr and extraction negatives.
The number of reads is very variable, and as you can see the negatives have sometimes the same or more than the samples:
The histogram and the prevalence plot (0.5) look like this:
At this point I am a bit lost on what to do... I can see it will be very difficult for me to remove many contaminants, but I would like to remove as much as I can.. What threshold would you recommend?
any help would be much appreciated!
The text was updated successfully, but these errors were encountered: