-
Notifications
You must be signed in to change notification settings - Fork 6
Common experience & wisdom
swo edited this page Sep 30, 2014
·
2 revisions
Some things to keep in mind here are:
- Were yours the only samples in the sequencing lane? If not, then you will expect that a smaller fraction of the total reads will map to your known barcodes.
- How many reads did you get? The forward and reverse
.fastq
files should be the same length. You can get its total length withwc -l
and divide by 4 (but don't do this on the head node!); or, you can get the first 1000 entries (head -4000 your.fastq
), check its size (ls -lah
), and then extrapolate the number of reads in the big fastq.- New MISEQ runs should have about 25 million reads total. Does this include ones that go to phiX?
- What do your qualities look like? The BMC data includes a file
...fastqc_report.html
that shows how the quality of the read depends on nucleotide position. - For MiSeq, you should be mostly in the green up to about 100 nucleotides.
- What fraction of your total reads were dropped?
- How many index reads mapped to known barcodes?
- Were the most poorly-matched index reads the most lowly abundant?