Skip to content

Latest commit

 

History

History
7 lines (4 loc) · 689 Bytes

README.md

File metadata and controls

7 lines (4 loc) · 689 Bytes

This project takes place during my internship at Institut Pasteur Guadeloupe for my master's degree in Bioinformatics.

The aim of this project is to perform a metabarcoding analysis of Illumina sequencing fastQ data for 16S, 18S and ITS barcodes. It enables taxonomic assignment of sample data and compositional analysis.

The bash and R programs are used to clean fastQ data (primer removal, end truncation, etc.) and perform taxonomic assignment using tools such as cutadapt, figaro and the R package DADA2.

The python program allows analysis of the abundance tables provided by the dada2 analysis, and dynamically displays various statistics (alpha and beta diversity) and graphs.