Code associated with "Domestication shapes the pig gut microbiome and immune traits from the scale of lineage to population." https://doi.org/10.1111/jeb.14227
Description: These files will allow you to replicate the bioinformatic and statistical analyses we used to assess the impact of domestication on the pig gut microbiome and immune state.
The following files should be used in order to process raw 16S rRNA gene sequencing data:
- Cutadapt to remove primers
- Dada2 to learn errors, dereplicate, infer ASVs, merge amplicons, make sequence tables, and assign taxonomy
- Decontam to remove contaminants, mitochondria, and chloroplasts
- Tables to create final counts and taxonomy tables
- Tree to create a phylogenetic tree
The output will be an ASV decontaminated counts table, an ASV taxonomy table, and a phylogenetic tree. The R markdown file (16S_analyses.Rmd) will allow you to replicate any statistical analyses and visualizations for 16S rRNA gene sequencing. The excel file, fecal_metadata_Apr2022.xlsx, contains all the metadata needed for these analyses.
###############################################################################################
The following files should be used in order to process raw shotgun metagenomic data:
- Quality filtering and trimming using Fastp
- Map sequences against reference genome
- Assemble to contigs using megahit
- Gene prediction with prodigal
- Dereplication with cd-hit
- Align to UniProt and taxonomic classification
- Annotation with EggNog, CARD, and VFDB
- Salmon gene quantification
Outputs will be annotation files and quantification files. The .RMD file will allow for replication of statistical analyses and visualizations. The excel file, "dom_cat_metagenomic_metadata.xlsx," contains all the metadata associated with the samples.
The SNP.R file along with the plink.vcf, PIG2.gds, and SNP_metadata.xlsx files can be used to replicate all SNP analyses.