-
Notifications
You must be signed in to change notification settings - Fork 43
Home
metaGEM
is a Snakemake-based workflow designed to generate high quality metagenome assembled genomes from short read paired-end data, reconstruct genome scale metabolic models, and perform community metabolic interaction simulations on high performance computing clusters. In the metaGEM paper we apply the workflow to lab culture, human gut, plant associated, bulk soil, and TARA oceans metagenomic samples.
The fastest way for you to start submitting jobs with metaGEM
is to follow this updated setup guide. Alternatively, you may have a look at the older quickstart guide or the manual setup guide.
A detailed tutorial of the metaGEM
core workflow is demonstrated here.
You can find answers to FAQs here.
The workflow below is adapted from Figure 1 of the metaGEM
manuscript:
In the following pages you can find documentation regarding the implementation of each tool within the Snakefile. For a more hands on demonstration of how to use metaGEM
please see the tutorial above.
- Quality filter reads with fastp
- Assembly with megahit
- Draft bin sets with CONCOCT, MaxBin2, and MetaBAT2
- Refine & reassemble bins with metaWRAP
- Taxonomic assignment with GTDB-tk
- Relative abundances with bwa
- Reconstruct & evaluate genome-scale metabolic models with CarveMe and memote
- Species metabolic coupling analysis with SMETANA
The following image is adapted from supplementary figure 2 of the metaGEM manuscript, highlighting the advantage of using the bin refinement and reassembly technique. It can also serve as somewhat anecdotal yet quantitative comparison of binning tool performances:
metaGEM: reconstruction of genome scale metabolic models directly from metagenomes
Francisco Zorrilla, Filip Buric, Kiran R Patil, Aleksej Zelezniak
Nucleic Acids Research, 2021; gkab815, https://doi.org/10.1093/nar/gkab815
- Quality filter reads with fastp
- Assembly with megahit
- Draft bin sets with CONCOCT, MaxBin2, and MetaBAT2
- Refine & reassemble bins with metaWRAP
- Taxonomic assignment with GTDB-tk
- Relative abundances with bwa
- Reconstruct & evaluate genome-scale metabolic models with CarveMe and memote
- Species metabolic coupling analysis with SMETANA