BEEM

BEEM is an approach to infer models for microbial community dynamics based on metagenomic sequencing data (16S or shotgun-metagenomics). It is based on the commonly used generalized Lotka-Volterra modelling (gLVM) framework. BEEM uses an iterative EM algorithm to simultaneously infer scaling factors (microbial biomass) and model parameters (microbial growth rate and interaction terms) from longitudinal data and can thus work directly with the relative abundance values that are obtained with metagenomic sequencing.

Note: BEEM stands for Biomass Estimation and model inference with an Expectation Maximization algorithm. We have now extended the BEEM framework to be able to work with cross-sectional data (BEEM-static, check out our R package here).

Dependencies

BEEM was written in R (>=3.3.1) and requires the following packages:

foreach
doMC: this currently only works on MacOS or LinuxOS
lokern
pspline
monomvn

You can install BEEM as an R package using devtools

devtools::install_github('csb5/beem')

Input data

The input files for BEEM should have the same format as described in the manual for MDSINE. The following two files are required by BEEM:

OTU table

This should be a tab-delimited text file whose first row has the sample IDs and the first column has the OTU IDs (or taxonomic annotations). Each row should then contain the relative abundance of one OTU across all samples and each column should contain the relative abundances of all OTUs in that sample.

Metadata

The metadata file should be a tab-delimited text file with the following columns:

sampleID    isIncluded    subjectID    measurementID

sampleID: sample IDs matching the first row of the OTU table
isIncluded: whether the sample should be included in the analysis (1-include, 0-exclude)
subjectID: indicator for which biological replicate the sample belongs to
measurementID: time in standardized units from the start of the experiment

Sample data

We have provided several sample input files that were also analyzed in our manuscript.

Data from Props et. al. (2016)

OTU count table: vignettes/props_et_al_analysis/counts.sel.txt
Metadata: vignettes/props_et_al_analysis/metadata.sel.txt

Data from Gibbons et. al. (2017)

OTU count table: vignettes/gibbons_et_al_analysis/{DA,DB,M3,F4}.counts.txt
Metadata: vignettes/gibbons_et_al_analysis/{DA,DB,M3,F4}.metadata.txt

Usage

Basic Usage (R commands)

## Load functions
library(beem)
## Read inputs
counts <- read.table('counts.txt', head=F, row.names=1)
metadata <- read.table('metadata.txt', head=T)
## Run BEEM
res <- EM(dat=input, meta=metadata)
## Estimate parameters
biomass <- biomassFromEM(res)
write.table(biomass, 'biomass.txt', col.names=F, row.names=F, quote=F)
gLVparameters <- paramFromEM(res, counts, metadata)
write.table(gLVparameters, 'gLVparameters.txt', col.names=T, row.names=F, sep='\t' , quote=F)

Output format

BEEM estimated parameters is an R data.frame (a table) with the following columns in order:

parameter_type: growth_rate or interaction
source_taxon: source taxon for interaction (NA if parameter_type is growth_rate)
target_taxon: target taxon for interaction or growth rate
value: parameter value
significance: confidence level of the inferred interaction (only meaningful for interactions)

Analyses in the manuscript

The commands for reproducing the analysis reportd in the manuscript are presented as jupyter notebooks: (1) notebook on a demo of the gLVM simulation, (2) notebook for Props et. al. and (3) notebook for Gibbons et. al..

Citation

C Li, K R Chng, J S Kwah, T V Av-Shalom, L Tucker-Kellogg & N Nagarajan. (2019). An expectation-maximization algorithm enables accurate ecological modeling using longitudinal metagenome sequencing data. Microbiome.

Contact

Please direct any questions or feedback to Chenhao Li (lich@gis.a-star.edu.sg) and Niranjan Nagarajan (nagarajann@gis.a-star.edu.sg).

Name		Name	Last commit message	Last commit date
Latest commit History 67 Commits
R		R
man		man
vignettes		vignettes
.Rbuildignore		.Rbuildignore
.gitignore		.gitignore
DESCRIPTION		DESCRIPTION
LICENSE		LICENSE
NAMESPACE		NAMESPACE
README.md		README.md
beem.Rproj		beem.Rproj
logo.png		logo.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BEEM

Dependencies

Input data

OTU table

Metadata

Sample data

Data from Props et. al. (2016)

Data from Gibbons et. al. (2017)

Usage

Basic Usage (R commands)

Output format

Analyses in the manuscript

Citation

Contact

About

Releases 1

Packages

Languages

License

CSB5/BEEM

Folders and files

Latest commit

History

Repository files navigation

BEEM

Dependencies

Input data

OTU table

Metadata

Sample data

Data from Props et. al. (2016)

Data from Gibbons et. al. (2017)

Usage

Basic Usage (R commands)

Output format

Analyses in the manuscript

Citation

Contact

About

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages