Skip to content

abcwcm/Espinosa-Carrasco2024

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Bioinformatic methods for Espinosa-Carrasco et al.

The quality of the sequenced reads was assessed with FastQC and QoRTs. Unless stated otherwise, plots involving high- throughput sequencing data were created using R and ggplot2.

RNA-seq data:

Alignment and gene counting

DNA sequencing reads were aligned with default parameters to the mouse reference genome (GRCm38.p6) using STAR. Gene expression estimates were obtained with featureCount using composite gene models (union of the exons of all transcript isoforms per gene) from Gencode (version M17).

DEGs

DEGs were determined using DESeq2 with Wald tests with a q-value cutoff of 0.05 (Benjamini–Hochberg correction).

Heatmaps

Heatmaps were created using DESeq2 normalized read counts after variance stabilizing transformation of genes identified as differentially expressed by DESeq2. Rows were centered and scaled.

Pathway and GO term enrichment analyses

Gene set enrichment analyses were done using fgsea with the fgseaMultilevel function. Genes were ranked based on the DESeq2 Wald statistic. Gene sets with an FDR < 0.05 were considered enriched. Gene ontology analysis was performed on up- and down-regulated DEGs using the clusterProfiler R package. Only GO categories enriched using a 0.05 false discovery rate cutoff were considered.

ATAC-seq data:

Alignment and creation of peak atlas

Reads were aligned to the mouse reference genome (version GRCm38) with BWA-backtrack v0.7.17. Post-alignment filtering was done with samtools and Picard tools to remove unmapped reads, improperly paired reads, nonunique reads, and duplicates. Peaks were called with MACS2, ,and peaks with adjusted P values smaller than 0.01 were excluded. Consensus peak sets were generated for each condition if a peak was found in at least two replicates. Reproducible peaks from each condition were merged with DiffBind to create an atlas of accessible peaks, which was used for downstream analyses. The peak atlas was annotated using the ChIPseeker and TxDb.Mmusculus.UCSC.mm10.knownGene. Blacklisted regions were excluded (https://sites.google.com/site/anshulkundaje/projects/blacklists).

Differentially accessible regions

Regions where the chromatin accessibility changed between different conditions were identified with DESeq2, and only Benjamini–Hochberg corrected P values < 0.05 were considered statistically significant.

Coverage files

Genome coverage files were normalized for differences in sequencing depth (RPGC normalization) with bamCoverage from deepTools. Replicates were averaged together using UCSC-tools bigWigMerge. Merged coverage files were used for display in Integrated Genomics Viewer.

Heatmaps

Heatmaps based on the differentially accessible peaks identified between TCROT1(-CD4) and TCROT1(+CD4) were created using profileplyr and ComplexHeatmap, by binning the region +/− 1kb around the peak summits in 20bp bins. To improve visibility, bins with read counts greater than the 75th percentile + 1.5*IQR were capped at that value.

Motif analyses

For identifying motifs enriched in differentially accessible peaks, we utilized HOMER via marge. HOMER was run separately on hyper- or hypo-accessible peaks with the flags -size given and -mask. Motifs enriched in hyper- or hypo-accessible peaks were determined by comparing the rank differences (based on P value). The consensus peakset identified by DiffBind was used as the background set.

Software used

Software Version Authors URL
STAR v2.6.0c Dobin et al. GitHub
featureCounts v1.6.2 Liao et al. Subread
R v4.1.0 R Core Team CRAN
DESeq2 v1.34.0 Love et al. Bioconductor
ggplot2 v3.4.1 Wickham CRAN
pheatmap v1.0.12 Kolde CRAN
fgsea v1.20.0 Korotkevich et al. Bioconductor
clusterProfiler v4.2.2 Guangchuang et al. Bioconductor
BWA v0.7.17 Li et al. Bio-BWA
samtools v1.8 Danecek et al. HTSlib
Picard tools v2.18.9 Broad Institute GitHub
MACS v2.1.1 Zhang et al. GitHub
DiffBind v3.4.11 Stark and Brown Bioconductor
ChIPseeker v1.30.3 Wang et al. Bioconductor
deepTools v3.1.0 Ramírez et al. Documentation
bigWigMerge UCSC KentUtils Kuhn et al. GitHub
IGV - Robinson et al. Documentation
profileplyr v1.10.2 Carroll and Barrows Bioconductor
ComplexHeatmap v2.15.1 Gu Bioconductor
HOMER v4.10-0 Heinz et al. HOMER
marge v0.0.4 Amezquita GitHub

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published