In this tutorial, we will guide you through the process of bulk RNA-seq data analysis (transcriptomic data analysis) using pair end samples from the NCBI GEO dataset. We will utilize the Ensembl database (GRCh38.p14) for the reference genome and reference annotation files (GFF, GTF).
For this Tutorial we have using this in the Anaconda Environment in linux
-
Download Anaconda from the website based on your system version
-
Installation
bash anaconda.sh
-
Then Create a new environment:
conda create -n Rnaseq_analysis
-
Activate the created environment
conda activate Rnaseq_analysis
Make sure you have the following tools installed in the working conda environment(Rnaseq_analysis) :
conda install -c bioconda fastqc
conda install -c bioconda trimmomatic
conda install -c bioconda bwa
conda install -c bioconda subread