Skip to content

Latest commit

 

History

History
35 lines (27 loc) · 1.73 KB

README.md

File metadata and controls

35 lines (27 loc) · 1.73 KB

Reference Genomes

Examples for downloading reference genomes using wget command.


Current example is based on human genome (Homo sapiens - GRCh38, release 106) from emsembl (http://ensemblgenomes.org).

Other genomes can be downloaded from the list below, looking for the genome of interest and clicking on the link to copy the link:

(1) For transcriptome analisys using STAR, we will need both dna-fasta and gft/gff3 files.

wget http://ftp.ensembl.org/pub/current_fasta/homo_sapiens/dna/Homo_sapiens.GRCh38.dna_sm.primary_assembly.fa.gz

wget http://ftp.ensembl.org/pub/release-106/gtf/homo_sapiens/Homo_sapiens.GRCh38.106.abinitio.gtf.gz
#or
wget http://ftp.ensembl.org/pub/release-106/gff3/homo_sapiens/Homo_sapiens.GRCh38.106.abinitio.gff3.gz

(2) In the case of transcriptome analisys using Kallisto or Salmon, we will only need cdna-fasta.

wget http://ftp.ensembl.org/pub/current_fasta/homo_sapiens/cdna/Homo_sapiens.GRCh38.cdna.abinitio.fa.gz

(3) For other type of analysis that would involved non-coding rna, this is the file

wget http://ftp.ensembl.org/pub/current_fasta/homo_sapiens/ncrna/Homo_sapiens.GRCh38.ncrna.fa.gz