Skip to content

Latest commit

 

History

History
17 lines (13 loc) · 775 Bytes

SRA_random_1000sample.md

File metadata and controls

17 lines (13 loc) · 775 Bytes

Random selection of 1000 SRA run

Prepare data

First, we collect all the SRA runs, excluding Severe Acute Respiratory Syndrome-related Coronavirus due to its large proportion. The search command in SRA advanced search is as follows.

(("oxford nanopore"[Platform]) NOT Severe acute respiratory syndrome-related coronavirus[Organism]) AND ("2010/01/01"[Publication Date] : "2024/01/09"[Publication Date])

The exported accession file from the following search is at ../data/SraAccList_NOCOVID_20240109.csv

Random selection

python random_select_SRARUN.py -i ../data/SraAccList_NOCOVID_20240109.csv -o ../results/Sampled_1000_srarun.txt;

We manually review the first 100 runs in Sampled_1000_srarun.txt.