Schumer lab: Download data to Sherlock from NCBI's SRA

From OpenWetWare
Jump to navigationJump to search

Get and convert data

NCBI's SRA is a huge raw data database that can be very useful. To download data directly to Sherlock:

1) Find the dataset you want to download on the [SRA]

2) Find the run id under Runs

For example, in this [link] the run id is: SRR7525606

3) On Sherlock, navigate to the directory you want to download data to.

construct the appropriate wget command based on the SRA run id.

For example, for SRR7525606 the command would be:


and enter it into the command line

4) Convert the SRA file into fastq.gz files using fastq-dump from the sratoolkit

/home/groups/schumer/shared_bin/fastq-dump SRR7525606.sra --gzip --split-files