Schumer lab: Download data to Sherlock from UW (PacBio)

From OpenWetWare
Revision as of 18:28, 11 December 2024 by Tododge (talk | contribs) (Created page with "==Download data with Globus== 1) info for JJ to fill in the important files are *hifi_reads.bc*.bam and *hifi_reads.bc*.bam.pbi # download all pacbio data from globus. the important files are *hifi_reads.bc*.bam and *hifi_reads.bc*.bam.pbi ==Delete unnecessary files== 1) delete fail_reads/ directory and unassigned.bam files <span style="font-family:Courier;"> cd /path/to/parent/directory/ <span style="font-family:Courier;"> rm -r fail_reads/ <span style="font-fa...")
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigationJump to search

Download data with Globus

1) info for JJ to fill in


the important files are *hifi_reads.bc*.bam and *hifi_reads.bc*.bam.pbi

  1. download all pacbio data from globus. the important files are *hifi_reads.bc*.bam and *hifi_reads.bc*.bam.pbi

Delete unnecessary files

1) delete fail_reads/ directory and unassigned.bam files

cd /path/to/parent/directory/

rm -r fail_reads/

rm hifi_reads/*.hifi_reads.unassigned.bam*

Rename files

The files we get back from UW are named using a unique barcode that they assign, but is meaningless to us. We want to rename all of these files based on the sample names we provided them. The information connecting barcodes to sample names can be found in the SequencingReport.pdf that they send (e.g. 240724_UW_LongRead_SequencingReport_Baczenas.J(Stanford).pdf).


2) copy the renaming script to the directory with reads

cd /path/to/directory/with/reads/

cp /home/groups/schumer/shared_bin/Lab_shared_scripts/rename_PacBio_UW_reads.sh .


3) make a file called sample.key which contains the UW barcodes and the sample name, separated by "@".

$ cat sample.key

bc2008@Xvar-JUCH-6-V-24-S265-M-01

bc2014@Xvar-JUCH-6-V-24-S178-M-01

bc2015@Xvar-JUCH-6-V-24-S250-M-01

bc2016@Xvar-JUCH-6-V-24-S250-M-02


3) run the renaming. it should take ~3 min per sample

sh rename_PacBio_UW_reads.sh


4) you are all done!