User:Lindenb/Notebook/UMR915/20101104
From OpenWetWare
SOAPAligner
testing soap aligner:
soap2.20release/soap -a xx_3_1.fastq.gz -bxx_3_2.fastq.gz -D hg18.fa.index -o ~/soap_results.data -2 ~/unpaired.data File Error: unrecognized file
hum... it doesn't like the gzipped fastq ?... :-/ unzippinf...
/usr/local/package/soap2.20release/soap -a XX_3_1.fastq -b XX_3_2.fastq -D hg18.fa.index -o soap_results.data -2 unpaired.data -m 200 -x 600
Begin Program SOAPaligner/soap2 Thu Nov 4 11:18:24 2010 Reference: hg18.fa.index Query File a: 4366_3_1.fastq Query File b: 4366_3_2.fastq Output File: soap_results.data unpaired.data Load Index Table ... Load Index Table OK Begin Alignment ... 131072 ok 86.16 sec 262144 ok 84.23 sec 393216 ok 93.31 sec 524288 ok 90.48 sec (...) 70254592 ok 62.22 sec 70385664 ok 58.87 sec 70516736 ok 59.30 sec 70647808 ok 58.74 sec 70778880 ok 59.80 sec 70909952 ok 55.67 sec 71041024 ok 55.21 sec 71172096 ok 52.78 sec 71303168 ok 54.85 sec 71434240 ok 59.92 sec 71473618 ok 17.14 sec Total Pairs: 35736809 PE Paired: 14641084 (40.97%) PE Singled: 34692762 (48.54%) SE Total Elapsed Time: 33175.70 - Load Index Table: 14.96 - Alignment: 33160.74 SOAPaligner/soap2 End Thu Nov 4 21:57:26 2010
Results
[lindenb@srv-clc-02 lindenb]$ ls -lah soap_results.data unpaired.data -rw-r--r-- 1 lindenb users 4.8G Nov 4 21:57 soap_results.data -rw-r--r-- 1 lindenb users 5.7G Nov 4 21:57 unpaired.data
file soap_results.data unpaired.data soap_results.data: ASCII text unpaired.data: ASCII text
[lindenb@srv-clc-02 lindenb]$ more soap_results.data IL36_4366:3:1:1056:15626/1 CCAAGGGAGCTTATTAGTCCCTTCCACCATGTGAGGACGCAAGAAGGCATCATC 9305(3-=<8)<?/9:+48?;(5:6=@C:;4<==-?;C9<?8<ABA3CBA?A5E 1 a 54 - chr7 36927164054M 54 IL36_4366:3:1:1056:15626/2 GAAATGCCGTCTTGCTTGAAAAGTCCTTCTTAACTCTCCCACAAGTCACTCTCT E-E4EEAE?59>=C9A7?A:=E?@A:;)4:<8488=;B<=6*.2;*8<+4)=98 1 b 54 + chr7 36926895054M 54 IL36_4366:3:1:1059:7264/1 TCTTTCTTCAGCTTCTTGGGCGAAACAGGGAGTCTTTCCTGTGGACTCAGCTTG 8437CE=AA=+C<?>:36.11@@@.CFBFA9<B>@==@>C;55;A<-1>;F;-? 2 a 54 - chr17 40983027054M 54 IL36_4366:3:1:1059:7264/2 TCTTCAAGGGTCTCTGGATTTTGAGTTTCGGGCTCTAGATGGAATTGAGAAGGT D>DD7DB8BAA-BD?@=94B4AAAA3>@AD<;6-6=DDA8*=7>4)<=<<9<3= 2 b 54 + chr17 40982790054M 54
[lindenb@srv-clc-02 lindenb]$ more unpaired.data IL36_4366:3:1:1053:17041/1 AGACAGCAGTTAGTTGGTTGGTGAGTTCTTATCCATTCTGCTGTTCTGTATCTT =<@=;@B>;30</C99*>0@=D4E7>:>8@+1@8BDD:ABAC@<CDABDBE8DB 4 a 54 - chr4 92885278 2 A->41T3 G->9T-13 54M 9G31A12 IL36_4366:3:1:1053:7122/1 GGCAGTGCCTCTACCCTCCTCCTTAAGTTTCTATGGTCCCTGCTGCTCCTGGCC 5)8(6+8?@>@--9?;,4>3=E?A@@8@6;?;+?B*?;?@;5A@'>2>:>40>A 1 a 54 - chr6 31233127 0 54M 54 IL36_4366:3:1:1053:7122/2 GAAAGAGACAGACTGAGAGGGGCTCTGAGGCTTTACTCATACTTTCAGGATTCT $6&')$'&4926<-..6%,%-)%68/8=84+-4<(-*/*.6(52*2.D/&-5/% 1 b 54 + chr6 31233072 0 54M 54
Galaxy
wget "http://bitbucket.org/galaxy/galaxy-dist/get/tip.zip" unzip tip.zip cd galaxy-dist sh setup.sh Copying datatypes_conf.xml.sample to datatypes_conf.xml Copying reports_wsgi.ini.sample to reports_wsgi.ini Copying tool_conf.xml.sample to tool_conf.xml Copying tool_data_table_conf.xml.sample to tool_data_table_conf.xml Copying universe_wsgi.ini.sample to universe_wsgi.ini Copying tool-data/alignseq.loc.sample to tool-data/alignseq.loc Copying tool-data/annotation_profiler_options.xml.sample to tool-data/annotation_profiler_options.xml Copying tool-data/annotation_profiler_valid_builds.txt.sample to tool-data/annotation_profiler_valid_builds.txt Copying tool-data/bfast_indexes.loc.sample to tool-data/bfast_indexes.loc Copying tool-data/binned_scores.loc.sample to tool-data/binned_scores.loc Copying tool-data/blastdb.loc.sample to tool-data/blastdb.loc Copying tool-data/bowtie_indices.loc.sample to tool-data/bowtie_indices.loc Copying tool-data/bowtie_indices_color.loc.sample to tool-data/bowtie_indices_color.loc Copying tool-data/encode_datasets.loc.sample to tool-data/encode_datasets.loc Copying tool-data/liftOver.loc.sample to tool-data/liftOver.loc Copying tool-data/maf_index.loc.sample to tool-data/maf_index.loc Copying tool-data/maf_pairwise.loc.sample to tool-data/maf_pairwise.loc Copying tool-data/microbial_data.loc.sample to tool-data/microbial_data.loc Copying tool-data/phastOdds.loc.sample to tool-data/phastOdds.loc Copying tool-data/quality_scores.loc.sample to tool-data/quality_scores.loc Copying tool-data/regions.loc.sample to tool-data/regions.loc Copying tool-data/sam_fa_indices.loc.sample to tool-data/sam_fa_indices.loc Copying tool-data/srma_index.loc.sample to tool-data/srma_index.loc Copying tool-data/twobit.loc.sample to tool-data/twobit.loc Copying tool-data/shared/ucsc/builds.txt.sample to tool-data/shared/ucsc/builds.txt Creating database/files Creating database/community_files Creating database/tmp Creating database/compiled_templates Creating database/job_working_directory Creating database/import Creating database/pbs Creating static/genetrack/plots Creating tool-data/shared/jars One or more of the python eggs necessary to run Galaxy couldn't be downloaded automatically. You can try building them by hand (all at once) with: python scripts/scramble.py Or individually: python scripts/scramble.py Mako python scripts/scramble.py Babel python scripts/scramble.py Whoosh python scripts/scramble.py Tempita python scripts/scramble.py Cheetah python scripts/scramble.py lrucache python scripts/scramble.py sqlalchemy_migrate python scripts/scramble.py NoseHTML python scripts/scramble.py pexpect python scripts/scramble.py bx_python python scripts/scramble.py PasteDeploy python scripts/scramble.py WebHelpers python scripts/scramble.py docutils python scripts/scramble.py numpy python scripts/scramble.py pysqlite python scripts/scramble.py Beaker python scripts/scramble.py SVGFig python scripts/scramble.py SQLAlchemy python scripts/scramble.py simplejson python scripts/scramble.py WebError python scripts/scramble.py python_lzo python scripts/scramble.py wchartype python scripts/scramble.py twill python scripts/scramble.py Routes python scripts/scramble.py elementtree python scripts/scramble.py decorator python scripts/scramble.py pycrypto python scripts/scramble.py Paste python scripts/scramble.py wsgiref python scripts/scramble.py nose python scripts/scramble.py amqplib python scripts/scramble.py WebOb python scripts/scramble.py PasteScript
tomcat already running on port 8080: edit universe_wsgi.ini and change
port=8020
but problem with the proxy
GEM
> gem-mapper Welcome to GEM-mapper build 544 (beta) - (2009/10/07 02:50:12 GMT) (c) 2008-2010 Paolo Ribeca <paolo.ribeca@gmail.com> ************************************************************************ * WARNING: this is a beta version, provided for testing purposes only; * * check for updates at <http://www.paoloribeca.net/software/GEM>. * ************************************************************************ Names of index, input and output file are mandatory Usage: gem-mapper -I <index_prefix> (mandatory) -c|--colorspace (index is colorspace-encoded) -e|--emulate-complement (for indices lacking it) -i <input_file> (mandatory, FASTA or FASTQ) -q|--quality-format 'ignore'|'phred'|'solexa' (mandatory with FASTQ input) --reads-per-block <number> (default=10000) -o <output_prefix> (mandatory) -t <thread_number> (default=1) -m <max_mismatch_number> (default=2) --mismatch-alphabet <symbols> (default="ACGT") -Q|--quality-model 'gem'|'flat' (default='gem') --gem-quality-threshold <number> (default=26, that is e<=2e-3) -d|--decoding-threshold 'all'|<number> (default=10) --filtering-threshold <number> (default=40) --max-indel-length <number> (default=5) --disable-accelerators (for debugging purposes) -h|--help (print usage)
no paired-end sequencing ?
gem-mapper -i XXXX_1.fastq -o gemout -I gemhg18 -q phred Welcome to GEM-mapper build 544 (beta) - (2009/10/07 02:50:12 GMT) (c) 2008-2010 Paolo Ribeca <paolo.ribeca@gmail.com> ************************************************************************ * WARNING: this is a beta version, provided for testing purposes only; * * check for updates at <http://www.paoloribeca.net/software/GEM>. * ************************************************************************ Thu Nov 4 12:46:54 2010 -- Loading index (likely to take long)... done. Thu Nov 4 12:46:58 2010 -- Loading locations... done. Thu Nov 4 12:46:58 2010 -- Pre-scanning input file... done -- (FASTQ, it contains 35736809 reads). Thu Nov 4 12:48:47 2010 -- #0 sequences processed Thu Nov 4 12:48:52 2010 -- #10000 sequences processed Thu Nov 4 12:48:57 2010 -- #20000 sequences processed (...) Thu Nov 4 17:37:57 2010 -- #35690000 sequences processed Thu Nov 4 17:38:02 2010 -- #35700000 sequences processed Thu Nov 4 17:38:06 2010 -- #35710000 sequences processed Thu Nov 4 17:38:11 2010 -- #35720000 sequences processed Thu Nov 4 17:38:15 2010 -- #35730000 sequences processed Thu Nov 4 17:38:18 2010 -- #35736809 sequences processed
results
ls -lah gemout.0.map -rw-r--r-- 1 lindenb users 6.0G Nov 4 17:38 gemout.0.map IL36_4366:3:1:1053:17041/1 AAGATACAGAACAGCAGAATGGATAAGAACTCACCAACCAACTAACTGCTGTCT BD8EBDBADC<@CABA:DDB8@1+@8>:>7E4D=@0>*99C/<03;>B@;=@<= 0:1:3 chr1:F180110233T45@18/1,chr1 3:F36979712C13C52@65/2,chr6:R120088529T13T45@52/2,chr4:R92885278T13C45@52/2 IL36_4366:3:1:1053:7122/1 GGCCAGGAGCAGCAGGGACCATAGAAACTTAAGGAGGAGGGTAGAGGCACTGCC A>04>:>2>'@A5;@?;?*B?+;?;6@8@@A?E=3>4,;?9--@>@?8+6(8)5 1:0:0 chr6:R31233127@0/0 IL36_4366:3:1:1056:15626/1 GATGATGCCTTCTTGCGTCCTCACATGGTGGAAGGGACTAATAAGCTCCCTTGG E5A?ABC3ABA<8?<9C;?-==<4;:C@=6:5(;?84+:9/?<)8<=-3(5039 1:0:0 chr7:R36927164@0/0 IL36_4366:3:1:1056:1782/1 AAGATTCCAAATGGAAAAATAAAACTTTTGCCTTCTACTTGTTATTTTAGCACT :8A*@>$6<<AB>B?;?(;BB94,5BB;B<*2),)<6+@3>(88/-@@?8;97% 0:0:1 chr18:R3177698C4C54@13/2