Short read toolbox: Difference between revisions

From OpenWetWare
Jump to navigationJump to search
No edit summary
Line 39: Line 39:
*[http://bowtie-bio.sourceforge.net/index.shtml Bowtie] - Bowtie.
*[http://bowtie-bio.sourceforge.net/index.shtml Bowtie] - Bowtie.
*[http://bio-bwa.sourceforge.net/ BWA] - Burrows-Wheeler aligner.
*[http://bio-bwa.sourceforge.net/ BWA] - Burrows-Wheeler aligner.
*[http://rga.cgrb.oregonstate.edu/ RGA] - Perl script which calls blat to assemble short reads.


*[http://rga.cgrb.oregonstate.edu/ RGA] - Perl script which calls blat to assemble short reads.
=List of open source reference guided / de novo "hybrid" assemblers=
*[http://www.bx.psu.edu/miller_lab/] - YASRA
*[[Liston:Computer_Scripts]] - scripts for post-processing of YASRA contigs


=List of assembly viewers=
=List of assembly viewers=

Revision as of 15:54, 23 July 2010

Short read toolbox

This page has been created to help list resources for working with next generation sequence data.


Online short-read resources

List of short-read quality control software

List of sequence format information

List of alignment format information

List of open source de novo assemblers

  • Velvet - Implements De Bruijn Graphs in C. Requires 64 bit Linux OS.
  • Edena - 32 and 64 bit Linux.
  • ABySS - Multi-threaded de novo assembly.
  • Ray - Multi-threaded de novo assembly.
  • QSRA - Utilizes quality scores.

List of open source reference guided assemblers

  • MAQ - Mapping and Assembly with Qualities.
  • Bowtie - Bowtie.
  • BWA - Burrows-Wheeler aligner.
  • RGA - Perl script which calls blat to assemble short reads.

List of open source reference guided / de novo "hybrid" assemblers

List of assembly viewers

List of alignment programs

List of query programs

Perl

A very brief example to demonstrate file input/output.

Code:

#!/usr/bin/perl
use strict;
use warnings;
my (@temp, $in, $out);
my $inf = "data.fq";
my $outf = "data_out.fq";
open($in, "<", $inf) or die "Can't open $inf: $!";
open($out, ">", $outf) or die "Can't open $outf: $!";
while(<$in>){
  chomp($temp[0]=$_); # First line is an identifier.
  chomp($temp[1]=<$in>); # Second line is sequence.
  chomp($temp[2]=<$in>); # Third line is an identifier.
  chomp($temp[3]=<$in>); # Fourth line is quality.
  print $out join("\t", @temp)."\n";
}
close $in or die "$in: $!";
close $out or die "$out: $!";
  • perlintro - Introduction to perl with links to other documentation.
  • BioPerl beginners - Introduction to BioPerl (be prepared for object oriented code).

Python

R project

  • R project - Statistical programming environment.
  • Bioconductor - R for biologists (micro-array and next generation data).
  • APE - Analysis of phylogenetics and evolution R package.

Useful links