Difference between revisions of "20.109(S13):Phylogenetic analysis (Day7)"

From OpenWetWare
Jump to: navigation, search
(Part 2: Identify species from sequences)
(Part 3: Construct tree)
Line 33: Line 33:
===Part 3: Construct tree===
===Part 3: Construct tree===
MEGA program
==For next time==
==For next time==

Revision as of 08:09, 21 January 2013

20.109(S13): Laboratory Fundamentals of Biological Engineering

S13 109-OWW-frontpage.jpg

Home        Schedule Spring 2013        Assignments       
DNA Engineering        Protein Engineering        Cell Engineering              


phylogenetic analysis, utility


Part 1: Prepare sequences for analysis

The data from Genewiz is available at this link. Choose the "Login" link and then use "astachow@mit.edu" and "be20109" to log in. At the bottom right should be a link to download your sequencing results. Select order date XXX for T/R results and XXX for W/F results. The quickest way to start working with your data is to follow the "View" link under the Seq File heading. For ambiguous data, you may want to look directly at the Trace File as well.

  1. Begin by downloading this file, which contains the DNA sequence of the vector we are using in GenBank format. Open the file in ApE (A plasmid Editor, created by M. Wayne Davis at the University of Utah), which is found on your desktop. Three items of interested are highlighted: the forward priming site, the reverse priming site and the two basepairs between which your sequence should be inserted.
  2. Paste the forward sequence of your first candidate into a new ApE file. Locate where the vector ends and the insert begins; trim away the vector.
  3. Paste the reverse sequence of your first candidate into yet another ApE file. Use Edit → Reverse Complement to adjust the sequence, and again trim away the vector.
  4. Use ToolsAlign Sequence to find where the forward and reverse sequences overlap. Combine them into one sequence with no repeated parts; when both forward and reverse sequence have coverage of the gene, choose whatever combination has the fewest Ns (ideally none!). Save this sequence as a new file called 20109_YourTeamDay-YourTeamColor_YourSampleID-"C"Candidate Number (e.g., 20109_WF-Purple_G737-C1).
  5. Finally, depending on the orientation of your insert, you may want to reverse complement the entire sequence. Use the original sequences of the forward and reverse 16S primers to guide your decision.


Part 2: Identify species from sequences

Align with "nucleotide blast" from NCBI

  1. The alignment program can be accessed through the NCBI BLAST page or directly from this link.
  2. Paste the sequence text that you prepared above into the "Query" box. If there were ambiguous areas of your sequencing results, these will be listed as "N" rather than "A" "T" "G" or "C" and it's fine to include Ns in the query.
  3. Under "choose search set," select "16S ribosomal RNA sequences (Bacteria and Archaea)" from the database pulldown menu.
  4. Click on the BLAST button. Matches will be shown by vertical lines between the aligned sequences, while mismatches and gaps will be shown with a dash.
  5. Because this gene is highly conserved, a number of species should come up as highly matched. However, one should (usually) be a best choice. Write down this strain and its accession number, its associated max score, query coverage, and max identity; write down these parameters for the second most closely matched species if it is close. Particularly if there are Ns in your sequence, you might need to scroll down and closely observe the aligned sequences to truly know which is best.
  6. You should print a screenshot of each [OR MAYBE JUST ONE EXAMPLE?] alignment to pdf (and to paper if you desire). These will be used to prepare a figure showing what you found today. You might want to email yourself the alignment screen shots or post them to your wiki userpage.

Part 3: Construct tree

MEGA program

For next time

Reagent list

write something here or not accessible to edit