Mking44 Week 5
Assignments
Individual Journal Entries
Class Journal Entries
Purpose
The purpose of this week's assignment was to compare the diversity in and between HIV subjects by obtaining sequence data, making phylogeny trees, counting sequence differences, and doing calculations. Then by doing this, it will start the development of a new HIV evolution project.
Combined Methods and Results
Activity 1: Looking at NCBI resources and HIV sequence data
Part 1: PubMed
- To search for the PubMed entry, I first clicked on the NCBI link on the Week 5 protocol (http://www.ncbi.nlm.nih.gov)
- From there, I clicked on the PubMed link on the right side of the page. (https://www.ncbi.nlm.nih.gov/pubmed/)
- Then, in the search bar I typed in the title and year Patterns of HIV-1 evolution in individuals with differing rates of CD4 T cell decline 1998, and it took me straight to the entry.
- Alternatives ways to finding the article include the author & year, or subject & year
- Some related information is available on the page, including similar papers, author information, other people who have cited the article, nucleotide sequences, protein sequences, references for the articles, and keywords that can be searched relating to the article.
Part 2: GenBank
- To get to GenBank, I clicked on the Nucleotide link on the right of the page with the Markham paper.
- Then, I clicked on the GenBank button for one of the clones (GenBank: AF016818.2), the FASTA format is showed by clicking the FASTA link, and the GenBank sequence is at the bottom of the page
- The clone I chose was subject 2, visit 4, clone 2.
- The source section under the FEATURES heading shows information about who the HIV was collected from.
- To save the FASTA files of the sequences, you click the Send To button, File, and Create File.
- To open the file, right click on the file, click open with, then select Notepad
- 5 sequences are listed below:
>AF016818.2 HIV-1 subject 2, visit 4 clone 2 from USA, envelope glycoprotein V3 region (env) gene, partial cds GAGGTAGTAATTAGATCTGAAAATTTCACGAACAATGCTAAAATCATAATAGTACAGCTAAATGAATCTG TAGAAATTAATTGTACAAGACCCAACAACAATACAAGAAAAAGTATACATATAAGACCAGGTAGAGCATT TTATACAACAAGAGACATAATAAGAGATATAAGACAAGCATATTGTAACATTAGTAGAGCAGAATGGAAT AACACTTTAAAACAGATAGTTATAAAATTAAGAGAACACTTTGGGAATAAAACAATAGTCTTTAATCACT CCTCA >AF089118.1 HIV-1 isolate S3V3-6 from USA envelope glycoprotein (env) gene, partial cds GATGTAGTAATTAGATCCGCCAATTTCACGGACAATGCTAAAATCATATTAGTACAGCTGAATGAAACTG TAGTAATGAATTGTACAAGACCCGGCAACAATACAAGAAAAAGGGTAACTCTAGGACCAGGCAGAGTATA CTATACAACAGGACAAATAATAGGAGATATAAGAAAAGCACATTGTAACCTTAGTAGAGCAGATTGGAGT AACACTTTAGAAAGAATAGCTATAAAATTAAGAGAACAATTTCAGAATAGAACAATAGTCTTTAATCAAT CCTCA >AF089134.1 HIV-1 isolate S3V5-3 from USA envelope glycoprotein (env) gene, partial cds GATGTAGTAATTAGATCCGCCAATTTCACAGACAATGCTAAAACCATACTAGTACAGCTGAATGAAACTG TAGTAATGAATTGTACAAGACCCGGCAACAATACAAGAAAAAGGGTAACTCTAGGACCAGGCAGAGTATA CTATACAACAGGACAAATAATAGGAGATATAAGAAAAGCACATTGTAACCTTAGTAGAGCGGGTTGGAAT AACACTTTAGAAAGGATAGCTATAAAATTAAGAGAACAATTTCAGAATAAAACAATAGGCTTTAATCAAT CCTCA >AF089522.1 HIV-1 isolate S11V4-3 from USA envelope glycoprotein (env) gene, partial cds GAGGTAATAATTAGATCTGAGAATTTCTCAAACAATGCTAAAAACATAATAGTACAGCTGAATGAATCTG TAGTAATTAATTGTACAAGACCCGACAACACTATAAAACAAAGGATAATACATATAGGACCAGGGAGACC ATTCTATACAACAGGAATAAAAGGAGATATAAGACAAGCACATTGTAACGTTAGTAGAGGACAATGGAAT AAAACTTTAGAACAGGTAGTTAAAAAATTAAGAGAACAATATGGACTAAATAAAACAATAGTCTTTAAGC AACCCATA >AF089542.1 HIV-1 isolate S12V4-1 from USA envelope glycoprotein (env) gene, partial cds GAGGTAGTAATTAGATCCAAGAATTTCACGGATAATGCTAAAATCATAATAGTACAGCTAAATGAGACTG TAGAAATTAATTGTACAAGACCCAACAACAATACAAGAAAAAGTATACCTATAGGACCAGGCAGAGCATT TTATACAACAGGAGAAATAATAGGAGATATAAGACAAGCACATTGTACCCTTAGTAGAGCAAAATGGAAT GAAACTTTAAAACAGATAGTTATAAAATTAAAAGAACAATTTAGGAATAAGACAATAGTCTTTAGTCCAT CCTCA
Part 3: Introduction to Phylogeny.fr
- I clicked on the Phylogeny.fr link found on the Week 5 protocol
- Then, I clicked on the One Click mode under the Phylogeny Analysis heading
- I copied and pasted my sequences in the box and clicked Submit. The results of my analysis can be viewed by clicking the link below.
- To obtain Clustal format of the sequences, I clicked on the Alignment tab, then under Outputs I clicked on Alignment in Clustal Format.
CLUSTAL FORMAT: MUSCLE (3.8) multiple sequence alignment AF089118.1 GATGTAGTAATTAGATCCGCCAATTTCACGGACAATGCTAAAATCATATTAGTACAGCTG AF089134.1 GATGTAGTAATTAGATCCGCCAATTTCACAGACAATGCTAAAACCATACTAGTACAGCTG AF089522.1 GAGGTAATAATTAGATCTGAGAATTTCTCAAACAATGCTAAAAACATAATAGTACAGCTG AF016818.2 GAGGTAGTAATTAGATCTGAAAATTTCACGAACAATGCTAAAATCATAATAGTACAGCTA AF089542.1 GAGGTAGTAATTAGATCCAAGAATTTCACGGATAATGCTAAAATCATAATAGTACAGCTA ** *** ********** ****** * * ********** **** ********** AF089118.1 AATGAAACTGTAGTAATGAATTGTACAAGACCCGGCAACAATACAAGAAAAAGG---GTA AF089134.1 AATGAAACTGTAGTAATGAATTGTACAAGACCCGGCAACAATACAAGAAAAAGG---GTA AF089522.1 AATGAATCTGTAGTAATTAATTGTACAAGACCCGACAACACTATAAAACAAAGGATAATA AF016818.2 AATGAATCTGTAGAAATTAATTGTACAAGACCCAACAACAATACAAGAAAAAGT---ATA AF089542.1 AATGAGACTGTAGAAATTAATTGTACAAGACCCAACAACAATACAAGAAAAAGT---ATA ***** ****** *** *************** ***** ** ** * **** ** AF089118.1 ACTCTAGGACCAGGCAGAGTATACTATACAACAGGACAAATAATAGGAGATATAAGAAAA AF089134.1 ACTCTAGGACCAGGCAGAGTATACTATACAACAGGACAAATAATAGGAGATATAAGAAAA AF089522.1 CATATAGGACCAGGGAGACCATTCTATACAACAGGA---ATAAAAGGAGATATAAGACAA AF016818.2 CATATAAGACCAGGTAGAGCATTTTATACAACAAGAGACATAATAAGAGATATAAGACAA AF089542.1 CCTATAGGACCAGGCAGAGCATTTTATACAACAGGAGAAATAATAGGAGATATAAGACAA * ** ******* *** ** ********* ** **** * *********** ** AF089118.1 GCACATTGTAACCTTAGTAGAGCAGATTGGAGTAACACTTTAGAAAGAATAGCTATAAAA AF089134.1 GCACATTGTAACCTTAGTAGAGCGGGTTGGAATAACACTTTAGAAAGGATAGCTATAAAA AF089522.1 GCACATTGTAACGTTAGTAGAGGACAATGGAATAAAACTTTAGAACAGGTAGTTAAAAAA AF016818.2 GCATATTGTAACATTAGTAGAGCAGAATGGAATAACACTTTAAAACAGATAGTTATAAAA AF089542.1 GCACATTGTACCCTTAGTAGAGCAAAATGGAATGAAACTTTAAAACAGATAGTTATAAAA *** ****** * ********* **** * * ****** ** *** ** **** AF089118.1 TTAAGAGAACAATTTCA---GAATAGAACAATAGTCTTTAATCAATCCTCA AF089134.1 TTAAGAGAACAATTTCA---GAATAAAACAATAGGCTTTAATCAATCCTCA AF089522.1 TTAAGAGAACAATATGGACTAAATAAAACAATAGTCTTTAAGCAACCCATA AF016818.2 TTAAGAGAACACTTTGG---GAATAAAACAATAGTCTTTAATCACTCCTCA AF089542.1 TTAAAAGAACAATTTAG---GAATAAGACAATAGTCTTTAGTCCATCCTCA **** ****** * * **** ******* ***** * ** *
- To save the phylo tree as a image, I used Snipping Tool app.
Error creating thumbnail: Unable to save thumbnail to destination
- Figure 1: Phylogenetic tree of 5 different clones from different subjects
Activity 2: Looking at sources of HIV among subjects
Part 1: Looking as clustering across subjects
- Three clones from four subjects were chosen and a multiple sequence alignment was made following the steps from the previous activity.
Error creating thumbnail: Unable to save thumbnail to destination
- Figure 2: Phylogenetic tree of 4 subjects
- I created a table for the 4 subjects with 3 clones using table syntax found on the Week 5 protocol.
Subject | Clone # |
3 | Visit 1 Clone 3 |
Visit 1 Clone 4 | |
Visit 1 Clone 5 | |
5 | Visit 1 Clone 6 |
Visit 1 Clone 7 | |
Visit 1 Clone 8 | |
11 | Visit 1 Clone 1 |
Visit 1 Clone 2 | |
Visit 1 Clone 3 | |
10 | Visit 1 Clone 1 |
Visit 1 Clone 2 | |
Visit 1 Clone 3 |
- Some of the clones from each subject cluster together. For example, in subject 3, clones 3 and 4 cluster together, and in subject 5, clones 7 and 8 cluster together. Also, subject 3 and 5 are closely related as well as 10 and 11.
- Yes some of the subjects' clones show more diversity. Like stated before, there was divergence and lead to two clones being clustered together and one by itself.
- No, none of the subjects clones clustered together and were in their own clades.
- It is shown that subjects 3 and 5 had closely related clones at the first visit, and subject 10 and 11 had closely related cones at the first visit. They are each in their own clade so there were not multiple subjects that clustered together. In subject 3, clones 3 and 4 clustered together, and in subject 5, clones 7 and 8 clustered together. It is 0.50 confidence that the two paired subjects split off into their own clades, so it is hard to confirm that is true.
Part 2: Quantifying diversity within and between subjects
- To quantify diversity within and between subjects, I chose subjects 3, 10 and 11 for analysis.
- I downloaded all clones from one subject from Nucleotide Sequence Data found on the Week 5 Assignment, aligned them through Phylogeny.Fr with the instructions shown previously. Phylogenetic Tree and CLUSTAL alignment are shown below.
- Then, to calculate the diversity, I will use the S statistic by counting the number of position differences between the clones for each subject and calculate θ by using the formula found on the Week 5 assignment page.
- To calculate the number of position differences, I copy and pasted the asterisks into Word for each section and subtracted the number of characters with spaces to ones without spaces.
- To calculate the harmonic sum, I used this calculator found on the University of Utah Department of Mathematics website which was advised by Dr. Dahlquist on the Week 5 Assignment page. For the number of terms, I used the number of clones.
Phylogenetic Trees
Error creating thumbnail: Unable to save thumbnail to destination
- Figure 3: All visits and clones from Subject 3
Error creating thumbnail: Unable to save thumbnail to destination
- Figure 4: All visits and clones from Subject 10
Error creating thumbnail: Unable to save thumbnail to destination
- Figure 5: All visits and clones from Subject 11
Clustal Formats
CLUSTAL FORMAT: MUSCLE SUBJECT 3 multiple sequence alignment S3V6-2 GATGTAGTAATCAGATCTGCCAATTTCACGGACAATGCTAAAACCATATTAGTACAGCTG S3V6-4 GATATAGTAATTAGATCTGCCAATTTCTCGGACAATGCTAAAACCATATTAGTACAGCTG S3V6-5 GATATAGTAATTAGATCTGCCAATTTCTCGGACAATGCTAAAACCATATTAGTACAGCTG S3V6-3 GATATAGTAATTAGATCTGCCAATTTCTCGGACAATGCTAAAACCATATTAGTACAGCTG S3V6-6 GATATAGTAATTAGATCTGCCAATTTCTCGGACAATGCTAAAACCATATTAGTACAGCTG S3V6-1 GATATAGTAATTAGATCTGCCAATTTCTCGGACAATGCTAAAACCATATTAGTACAGCTG S3V4-5 GAGGTAGTAATTAGATCCGCCAATTTCACGGACAATGCTAAAACCATATTAGTACAGCTG S3V3-2 GATGTAGTAATTAGATCCGCCAATTTCACGGACAATTCTAAAACCATAATAGTACAGCTG S3V3-1 GATGTAGTAATTAGATCCGCCAATTTCACGGACAATGCTAAAATCATAATAGTACAGCTG S3V1-3 GATGTAGTAATTAGATCCGCCAATTTCACGAACAATGCTAAAACCATACTAGTACAGCTG S3V1-4 GATGTAGTAATCAGATCCGCCAATTTCACGAACAATGCTAAAACCATACTAGTACAGCTG S3V5-3 GATGTAGTAATTAGATCCGCCAATTTCACAGACAATGCTAAAACCATACTAGTACAGCTG S3V4-7 GATGTAGTAATCAGATCCGCCAATTTCACGGACAATGCTAAAACCATATTAGTACAGCTG S3V5-2 GATGTAGTAATCAGATCCGCCAATTTCACGAACAATGCTAAAACCATATTAGTACAGCTG S3V5-9 GATGTAGTAATTAGATCCGCCAATTTCACGAACAATGCTAAAACCATATTAGTACAGCTG S3V5-1 GATGTAGTAATTAGATCCGCCAATTTCACGAACAATGCTAAAACCATACTAGTACAGCTG S3V5-10 GATGTAGTAATTAGATCCGCCAATTTCACGAACAATGCTAAAACCATATTAGTACAGCTG S3V5-7 GATGTAGTAATTAGATCCGCCAATTTCACGAACAATGCTAAAACCATATTAGTACAGCTG S3V5-8 GATGTAGTAATTAGATCCGCCAATTTCACGAACAATGCTAAAACCATATTAGTACAGCTG S3V5-4 GATGTAGTAATTAGATCCGCCAATTTCACGAACAATGCTAAAACCATATTAGTACAGCTG S3V5-5 GATGTAGTAATTAGATCCGCCAATTTCACGAACAATGCTAAAACCATATTAGTACAGCTG S3V5-6 GATGTAGTAATTAGATCCGCCAATTTCACGAACAATGCTAAAACCATATTAGTACAGCTG S3V3-6 GATGTAGTAATTAGATCCGCCAATTTCACGGACAATGCTAAAATCATATTAGTACAGCTG S3V3-10 GATGTAGTAATTAGATCCGCCAATTTCACGGACAATGCTAAAATCATATTAGTACAGCTG S3V3-9 GATGTAGTAATTAGATCCGCCAATTTCACAGACAATGCTAAAATCATAATAGTACAGCTG S3V3-3 GATGTAGTAATTAGATCCGCCAATTTCACGGACAATGCTAAAATCATAATAGTACAGCTG S3V3-7 GATGTAGTAATTAGATCCGCCAATTTCACGGACAATGCTAAAATCATACTAGTACAGCTG S3V4-8 GATGTAGTAATTAGATCCGCCAATTTCGCGGACAATGCTAAAACCATACTAGTACAGCTG S3V1-1 GATGTAGTAATTAGATCCGCCAATTTCTCGGACAATGCTAAAACCATACTAGTACAGCTG S3V1-2 GATGTAGTAATTAGATCCGCCAATTTCTCGGACAATGCTAAAACCATACTAGTACAGCTG S3V4-2 GATGTAGTAATTAGATCCGCCAATTTCGCGGACAATGCTAAAACCATACTAGTACAGCTG S3V4-3 GATGTAGTAATTAGATCCGCCAATTTCACGGACAATGCTAAAACCATATTAGTACAGCTG S3V4-1 GATGTAGTAATTAGATCCGCCAATTTCACGGACAATGCTAAAACCATATTAGTACAGCTG S3V4-4 GATGTAGTAATTAGATCCGCCAATTTCACGGACAATGCTAAGACCATATTAGTACAGCTG S3V3-5 GATGTAGTAATTAGATCCGCCAATTTCACGGACAATGCTAAAATCATATTAGTACAGCTG S3V4-9 GATGTAGTAATTAGATCCGCCAATTTCACGGACAATGCTAAAACCATATTAGTACAGCTG S3V3-8 GATGTAGTAATTAGATCCGCCAATTTCACGGACAATGCTAAAATCATATTAGTACAGCTG S3V3-4 GATGTAGTAATTAGATCCGCCAATTTCACGGACAATGCTAAAATCATAATAGTACAGCTG S3V4-6 GATGTAGTAATTAGATCCGCCAATTTCACGGACAATGCTAAAACCATAATAGTACAGCTG ** ******* ***** ********* * ***** **** * **** *********** S3V6-2 AATGAAACTGTAGTAATGAATTGTACAAGACCCGGCAACAATACAAGAAAAAGGGTAACT S3V6-4 AATGAAACTGTAGTAATGAATTGTACAAGACCCGGCAACAATACAAGAAAAAGGGTAACT S3V6-5 AATGAAACTGTAGTAATGAATTGTACAAGACCCGGCAACAATACAAGAAAAAGGGTAACT S3V6-3 AATGAAACTGTAGTAATGAATTGTACAAGACCCGGCAACAATACAAGAAAAAGGGTAACT S3V6-6 AATGAAACTGTAGTAATGAATTGTACAAGACCCGGCAACAATACAAGAAAAAGGGTAACT S3V6-1 AATGAAACTGTAGTAATGAATTGTACAAGACCCGGCAACAATACAAGAAAAAGGGTAACT S3V4-5 AATGAAACTGTAGTAATGAATTGTACAAGACCCGGCAACAATACAAGAAAAAGAGTAACT S3V3-2 AATGAAACTGTAGTAATGAATTGTACAAGACCCGGCAACAATACAAGAAAAAGGGTAACT S3V3-1 AATGAAACTGTAGTAATGAATTGTACAAGACCCGGCAACAATACAAGAAAAAGGGTAACT S3V1-3 AATGAAACTGTAGTAATGAATTGTACAAGACCCGGCAACAATACAAGAAAAAGAGTAACT S3V1-4 AATGAAACTGTAGTAATGAATTGTACAAGACCCGGCAACAATACAAGAAAAAGAGTAACT S3V5-3 AATGAAACTGTAGTAATGAATTGTACAAGACCCGGCAACAATACAAGAAAAAGGGTAACT S3V4-7 AATGAAACTGTAGTAATGAATTGTACAAGACCCGGCAACAATACAAGAAAAAGGGTAACT S3V5-2 AATGAAACTGTAGTAATGAATTGTACAAGACCCGGCAACAATACAAGAAAAAGGGTAACT S3V5-9 AATGAAACTGTAGTAATGAATTGTACAAGACCCGGCAACAATACAAGAAAAAGGGTAACT S3V5-1 AATGAAACTGTAGTAATGAATTGTACAAGACCCGGCAACAATACAAGAAAAAGGGTAACT S3V5-10 AATGAAACTGTAGTAATGAATTGTACAAGACCCGGCAACAATACAAGAAAAAGGGTAACT S3V5-7 AATGAAACTGTAGTAATGAATTGTACAAGACCCGGCAACAATACAAGAAAAAGGGTAACT S3V5-8 AATGAAACTGTAGTAATGAATTGTACAAGACCCGGCAACAATACAAGAAAAAGGGTAACT S3V5-4 AATGAAACTGTAGTAATGAATTGTACAAGACCCGGCAACAATACAAGAAAAAGGGTAACT S3V5-5 AATGAAACTGTAGTAATGAATTGTACAAGACCCGGCAACAATACAAGAAAAAGGGTAACT S3V5-6 AATGAAACTGTAGTAATGAATTGTACAAGACCCGGCAACAATACAAGAAAAAGGGTAACT S3V3-6 AATGAAACTGTAGTAATGAATTGTACAAGACCCGGCAACAATACAAGAAAAAGGGTAACT S3V3-10 AATGAAACTGTAGTAATGAATTGTACAAGACCCGACAACAATACAAGAAAAAGGGTAACT S3V3-9 AATGAAACTGTAGTAATGAATTGTACAAGACCCGGCAACAATACAAGAAAAAGGGTAACT S3V3-3 AATGAAACTGTAGTAATGAATTGTACAAGACCCGGCAACAATACAAGAAAAAGGGTAACT S3V3-7 AATGAAACTGTAGTAATGAATTGTACAAGACCCGGCAACAATACAAGAAAAAGGGTAACT S3V4-8 AATGAAACTGTAGTAATGAATTGTACAAGACCCGGCAACAATACAAGAAAAAGGGTAACT S3V1-1 AATGAAACTGTAGTAATGAATTGTACAAGACCCGGCAACAATACAAGAAAAAGGGTAACT S3V1-2 AATGAAACTGTAGTAATGAATTGTACAAGACCCGGCAACAATACAAGAAAAAGGGTAACT S3V4-2 AATGAAACTGTAGTAATGAATTGTACAAGACCCGGCAACAATACAAGAAAAAGGGTAACT S3V4-3 AATGAAACTGTAGTAATGAATTGTACAAGACCCGGCAATAATACAAGAAAAAGGGTAACT S3V4-1 AATGAAACTGTAGTAATGAATTGTACAAGACCCGGCAACAATACAAGAAAAAGGGTAACT S3V4-4 AATGAAACTGTAGTAATGAATTGTACAAGACCCGGCAACAATACAAGAAAAAGGGTAACT S3V3-5 AATGAAACTGTAGTAATGAATTGTACAAGACCCGGCAACAATACAAGAAAAAGGGTAACT S3V4-9 AATGAAACTGTAGTAATGAATTGTACAAGACCCGGCAACAATACAAGAAAAAGGGTAACT S3V3-8 AATGAAACTGTAGTAATGAATTGTACAAGACCCGGCAACAATACAAGAAAAAGGGTAACT S3V3-4 AATGAAACTGTAGTAATGAATTGTACAAGACCCGGCAACAATACAAGAAAAAGGGTAACT S3V4-6 AATGAAACTGTAGTAATGAATTGTACAAGACCCGGCAACAATACAAGAAAAAGGGTAACT ********************************** *** ************** ****** S3V6-2 CTAGGACCAGGCAGAGTATACTATACAACAGGACAAATAATAGGAGATATAAGAAAAGCA S3V6-4 CTAGGACCAGGCAGAGTATACTATACAACAGGACAAATAATAGGAGATATAAGAAAAGCA S3V6-5 CTAGGACCAGGCAGAGTATACTATACAACAGGACAAATAATAGGAGATATAAGAAAAGCA S3V6-3 CTAGGACCAGGCAGAGTATACTATACCACAGGACAAATAATAGGAGATATAAGAAAAGCA S3V6-6 CTAGGACCAGGCAGAGTATACTATACCACAGGACAAATAATAGGAGATATAAGAAAAGCA S3V6-1 CTAGGACCAGGCAGAGTATACTATACCACAGGACAAATAATAGGAGATATAAGAAAAGCA S3V4-5 CTAGGACCAGGCAGAGTATACTATACAACAGGACAAATAATAGGAGATATAAGAAAAGCA S3V3-2 CTAGGACCAGGCAGAGTATACTATACAACAGGACAAATAATAGGAGATATAAGAAAAGCA S3V3-1 CTAGGACCAGGCAGAGTATACTATACAACAGGACAAATAATAGGAGATATAAGAAAAGCA S3V1-3 CTAGGACCAGGCAAAGTATACTACACAACAGGACAAATAATAGGAGATATAAGAAAAGCA S3V1-4 CTAGGACCAGGCAAAGTATACTATACAACAGGACAAATAATAGGAGATATAAGAAAAGCA S3V5-3 CTAGGACCAGGCAGAGTATACTATACAACAGGACAAATAATAGGAGATATAAGAAAAGCA S3V4-7 CTAGGACCAGGCAGAGTATACTATACAACAGGACAAATAATAGGAGATATAAGAAAAGCA S3V5-2 CTAGGACCGGGCAGAGTATACTATACAACAGGACAAATAATAGGGGATATAAGAAAAGCA S3V5-9 CTAGGACCAGGCAGAGTATACTATACAACAGGACAAATAATAGGAGATATGAGAAAAGCA S3V5-1 CTAGGACCAGGCAGAGTATACTATACAACAGGACAAATAATAGGAGATATAAGAAAAGCA S3V5-10 CTAGGACCAGGCAGAGTATACTATACAACAGGACAAATAATAGGGGATATAAGAAAAGCA S3V5-7 CTAGGACCAGGCAGAGTATACTATACAACAGGACAAATAATAGGAGATATAAGAAAAGCA S3V5-8 CTAGGACCAGGCAGAGTATACTATACAACAGGACAAATAATAGGAGATATAAGAAAAGCA S3V5-4 CTAGGACCAGGCAGAGTATACTATACCACAGGACAAATAATAGGAGATATAAGAAAAGCA S3V5-5 CTAGGACCAGGCAGAGTATACTATACCACAGGACAAATAATAGGAGATATAAGAAAAGCA S3V5-6 CTAGGACCAGGCAGAGTATACTATACAACAGGACAAATAATAGGAGATATAAGAAAAGCA S3V3-6 CTAGGACCAGGCAGAGTATACTATACAACAGGACAAATAATAGGAGATATAAGAAAAGCA S3V3-10 CTAGGACCAGGCAGAGTATACTATACAACAGGACAAATAATAGGAGATATAAGAAAAGCA S3V3-9 CTAGGACCAGGCAGAGTATACTATACAACAGGACAAATAATAGGAGATATAAGAAAAGCA S3V3-3 CTAGGACCAGGCAGAGTATACTATACAACAGGACAAATAATAGGAGATATAAGAAAAGCA S3V3-7 CTAGGACCAGGCAGAGTATACTATACAATAGGACAAATAATAGGAGATATAAGAAAAGCA S3V4-8 CTAGGACCAGGCAAAGTATACTATACAACAGGACAAATAATAGGAGATATAAGAAAAGCA S3V1-1 CTAGGACCAGGCAAAGTATACTATACAACAGGACAAATAATAGGAGATATAAGAAAAGCA S3V1-2 CTAGGACCAGGCAAAGTATACTATACAACAGGACAAATAATAGGAGATATAAGAAAAGCA S3V4-2 CTAGGACCAGGCAAAGTATATTATACAACAGGACAAATAATAGGAGATATAAGAAAAGCA S3V4-3 CTAGGACCAGGCAGAGTATACTATACAACAGGACAAATAATAGGAGATATAAGAAAAGCA S3V4-1 CTAGGACCAGGCAGAGTATACTATACAACAGGACAAATAATAGGAGATATAAGAAAAGCA S3V4-4 CTAGGACCAGGCAGAGTATACTATACAACAGGACAAATAATAGGAGATATAAGAAAAGCA S3V3-5 CTAGGACCAGGCAGAGTATACTATACAACAGGACAAATAATAGGAGATATAAGAAAAGCA S3V4-9 CTAGGACCAGGCAGAGTATACTATACAACAGGACAAATAATAGGAGATATAAGAAAAGCA S3V3-8 CTAGGACCAGGCAGAGTATACTATACAACAGGACAAATAATAGGAGATATAAGAAAAGCA S3V3-4 CTAGGACCAGGCAGAGTATACTATACAACAGGACAAATAATAGGAGATATAAGAAAAGCA S3V4-6 CTAGGACCAGGCAGAGTATACTATACAACAGGACAAATAATAGGAGATATAAGAAAAGCA ******** **** ****** ** ** * *************** ***** ********* S3V6-2 CATTGTAACCTTAGTAGAGCAGATTGGAATAACACTTTAGAAAGGATAGCTATAAAATTA S3V6-4 CATTGTAACCTTAGTAGAGCGGATTGGAATAACACTTTAGAAAGGATAGCTATAAAATTA S3V6-5 CATTGTAACCTTAGTAGAGCAGATTGGAATAACACTTTAGAAAGGATAGCTATAAAATTA S3V6-3 CATTGTAACCTTAGTAGAGCAGGTTGGAATAACACTTTAGAAAGGATAGCTATAAAATTA S3V6-6 CATTGTAACCTTAGTAGAGCAGGTTGGAATAGCACTTTAGAAAGGATAGCTATAAAATTA S3V6-1 CATTGTAACCTTAGTAGAGCAGGTTGGAATAACACTTTAGAAAGGATAGCTATAAAATTA S3V4-5 CATTGTAACCTTAGTAGAGCAGGTTGGAATAACACTTTAACAAGGATAGCTATAAAATTA S3V3-2 CATTGTAACCTTAGTAGAGCAGGTTGGGCTAACACTTTAGAAAGGATAGCTGTAAAATTA S3V3-1 CATTGTAACCTTAGTAGAACAGGTTGGAGTAACACTTTAAAAAGGATAGCTGTAAAATTA S3V1-3 CATTGTAACCTTAGTAGAGCAGATTGGAATAACACTTTAAAAAGGATAGCTATAAAATTA S3V1-4 CATTGTAACCTTAGTAGAGCAGATTGGAATAACACTTTAAAAAGGATAGCTATAAAATTA S3V5-3 CATTGTAACCTTAGTAGAGCGGGTTGGAATAACACTTTAGAAAGGATAGCTATAAAATTA S3V4-7 CATTGTAACCTTAGTAGAGCAGGTTGGAATAACACTTTAAAAGGGATAGCTATAAAATTA S3V5-2 CATTGTAACCTTAGTAGAGCAGGTTGGAATAACACTTTAGAAAGGATAGCTATAAAATTA S3V5-9 CATTGTAACCTTAGTTGAGCAGGTTGGAATAACACTTTAGAAAGGATAGCTATAAAATTA S3V5-1 CATTGTAACCTTAGTAGAGCAGGTTGGAATAACACTTTAGAAAGGATAGCTATAAAATTA S3V5-10 CATTGTAACCTTAGTAGAGCAGGTTGAAATAACACTTTAGAAAGAATAGCTATAAAATTA S3V5-7 CATTGTAACCTTAGTAGAGCAGGTTGAAATAACACTTTAGAAAGGATAGCTATAAAATTA S3V5-8 CATTGTAACCTTAGTAGAGCAGGTTGAAATAACACTTTAGAAAGAATAGCTATAAAATTA S3V5-4 CATTGTAACCTTAGTAGAGCAGGTTGGAATAACACTTTAGAAAGGATAGCTATAAAATTA S3V5-5 CATTGTAACCTTAGTAGAGCAGGTTGGACTAACACTTTAGAAAGAATAGCTATAAAATTA S3V5-6 CATTGTAACCTTAGTAGAGCAGGTTGGACTAACACTTTAGAAAGGATAGCTATAAAATTA S3V3-6 CATTGTAACCTTAGTAGAGCAGATTGGAGTAACACTTTAGAAAGAATAGCTATAAAATTA S3V3-10 CATTGTAACCTTAGCAGAGCAGGTTGGACTAACACTTTAGAAAGAATAGCTATAAAATTA S3V3-9 CATTGTAACCTTAGTAGAGCAGGTTGGAGTAACACTTTAGAAAGGATAGCTATAAAATTA S3V3-3 CATTGTAACCTTAGTAGAGCAGGTTGGACTAACACTTTAGAAAGAATAGCTATAAAATTA S3V3-7 CATTGTAACCTTAGTAGAGCAGGTTGGACTAACACTTTAGAAAGAATAGCTATAAAATTA S3V4-8 CATTGTAACCTTAGTAGAGCAGGTTGGAATAACACTTTAAAAAGGATAGCTATAAAATTA S3V1-1 CATTGTAACCTTAGTAGAGCAGATTGGAATAACACTTTAAAAAGGATAGCTATAAAATTA S3V1-2 CATTGTAACCTTAGTAGAGCAGATTGGAATAACACTTTAAAAAGGATAGCTATAAAATTA S3V4-2 CATTGTAACCTTAGTAGAGCAGATTGGAATAACACTTTAAAAAGGATAGCTATAAAATTA S3V4-3 CATTGTAACCTTAGTAGAGCAGATTGGAATAACACTTTAGAAAGGATAGCTATAAAATTA S3V4-1 CATTGTAACCTTAGTAGAGCAGGTTGGAATAACACTTTAGAAAGGATAGCTATAAAATTA S3V4-4 CATTGTAACCTTAGTAGAGCAGGTTGGAATAACACTTTAGAAAGGATAGCTATAAAATTA S3V3-5 CATTGTAACCTTAGTAGAGCAGGTTGGAATAACACTTTAAAAAGGATAGCTATAAAATTA S3V4-9 CATTGTAACCTTAGTAGAGCAGATTGGAATAACACTTTAAAAAGGATAGCTATAAAATTA S3V3-8 CATTGTAACCTTAGTAGAGCAGATTGGAATAACACTTTAAAAAGGATAGCTATAAAATTA S3V3-4 CATTGTAACCTTAGTAGAGCAAGTTGGAATAACACTTTAAAAAGGATAGCTATAAAATTA S3V4-6 CATTGTAACCTTAGTAGAGCAGGTTGGAATAACACTTTAAAAAGGATAGCTATAAAATTA ************** ** * *** ** ******* * * ****** ******** S3V6-2 AGAGAACAATTTCAGAATAAAACAATAGCCTTTAATCAATCCTCA S3V6-4 TGAGAACAATTTCAGAATAAAACAATAGGCTTTAATCAATCCTCA S3V6-5 AGAGAACAATTTCAGAATAAAACAATAGCCTTTAATCAATCCTCA S3V6-3 AGAGAACAATTTCAGAATAAAACAATAGCCTTTAACCAATCCTCA S3V6-6 AGAGAACAATTTCAGAATAAAACAATAGCCTTTAATCAATCCTCA S3V6-1 AGAGAACAATTTCAGAATAAAACAATAGCCTTTAATCAATCCTCA S3V4-5 AGAGAACAATTTCAGAATAGAACAATAGGCTTTAATCAATCCTCA S3V3-2 AGAGAACAATTTCAGAATAGAACAATAGTCTTTAATCAATCCTCA S3V3-1 AGAGAACAATTTCAGAATAGAACAATAGGCTTTAATCAATCCTCA S3V1-3 AGAGAACAATTTCAGAATAAAACAATAGCCTTTAATCAATCCTCA S3V1-4 AGAGAACAATTTCAGAATAAAACAATAGTCTTTAATCAATCCTCA S3V5-3 AGAGAACAATTTCAGAATAAAACAATAGGCTTTAATCAATCCTCA S3V4-7 TGAGAACAATTTCAGAATAGAACAATAGTCTTTAATCAATCCTCA S3V5-2 AGAGAACAATTTCAGAATAAAACAATATTCTTTAATCAATCCTCA S3V5-9 AGAGAACAATTTCAGAATAAAACAATAGCCTTTAATCAATCCTCA S3V5-1 AGAGAACAATTTCAGAATAAAACAATAGCCTTTAATCAATCCTCA S3V5-10 AGAGAACAATTTCAGAATAAAACAATAGCCTTTAATCAATCCTCA S3V5-7 AGAGAACAATTTCAGAATAAAACAATAGCCTTTAATCAATCCTCA S3V5-8 AGAGAACAATTTCAGAATAAAACAATATTCTTTAATCAATCCTCA S3V5-4 AGAGAACAATTTCAGAATAAAACAATATTCTTTAATCAATCCTCA S3V5-5 AGAGAACAATTTCAGAATAAAACAATATTCTTTAATCAATCCTCA S3V5-6 AGAGAACAATTTCAGAATAAAACAATAGTCTTTAATCAATCCTCA S3V3-6 AGAGAACAATTTCAGAATAGAACAATAGTCTTTAATCAATCCTCA S3V3-10 AGAGAACAATTTCAGAATAGAACAATAGTCTTTAATCAATCCTCA S3V3-9 AGAGAACAATTTCAGAATAGAACAATAGTCTTTAATCAATCCTCA S3V3-3 AGAGAACAATTTCAGAATAGAACAATAGTCTTTAATCAATCCTCA S3V3-7 AGAGAACAATTTCAGAATAGAACAATAGTCTTTAATCAATCCTCA S3V4-8 AGAGAACAATTTCAGAATAGAACAATAGTCTTTAATCAATCCTCA S3V1-1 AGAGAACAATTTCAGAATAAAACAATAGTCTTTAATCAATCCTCA S3V1-2 AGAGAACAATTTCAGAATAAAACAATAGCCTTTAATCAATCCTCA S3V4-2 AGAGAACAATTTCAGAATAAAACAATAGGCTTTAATCAATCCTCA S3V4-3 AGAGAACAATTTCAGAATAGAACAATATTCTTTAATCAATCCTCA S3V4-1 AGAGAACAATTTCAGAATAGAACAATAGTCTTTAATCAATCCTCA S3V4-4 AGAGAACAATTTCAGAATAGAACAATAGTCTTTAATCAATCCTCA S3V3-5 AGAGAACAATTTCAGAATAGAACAATAGTCTTTAATCAATCCTCA S3V4-9 AGAGAACAATTTCAGAATAGAACAATAGTCTTTAATCAATCCTCA S3V3-8 AGAGAACAATTTCAGAATAGAACAATAGTCTTTAATCAATCCTCA S3V3-4 AGAGAACAATTTCAGAATAGAACAATAGTCTTTAATCAATCCTCA S3V4-6 AGAGAACAATTTCAGAATAGAACAATAGTCTTTAATCAATCCTCA ****************** ******* ****** *********
CLUSTAL FORMAT: MUSCLE SUBJECT 10 multiple sequence alignment S10V6-2 GAGGTAGTAATTAGATCTGAAAATTTCACGGACAATGCTAAAACCATAATAGTACATCTG S10V5-3 GAGGTAGTAATAAGATCTAAAAATTTCACGGACAATGCTAAAACCATAATAGTACAGCTG S10V4-8 GAGGTAGTAATTAGATCTGAAAACTTCACGGACAATGCTAAAACCATAATAGTACAGCTG S10V4-15 GAGGTAGTAATTAGATCTGAAGATTTCACGGACAATGCTAAAACCATAATAGTACAGCTG S10V5-4 GAGGTAGTAATTAGATCTGAAAATTTCACGGACAATGCTAAA---ATAATAGTACAGCTG S10V4-11 GAGGTAGTAATTAGATCTGAAAATTTCACGGACAATGCTAAAACCATAATAGTACAGCTG S10V5-7 GAGGTAGTAATTAGATCTGAAAATTTCACGGACAATGCTAAAACCATAATAGTACAGCTG S10V5-9 GAGGTAGTAATTAGATCTGAAAATTTCACGGACAATGCTAAAACCATAATAGTACAGCTG S10V5-10 GAGGTAGTAATTAGATCTGAAAATTTCACGGACAATGCTAAAACCATAATAGTACAGCTG S10V4-3 GAGGTAGTAATTAGATCTGAAAATTTCACGGACAATGCTAAAACCATAATAGTACAGCTG S10V4-12 GAGGTAGTAATTAGATCTGAAAATTTCACGGACAATGCTAAAACCATAATAGTACAGCTG S10V4-13 GAGGTAGTAATTAGATCTGAAAATTTCACGGACAATGCTAAAACCATAATAGTACAGCTG S10V5-2 GAGGTAGTAATTAGATCTGAAAATTTCACGGACAATGCTAAAACCATAATAGTACAGCTG S10V5-5 GAGGTAGTAATAAGATCTGAAAATTTCACGGACAATGCTAAAACCATAATAGTACAGCTG S10V5-6 GAGGTAGTAATAAGATCTGAAAATTTCACGGACAATGCTAAAACCATAATAGTACAGCTG S10V5-8 GAGGTAGTAATCAGATCTGAAAATTTCACGGACAATGCTAAAACCATAATAGTACAGCTG S10V4-6 GAGGTAGTAATTAGATCTGAAAATTTCACGGACAATGCTAAAACCATAATAGTACAGCTG S10V4-1 GAGGTAGTAATTAGATCTGAAAATTTCACGGACAATGCTAAAACCATAATAGTACAGCTG S10V5-1 GAGGTAGTAATTAGATCTGAAAATTTCACGGACAATGCTAAAACCATAATAGTACAGCTG S10V4-2 GAGGTAGTAATTAGATCTGAAAATTTCACGGACAATGCTAAAACCATAATAGTACAGCTG S10V4-14 GAGGTAGTAATTAGATCTGAAAATTTCACGGACAATGCTAAAACCATAATAGTACAGCTG S10V6-7 GAGGTAGTAATTAGATCTGAAAATTTCACGGACAATGCTAAAACCATAATAGTACATCTG S10V4-5 GAGGTAGTAATTAGATCTGAAAATTTCACGGACAATGCTAAAACCATAATAGTACAGCTG S10V4-16 GAGGTAGTAATTAGATCTGAAAATTTCACGGACAATGCTAAAACCATAATAGTATAGCTG S10V6-6 GAGGTAGCAATTAGATCTGAAAATTTCACGGACAATGCTAAAACCATAATAGTACATCTG S10V4-9 GAGGTAGTAATTAGATCTGAAAATTTCACGGACAATGCTAAAACCATAATAGTACAGCTG S10V6-3 GAGGTAGTAATTAGATCTGAAAATTTCACGGACAATGCTAAAACCATAATAGTACAGCTG S10V6-9 GAGGTAGTAATTAGATCTGAAAATTTCACGGACAATGCTAAAACCATAATAGTACATCTG S10V6-10 GAGGTAGTAATTAGATCTGAAAATTTCACGGACAATGCTAAAACCATAATAGTACATCTG S10V6-1 GAGGTAGTAATTAGATCTGAAAATTTCACGGACAATGCTAAAACCATAATAGTACATCTG S10V2-2 GAGATAGTAATTAGATCTGAAAATTTCACGGACAATGCTAAAATCATAATAGTACAGCTG S10V6-8 GAGGTAGTAATTAGATCTGAAAATTTCACGGACAATGCTAAAACCATAATAGTACATCTG S10V4-4 GAGGTAGTAATTAGATCTGAAAATTTCACGGACAATGCTAAAACCATAATAGTACAGCTG S10V1-5 GAGGTAGTAATTAGATCTGAAAATTTCACGGACAATGCTAAAACCATAATAGTACAGCTG S10V2-1 GAGGTAGTAATTAGATCTGAAAATTTCACGGACAATGCTAAAACCATAATAGTACAGCTG S10V2-3 GAGGTAGTAATTAGATCTGAAAATTTCACGGACAATGCTAAAACCATAATAGTACAGCTG S10V2-6 GAGGTAGTAATTAGATCTGAAAATTTCACGGACAATGCTAAAATCATAATAGTACAGCTG S10V2-4 GAGGTAGTAATTAGATCTGAAAATTTCACGGACAATGCTAAAATCATAATAGTACAGCTG S10V1-2 GAGGTAGTAATTAGATCTGAAAATTTCACGGACAATGCTAAAACCATAATAGTACAGCTG S10V1-3 GAGGTAGTAATTAGATCTGAAAATTTCACGGACAATGCTAAAACCATAATAGTACAGCTG S10V1-4 GAGGTAGTAATTAGATCTGAAAATTTCACGGACAATGCTAAAACCATAATAATACAGCTG S10V1-6 GAGGTAGTAATTAGATCTGAAAATTTCACGGACAATGCTAAAACCATAATAGTACAGCTG S10V1-7 GAGGTAGTAATTAGATCTGAAAATTTCACGGACAATGCTAAAACCATAATAGTACAGCTG S10V1-1 GAGGTAGTAATTAGATCTGAAAATTTCACGGACAATGCTAAAACCATAATAGTACAGCTG S10V2-5 GAGGTAGTAATTAGATCTGAAAATTTCACGGACAATGCTAAAACCATAATAGTACAGCTG S10V4-7 GAGGTAGTAATTAGATCTGAAAATTTCACGGACAATGCTAAAACCATAATAGTACAGCTG S10V4-10 GAGGTAGTAATTAGATCTGAAAATTTCACGGACAATGCTAAAACCATAATAGTACAGCTG S10V6-4 GAGGTAGTAATTAGATCTGAAAATTTCACGGACAATGCTAAAACCATAATAGTACATCTG S10V6-5 GAGGTAGTAATTAGATCTGAAAATTTCACGGACAATGCTAAAACCATAATAGTACATCTG *** *** *** ****** ** * ****************** ****** ** * *** S10V6-2 AATAAATCTGTAGAAATTAATTGCACAAGACCCAATAACAATACAAGAAGAAGTATAAAT S10V5-3 AATAAATCTGTAGAAATTAATTGCACAAGACCCAACAACAATACAAGAAGAAAAATAAGT S10V4-8 AATAAATCTGTAGAAATTAATTGCACAAGACCCAACAACAATACAAGAAGAAGAATAAGT S10V4-15 AATAAATCTGTAGAAATTAATTGCACAAGACCCAACAACAATACAAGAAGAAGAATAAGT S10V5-4 AATAAAGCTGTAGAAATCAATTGCACAAGACCCAACAACAATAAAAAAAGAAGAATAAGT S10V4-11 AATAAAGCTGTAGAAATTAATTGCACAAGACCCAACAACAATACAAGAAGAAGAATAAGT S10V5-7 AATAAAGCTGTAGAAATTAATTGCACAAGACCCAGCAACAATACAAGAAGAAGTATAAAT S10V5-9 AATAAAGCTGTAGAAATTAATTGCACAAGACCCAACAGCAATACAAGAAGAAGAATAAGT S10V5-10 AATAAAGCTGTAGAAATTAATTGCACAAGACCCAACAACAATACAAGACGAAGAATAAGT S10V4-3 AATAAAGCTGTAGAAATTAATTGCACAAGACCCAACAACAATACAAGAAGAAGAATAAGT S10V4-12 AATAAAGCTGTAGAAATTAATTGCACAAGACCCAACAACAATAGAAGAAGAAGAATAAGT S10V4-13 AATAAAGCTGTAGAAATTAATTGCACAAGACCCAACAACAATAGAAGAAGAAGAATAAGT S10V5-2 AATAAAGCTGTAGAAATTAATTGCACAAGACCCAACAACAATAGAAGAAGAAGAATAAGT S10V5-5 AATAAAGCTGTAGAAATTAATTGCACAAGACCCAACAACAATAAAAGAAGAAGAATAAGT S10V5-6 AATAAAGCTGTAGAAATTAATTGCACAAGACCCAACAACAATAAAAGAAGAAGAATAAGT S10V5-8 AATAAAGCTGTAGAAATTAATTGCACAAGACCCAACAACAATAGAAGAAGAAGAATAAGT S10V4-6 AATAAAGCTGTAGAAATTAATTGCACAAGACCCAACAACAACACAAGAAGAAGAATAAGT S10V4-1 AATAAAGCTGTAGAAATTAATTGCACAAGACCCAACAACAATACAAGAAGAAGAATAAGT S10V5-1 AATAAAGCTGTAGAAATTAATTGCACAAGACCCAACAACAATACAAGAAGAAGAATAAGT S10V4-2 AATAAAGCTGTAGAAATTAATTGCACAAGACCCAACAACAATACAAGAAGAAGAATAAGT S10V4-14 AATAAAGCTGTAGAAATTAATTGCACAAGACCCAACAACAATACAAGAAGAAGAATAAGT S10V6-7 AATAAAGCTGTAGAAATTAATTGCACAAGACCCAACAACAATACAAGAAGAAGTATAAAT S10V4-5 AATGAATCTGTAGAAATTAATTGCACAAGACCCAACAACAATACAAGAAGAAGCATACAT S10V4-16 AATGAATCTGTAGAAATTAATTGCACAAGACCCAACAACAATACAAGAAGAAGTATAAAT S10V6-6 AATAAATCTGTAGAAATTAATTGCACAAGACCCAACAACAATACAAGAAGAAGTATAAAT S10V4-9 AATAAATCTGTAGAAATTAATTGCACAAGACCCAACAACAATACAAGAAGAAGTATAAAT S10V6-3 AATAAATCTGTAGAAATTAATTGCACAAGACCCAACAACAATACAAGAAGAAGTATAAAT S10V6-9 AATAAATCTGTAGAAATTAATTGCACAAGACCCAACAACAATACAAGAAGAAGTATAAAT S10V6-10 AATAAATCTGTAGAAATTAATTGCACAAGACCCAACAACAATACAAGAAGAAGTATAAAT S10V6-1 AATAAATCTGTAGAAATTAATTGCACAAGACCCAACAACAATACAAGAAGAAGTATAAAT S10V2-2 AATAAATCTGTAGAAATTAATTGCACAAGACCCAACAACAATACAAGAAGAAGTATAAAT S10V6-8 AATAAATCTGTAGAAATTAATTGCACAAGACCCAACAACAATACAAGAAGAAGTATAAAT S10V4-4 AATAAATCTGTAGAAATTAATTGCACAAGACCCAACAACAATACAAGAAGAAGTATAAAT S10V1-5 AATAAATCTGTAGAAATTAATCGCACAAGACCCAACAACAATACAAGAAGAAGTATAAAT S10V2-1 AATAAATCTGTAGAAATTAATTGCACAAGACCCAACAACAATACAAGAAGAAGTATAAAT S10V2-3 AATAAATCTGTAGAAATTAATTGCACAAGACCCAACAACAATACGAGAAGAAGTATAAAT S10V2-6 AATAAATCTGTAGAAATTAATTGCACAAGACCCAACAACAATACAAGAAGAAGTATAAAT S10V2-4 AATAAATCTGTAGAAATTAATTGCACAAGACCCAACAACAATACAAGAAGAAGTATAAAT S10V1-2 AATAAATCTGTAGAAATTAATTGCACAAGACCCAACAACAATACAAGAAGAAGTATAAAT S10V1-3 AATAAATCTGTAGAAATTAATTGCACAAGACCCAACAACAATACAAGAAGAAGTATAAAT S10V1-4 AATAAATCTGTAGAAATTAATTGCACAAGACCCAACAACAATACAAGAAGAAGTATAAAT S10V1-6 AATAAATCTGTAGAAATTAATTGCACAAGACCCAACAACAATACAAGAAGAAGTATAAAT S10V1-7 AATAAATCTGTAGAAATTAATTGCACAAGACCCAACAACAATACAAGAAGAAGTATAAAT S10V1-1 AATAAATCTGTAGAAATTAATTGCACAAGACCCAACAACAATACAAGAAGAAGTATAAAT S10V2-5 AATAAATCTGTAGAAATTAATTGCACAAGACCCAACAACAATACAAGAAGAAGTATAAAT S10V4-7 AATAAATCTGTAGAAATTAATTGCACAAGACCCAACAACAATACAAGAAGAAGTATAAAT S10V4-10 AATAAATCTGTAGAAATTAATTGCACAAGACCCAACAACAATACAAGAAGAAGTATAAAT S10V6-4 AATGAATCTGTAGAAATTAATTGCACAAGACCCAACAACAATAAAAGAAAAAGAATCAAT S10V6-5 AATAAATCTGTAGAAATTAATTGCACAAGACCCAACAACAATAAAAGAAAAAGAATAACT *** ** ********** *** ************ * *** * * * ** ** * S10V6-2 ATGGGACCAGGGAGAGCATTCTATACAACAGGAGACATAATAGGAGATATAAGGCAAGCA S10V5-3 ATGGGACCAGGGAGAGTACTTTATACAACAGGAGAAATATTAGGAGATATAAGGCAAGCA S10V4-8 ATGGGACCAGGGAGAGTACTTTATACAACAGGAGAAATAATAGGAGATATAAGGCAAGCA S10V4-15 ATGGGACCAGGGAGAGTACTTTATACAACAGGAGAAATAATAGGAGATATAAGGCAAGCA S10V5-4 ATGGGACCAGGGAGAGTACTTTATACAACAGGAGAAATAATAGGAGATATAAGGCAAGCA S10V4-11 ATGGGACCAGGGAGAGTATTTTATACAACAGGAGAAATAATAGGAGATATAAGGCAAGCA S10V5-7 ATGGGACCAGGGAGAGTATTGTATACAACAGGAGAAATAATAGGAGATATAAGGCAAGCA S10V5-9 ATGGGACCAGGGAGAGTATTTTATACAACAGGAGAAATAATAGGAGATATAAGGCAAGCA S10V5-10 ATGGGACCAGGGAGAGTATTGTATACAACAGGAGAAATAATAGGAGATATAAGGCAAGCA S10V4-3 ATGGGACCAGGGAGAGTACTTTATACAACAGGAGAAATAATAGGAGATATAAGGCAAGCA S10V4-12 ATGGGACCAGGGAGAGTATTTTATACAACAGGAGAAATGATAGGAGATATAAGGCAAGCA S10V4-13 ATGGGACCAGGGAGAGTATTTTATACAACAGGAGAAATAATAGGAGATATAAGGCAAGCA S10V5-2 ATGGGACCAGGGAGAGTATTTTATACAACAGGAGAAATAATAGGAGATATAAGGAAAGCA S10V5-5 ATGGGACCAGGGAGAGTATTGTATACAACAGGAGAAATAATAGGAGATATAAGGCAAGCA S10V5-6 ATGGGACCAGGGAGAGTATTGTATACAACAGGAGAAATAATAGGAGATATAAGGCAAGCA S10V5-8 ATGGGACCAGGGAGAGTATTGTATACAACAGGAGAAATAATAGGAGATATAAGGCAAGCA S10V4-6 ATGGGACCAGGGAGAGTATTGTATACAACAGGAGAAATAATAGGAGATATAAGGCAAGCA S10V4-1 ATGGGACCAGGGAGAGTATTGTATACAACAGGAGAAATAATAGGAGATATAAGGCGAGCA S10V5-1 ATGGGACCAGGGAGAGTATTGTATACAACAGGAGAAATAATAGGAGATATAAGGCGAGCA S10V4-2 ATGGGACCAGGGAGAGTATTGTATACAACAGGAGAAATAATAGGAGATATAAGGCAAGCA S10V4-14 ATGGGACCAGGGAGAGTATTGTATACAACAGGAGAAATAATAGGAGATATAAGGCAAGCA S10V6-7 ATGGGACCAGGGAGAGCATTTTATGCAACAGGAGACATAATAGGAGATATAAGGCAAGCA S10V4-5 ATGGGACCAGGGAGAGCATTTTATGCAACAGGAGAAATAATAGGAGATATAAGGCAAGCA S10V4-16 ATGGGACCAGGGAGAGCATTCTATACAACAGGAGAAATAATAGGAGATATAAGGCAAGCA S10V6-6 ATGGGACCAGGGAGAGCATTTTATGCAACAGGAGACATAATAGGAGATATAAGGCAAGCA S10V4-9 ATGGGACCAGGGAGAGCATTCTATACAACAGGAGACATAATAGGAGATATAAGGCAAGCA S10V6-3 ATGGGACCAGGGAGAGCATTGTATACAACAGGAGACATAATAGGAGATATAAGGCAAGCA S10V6-9 ATGGGACCAGGGAGAGTATTTTATGCAACAGGAGAAATAATAGGAGATATAAGGCAAGCA S10V6-10 ATGGGACCAGGGAGAGCATTTTATGCAACAGGAGAAATAATAGGAGATATAAGGCAAGCA S10V6-1 ATGGGACCAGGGAGAGCATTTTATGCAACAGGAGAAATAATAGGAGATATAAGGCAAGCA S10V2-2 ATGGGACCAGGGAGAGCATTTTATACAACAGGAGAAATAATAGGAGATATAAGGCAAGCA S10V6-8 ATGGGACCAGGGAGAGTATTTTATACAACAGGAGAAATAATAGGAGATATAAGGCAAGCA S10V4-4 ATGGGACCAGGGAGAGTATTTTATACAACAGGAGAAATAATAGGAGATATAAGGCAAGCA S10V1-5 ATGGGGCCAGGGAGAGCATTTTATACAACAGGAGAAATAATAGGAGATATAAGGCAAGCA S10V2-1 ATGGGACCAGGGAGAGCACTTTATACAACAGGAGAAATAATAGGAGATATAAGGCAAGCA S10V2-3 ATGGGACCAGGGAGAGCATTTTATACAACAGGAGAAGTAATAGGAGATATAAGGCAAGCA S10V2-6 ATGGGACCAGGGAGAGCATTTTATACAACAGGAGAAATAATAGGAGATATAAGGCAAGCA S10V2-4 ATGGGACCAGGGAGAGCATTTTATACAACAGGAGAAATAATAGGAGATATAAGGCAAGCA S10V1-2 ATGGGACCAGGGAGAGCATTATATACAACAGGAGAAATAATAGGAGATATAAGGCAAGCA S10V1-3 ATGGGACCAGGGAGAGCATTTTATACAACAGGAGAAATAACAGGAGATATAAGGCAAGCA S10V1-4 ATGGGACCAGGGAGAGCATTTTATACAACAGGAGAAATAATAGGAGATATAAGGCAAGCA S10V1-6 ATGGGACCAGGGAGAGCATTTTATACAACAGGAGGAATAATAGGAGATATAAGGCAAGCA S10V1-7 ATGGGACCAGGGAGAGCATTTTATACAACAGGAGAAATAATAGGAGATATAAGGCAAGCA S10V1-1 ATGGGACCAGGGAGAGCATTTTATACAACAGGAGAAATAATAGGAGATATAAGGCAAGCA S10V2-5 ATGGGACCAGGGAGAGCATTTTATACAACAGGAGAAATAATAGGAGATATAAGGCAAGCA S10V4-7 ATGGGACCAGGGAGAGCATTTTATACAACAGGAGAAATAATAGGAGATATAAGGCAAGCA S10V4-10 ATGGGACCAGGGAGAGCATTTTATACAACAGGAGAAATAGTAGGAGATATAAGGCAAGCA S10V6-4 ATGGGACCAGGGAGAGTATTGTATACAACAGGAGAAATAATAGGAGATATAAGGCAAAGA S10V6-5 ATGGGACCAGGGAGAGCATTGTATACAACAGGAGAAATAATAGGAGATATAAGGCAAAGA ***** ********** * * *** ********* * ************* * * S10V6-2 CATTGTAACCTTAGTAGAACAGAATGGAATGACACTTTAAAACAGGCTGTCGACAAATTA S10V5-3 CATCGTAACCTTAGTAGAACAGAATGGAATGACACTTTAAAACAGGTAGTTGACAAATTA S10V4-8 CACTGTAGCCTTAGTAGAACAAAATGGCATGACACTTTAAAACAGGTAGTTGACAAGTTA S10V4-15 CATTGTAACCTTAGTAGAACAAAATGGCATGACACTTTAAAACAGGTAGTTGACAAGTTA S10V5-4 CATTGTAACCTTAGTAGAACAAAATGGAATGACACTTTAAAACAGGTAGTTAACAAATTA S10V4-11 CATTGTAACCTTAGTAGAACAAAATGGAATGACACTTTAAAACAGGTAGTTGACAGGTTA S10V5-7 CATTGTAACCTTAGTAGAACAGAATGGAATGACACTTTAAAACAGGTAGTTGACAAATTA S10V5-9 CATTGTAACCTTAGTTGAACGAAATGGAATGACACTTTAAAACAGGTAGTTGACAAATTA S10V5-10 CATTGTAACCTTAGTTGAACGAAATGGAATGACACTTTAAAACAGGTAGTTGACAAATTA S10V4-3 CATTGTAACCTTAGTAGAACAGAATGGAATGACACTTTAAAACAGGTAGTTGACAAATTA S10V4-12 CATTGTAACCTTAGTAGAACAAAATGGAATGACACTTTAAAACAGGTAGTTGACAAATTA S10V4-13 CATTGTAACCTTAGTAGAACAAAATGGAATGACACTTTAAAACAGGTAGTTGACAAATTA S10V5-2 CATTGTAACCTTAGTAGAACAAAATGGAATGACACTTTAAAACAGGTAGTTGACAAATTA S10V5-5 CATTGTAACCTTAGTAGAACAAAATGGAATGACACTTTAAAACAGGTAGTTAACAAATTA S10V5-6 CATTGTAACCTTAGTAGAACAAAATGGAATGACACTTTAAAACAGGTAGCTGACAAATTA S10V5-8 CATTGTAACCTTAGTAGAACAAAATGGAATGACACTTTAAAACAGGTAGCTGACAAATTA S10V4-6 CATTGTAACCTTAGTAGAACAAAATGGAATGACACTTTAAAACAGGTAGTTGACAAATTA S10V4-1 CATTGTAACCTTAGTAGAACAAAATGGAATGACACTTTAAAACAGGTAGTTGACAAATTA S10V5-1 CATTGTAACCTTAGTAGAACAAAATGGAATGACACTTTAAAACAGGTAGTTAACAAATTA S10V4-2 CATTGTAACCTTAGTAGAACAAAATGGAATGACACTTTAAAACAGGTAGTTGACAAATTA S10V4-14 CATTGTAACCTTAGTAGAACAAAATGGAATGACACTTTAGAACAGGTAGTTGACAAATTA S10V6-7 CATTGTAACCTTAGTAGAACAAAATGGAATGACACTTTAAAACAGGCTGTTGCCAAATTA S10V4-5 CATTGTAACCTTAGTAGAACAAAATGGAATGACACTTTAAAACAGGTAGTTGACAAATTA S10V4-16 CATTGTAACCTTAGTAGAACAAAATGGCATGACACTTTAAAACAGGTAGTTGACAAGTTA S10V6-6 CATTGTAACCTTAGTAGAACAAAATGGAATGACACTTTAAAACAAGTAGTTGACAAATTA S10V4-9 CATTGTAACCTTAGTAGAACAAAATGGCATGACACTTTAAAACAGGTAGTTGACAAATTA S10V6-3 CATTGTAACATTAGTAGAACAGAATGGAATGACACTTTAAAACAGGTAGTTGACAAATTA S10V6-9 CATTGTAACCTTAGTAGAACAAAATGGAATGACACTTTAAAACAGGTAGTTGCCAAATTA S10V6-10 CATTGTAACCTTAGTAGAACAAAATGGAATGACACTTTAAAACAGGTAGTTGACAAATTA S10V6-1 CATTGTAACCTTAGTAGAACAAAATGGAATGACACTTTAAAACAGGTAGTTGCCAAATTA S10V2-2 CATTGTAACCTTAGTAGAACAAAATGGAACGACACTTTAAAACAGGTAGTTGACAAATTA S10V6-8 CATTGTAACCTTAGTAGAACAAAATGGAATGACACTTTAAAACAGGTAGTTGACAAATTA S10V4-4 CATTGTAACCTTAGTAGAACAAAATGGAATGACACTTTAAAACAGGTAGTTGACAAATTA S10V1-5 CATTGTAACCTTAGTAGAACAAAATGGAATGACACTTTAAAACAGGTAGTTGACAAATTA S10V2-1 CATTGTAACCTTAGTAGAACAAAATGGAATGACACTTTAAAACAGGTAGTTGACGAATTA S10V2-3 CATTGTAACCTTAGTAGAACAAAATGGAATGACACTTTAAAACAGGTAGTTGACAAATTA S10V2-6 CATTGAAACCTTAGTAGAACAAAATGGAATGACACTTTAAAACAGGTAGTTGACAAATTA S10V2-4 CATTGTAACCTTAGTAGAACAAAATGGAATGACACTTTAAAACAGGTAGTTGACAAATTA S10V1-2 CATTGTAACCTTAGTAGAACAAAATGGAATGACACTTTAAAACAGGTAGTTGACAAATTA S10V1-3 CATTGTAACCTTAGTAGAACAAAATGGAATGACACTTTAAAACAGGTAGTTGACAAATTA S10V1-4 CATTGTAACCTTAGTAGAACAAAATGGAATGACACTTTAAAACAGGTAGTTGACAAATTA S10V1-6 CATTGTAACCTTAGTAGAACAAAATGGAATGACACTTTAAAACAGGTAGTTGACAAATTA S10V1-7 CATTGTAACCTTAGTAGAACAAAATGGAATGACACTTTAAAACAGGTAGTTGACAAATTA S10V1-1 CATTGTAACCTTAGTAGAACAAAATGGAATGACACTTTAAAACAGGTAGTTGACAAATTA S10V2-5 CATTGTAACCTTAGTAGAACAAAATGGAATGACACTTTAAAACAGGTAGTTGACAAATTA S10V4-7 CATTGTAACCTTAGTAGAACAAAATGGAATGACACTTTAAAACAGGTAGTTGACAAATTA S10V4-10 CATTGTAACCTTAGTAGAACAAAATGGAATGACACTTTAAAACAGGTAGTTGACAAATTA S10V6-4 TATTGTAACCTTAGTAGAACAGAATGGAATGACACTTTAAAACAAGTAGTTGACAAATTA S10V6-5 TATTGTAACCTTAGTAGAACAAAATGGAATGACACTTTAAAACAGGGTGCTGACAAATTA * * * * ***** **** ***** * ********* **** * * * *** S10V6-2 AGAGAACAATTTAGGAACAAAACAATAATCTTTAATCAATCCTCA S10V5-3 AGAGAACAATTTAGGAATAAAACAATAATCTTTAATCAATCCTCA S10V4-8 AGAGAACAATTTAGGAATAAAACAATAATCTTTAATCAATCCTCA S10V4-15 AGAGAACAATTTAGGAATAAAACAATAATCTTTAATCAATCCTCA S10V5-4 AGAGAACAATTTAGGAATAAAACAATAATCTTTAATCAATCCTCA S10V4-11 AGAGAACAATTTAGGAATAAAGCAATAATCGTTAATCAATCCTCA S10V5-7 AGAGAACAATTTAGGAATAAAACAATAATCTTTAATCAATCCTCA S10V5-9 AGAGAACAATTTAGGAATAAAACAATAATCTTTAATCAATCCTCA S10V5-10 AGAGAACAATTTAGGAATAAAACAATAATCTTTAATCAATCCTCA S10V4-3 AGGGAACAATTTAGGAATAAAACAATAATCTTTAATCAATCCTCA S10V4-12 AGAGAACAATTTAGGAATAAAACAATAATCTTTAATCAATCCTCA S10V4-13 AGGGAACAATTTAGGAATAAAACAATAATCTTTAATCAATCCTCA S10V5-2 AGAGAACAATTTAGGAATAAAACAATAATCTTTAATCAATCCTCA S10V5-5 AGGGAACAATTTAGGAATAAAACAATAATCTTTAATCAATCCTCA S10V5-6 AGAGAACAATTTAGGAATAAAACAATAATCTTTAATCAATCCTCA S10V5-8 AGAGAACAATTTAGGAATAAAACAATAATCTTTAATCAATCCTCA S10V4-6 AGGGAACAATTTAGGAATAAAACAATAATCCTTAATCAATCCTCA S10V4-1 AGGGAACAATTTAGGAATAAAACAATAATCTTTAATCAATCCTCA S10V5-1 AGGGAACAATTTAGGAATAAAACAATAATCTTTAATCAATCCTCA S10V4-2 AGAGAACAATTTAGGAATAAAACAATAATCTTTAATCAATCCTCA S10V4-14 AGAGAACAATTTAGGAATAAAACAATAATCTTTAATCAATCCTCA S10V6-7 AGAGAACAATTTAGGAACAAAACAATAATCTTTACTCAATCCTCA S10V4-5 AGAGAACAATTTAGGAATAAAACAATAATCTTTAAGCAATCCTCA S10V4-16 AGAGAACAATTTAGGAATAAAACAATAATCTTTAATCAATCCTCA S10V6-6 AGAGAACAATTTAGGAACAAAACAATAATCTTTAATCAATCCTCA S10V4-9 AGGGAACAATTTAGGAATAAAACAATAATCTTTAATCAATCCTCA S10V6-3 AGAGAACAATTTAGGAATAAAACAATAATCTTTAATCAATCCTCA S10V6-9 AGAGAACAATTTAGGAATAAAACAATAATCTTTACTCAATCCTCA S10V6-10 AGAGAACAATTTAGGAATAAAACAATAATCTTTACTCAATCCTCA S10V6-1 AGAGAACAATTTAGGAATAAAACAATAATCTTTACTCAATCCTCA S10V2-2 AGAGAGCAATTTAGGAATAAAACAATAATCTTTAATCAATCCTCA S10V6-8 AGAGAACAATTTAGGAATAAAACAATAATCTTTAATCAATCCTCA S10V4-4 AGGGAACAATTTAGGAATAAAACAATAATCTTTAATCAATCCTCA S10V1-5 AGAGAACAATTTAGGAATAAAACAATAATCTTTAATCAATCCTCA S10V2-1 AGAGAACAATTTAGGAATAAAACAATAATCTTTAATCAATCCTCA S10V2-3 AGAGAACAATTTAGGAATAAAACAATAATCTTTAATCAATCCTCA S10V2-6 AGAGAACAATTTAGGAATAAAACAATAATCTTTAATCAATCCTCA S10V2-4 AGAGAACAATTTAGGAATAAAACAATAATCTTTAATCAATCCTCA S10V1-2 AGAGAACAATTTAGGAATAAAACAATAATCTTTAATCAATCCTCA S10V1-3 AGAGAACAATTTAGGAATAAAACAATAATCTTTAATCAATCCTCA S10V1-4 AGAGAACAATTTAGGAATAAAACAATAATCTTTAATCAATCCTCA S10V1-6 AGAGAACAATTTAGGAATAAAACAATAATCTTTAATCAATCCTCA S10V1-7 GGAGAACAATTTAGGAATAAAACAATAATCTTTAATCAATCCTCA S10V1-1 AGAGAACAATTTAGGAATAAAACAATAATCTTTAATCAATCCTCA S10V2-5 AGAGAACAATTTAGGAATAAAACAATAATCTTTAATCAATCCTCA S10V4-7 AGAGAACAATTTAGGAATAAAACAATAATCTTTAATCAATCCTCA S10V4-10 AGAGAACAATTTAGGAATAAAACAATAATCTTTAATCAATCCTCA S10V6-4 AGAGAACAATTTAGGAATAAAACAATAATCTTTAATCAATCCTCA S10V6-5 AGAGAACAATTTAGGAATAAAACAATAATCTTTAATCAATCCTCA * ** *********** *** ******** *** *********
CLUSTAL FORMAT: MUSCLE SUBJECT 11 multiple sequence alignment S11V1-4 GAGGTAATAATTAGATCTAAGAATTTCTCAAACAATGCTAAAAACATAATAGTACAGCTG S11V4-8 GAGGTAATAATTAGATCTGAGAATTTCTCAAACAATGCTAAAAACATAATAGTACAGCTG S11V3-2 GAGGTAATAATTAGATCTGAGAATTTCTCAAATAATGCTAAAAACATAATAGTACAGCTG S11V3-5 GAGGTAATAATTAGATCTGAGAATTTCTCAAACAATGCTAAAAACATAATAGTACAGCTG S11V1-2 GAGGTAATAATTAGATCTGAGAATTTCTCAAACAATGCTAAAAACATAATAGTACAGCTG S11V4-2 GAGGTAATAATTAGATCTGAGAATTTCTCAAACAATGCTAAAAACATAATAGTACAGCTG S11V4-6 GAGGTAATAATTAGATCTGAGAATTTCTCAAACAATGCTAAAAACATAATAGTACAGCTG S11V4-5 GAGGTAATAATTAGATCTGAGAATTTCTCAAACAATGCTAAAAACATAATAGTACAGCTG S11V2-1 GAGGTAATAATTAGATCTGAGAATTTCTCAAACAATGCTAAAAACATAATAGTACAGCTG S11V4-9 GAGGTAATAATTAGATCTGAGAATTTCTCAAACAATGCTAAAAACATAATAGTACAGCTG S11V1-7 GAGGTAATAATTAGATCTAAGAATTTCTCAAACAATGCTAAAAACATAATAGTACAGCTG S11V1-1 GAGGTAATAATTAGATCTAAGAATTTCTCAAACAATGCTAAAAACATAATAGTACAGCTG S11V4-4 GAGGTAATAATTAGATCTGAGAATTTCTCAAACAATGCTAAAAACATAATAGTACAGCTG S11V4-1 GAGGTAATAATTAGATCTGAGAATTTCTCAAACAATGCTAAAAACATAATAGTACAGCTG S11V4-3 GAGGTAATAATTAGATCTGAGAATTTCTCAAACAATGCTAAAAACATAATAGTACAGCTG S11V3-10 GAGGTAATAATTAGATCTGAGAATTTCTCAAACAATGCTAAAAACATAATAGTACAGCTG S11V3-8 GAGGTAATAATTAGATCTGAGAATTTCTCAAACAATGCTAAAAACATAATAGTACAGCTG S11V1-5 GAGGTAATAATTAGATCTGAGAATTTCTCAAACAATGCTAAAAACATAATAGTACAGCTG S11V1-6 GAGGTAATAATTAGATCTGAGAATTTCTCAAACAATGCTAAAAACATAATAGTACAGCTG S11V2-4 GAGGTAATAATTAGATCTGAGAATTTCTCAAACAATGCTAAAAACATAATAGTACAGCTG S11V2-3 GAGGTAATAATTAGATCTGAGAATTTCTCAAACAATGCTAAAAACATAATAGTACAGCTG S11V2-5 GAGGTAATAATTAGATCTGAGAATTTCTCAAACAATGCTAAAAACATAATAGTACAGCTG S11V2-6 GAGGTAATAATTAGGTCTGAGAATTTCTCAAACAATGCTAAAAACATAATAGTACAGCTG S11V3-1 GAGGTAATAATTAGATCTGAGAATTTCTCAAACAATGCTAAAAACATAATAGTACAGCTG S11V3-3 GAGGTAATAATTAGATCTGAGAATTTCTCAAACAATGCTAAAAACATAATAGTACAGCTG S11V3-4 GAGGTAATAATTAGATCTGAGAATTTCTCAAACAATGCTAAAAACATAATAGTACAGCTG S11V3-6 GAGGTAATAATTAGATCTGAGAATTTCTCAAACAATGCTAAAAACATAATAGTACAGCTG S11V3-9 GAGGTAATAATTAGATCTGAGAATTTCTCAAACAATGCTAAAAACATAATAGTACAGCTG S11V4-7 GAGGTAATAATTAGATCTGAGAATTTCTCAAACAATGCTAAAAACATAATAGTACAGCTG S11V1-3 GAGGTAATAATTAGATCTGAGAATTTCTCAAACAATGCTAAAAACATAATAGTACAGCTG S11V2-2 GAGGTAATAATTAGATCTGAGAATTTCTCAAACAATGCTAAAAACATAATAGTACAGCTG S11V3-7 GAGGTAATAATTAGATCTGAGAATTTCTCAAACAATGCTAAAAACATAATAGTACAGCTG ************** *** ************* *************************** S11V1-4 AATGAATCTGTAGTAATTAATTGTACAAGACCCGACAACACTATAAAACAAAGGATAATA S11V4-8 AATGAATCTGTAGTAATTAATTGTACAAGACCCGACAACACTATAAAACAAAGGATAATA S11V3-2 AATGAATCTGTAGTAATTAATTGTACAAGACCCGACAACACTATAAAACCAAGGATAATA S11V3-5 AATAAATCTGTAGTAATTAATTGTACAAGACCCGATAACACTATAAAACAAAGGATAATA S11V1-2 AATGAATCTGTAGTAATTACTTGTACAAGACCCGACAACACTATCAAACAAAGGATAATA S11V4-2 AATGAATCTGTAGTAATTAATTGTGCAAGACCCGACTACACTATAAAACAAAGGATAATA S11V4-6 AATGAATCTGTAGTAATTAATTGTACAAGACCCGACTACACTATAAAACAAAGGATAATA S11V4-5 AATGAATCTGTAGTAATTAATTGTACAAGACCCGACAACACTATAAAACAAAGGATAATA S11V2-1 AATGAATCTGTAGTAATTAATTGTACAAGACCCGACAACACTATAAAACAAAGGATAATA S11V4-9 AATGAATCTGTAGTAATTAATTGTACAAGACCCGACTACACTATAAAACAAAGGATAATA S11V1-7 AATGAATCTGTAGTAATTAATTGTACAAGACCCGACAACACTATAAAACAAAGGATAATA S11V1-1 AATGAATCTGTAGTAATTAATTGTACAAGACCCGACAACACTATAAAACAAAGGATAATA S11V4-4 AATGAATCTGTAGTAATTAATTGTACAAGACCCGACAACACTATAAAACAAAGGATAATA S11V4-1 AATGAATCTGTAGTAATTAATTGTACAAGACCCGACAACACTATAAAACAAAGGATAATA S11V4-3 AATGAATCTGTAGTAATTAATTGTACAAGACCCGACAACACTATAAAACAAAGGATAATA S11V3-10 AATGAATCTGTAGTAATTAATTGTACGAGACCCGACAACACTATAAAACAAAGGATAATA S11V3-8 AATGAATCTGTAGTAATTAATTGTACCAGACCCGACAACACTATAAAACAAAGGATAATA S11V1-5 AATGAATCTGTAGTAATTAATTGTACAAGACCCGACAACACTATAAAACAAAGGATAATA S11V1-6 AATGAATCTGTAGTAATTAATTGTACAAGACCCGACAACACTATAAAACAAAGGATAATA S11V2-4 AATGAATCTGTAGTAATTAATTGTACAAGACCCGACAACACTATAAAACAAAGGATAATA S11V2-3 AATGAATCTGTAGTAATTAATTGTACAAGACCCGACAACACTATAAAACAAAGGATAATA S11V2-5 AATGAATCTGTAGTAATTAATTGTACAAGACCCGACAACACTATAAAACAAAGGATAATA S11V2-6 AATGAATCTGTAGTAATTAATTGTACAAGACCCGACAACACTATAAAACAAAGGATAATA S11V3-1 AATGAATCTGTAGTAATTAATTGTACAAGACCCGACAACACTATAAAACAAAGGATAATA S11V3-3 AATGGATCTGTAGTAATTAATTGTACAAGACCCGACAACACTATAAAACAAAGGATAATA S11V3-4 AATGAATCTGTAGTGATTAATTGTACAAGACCCGACAACACTATAAAACAAAGGATAATA S11V3-6 AATGAATCTGTAGTAATTAATTGTACAAGACCCGACAACACTATAAAACAAAGGATAATA S11V3-9 AATGAATCTGTAGTAATTAATTGTACAAGACCCGACAACACTATAAAACAAAGGATAATA S11V4-7 AATGAATCTGTAGTAATTAATTGTACAAGACCCGACAACACTATAAAACAAAGGATAATA S11V1-3 AATGAATCTGTAGTAATTAATTGTACAAGACCCGACAACACTATAAAACAAAGGATAATA S11V2-2 AATGAATCTGTAGTAATTAATTGTACAAGACCCGACAACACTATAAAACAAAGGATAATA S11V3-7 AATGAATCTGTAGTAATTAATTGTACAAGACCCGACAACACTATAAAACAAAGGATAATA *** ********* **** **** * ******** ******* **** ********** S11V1-4 CATATAGGACCAGGGAGACCATTCTATACAACAGGAATAAAAGGAAATATAAGACAAGCA S11V4-8 CATATAGGACCAGGGAGACCATTCTATACAACAGGAATAAAAGGAAATATAAGACAAGCA S11V3-2 CATATAGGACCAGGGAGACCATTCTATACAACAGGAATAAAAGGAGATATAAGACAAGCA S11V3-5 CATATAGGACCAGGGAGACCATTCTATACAACAGAAATAAAAGGAAATATAAGACAAGCA S11V1-2 CATATAGGACCAGGGAGACCATTCTATACGACAGGAATAAAAGGAAATATAAGACAAGCA S11V4-2 CATATAGGACCAGGGAGACCATTCTATACAACAGGAATAAAAGGAAATATAAGACAAGCA S11V4-6 CATATAGGACCAGGGAGACCATTCTATACAACAGGAATAAAAGGAAATATAAGACAAGCA S11V4-5 CATATAGGACCAGGGAGACCATTCTATACAACAGGAATAAAAGGAAATATAAGACAAGCA S11V2-1 CATATAGGACCAGGGAGACCATTCTATACAACAGGAATAAAAGGAAATATAAGACAAGCA S11V4-9 CATATAGGACCAGGGAGACCATTCTATACAACAGGAATAAAAGGAAATATAAGACAAGCA S11V1-7 CATATAGGACCAGGGAGACCATTCTATACAACAGGAATAAAAGGAAATATAAGACAAGCA S11V1-1 CATATAGGACCAGGGAGACCATTCTATACAACAGGAATAAAAGGAAATATAAGACAAGCA S11V4-4 CATATAGGACCAGGGAGACCATTCTATACAACAGGAATAAAAGGAAATATAAGACAAGCA S11V4-1 CATATAGGACCAGGGAGACCATTCTATACAACAGGAATAAAAGGAGATATAAGACAAGCA S11V4-3 CATATAGGACCAGGGAGACCATTCTATACAACAGGAATAAAAGGAGATATAAGACAAGCA S11V3-10 CATATAGGACCAGGGAGACCATTCTATACAACAGGAATAAAAGGAAATATAAGACAAGCA S11V3-8 CATATAGGACCAGGGAGACCATTCTATACAACAGGAATAAAAGGAAATATAAGACAAGCA S11V1-5 CATATAGGACCAGGGAGACCATTCTATACAACAGGAATAAAAGGAAATATAAGACAAGCA S11V1-6 CATATAGGACCAGGGAGACCATTCTATACAACAGGAATAAAAGGAAATATAAGACAAGCA S11V2-4 CATATAGGACCAGGGAGACCATTCTATACAACAGGAATAAAAGGAAATATAAGACAAGCA S11V2-3 CATATAGGACCAGGGAGACCATTCTATACAACAGGAATAAAAGGAAATATAAGACAAGCA S11V2-5 CATATAGGACCAGGGAGACCATTCTATACAACAAGAATAAAAGGAAATATAAGACAAGCA S11V2-6 CATATAGGACCAGGGAGACCATTCTATACAACAGGAATAAAAGGAAATATAAGACAAGCA S11V3-1 CATATAGGACCAGGGAGACCATTCTATACAACAGGAATAAAAGGAAATATAAGACAAGCA S11V3-3 CATATAGGACCAGGGAGACCATTCTATACAACAGGAATAAAAGGAAATATAAGACAAGCA S11V3-4 CATATAGGACCAGGGAGACCATTCTATACAACAGGAATAAAAGGAAATATAAGACAAGCA S11V3-6 CATGTAGGACCAGGGAGACCATTCTATACAACAGGAATAAAAGGAAATATAAGACAAGCA S11V3-9 CATATAGGACCAGGGAGACCATTCTATACAGCAGGAATAAAAGGAAATATAAGACAAGCA S11V4-7 CATATAGGACCAGGGAGACCATTCTATACAACAGGAATAAAAGGAAATATAAGACAAGCA S11V1-3 CATATAGGACCAGGGAGACCATTCTATACAACAGGAATAAAAGGAAATATAAGACAAGCA S11V2-2 CATATAGGACCAGGGAGACCATTCTATACAACAGGAATAAAAGGAAATATAAGACAAGCA S11V3-7 CATATAGGACCAGGGAGACCATTCTATACAACAGGAATAAAAGGAAATATAAGACAAGCA *** ************************* ** ********** ************** S11V1-4 CATTGTAACGTTAGTGAGGGACAATGGAATAAAACTTTAGAACAGGTAGTTAGAAAATTA S11V4-8 CATTGTAACGTTAGTGGAGGACAATGGAATAAAACTTTAGAACAGGTAGTTAAAAACTTA S11V3-2 CATTGTAACGTTAGTAGAGGACAATGGAATAAAACTTTAGAACAGGTAGTTAGAAAATTA S11V3-5 CATTGTAACGTTAGTAGAGGACAATGGAATAAAACTTTAGAACAGGTAGTTAGAAAATTA S11V1-2 CATTGTAACGTTAGTAGAGGACAATGGAATAAAACTTTAGAACAGGTAGTTAGAAAATTA S11V4-2 CATTGTAACGTTAGTGGAGGACAATGGAATAAAACTTTAGAACAGGTAGTTAGAAAATTA S11V4-6 CATTGTAACGTTAGTAGAGGACAATGGAATAAAACTTTAGAACAGGTAGTTATAAAATTA S11V4-5 CATTGTAACGTTAGTAGAGGACAATGGAATAAAACTTTAGAACAGGTAGTTAGAAAATTA S11V2-1 CATTGTAACGTTAGTAGAGGACAATGGAATAAAACTTTAGAACAGGTAGTAAGAAAATTA S11V4-9 CATTGTAACGTTAGTAGAGGACAATGGAATAAAACTTTAGAACAGGTAGTTAAAAAATTA S11V1-7 CATTGTAACGTTAGTAGAGGACAATGGGATAAAACTTTAGAACAGGTAGTTAGAAAATTA S11V1-1 CATTGTAACGTTAGTAGAGGACAATGGAATAAAACTTTAGAACAGGTAGTTAGAAAATTA S11V4-4 CATTGTAACGTTAGTAGAGGACAATGGAATAAAACTTTAGAACAGGTAGTTAGAAAATTA S11V4-1 CATTGTAACGTTAGTAGAGGACAATGGAATAAAACTTTAGAACAGGTAGTTAGAAAATTA S11V4-3 CATTGTAACGTTAGTAGAGGACAATGGAATAAAACTTTAGAACAGGTAGTTAAAAAATTA S11V3-10 CATTGTAACGTTAGTAGAGGACAATGGAATAAAACTTTAGAACAGGTAGTTAGAAAATTA S11V3-8 CATTGTAACGTTAGTAGAGGACAATGGAATAAAACTTTAGAACAGGTAGTTAGAAAATTA S11V1-5 CATTGTAACGTTAGTAGAGGACAATGGAATAAAACTTTAGAACAGGTAGTTGGAAAATTA S11V1-6 CATTGTAACGTTAGTAGAGGACAATGGAATAAAACTTTAGAACAGGTAGTTAGAAGATTA S11V2-4 CATTGTAACGTTAGTAGAGGACAATGGAATAAAACTTTAGAACAGGTAGTTAGAAGATTA S11V2-3 CATTGTAACGTTAGTAGAGGACAATGGAATAGAACTTTAGAACAGGTAGTTAGAAAATTA S11V2-5 CATTGTAACGTTAGTAGAGGACAATGGAATAAAACTTTAGAACAGGTAGTTAGAAAATTA S11V2-6 CATTGTAACGTTAGTAGAGGACAATGGAATAAAACTTTAGAACAGGTAGTTAGAAAATTA S11V3-1 CATTGTAACGTTAGTAGAGGACAATGGAATAAAACTTTAGAACAGGTAGTTAGAAAATTA S11V3-3 CATTGTAACGTTAGTAGAGGACAATGGAATAAAACTTTAGAACAGGTAGTTAGAAAATTA S11V3-4 CATTGTAACGTTAGTAGAGGACAATGGAATAAAACTTTAGAACAGGTAGTTAGAAAATTA S11V3-6 CATTGTAACGTTAGTAGAGGACAATGGAATAAAACTTTAGAACAGGTAGTTAGAAAATTA S11V3-9 CATTGTAACGTTAGTAGAGGACAATGGAATAAAACTTTAGAACAGGTAGTTAGAAAATTA S11V4-7 CATTGTAACGTTAGTAGAGGACAATGGAATAAAACTTTAGAACAGGTAGTTAGAAAATTA S11V1-3 CATTGTAACGTTAGTAGAGGACAATGGAATAAAACTTTAGAACAGGTAGTTAGAAAATTA S11V2-2 CATTGTAACGTTAGTAGAGGACAATGGAATAAAACTTTAGAACAGGTAGTTAGAAAATTA S11V3-7 CATTGTAACGTTAGTAGAGGACAATGGAATAAAACTTTAGAACAGGTAGTTAGAAAATTA *************** ********* *** ****************** ** *** S11V1-4 AGAGAACAATATGGACTGAATAAAACAATAGTCTTTAAGCAACCCATA S11V4-8 AGAGAACAATATGGACTGAATAAAACAATAGTCTTTAAGCAACCCATA S11V3-2 AGAGAACAATATGGACTGAATAAAACAATAGTCTTTAAGCAACCCATA S11V3-5 AGAGAACAATATGGACCGAATAAAACAATAGTCTTTAAGCAACCCATA S11V1-2 AGAGAACAATATGGACTGAATAAAACAATAGTCTTTAAGCAACCCATA S11V4-2 AGAGAACAATATGGACTAAATAAAACAATAGTCTTTAAGCAACCCATA S11V4-6 AGAGAACAATATGGGCCTAATAAAACAATAGTCTTTAAGCAACCCATA S11V4-5 AGAGAACAATATGGGCCTAATAAAACAATAGTCTTTAAGCAACCCATA S11V2-1 AGAGAACAATATGGACTGAACAAAACAATAGTCTTTAAGCAACCCATA S11V4-9 AGAGAACAATATGGACTAAATAAAACAATAGTCTTTAAGCAACCCATA S11V1-7 AGAGAACAATATGGACTGAATAAAACAATAGTCTTTAAGCAACCCATA S11V1-1 AGAGAACAATATGGACTGAATAAAACAATAGTCTTTAAGCAACCCATA S11V4-4 AGAGAACAATATGGACTAAATAAAACAATAGTCTTTAAGCAACCCATA S11V4-1 AGAGAACAATATGGACTAAATAAAACAATAGTCTTTAAGCAACCCATA S11V4-3 AGAGAACAATATGGACTAAATAAAACAATAGTCTTTAAGCAACCCATA S11V3-10 AGAGAACAATATGGACTGAATAAAACAATAGTCTTTAAGCAACCCATA S11V3-8 AGAGAACAATATGGACTGAATAAAACAATAGTCTTTAAGCAACCCATA S11V1-5 AGAGAACAATATGGACTGAATAAAACAATAGTCTTTAAGCAACCCATA S11V1-6 AGAGAACAATATGGACTGAATAAAACAATAGTCTTTAAGCAACCCATA S11V2-4 AGAGAACAATATGGACTGAATAAAACAATAGTCTTTAAGCAACCCATA S11V2-3 AGAGAACAATATGGACTGAATAAAACAATAGTCTTTAAGCAACCCATA S11V2-5 AGAGAACAATATGGACTGAATAAAACAATAGTCTTTAAGCAACCCATA S11V2-6 AGAGAACAATATGGACTGAATAAAACAATAGTCTTTAAGCAACCCATA S11V3-1 AGAGAACAATATGGACCGAATAAAACAATAGTCTTTAAGCAACCCATA S11V3-3 AGAGAACAATATGGACTGAATAAAACAATAGTCTTTAAGCAACCCATA S11V3-4 AGAGAACAATATGGACTGAATAAAACAATAGTCTTTAAGCAACCCATA S11V3-6 AGAGAACAATATGGACTGAATAAAACAATAGTCTTTAAGCAACCCATA S11V3-9 AGAGAACAATATGGACTGAATAAAACAATAGTCTTTAAGCAACCCATA S11V4-7 AGAGAACAATATGGACTGAATAAAACAATAGCCTTTAAGCAACCCATA S11V1-3 AGAGAACAATATGGACTGAATAAAACAATAGTCTTTAAGCAACCCATA S11V2-2 AGAGAACAATATGGACTGAATAAAACAATAGTCTTTAAGCAACCCATA S11V3-7 AGAGAACAATATGGACTGAATAAAACAATAGTCTTTAAGCAACCCATA ************** * ** ********** ****************
Theta Calculations
Subject | Number of Clones | S | Theta |
---|---|---|---|
3 | 39 | 42 | 9.87 |
10 | 49 | 73 | 16.3 |
11 | 32 | 33 | 8.13 |
Activity 3: Defining your HIV evolution project
- To develop my HIV evolution project, I looked at the complete data from the HIV paper. (Markham et al. 1998 data table)
- When looking at some of the graphs and the main table in Markham et. al 1998, it was observed that rapid and moderate progressors have the potential to fall into the same category.
The question
- Should moderate progressors and rapid-progressors fall into the same group of subjects?
Hypothesis
- Yes, they can be categorized in the same category due to the fact that they found no significant statistical difference between the rapid and moderate progressors as well as outliers are present in both groups that may have led them to be categorized into separate groups.
Data
- To carry out the experiment, we will only look at the rapid and moderate progressors. We will exclude subject 4 (rapid)and 7(moderate) all together, because they started off with over 1,000 CD4 cells. This could have made the data adjust in a way that is not entirely concise. We will also look at only the first visit and the 2 year mark for the subjects because most of the rapids did not make it past 2 years, so the data will be more uniform between the rapid and moderates. We will also exclude subject 1(rapid) for a similar reason, because they only had three visits which may imbalance the data as well.
- Therefore, the subjects used will be 3,10,11 and 15 for rapid, and 5,6,8,9 and 14 for moderate.
- Looking at the clones for the first visit for the subject (Nucleotide Sequence Data) found on the Week 5 Assignment, the amount we will choose is 3 clones. We decided 3 because subject 6 has the least amount of clones (3), so we want to keep the data uniform as possible. We will also choose them randomly with a calculator so it is non-bias as possible.
- Looking at the clones for the 2 year mark (some of the subjects have the 2 year mark at visit 4 or visit 5), the amount we will choose is 6 clones. We decided 6 because subject 8 has the least amount of clones (6), so we want to have the data uniform as possible.
Notes
- Rapid-
- 3 first visit:4 | 2 year mark(v4): 9
- 10 first visit:7 | 2 year mark(v4): 16
- 11 first visit:7 | 2 year mark(v4): 9
- 15 first visit:12 | 2 year mark(v4): 10
- Moderate-
- 5 first visit:8 | 2 year mark(v4): 7
- 6 first visit:3 | 2 year mark(v5): 9
- 8 first visit:5 | 2 year mark(v4): 6
- 9 first visit:5 | 2 year mark(v5): 9
- 14 first visit:6 | 2 year mark(v5): 7
Data and Files
All data and files were phylogenetic trees made on the Phylogeny.fr website and uploaded to the wiki:
- File:Mkphylotreehiv1.PNG
- Phylogenetic tree of five different clones: S3V3C6, S3V5C3, S12V4C1, S2V4C2, and S11V4C3.
- File:Mkphylogenetictreeactivity2.PNG
- Phylogenetic tree of three clones from four subjects (3,5, 10 and 11).
- File:Mkphylotreesubject3.PNG
- Phylogenetic tree of all clones from every visit from subject 3.
- File:Mkphylotreesubject10.PNG
- Phylogenetic tree of all clones from every visit from subject 10.
- File:Mkphylotreesubject11.JPG
- Phylogenetic tree of all clones from every visit from subject 11.
Scientific Conclusion
By first browsing through the NCBI resources and looking at the HIV sequence data, one is able to be familiar with finding their own sequences using the bioinformatics tools and therefore carryout their research project. Then, making phylogenetic trees with different combinations of sequences and analyzing them also allows for the student to know how to analyze their own data in their project. Lastly, learning how to using a statistical test such as theta to find the calculated genetic distance between different sequences allows to determine if the experiment has any significance that can add to the information for HIV-1 evolution. Therefore, using all of this gives the student experience and is prepared to begin their own project.
Acknowledgements
- I copied and modified the protocol from Week 5 for this assignment.
- I used the syntax for tables from Week 5 for this assignment.
- I used the Phylogeny.fr website in order to make phylogenetic trees for clones from the HIV subjects as well as obtain Clustal formats
- I asked my homework partner Maya some questions regarding the protocol and help with formatting as well as worked with her on our HIV evolution project outside of class.
- I emailed Dr. Dahlquist a question about the formatting of the data and files section.
- I met with Dr. Dahlquist outside of class for help on our research project.
- I used the harmonic sum calculator found at https://www.math.utah.edu/~carlson/teaching/calculus/harmonic.html in order to calculate theta.
- Except for what is noted above, this individual journal entry was completed by me and not copied from another source.
Mking44 (talk) 18:26, 16 February 2020 (PST)
References
- OpenWetWare. (2020). BIOL368/S20:Week 5. Retrieved Fenruary 13, 2020, from https://openwetware.org/wiki/BIOL368/S20:Week_5
- The Harmonic Series. (n.d.). Retrieved February 16, 2020, from https://www.math.utah.edu/~carlson/teaching/calculus/harmonic.html