BIOL368/F14:Nicole Anguiano Week 4: Difference between revisions

From OpenWetWare
Jump to navigationJump to search
(→‎Results: Fleshed out activity 2 part 1 results)
(→‎Exploring HIV Evolution In-Class Activity: removed activity 2 part 2 questions)
(4 intermediate revisions by the same user not shown)
Line 20: Line 20:
===Results===
===Results===
=====Activity 1, Part 3=====
=====Activity 1, Part 3=====
[[Image:Sequence_Alignment_ClustalW.png]]
[[Image:Sequence_Alignment_ClustalW.png | thumb | none | <b>Figure 1</b>: The results from the sequence alignment from the Clustal W of the 4 genes AF016768.2, AF016818.2,AF016767.2, and AF089153.2. ]]
*<b>Figure 1</b>: The results from the sequence alignment from the Clustal W of the 4 genes AF016768.2, AF016818.2,AF016767.2, and AF089153.2.  
*AF016767.2 and AF016768.2 come from subject 1, but AF016768.2 is slightly more similar to AF016818.2 than AF016767.2, having 6 differences with AF016767.2 and  5 differences with AF016818.2. AF016767.2 and AF016818.2 are different, having 11 differences between the two of them. AF089153.2 is the most different from all of them, having 21 locations where it is completely different from the other 3, three of them being areas that do not exist in the gene (Fig. 1).  
*AF016767.2 and AF016768.2 come from subject 1, but AF016768.2 is slightly more similar to AF016818.2 than AF016767.2, having 6 differences with AF016767.2 and  5 differences with AF016818.2. AF016767.2 and AF016818.2 are different, having 11 differences between the two of them. AF089153.2 is the most different from all of them, having 21 locations where it is completely different from the other 3, three of them being areas that do not exist in the gene (Fig. 1).  
[[Image:A1P3_ClustalW_Dendrogram.png]]
[[Image:A1P3_ClustalW_Dendrogram.png | thumb | none | <b>Figure 2</b>: The unrooted tree resulting from the ClustalW of the 4 genes AF016768.2, AF016818.2,AF016767.2, and AF089153.2. ]]
*<b>Figure 2</b>: The unrooted tree resulting from the ClustalW of the 4 genes AF016768.2, AF016818.2,AF016767.2, and AF089153.2.  
*Of the four genes used, three of the four (AF016768.2, AF016767.2, and AF016818.2) have a relatively high degree of similarity, which can be seen in the unrooted tree. Those three are clustered relatively closely together, with AF016768.2 having a closer similarity to AF016818.2 and AF016767.2 than AF016818.2 and AF016767.2 have with each other. AF089153.2 is far away from the others, indicating a large amount of dissimilarity between it and the other three genes (Fig. 2).
*Of the four genes used, three of the four (AF016768.2, AF016767.2, and AF016818.2) have a relatively high degree of similarity, which can be seen in the unrooted tree. Those three are clustered relatively closely together, with AF016768.2 having a closer similarity to AF016818.2 and AF016767.2 than AF016818.2 and AF016767.2 have with each other. AF089153.2 is far away from the others, indicating a large amount of dissimilarity between it and the other three genes (Fig. 2).
=====Activity 2, Part 1=====
=====Activity 2, Part 1=====
[[Image:A2P1_ClustalW.png]]
[[Image:A2P1_ClustalW.png | thumb | none | <b>Figure 3</b>: The sequence alignment resulting from the comparison of 12 genes, 3 each from subjects 7, 15, 13, and 1 (S7V1-1, S7V1-4, S7V1-8, S15V1-3, S15V1-7, S15V1-3, S13V1-1, S13V1-2, S13V1-3, S1V1-3, S1V1-6, and S1V1-9).]]
*<b>Figure 3</b>: The sequence alignment resulting from the comparison of 12 genes, 3 each from subjects 7, 15, 13, and 1 (S7V1-1, S7V1-4, S7V1-8, S15V1-3, S15V1-7, S15V1-3, S13V1-1, S13V1-2, S13V1-3, S1V1-3, S1V1-6, and S1V1-9).
*The most similarity is seen between clones from the same subject, as expected. However, a higher degree of dissimilarity than would be expected is seen among the clones of subject 15, with 24 areas in which 1 clone is different from the rest. Conversely, the clones from subject 13 are extremely similar (which had 1 area in which 1 clone is different from the rest), as are the clones from subject 1 (which had 2 areas in which 1 clone is different from the rest. Subject 7 lies in between the group, with 14 areas in which 1 clone is different from the rest.  
*The most similarity is seen between clones from the same subject, as expected. However, a higher degree of dissimilarity than would be expected is seen among the clones of subject 15, with 24 areas in which 1 clone is different from the rest. Conversely, the clones from subject 13 are extremely similar (which had 1 area in which 1 clone is different from the rest), as are the clones from subject 1 (which had 2 areas in which 1 clone is different from the rest. Subject 7 lies in between the group, with 14 areas in which 1 clone is different from the rest.  
[[Image:A2P1_Tree.png]]
[[Image:A2P1_Tree.png | thumb | none | <b>Figure 4</b>: The tree resulting from the comparison of the 12 genes S7V1-1, S7V1-4, S7V1-8, S15V1-3, S15V1-7, S15V1-3, S13V1-1, S13V1-2, S13V1-3, S1V1-3, S1V1-6, and S1V1-9.]]
*<b>Figure 4</b>: The tree resulting from the comparison of the 12 genes S7V1-1, S7V1-4, S7V1-8, S15V1-3, S15V1-7, S15V1-3, S13V1-1, S13V1-2, S13V1-3, S1V1-3, S1V1-6, and S1V1-9.
*The tree confirms what we saw in the sequence alignment. Subject 15 has the highest degree of difference, and its branches are quite far apart. The clones from subject 1 and the clones on subject 13 are extremely similar and are almost entirely overlapping due to their similarity. The clones from subject 7 are not as similar as those from subjects 1 and 13, but not as different as the clones from subject 15.
*The tree confirms what we saw in the sequence alignment. Subject 15 has the highest degree of difference, and its branches are quite far apart. The clones from subject 1 and the clones on subject 13 are extremely similar and are almost entirely overlapping due to their similarity. The clones from subject 7 are not as similar as those from subjects 1 and 13, but not as different as the clones from subject 15.
=====Activity 2, Part 2=====
=====Activity 2, Part 2=====
<!--[[Image:A2P2_S13_SA.png | thumb | none | <b>Figure 5</b>: The sequence alignment from the comparison of all clones from Subject 13, used to calculate S and θ in Table 1.]]
[[Image:A2P2_S13_CDM.png | thumb | none | <b>Figure 6</b>: The clustal distance matrix from the Clustal W of all the clones from Subject 13, used to calculate the min and max difference in Table 1.]]
[[Image:A2P2_S1_SA.png | thumb | none | <b>Figure 7</b>: The sequence alignment from the comparison of all clones from Subject 1, used to calculate S and θ in Table 1.]]
[[Image:A2P2_S1_CDM.png | thumb | none | <b>Figure 8</b>: The clustal distance matrix from the Clustal W of all the clones from Subject 1, used to calculate the min and max difference in Table 1.]]
[[Image:A2P2_S7_SA.png | thumb | none | <b>Figure 9</b>: The sequence alignment from the comparison of all clones from Subject 7, used to calculate S and θ in Table 1.]]
[[Image:A2P2_S7_CDM.png | thumb | none | <b>Figure 10</b>: The clustal distance matrix from the Clustal W of all the clones from Subject 7, used to calculate the min and max difference in Table 1.]]
Commented out due to wiki preventing image upload.
-->
{| border="1"
|-
! Subject !! Number of Clones !! S !! Theta !! Min difference !! Max difference
|-
| 13 || 4 || 3 || 2 || 1 || 2
|-
| 1 || 13 || 26 || <math>\textstyle\frac{55440}{6617}</math> || 1 || 14
|-
| 7 || 10 || 28 || <math>\textstyle\frac{70560}{7129}</math> || 2 || 18
|} <!-- Many thanks to http://www.calculatorsoup.com/calculators/math/fractions.php for helping with the fraction calculations and to http://en.wikipedia.org/wiki/Wikipedia:Manual_of_Style/Dates_and_numbers#Fractions with helping me figure out how to do fractions in wiki formatting-->
*<b>Table 1</b>: The S, θ (theta), min difference, and max difference values from subjects 13, 1, and 7. S is calculated using the number of differences present in the sequence alignment. θ is calculated using the formula <math>S/(\sum_{i=1}^{n-1} \frac{1}{i})</math>. <!--Many thanks to http://en.wikipedia.org/wiki/Wikipedia:Manual_of_Style/Mathematics--> The min and max differences are calculated by taking the lowest and highest numbers in the clustal distance matrix respectively and multiplying them by the number of base pairs, which was 285 in every case. The number is then rounded to the nearest integer.
*The values varied depending on how similar each subject's clones were to one another. The most similar clones were found among subject 13, and the most diverse among subject 7.
<!--[[Image:A2P2_S2S4_CMD.png | thumb | none | <b>Figure 11</b>: The clustal distance matrix between the clones of subject 2 and subject 4, used to calculate the min and max difference in Table 2.]]
[[Image:A2P2_S5S10_CDM.png | thumb | none | <b>Figure 12</b>: The clustal distance matrix between the clones of subject 5 and subject 10, used to calculate the min and max difference in Table 2.]]
[[Image:A2P2_S6S12_CDM.png | thumb | none | <b>Figure 13</b>: The clustal distance matrix between the clones of subject 6 and subject 12, used to calculate the min and max difference in Table 2.]]
Commented out due to wiki preventing image upload.
-->
{| border="1"
|-
! Pair of subjects being compared !! Min difference !! Max difference
|-
| 2 and 4 || 21 || 26
|-
| 5 and 10 || 30 || 36
|-
| 6 and 12 || 25 || 28
|}
*<b>Table 2</b>: The min and max differences from the clones of Subjects 2 and 4, 5 and 10, and 6 and 12. The differences were taken comparing across subjects, not within each subject's clones. The min and max differences were calculated the same was as in Table 1.


===Interpretations===
===Interpretations===
Line 87: Line 119:
#Copy and paste your tree from the <i>Biology Workbench</i> into a word processor and print it out to share with the class.
#Copy and paste your tree from the <i>Biology Workbench</i> into a word processor and print it out to share with the class.
#*See figure 4 above.
#*See figure 4 above.
=====Activity 2, Part 2=====
===Conclusions===
===Conclusions===
=====Activity 1, Part 2=====
=====Activity 1, Part 2=====

Revision as of 18:35, 21 September 2014

Exploring HIV Evolution In-Class Activity

Methods

Activity 1, Part 2
  • First, I opened the link to the NCBI website and searched for the Markham, et al article. Once found, I switched the search type to nucleotide and searched again using the same search criteria that had led me to the original paper (Markham RB[AUTHOR]). Then, I selected a gene from the list. I saved it in fasta format.
  • I selected 3 other files, one from subject 4, one from subject 1, and a second from subject 1, and saved all of them in fasta format.
Activity 1, Part 3
  • I navigated to the Biology Workbench site and created an account, then entered the workbench and selected "nucleic tools". I selected "Add New Nucleic Sequence" and hit "Run". I uploaded the fasta file I had created before and saved it to my workbench under the label "S2V4C2".
  • I then imported the rest of the files that I had saved, and saved them under the same labeling scheme (Subject, Visit, Clone number).
  • I selected all of the files I'd uploaded into my workbench, chose the option "ClustalW - Multiple Sequence Alignment", and ran it. I submitted it with the default settings.
  • I clicked "Session Tools" and started a new session.
Activity 2, Part 1
Activity 2, Part 2
  • I selected the four clones from Subject 13 (S13V1-1, S13V1-2, S13V1-3, S13V1-4), and ran a ClustalW on them using the default settings. I counted the number of differences in the sequence analysis. I then selected the "Import Alignment(s)" button. I selected the alignment I had just imported and ran a "CLUSTALDIST - Generate Distance Matrix with Clustal W" and ran it using the default settings. I used the values from the Clustal Distance Matrix to calculate the min and max difference by multiplying the smallest and largest numbers by the total number of base pairs, respectively.
  • I repeated the above steps with the 13 clones from subject 1, then the 10 clones from subject 7, then the combined 9 clones of subjects 2 and 4 (6 from 2, 3 from 4), then the combined 15 clones from subjects 5 and 10 (8 from 5 and 7 from 10), then the combined 7 clones from subjects 6 and 12 (3 from 6 and 4 from 12).

Results

Activity 1, Part 3
Figure 1: The results from the sequence alignment from the Clustal W of the 4 genes AF016768.2, AF016818.2,AF016767.2, and AF089153.2.
  • AF016767.2 and AF016768.2 come from subject 1, but AF016768.2 is slightly more similar to AF016818.2 than AF016767.2, having 6 differences with AF016767.2 and 5 differences with AF016818.2. AF016767.2 and AF016818.2 are different, having 11 differences between the two of them. AF089153.2 is the most different from all of them, having 21 locations where it is completely different from the other 3, three of them being areas that do not exist in the gene (Fig. 1).
Figure 2: The unrooted tree resulting from the ClustalW of the 4 genes AF016768.2, AF016818.2,AF016767.2, and AF089153.2.
  • Of the four genes used, three of the four (AF016768.2, AF016767.2, and AF016818.2) have a relatively high degree of similarity, which can be seen in the unrooted tree. Those three are clustered relatively closely together, with AF016768.2 having a closer similarity to AF016818.2 and AF016767.2 than AF016818.2 and AF016767.2 have with each other. AF089153.2 is far away from the others, indicating a large amount of dissimilarity between it and the other three genes (Fig. 2).
Activity 2, Part 1
Figure 3: The sequence alignment resulting from the comparison of 12 genes, 3 each from subjects 7, 15, 13, and 1 (S7V1-1, S7V1-4, S7V1-8, S15V1-3, S15V1-7, S15V1-3, S13V1-1, S13V1-2, S13V1-3, S1V1-3, S1V1-6, and S1V1-9).
  • The most similarity is seen between clones from the same subject, as expected. However, a higher degree of dissimilarity than would be expected is seen among the clones of subject 15, with 24 areas in which 1 clone is different from the rest. Conversely, the clones from subject 13 are extremely similar (which had 1 area in which 1 clone is different from the rest), as are the clones from subject 1 (which had 2 areas in which 1 clone is different from the rest. Subject 7 lies in between the group, with 14 areas in which 1 clone is different from the rest.
Figure 4: The tree resulting from the comparison of the 12 genes S7V1-1, S7V1-4, S7V1-8, S15V1-3, S15V1-7, S15V1-3, S13V1-1, S13V1-2, S13V1-3, S1V1-3, S1V1-6, and S1V1-9.
  • The tree confirms what we saw in the sequence alignment. Subject 15 has the highest degree of difference, and its branches are quite far apart. The clones from subject 1 and the clones on subject 13 are extremely similar and are almost entirely overlapping due to their similarity. The clones from subject 7 are not as similar as those from subjects 1 and 13, but not as different as the clones from subject 15.
Activity 2, Part 2
Subject Number of Clones S Theta Min difference Max difference
13 4 3 2 1 2
1 13 26 [math]\displaystyle{ \textstyle\frac{55440}{6617} }[/math] 1 14
7 10 28 [math]\displaystyle{ \textstyle\frac{70560}{7129} }[/math] 2 18
  • Table 1: The S, θ (theta), min difference, and max difference values from subjects 13, 1, and 7. S is calculated using the number of differences present in the sequence alignment. θ is calculated using the formula [math]\displaystyle{ S/(\sum_{i=1}^{n-1} \frac{1}{i}) }[/math]. The min and max differences are calculated by taking the lowest and highest numbers in the clustal distance matrix respectively and multiplying them by the number of base pairs, which was 285 in every case. The number is then rounded to the nearest integer.
  • The values varied depending on how similar each subject's clones were to one another. The most similar clones were found among subject 13, and the most diverse among subject 7.
Pair of subjects being compared Min difference Max difference
2 and 4 21 26
5 and 10 30 36
6 and 12 25 28
  • Table 2: The min and max differences from the clones of Subjects 2 and 4, 5 and 10, and 6 and 12. The differences were taken comparing across subjects, not within each subject's clones. The min and max differences were calculated the same was as in Table 1.

Interpretations

Activity 1, Part 2
Activity 1, Part 3
Activity 2, Part 1
Activity 2, Part 2

Questions

Activity 1, Part 2
  1. What was the accession number of the sequence you chose?
    • The Accession number was AF016818.
  2. Which subject of the study was that HIV sequence from? Which section of the record contains information about who the HIV was collected from?
    • The sequence was from subject 2. The "Definition" section of the record contained the information about where the HIV was collected from.
Activity 2, Part 1
  • As a preliminary analysis you should generate a multiple sequence alignment and distance tree for 12 of these sequences (3 clones from each of 4 subjects).
    • See figures 3 and 4 above in results.
  • Use the data table below (Table 2) to keep a record of the data you analyzed.
Subject Clone #
1 3
1 6
1 9
7 1
7 4
7 8
13 1
13 2
13 3
15 3
15 7
15 11
  1. Do the clones from each subject cluster together?
    • The clones from each subject are in clusters that range from very loose to tight. The clones from subjects 1 and 13 are clustered very close together. The clones from subject 7 are a little farther apart than those seen in 1 and 13, with clone numbers 1 and 4 being more close together than clone 8. However, the clones from subject 15 are very far apart from each other, indicating that they are only very loosely related to one another.
  2. Do some subjects clones show more diversity than others?
    • Yes. Subject 7's clones shows more diversity than subjects 1 and 13. Subject 15's clones show more diversity than all of the other clones, with each of the branches being very long and far apart.
  3. Write a brief description of your tree and how you interpret the clustering pattern with respect to the similarities and potential evolutionary relationships between the subject's HIV sequences.
    • My tree contains a relatively short central trunk, with branches of varying length extending off. The branch with subject 13's clones is very distant from the rest, indicating it is the most different from all of the other subject's clones. However, the clones from subject 13 are extremely close together, with clone 2 being located at the branching point in which the rest separate off. This indicates that it may be a common ancestor of the two, or that it may be equally related to both of them. Clone 1 is a very short distance from clone 2, and is located on an extremely tiny branch that is so small that it is hard to see, indicating that it is very closely related to the other clones. Clone 3 is the furthest up the branch, indicating the most diversity. Overall, the clones from subject 13 are very closely related and not very diverse. The next longest branch of the main trunk is Subject 7. Subject 7's clones are much more loosely related than subject 13's. However, the clones are relatively close together, indicating that they are closely related. Clones 1 and 4 are more closely related than clone 8, as clone 8 has a branch to itself while clones 1 and 4 stem off of the same terminal point. The next longest branch off the trunk is Subject 1. Like subject 13, subject 1's clones are very closely related. Clone number 3 is located on the branching point that splits off into clones 6 and 9, indicating that it may be the common ancestor of 6 and 9, and 6 and 9 are also closely related. The final branch is for subject 15, the most dissimilar of the subjects. Each of the clones of subject 15 is located on a longer branch from those around it, indicating that they are not very similar and are much more diverse. Clone 3 is the most diverse of the clones, having diverged first and being on a branch of its own. Clones 7 and 11 split off the same branching point, but their respective branches are very long, indicating that they are not very similar.
  4. Copy and paste your tree from the Biology Workbench into a word processor and print it out to share with the class.
    • See figure 4 above.

Conclusions

Activity 1, Part 2
Activity 1, Part 3
Activity 2, Part 1
Activity 2, Part 2

Overall Conclusions

Electronic Lab Notebook

Links

Nicole Anguiano
BIOL 368, Fall 2014

Assignment Links
Individual Journals
Class Journals