Harvard:Biophysics 101/2007/Notebook:Kaull/2007-3-15: Difference between revisions
From OpenWetWare
Jump to navigationJump to search
(New page: =Homework 3/15/07= See [http://openwetware.org/wiki/Harvard:Biophysics_101/2007/03/13 here] for assignment. ==Part 1: Walk-Through of Analyzing Gene== * Acquire sequence of interest <p...) |
(No difference)
|
Revision as of 05:33, 15 March 2007
Homework 3/15/07
See here for assignment.
Part 1: Walk-Through of Analyzing Gene
- Acquire sequence of interest
>example1 CACCCTCGCCAGTTACGAGCTGCCGAGCCGCTTCCTAGGCTCTCTGCGAATACGGACACG CATGCCACCCACAACAACTTTTTAAAAGAATCAGACGTGTGAAGGATTCTATTCGAATTA CTTCTGCTCTCTGCTTTTATCACTTCACTGTGGGTCTGGGCGCGGGCTTTCTGCCAGCTC CGCGGACGCTGCCTTCGTCCAGCCGCAGAGGCCCCGCGGTCAGGGTCCCGCGTGCGGGGT ACCGGGGGCAGAACCAGCGCGTGACCGGGGTCCGCGGTGCCGCAACGCCCCGGGTCTGCG CAGAGGCCCCTGCAGTCCCTGCCCGGCCCAGTCCGAGCTTCCCGGGCGGGCCCCCAGTCC GGCGATTTGCAGGAACTTTCCCCGGCGCTCCCACGCGAAGC
- Send to BLAST - identify gene, and note mutations
>ref|NT_030059.12|Hs10_30314 Homo sapiens chromosome 10 genomic contig, reference assembly Length=44617998 Features flanking this part of subject sequence: 3895 bp at 5' side: hypothetical protein 425 bp at 3' side: HtrA serine peptidase 1 Score = 787 bits (397), Expect = 0.0 Identities = 400/401 (99%), Gaps = 0/401 (0%) Strand=Plus/Plus Query 1 CACCCTCGCCAGTTACGAGCTGCCGAGCCGCTTCCTAGGCTCTCTGCGAATACGGACACG 60 |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||| Sbjct 42968870 CACCCTCGCCAGTTACGAGCTGCCGAGCCGCTTCCTAGGCTCTCTGCGAATACGGACACG 42968929 Query 61 CATGCCACCCACAACAACTTTTTAAAAGAATCAGACGTGTGAAGGATTCTATTCGAATTA 120 |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||| Sbjct 42968930 CATGCCACCCACAACAACTTTTTAAAAGAATCAGACGTGTGAAGGATTCTATTCGAATTA 42968989 Query 121 CTTCTGCTCTCTGCTTTTATCACTTCACTGTGGGTCTGGGCGCGGGCTTTCTGCCAGCTC 180 |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||| Sbjct 42968990 CTTCTGCTCTCTGCTTTTATCACTTCACTGTGGGTCTGGGCGCGGGCTTTCTGCCAGCTC 42969049 Query 181 CGCGGACGCTGCCTTCGTCCAGCCGCAGAGGCCCCGCGGTCAGGGTCCCGCGTGCGGGGT 240 |||||||||||||||||||| ||||||||||||||||||||||||||||||||||||||| Sbjct 42969050 CGCGGACGCTGCCTTCGTCCGGCCGCAGAGGCCCCGCGGTCAGGGTCCCGCGTGCGGGGT 42969109 Query 241 ACCGGGGGCAGAACCAGCGCGTGACCGGGGTCCGCGGTGCCGCAACGCCCCGGGTCTGCG 300 |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||| Sbjct 42969110 ACCGGGGGCAGAACCAGCGCGTGACCGGGGTCCGCGGTGCCGCAACGCCCCGGGTCTGCG 42969169 Query 301 CAGAGGCCCCTGCAGTCCCTGCCCGGCCCAGTCCGAGCTTCCCGGGCGGGCCCCCAGTCC 360 |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||| Sbjct 42969170 CAGAGGCCCCTGCAGTCCCTGCCCGGCCCAGTCCGAGCTTCCCGGGCGGGCCCCCAGTCC 42969229 Query 361 GGCGATTTGCAGGAACTTTCCCCGGCGCTCCCACGCGAAGC 401 ||||||||||||||||||||||||||||||||||||||||| Sbjct 42969230 GGCGATTTGCAGGAACTTTCCCCGGCGCTCCCACGCGAAGC 42969270
- View location of contig to assess where the mutation is located - in this case, it seems to be within a coding region.
- Find frame of relevance to the mutant region, and what the mutation represents in that frame.
In this case, the SNP results in GGC -> AGC, or Gly -> Ser, at amino acid 120.
- Search OMIM for the identified region - no disease matches.
In this case, there is a nonconservative substitution in a coding region. Glycine and serine have very different properties, so this change would likely disrupt a protein's function. However, the coding region is of a "hypothetical protein", which has no known functions or disease associations. The patient is not advised to worry at this time.
Part 2: Contribute a Test Case
>KFG (Kay's Favorite Gene) ATTGCCCCGGTGCTGAGCGGCGCCGCGAGTCGGCCCGAGGCCTCCGGGGACTGCCGTGCCGGGCGGGAGA CCGCCATGGCGACCCTGGAAAAGCTGATGAAGGCCTTCGAGTCCCTCAAGTCCTTCCAGCAGCAGCAGCA GCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAACAGCCG CCACCGCCGCCGCCGCCGCCGCCTCCTCAGCTTCCTCAGCCGCCGCCGCAGGCACAGCCGCTGCTGCCTC AGCCGCAGCCGCCCCCGCCGCCGCCCCCGCCGCCACCCGGCCCGGCTGTGGCTGAGGAGCCGCTGCACCG ACCGTGAGTTTGGGCCCGCTGCAGCTCCCTGTCCCGGCGGGTCCCAGGCTACGGCGGGGATGGCGGTAAC CCTGCAGCCTGCGGGCCGGCGACACGAACCCCCGGCCCCGCAGAGACAGAGTGACCCAGCAACCCAGAGC CCATGAGGGACACCCGCCCCCTCCTGGGGCGAGGCCTTCCCCCACTTCAGCCCCGCTCCCTCACTTGGGT CTTCCCTTGTCCTCTCGCGAGGGGAGGCAGAGCCTTGTTGGGGCCTGTCCTGAATTCACCGAGGGGAGTC ACGGCCTCAGCCCTCTCGCCCTTCGCAGGATGCGAAGAGTTGGGGCGAGAACTTGTTTCTTTTTATTTGC