Harvard:Biophysics 101/2007/Notebook:Kaull/2007-3-15: Difference between revisions

From OpenWetWare
Jump to navigationJump to search
(New page: =Homework 3/15/07= See [http://openwetware.org/wiki/Harvard:Biophysics_101/2007/03/13 here] for assignment. ==Part 1: Walk-Through of Analyzing Gene== * Acquire sequence of interest <p...)
(No difference)

Revision as of 05:33, 15 March 2007

Homework 3/15/07

See here for assignment.

Part 1: Walk-Through of Analyzing Gene

  • Acquire sequence of interest
>example1                                                                      
CACCCTCGCCAGTTACGAGCTGCCGAGCCGCTTCCTAGGCTCTCTGCGAATACGGACACG                    
CATGCCACCCACAACAACTTTTTAAAAGAATCAGACGTGTGAAGGATTCTATTCGAATTA                    
CTTCTGCTCTCTGCTTTTATCACTTCACTGTGGGTCTGGGCGCGGGCTTTCTGCCAGCTC                    
CGCGGACGCTGCCTTCGTCCAGCCGCAGAGGCCCCGCGGTCAGGGTCCCGCGTGCGGGGT                    
ACCGGGGGCAGAACCAGCGCGTGACCGGGGTCCGCGGTGCCGCAACGCCCCGGGTCTGCG                    
CAGAGGCCCCTGCAGTCCCTGCCCGGCCCAGTCCGAGCTTCCCGGGCGGGCCCCCAGTCC                    
GGCGATTTGCAGGAACTTTCCCCGGCGCTCCCACGCGAAGC
  • Send to BLAST - identify gene, and note mutations
>ref|NT_030059.12|Hs10_30314  Homo sapiens chromosome 10 genomic contig, reference assembly
Length=44617998

 Features flanking this part of subject sequence:
   3895 bp at 5' side: hypothetical protein
   425 bp at 3' side: HtrA serine peptidase 1


 Score =  787 bits (397),  Expect = 0.0
 Identities = 400/401 (99%), Gaps = 0/401 (0%)
 Strand=Plus/Plus

Query  1         CACCCTCGCCAGTTACGAGCTGCCGAGCCGCTTCCTAGGCTCTCTGCGAATACGGACACG  60
                 ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct  42968870  CACCCTCGCCAGTTACGAGCTGCCGAGCCGCTTCCTAGGCTCTCTGCGAATACGGACACG  42968929

Query  61        CATGCCACCCACAACAACTTTTTAAAAGAATCAGACGTGTGAAGGATTCTATTCGAATTA  120
                 ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct  42968930  CATGCCACCCACAACAACTTTTTAAAAGAATCAGACGTGTGAAGGATTCTATTCGAATTA  42968989

Query  121       CTTCTGCTCTCTGCTTTTATCACTTCACTGTGGGTCTGGGCGCGGGCTTTCTGCCAGCTC  180
                 ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct  42968990  CTTCTGCTCTCTGCTTTTATCACTTCACTGTGGGTCTGGGCGCGGGCTTTCTGCCAGCTC  42969049

Query  181       CGCGGACGCTGCCTTCGTCCAGCCGCAGAGGCCCCGCGGTCAGGGTCCCGCGTGCGGGGT  240
                 |||||||||||||||||||| |||||||||||||||||||||||||||||||||||||||
Sbjct  42969050  CGCGGACGCTGCCTTCGTCCGGCCGCAGAGGCCCCGCGGTCAGGGTCCCGCGTGCGGGGT  42969109

Query  241       ACCGGGGGCAGAACCAGCGCGTGACCGGGGTCCGCGGTGCCGCAACGCCCCGGGTCTGCG  300
                 ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct  42969110  ACCGGGGGCAGAACCAGCGCGTGACCGGGGTCCGCGGTGCCGCAACGCCCCGGGTCTGCG  42969169

Query  301       CAGAGGCCCCTGCAGTCCCTGCCCGGCCCAGTCCGAGCTTCCCGGGCGGGCCCCCAGTCC  360
                 ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Sbjct  42969170  CAGAGGCCCCTGCAGTCCCTGCCCGGCCCAGTCCGAGCTTCCCGGGCGGGCCCCCAGTCC  42969229

Query  361       GGCGATTTGCAGGAACTTTCCCCGGCGCTCCCACGCGAAGC  401
                 |||||||||||||||||||||||||||||||||||||||||
Sbjct  42969230  GGCGATTTGCAGGAACTTTCCCCGGCGCTCCCACGCGAAGC  42969270
  • View location of contig to assess where the mutation is located - in this case, it seems to be within a coding region.
  • Find frame of relevance to the mutant region, and what the mutation represents in that frame.
      In this case, the SNP results in GGC -> AGC, or Gly -> Ser, at amino acid 120.
  • Search OMIM for the identified region - no disease matches.

In this case, there is a nonconservative substitution in a coding region. Glycine and serine have very different properties, so this change would likely disrupt a protein's function. However, the coding region is of a "hypothetical protein", which has no known functions or disease associations. The patient is not advised to worry at this time.


Part 2: Contribute a Test Case

>KFG (Kay's Favorite Gene)
ATTGCCCCGGTGCTGAGCGGCGCCGCGAGTCGGCCCGAGGCCTCCGGGGACTGCCGTGCCGGGCGGGAGA
CCGCCATGGCGACCCTGGAAAAGCTGATGAAGGCCTTCGAGTCCCTCAAGTCCTTCCAGCAGCAGCAGCA
GCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAACAGCCG
CCACCGCCGCCGCCGCCGCCGCCTCCTCAGCTTCCTCAGCCGCCGCCGCAGGCACAGCCGCTGCTGCCTC
AGCCGCAGCCGCCCCCGCCGCCGCCCCCGCCGCCACCCGGCCCGGCTGTGGCTGAGGAGCCGCTGCACCG
ACCGTGAGTTTGGGCCCGCTGCAGCTCCCTGTCCCGGCGGGTCCCAGGCTACGGCGGGGATGGCGGTAAC
CCTGCAGCCTGCGGGCCGGCGACACGAACCCCCGGCCCCGCAGAGACAGAGTGACCCAGCAACCCAGAGC
CCATGAGGGACACCCGCCCCCTCCTGGGGCGAGGCCTTCCCCCACTTCAGCCCCGCTCCCTCACTTGGGT
CTTCCCTTGTCCTCTCGCGAGGGGAGGCAGAGCCTTGTTGGGGCCTGTCCTGAATTCACCGAGGGGAGTC
ACGGCCTCAGCCCTCTCGCCCTTCGCAGGATGCGAAGAGTTGGGGCGAGAACTTGTTTCTTTTTATTTGC