CH391L/S13/Ancestral Sequence Reconstruction
- 1 What is Ancestral Sequence Reconstruction?
- 1.1 Pipeline for Generating Ancestral Genes
- 1.2 Examples of Ancestral Sequence Reconstructions
- 1.3 Precambrian β lactamases
- 1.4 References
What is Ancestral Sequence Reconstruction?
Ancestral Sequence Reconstruction is a method which enables scientists to synthesize biomolecules from extinct organisms. Sequence information (Nucleic Acid and Protein) from extant species can be used to infer the sequences of common ancestor species which can be synthesized and tested in the lab. The method was originally discussed by Pauling and Zuckerkandl in 1963, almost 30 years before the theory was experimentally tested.
Pipeline for Generating Ancestral Genes
- Sequences from extant species of the desired common ancestral gene and outgroup genes are aligned.
- The ancestral gene is inferred based on evolutionary models (typically maximum parsimony or maximum likelihood).
- Ancestral genes are synthesized via overlapping oligonucleotides and PCR assembly.
- Ancestral genes are cloned and tested for function.
Methods of Inferring Ancient Sequences
- Consensus Sequence - the most frequently occurring residue of a extant organisms is assumed to be the ancestral state.
- Maximum Parsimony - minimization of the total number of changes required to account for the terminal sequences.
- Maximum Likelihood - ancestral states are evaluated at each internal node in the tree based on the likelihood of each mutation. This process uses a statistical framework of molecular evolution which takes into account differences in certain types of mutations. The generated ancestral sequence gives the probability that each residue is correctly predicted.
Modern advances in oligonucleotide synthesis allow for the cheap construction of synthetic ancestral genes. As of January 2012, oligonucleotides can be purchased for 0.35$/base. Using overlapping PCR assembly most synthetic genes can be constructed for several hundred dollars.
Testing Ancestral Variants
After synthesis is complete, ancestral genes can be tested for their function. For example, the ancestral genes can be cloned into bacterial expression vectors and transformed into E.coli. The proteins can be purified and tested for their activity, whether that be fluorescence spectra, binding affinity, thermoactivity, etc.
Examples of Ancestral Sequence Reconstructions
As the technology for ancestral sequence reconstruction has advanced the technique has popularized. It offers a unique way to peer into the past and revisit the biomolecules of our forefathers. The sensitivity of genetic information to degrade limits our understanding to extant organisms or circumstantially preserved ones, such as specimens preserved by ice. The ability to resurrect sequences from the dead and test them relieves us from these temporal limitations.
Evolution of Coral Pigments
One visceral example of ancestral sequence reconstruction was done by the Matz group (currently residing at the University of Texas at Austin). Fluorescent proteins from related coral species had wavelengths corresponding to Cyan, Green, and Red. The details of the evolution of fluorescent color in the GFP superfamily was not fully understand. That is, what fluorescent spectra did the common ancestors of the modern corals have? Sequences for the common ancestor nodes were synthesized and tested for their activity. The ancestral sequences revealed an interesting evolutionary history. The common ancestor to all the superfamily had a green emission peak. The more recent common ancestor of Green/Red had two emission peaks; a strong green peak and a smaller red peak. This evolution of this ancestor resolved into either a green or red peak, losing the emission bimodality and specializing.
Inferring the Paleoenvironment of ancient Earth
Ancestral sequence reconstruction has been used to infer the environmental conditions on the early Earth. The study sought to explore the evolutionary history of a translation factor EFtu. This elongation factor functions optimally at the temperature which the organism lives, for example thermophilic organisms have an EFtu optimized for high temperatures. By resurrecting the EFtu protein in the common ancestors of bacteria, the temperature profile might elucidate what the environmental conditions were. Interestingly the EFtu common ancestor to all mesophilic bacterium (~1BYA) has an optimal temperature of a thermophile. This suggests that the hypothesis of a hot early Earth is true.
Precambrian β lactamases
- Ugalde JA, Chang BS, and Matz MV. Evolution of coral pigments recreated. Science. 2004 Sep 3;305(5689):1433. DOI:10.1126/science.1099597 |
- Gaucher EA, Thomson JM, Burgan MF, and Benner SA. Inferring the palaeoenvironment of ancient bacteria on the basis of resurrected proteins. Nature. 2003 Sep 18;425(6955):285-8. DOI:10.1038/nature01977 |
- Thornton JW. Resurrecting ancient genes: experimental analysis of extinct molecules. Nat Rev Genet. 2004 May;5(5):366-75. DOI:10.1038/nrg1324 |
, L. and E. Zuckerkandl. 1963. Chemical paleogenetics: molecular restoration studies of extinct forms of life. Acta Chem. Scand 17:89. CrossRef doi=10.3891