- What proportion of newly synthesized proteins fail to fold, and why?
- How do cells decide when a misfolded should be degraded rather than given another chance to fold?
- What is the cost of producing a protein that misfolds, compared to the cost if that protein folds properly?
- Why are misfolded proteins costly?
- How does inaccuracy in the translational apparatus (ribosomes, aa-tRNA synthetases, etc.) shape the evolution of coding sequences and proteins?
The fitness cost of protein misfolding
Coming soon...supporting website here.
Mistranslation-induced misfolding and gene evolution
Strikingly consistent correlations between rates of coding-sequence evolution and gene expression levels are apparent across taxa, but the biological causes behind the selective pressures on coding-sequence evolution remain controversial. Here we demonstrate conserved patterns of simple covariation between sequence evolution, codon usage, and mRNA level in E. coli, yeast, worm, fly, mouse, and human that suggest that all observed trends stem largely from a unified underlying selective pressure. In metazoans, these trends are strongest in tissues composed of neurons, whose structure and lifetime confer extreme sensitivity to protein misfolding. We propose, and demonstrate using a molecular-level evolutionary simulation, that selection against toxicity of misfolded proteins generated by ribosome errors suffices to create all the observed covariation. The mechanistic model of molecular evolution which emerges yields testable biochemical predictions, calls into question use of nonsynonymous-to-synonymous substitution ratios (Ka/Ks) to detect functional selection, and suggests how mistranslation may contribute to neurodegenerative disease.
- Drummond DA and Wilke CO. Mistranslation-induced protein misfolding as a dominant constraint on coding-sequence evolution. Cell. 2008 Jul 25;134(2):341-52. DOI:10.1016/j.cell.2008.05.042 |
Evolution and expression data
These tab-delimited files include gene and ortholog identifiers, dN, dS, ts/tv ratio, expression level, Fop, and (for the multicellular organisms) intronic guanine/cytosine (GC) content.
- Evolution and expression data [ZIP archive, ~800K]
Coding sequence alignments
Alignments are in FASTA format, ZIP-compressed.
- E. coli vs. S. typhimurium (~1.8MB)
- S. cerevisiae vs. S. paradoxus (~3.8MB)
- C. elegans vs. C. briggsae (~4.7MB)
- D. melanogaster vs. D. yakuba (~5.3MB)
- M. musculus vs. R. norvegicus (~8.7MB)
- H. sapiens vs. C. familiaris (~8.3MB)