Letter | Published:

Robust signals of coevolution of interacting residues in mammalian proteomes identified by phylogeny-aided structural analysis

Nature Genetics volume 37, pages 13671371 (2005) | Download Citation

Subjects

Abstract

The structure of a protein depends critically on the complex interactions among its amino acid residues. It has long been hypothesized that interacting residues might tend to coevolve, but it is not known whether such coevolution is a general phenomenon across the proteome. Here, we describe a novel methodology called phylogeny-aided structural analysis, which uncovers robust signals of interacting-residue coevolution in mammalian proteomes. Furthermore, this new method allows the magnitude of coevolution to be quantified. Finally, it facilitates a comprehensive evaluation of various factors that affect interacting-residue coevolution, such as the physicochemical properties of the interactions between residues, solvent accessibility of the residues and their secondary structure context.

Access optionsAccess options

Rent or Buy article

Get time limited or full article access on ReadCube.

from$8.99

All prices are NET prices.

References

  1. 1.

    , & Lessons from an evolving rRNA: 16S and 23S rRNA structures from a comparative perspective. Microbiol. Rev. 58, 10–26 (1994).

  2. 2.

    , , & Further perspective on the catalytic core and secondary structure of ribonuclease P RNA. Proc. Natl. Acad. Sci. USA 91, 2527–2531 (1994).

  3. 3.

    et al. RNA secondary structure and compensatory evolution. Genes Genet. Syst. 74, 271–286 (1999).

  4. 4.

    & Mechanisms and convergence of compensatory evolution in mammalian mitochondrial tRNAs. Nat. Genet. 36, 1207–1212 (2004).

  5. 5.

    The role of compensatory neutral mutations in molecular evolution. J. Genet. 64, 7–19 (1985).

  6. 6.

    , & Protein structure relationships revealed by mutational analysis. Science 146, 1593–1594 (1964).

  7. 7.

    , , , & Ancestral lysozymes reconstructed, neutrality tested, and thermostability linked to hydrocarbon packing. Nature 345, 86–89 (1990).

  8. 8.

    & Complementary advantageous substitutions in the evolution of an antiviral RNase of higher primates. Proc. Natl. Acad. Sci. USA 99, 5486–5491 (2002).

  9. 9.

    The molecular clock may be an episodic clock. Proc. Natl. Acad. Sci. USA 81, 8009–8013 (1984).

  10. 10.

    , & Dobzhansky-Muller incompatibilities in protein evolution. Proc. Natl. Acad. Sci. USA 99, 14878–14883 (2002).

  11. 11.

    & Why are some human disease-associated mutations fixed in mice? Trends Genet. 19, 678–681 (2003).

  12. 12.

    , , & Correlated mutations and residue contacts in proteins. Proteins 18, 309–317 (1994).

  13. 13.

    How frequent are correlated changes in families of protein sequences? Proc. Natl. Acad. Sci. USA 91, 98–102 (1994).

  14. 14.

    , & Can three-dimensional contacts in protein structures be predicted by analysis of correlated mutations? Protein Eng. 7, 349–358 (1994).

  15. 15.

    & Compensating changes in protein multiple sequence alignments. Protein Eng. 7, 341–348 (1994).

  16. 16.

    & Sequence evolution and the mechanism of protein folding. Biophys. J. 79, 1787–1799 (2000).

  17. 17.

    , & Detecting compensatory covariation signals in protein evolution using reconstructed ancestral sequences. J. Mol. Biol. 319, 729–743 (2002).

  18. 18.

    & Effectiveness of correlation analysis in identifying protein residues undergoing correlated evolution. Protein Eng. 10, 647–657 (1997).

  19. 19.

    & Exploring a phylogenetic approach for the detection of correlated substitutions in proteins. Mol. Biol. Evol. 17, 1753–1759 (2000).

  20. 20.

    , , & Placental mammal diversification and the Cretaceous-Tertiary boundary. Proc. Natl. Acad. Sci. USA 100, 1056–1061 (2003).

  21. 21.

    , & Analysis and assessment of comparative modeling predictions in CASP4. Proteins (Suppl.) 5, 22–38 (2001).

  22. 22.

    Toward defining the course of evolution: minimum change for a specific tree topology. Syst. Zool. 20, 406–416 (1971).

  23. 23.

    , , , & The Protein Data Bank and structural genomics. Nucleic Acids Res. 31, 489–491 (2003).

  24. 24.

    & Protein structure prediction and structural genomics. Science 294, 93–96 (2001).

  25. 25.

    , , , & Molecular docking of substrates and inhibitors in the catalytic site of CYP6B1, an insect cytochrome p450 monooxygenase. Protein Eng. 16, 577–587 (2003).

  26. 26.

    Empirical force fields for biological macromolecules: overview and issues. J. Comput. Chem. 25, 1584–1604 (2004).

  27. 27.

    , , & Hydrogen bonding in globular proteins. J. Mol. Biol. 226, 1143–1159 (1992).

  28. 28.

    & Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features. Biopolymers 22, 2577–2637 (1983).

Download references

Acknowledgements

We thank C.M. Malcom, P.A. Rice, T.R. Sosnick, E.J. Vallender, G.J. Wyckoff and X. Yang for technical help, discussions and comments on the manuscript.

Author information

Author notes

    • Sun Shim Choi
    •  & Weimin Li

    These authors contributed equally to this work.

Affiliations

  1. Howard Hughes Medical Institute, Department of Human Genetics, University of Chicago, Chicago, Illinois 60637, USA.

    • Sun Shim Choi
    • , Weimin Li
    •  & Bruce T Lahn

Authors

  1. Search for Sun Shim Choi in:

  2. Search for Weimin Li in:

  3. Search for Bruce T Lahn in:

Competing interests

The authors declare no competing financial interests.

Corresponding author

Correspondence to Bruce T Lahn.

Supplementary information

PDF files

  1. 1.

    Supplementary Fig. 1

    Declining signal of coevolution as the sequence identity between the modeled protein and the template declines.

  2. 2.

    Supplementary Table 1

    Proteins in the rat-human-dog data set.

About this article

Publication history

Received

Accepted

Published

DOI

https://doi.org/10.1038/ng1685

Further reading Further reading