High-resolution structure prediction and the crystallographic phase problem

Qian, Bin; Raman, Srivatsan; Das, Rhiju; Bradley, Philip; McCoy, Airlie J.; Read, Randy J.; Baker, David

doi:10.1038/nature06249

Article
Published: 14 October 2007

High-resolution structure prediction and the crystallographic phase problem

Bin Qian¹^na1,
Srivatsan Raman¹^na1,
Rhiju Das¹^na1,
Philip Bradley¹,
Airlie J. McCoy²,
Randy J. Read² &
…
David Baker¹

Nature volume 450, pages 259–264 (2007)Cite this article

3189 Accesses
251 Citations
23 Altmetric
Metrics details

Abstract

The energy-based refinement of low-resolution protein structure models to atomic-level accuracy is a major challenge for computational structural biology. Here we describe a new approach to refining protein structure models that focuses sampling in regions most likely to contain errors while allowing the whole structure to relax in a physically realistic all-atom force field. In applications to models produced using nuclear magnetic resonance data and to comparative models based on distant structural homologues, the method can significantly improve the accuracy of the structures in terms of both the backbone conformations and the placement of core side chains. Furthermore, the resulting models satisfy a particularly stringent test: they provide significantly better solutions to the X-ray crystallographic phase problem in molecular replacement trials. Finally, we show that all-atom refinement can produce de novo protein structure predictions that reach the high accuracy required for molecular replacement without any experimental phase information and in the absence of templates suitable for molecular replacement from the Protein Data Bank. These results suggest that the combination of high-resolution structure prediction with state-of-the-art phasing tools may be unexpectedly powerful in phasing crystallographic data for which molecular replacement is hindered by the absence of sufficiently accurate previous models.

Access through your institution

Buy or subscribe

This is a preview of subscription content, access via your institution

Access options

Access through your institution

Buy this article

Purchase on Springer Link
Instant access to full article PDF

Buy now

Prices may be subject to local taxes which are calculated during checkout

Figure 1: **Overview of the rebuilding-and-refinement method.**

Figure 2: **Improvement in model accuracy produced by rebuilding and refinement.**

Figure 3: **Improvement in electron density using models from rebuilding and refinement in molecular replacement searches.**

Figure 4: ***Ab initio*** **phasing by** ***ab initio*** **modelling.**

Highly accurate protein structure prediction for the human proteome

Article Open access 22 July 2021

Sampling of the conformational landscape of small proteins with Monte Carlo methods

Article Open access 23 October 2020

Extended experimental inferential structure determination method in determining the structural ensembles of disordered protein states

Article Open access 09 June 2020

References

Misura, K. M. & Baker, D. Progress and challenges in high-resolution refinement of protein structure models. Proteins 59, 15–29 (2005)
Article CAS PubMed Google Scholar
Pieper, U. et al. MODBASE: a database of annotated comparative protein structure models and associated resources. Nucleic Acids Res. 34, D291–D295 (2006)
Article ADS CAS PubMed Google Scholar
Moult, J. A decade of CASP: progress, bottlenecks and prognosis in protein structure prediction. Curr. Opin. Struct. Biol. 15, 285–289 (2005)
Article CAS PubMed Google Scholar
Schwarzenbacher, R., Godzik, A., Grzechnik, S. K. & Jaroszewski, L. The importance of alignment accuracy for molecular replacement. Acta Crystallogr. D 60, 1229–1236 (2004)
Article PubMed Google Scholar
Giorgetti, A., Raimondo, D., Miele, A. E. & Tramontano, A. Evaluating the usefulness of protein structure models for molecular replacement. Bioinformatics 21 (suppl. 2). ii72–ii76 (2005)
Article CAS PubMed Google Scholar
Chen, Y. W., Dodson, E. J. & Kleywegt, G. J. Does NMR mean “not for molecular replacement”? Using NMR-based search models to solve protein crystal structures. Structure 8, R213–R220 (2000)
Article CAS PubMed Google Scholar
Strop, P., Brzustowicz, M. R. & Brunger, A. T. Ab initio molecular-replacement phasing for symmetric helical membrane proteins. Acta Crystallogr. D 63, 188–196 (2007)
Article CAS PubMed PubMed Central Google Scholar
Rossmann, M. G. Ab initio phase determination and phase extension using non-crystallographic symmetry. Curr. Opin. Struct. Biol. 5, 650–655 (1995)
Article CAS PubMed Google Scholar
Kuhlman, B. et al. Design of a novel globular protein fold with atomic-level accuracy. Science 302, 1364–1368 (2003)
Article ADS CAS PubMed Google Scholar
McCoy, A. J. et al. Phaser crystallographic software. J. Appl. Crystallogr. 40, 658–674 (2007)
Article CAS PubMed PubMed Central Google Scholar
Perrakis, A., Morris, R. & Lamzin, V. S. Automated protein model building combined with iterative structure refinement. Nature Struct. Biol. 6, 458–463 (1999)
Article CAS PubMed Google Scholar
Terwilliger, T. C. Automated main-chain model building by template matching and iterative fragment extension. Acta Crystallogr. D 59, 38–44 (2003)
Article PubMed Google Scholar
Bradley, P., Misura, K. M. & Baker, D. Toward high-resolution de novo structure prediction for small proteins. Science 309, 1868–1871 (2005)
Article ADS CAS PubMed Google Scholar
Rohl, C. A., Strauss, C. E., Misura, K. M. & Baker, D. Protein structure prediction using Rosetta. Methods Enzymol. 383, 66–93 (2004)
Article CAS PubMed Google Scholar
Leaver-Fay, A., Kuhlman, B. & Snoeyink, J. Rotamer-pair energy calculations using a Trie data structure. In Algorithms in Bioinformatics (eds Casadio, R. & Myers, G.) 389 (Springer, Berlin, 2005)
Chapter Google Scholar
Wales, D. J. & Scheraga, H. A. Global optimization of clusters, crystals, and biomolecules. Science 285, 1368–1372 (1999)
Article CAS PubMed Google Scholar
Wallner, B. & Elofsson, A. Identification of correct regions in protein models using structural, alignment, and consensus information. Protein Sci. 15, 900–913 (2006)
Article CAS PubMed PubMed Central Google Scholar
Glover, F. & Laguna, M. Tabu Search (Kluwer, Norwell, Massachusetts, 1997)
Book Google Scholar
Lee, J., Liwo, A. & Scheraga, H. A. Energy-based de novo protein folding by conformational space annealing and an off-lattice united-residue force field: application to the 10–55 fragment of staphylococcal protein A and to apo calbindin D9K. Proc. Natl Acad. Sci. USA 96, 2025–2030 (1999)
Article ADS CAS PubMed PubMed Central Google Scholar
Doreleijers, J. F., Rullmann, J. A. & Kaptein, R. Quality assessment of NMR structures: a statistical survey. J. Mol. Biol. 281, 149–164 (1998)
Article CAS PubMed Google Scholar
Grishaev, A. & Bax, A. An empirical backbone-backbone hydrogen-bonding potential in proteins and its applications to NMR structure refinement and validation. J. Am. Chem. Soc. 126, 7281–7292 (2004)
Article CAS PubMed Google Scholar
Rieping, W., Habeck, M. & Nilges, M. Inferential structure determination. Science 309, 303–306 (2005)
Article ADS CAS PubMed Google Scholar
Zemla, A. LGA: A method for finding 3D similarities in protein structures. Nucleic Acids Res. 31, 3370–3374 (2003)
Article CAS PubMed PubMed Central Google Scholar
Lovell, S. C. et al. Structure validation by Cα geometry: φ, ψ and Cβ deviation. Proteins 50, 437–450 (2003)
Article CAS PubMed Google Scholar
Das, R. et al. Structure prediction for CASP7 targets using extensive all-atom refinement with Rosetta@home. Proteins doi: 10.1002/prot.21636 (25 September 2007)
Berman, H., Henrick, K., Nakamura, H. & Markley, J. L. The worldwide Protein Data Bank (wwPDB): ensuring a single, uniform archive of PDB data. Nucleic Acids Res. 35, D301–D303 (2007)
Article CAS PubMed Google Scholar
Andrade, S. L., Dickmanns, A., Ficner, R. & Einsle, O. Crystal structure of the archaeal ammonium transporter Amt-1 from Archaeoglobus fulgidus . Proc. Natl Acad. Sci. USA 102, 14994–14999 (2005)
Article ADS CAS PubMed PubMed Central Google Scholar
Pannu, N. S. & Read, R. J. Improved structure refinement through maximum likelihood. Acta Crystallogr. A 52, 659–668 (1996)
Article Google Scholar
Dauter, Z. New approaches to high-throughput phasing. Curr. Opin. Struct. Biol. 12, 674–678 (2002)
Article CAS PubMed Google Scholar
Englander, J. J. et al. Protein structure change studied by hydrogen-deuterium exchange, functional labeling, and mass spectrometry. Proc. Natl Acad. Sci. USA 100, 7057–7062 (2003)
Article ADS CAS PubMed PubMed Central Google Scholar
Young, M. M. et al. High throughput protein fold identification by using experimental constraints derived from intramolecular cross-links and mass spectrometry. Proc. Natl Acad. Sci. USA 97, 5802–5806 (2000)
Article ADS CAS PubMed PubMed Central Google Scholar
Takamoto, K. & Chance, M. R. Radiolytic protein footprinting with mass spectrometry to probe the structure of macromolecular complexes. Annu. Rev. Biophys. Biomol. Struct. 35, 251–276 (2006)
Article CAS PubMed Google Scholar
Zhang, Y. & Skolnick, J. TM-align: a protein structure alignment algorithm based on the TM-score. Nucleic Acids Res. 33, 2302–2309 (2005)
Article CAS PubMed PubMed Central Google Scholar
Ortiz, A. R., Strauss, C. E. & Olmea, O. MAMMOTH (matching molecular models obtained from theory): an automated method for model comparison. Protein Sci. 11, 2606–2621 (2002)
Article CAS PubMed PubMed Central Google Scholar
Simons, K. T., Kooperberg, C., Huang, E. & Baker, D. Assembly of protein tertiary structures from fragments with similar local sequences using simulated annealing and Bayesian scoring functions. J. Mol. Biol. 268, 209–225 (1997)
Article CAS PubMed Google Scholar
Canutescu, A. A. & Dunbrack, R. L. Cyclic coordinate descent: A robotics algorithm for protein loop closure. Protein Sci. 12, 963–972 (2003)
Article CAS PubMed PubMed Central Google Scholar
Lazaridis, T. & Karplus, M. Effective energy function for proteins in solution. Proteins 35, 133–152 (1999)
Article CAS PubMed Google Scholar
Dunbrack, R. L. & Cohen, F. E. Bayesian statistical analysis of protein side-chain rotamer preferences. Protein Sci. 6, 1661–1681 (1997)
Article CAS PubMed PubMed Central Google Scholar
Engh, R. A. & Huber, R. Accurate bond and angle parameters for X-ray protein structure refinement. Acta Crystallogr. A 47, 392–400 (1991)
Article Google Scholar
Wang, C., Schueler-Furman, O. & Baker, D. Improved side-chain modeling for protein-protein docking. Protein Sci. 14, 1328–1339 (2005)
Article CAS PubMed PubMed Central Google Scholar
Li, Z. & Scheraga, H. A. Monte Carlo-minimization approach to the multiple-minima problem in protein folding. Proc. Natl Acad. Sci. USA 84, 6611–6615 (1987)
Article ADS MathSciNet CAS PubMed PubMed Central Google Scholar
Garbuzynskiy, S. O., Melnik, B. S., Lobanov, M. Y., Finkelstein, A. V. & Galzitskaya, O. V. Comparison of X-ray and NMR structures: is there a systematic difference in residue contacts between X-ray- and NMR-resolved protein structures? Proteins 60, 139–147 (2005)
Article CAS PubMed Google Scholar
Ginalski, K., Elofsson, A., Fischer, D. & Rychlewski, L. 3D-Jury: a simple approach to improve protein structure predictions. Bioinformatics 19, 1015–1018 (2003)
Article CAS PubMed Google Scholar
Chivian, D. & Baker, D. Homology modeling using parametric alignment ensemble generation with consensus and energy-based model selection. Nucleic Acids Res. 34, e112 (2006)
Article PubMed PubMed Central Google Scholar
Sali, A. & Blundell, T. L. Comparative protein modelling by satisfaction of spatial restraints. J. Mol. Biol. 234, 779–815 (1993)
Article CAS PubMed Google Scholar
Bonneau, R., Strauss, C. E. & Baker, D. Improving the performance of Rosetta using multiple sequence alignment information and global measures of hydrophobic core formation. Proteins 43, 1–11 (2001)
Article CAS PubMed Google Scholar
Moult, J., Fidelis, K., Rost, B., Hubbard, T. & Tramontano, A. Critical assessment of methods of protein structure prediction (CASP)–round 6. Proteins 61 (suppl. 7). 3–7 (2005)
Article CAS PubMed Google Scholar
Petsko, G. A. The grail problem. Genome Biol. 1, COMMENT002 (2000)
CAS PubMed PubMed Central Google Scholar
Plewczynski, D., Pas, J., Von Grotthuss, M. & Rychlewski, L. Comparison of proteins based on segments structural similarity. Acta Biochim. Pol. 51, 161–172 (2004)
CAS PubMed Google Scholar
Kuhlman, B. & Baker, D. Native protein sequences are close to optimal for their structures. Proc. Natl Acad. Sci. USA 97, 10383–10388 (2000)
Article ADS CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We thank Rosetta@home participants for contributing computing power that made testing of many new ideas possible; the DOE INCITE program for access to Blue Gene/L at Argonne National Laboratory and the IBM Blue Gene Watson supercomputers; and the NCSA, SDSC and Argonne National Laboratory supercomputer centres for computer time and help with porting Rosetta to Blue Gene. We thank D. Kim and K. Laidig for developing the computational infrastructure underlying Rosetta@home; J. Abendroth for help with RESOLVE and ARP/wARP software; M. Kennedy of NESG for the NMR structure coordinates of protein 1xpw and for help with the molecular replacement calculations; and J. Abendroth, J. Bosch, J. Havranek and C. Wang for comments on the manuscript. We also thank the CASP organizers and contributing structural biologists for providing an invaluable test set for new structure refinement methods. This work was funded by the National Institute of General Medical Sciences, National Institutes of Health (to D.B.), the Wellcome Trust, UK (to R.J.R.), the Howard Hughes Medical Institute (D.B.), a Leukemia and Lymphoma Society Career Development fellowship (to B.Q.), and a Jane Coffin Childs fellowship (to R.D.). Rosetta software and source code are available to academic users free of charge at http://www.rosettacommons.org/software/.

Author Contributions B.Q., S.R. and R.D. contributed equally to this work. Structure predictions for NMR-based, comparative-model-based and de novo predictions were carried out by S.R., B.Q. and R.D. respectively, with advice and software from D.B. and P.B. Phasing trials were performed by R.J.R., B.Q., S.R. and R.D., with advice from R.J.R. and A.J.M. All authors discussed results and commented on the manuscript.

Author information

Bin Qian, Srivatsan Raman and Rhiju Das: These authors contributed equally to this work.

Authors and Affiliations

Department of Biochemistry and Howard Hughes Medical Institute, University of Washington, Box 357350, Seattle 98195, USA,
Bin Qian, Srivatsan Raman, Rhiju Das, Philip Bradley & David Baker
Department of Haematology, University of Cambridge, Cambridge Institute for Medical Research, Wellcome Trust/MRC Building, Hills Road, Cambridge CB2 0XY, UK,
Airlie J. McCoy & Randy J. Read

Authors

Bin Qian
View author publications
You can also search for this author in PubMed Google Scholar
Srivatsan Raman
View author publications
You can also search for this author in PubMed Google Scholar
Rhiju Das
View author publications
You can also search for this author in PubMed Google Scholar
Philip Bradley
View author publications
You can also search for this author in PubMed Google Scholar
Airlie J. McCoy
View author publications
You can also search for this author in PubMed Google Scholar
Randy J. Read
View author publications
You can also search for this author in PubMed Google Scholar
David Baker
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to David Baker.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Supplementary information

Supplementary Information

The file contains Supplementary Tables S1-S2 and Supplementary Figures S1-S5 with Legends and additional acknowledgements. (PDF 985 kb)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Qian, B., Raman, S., Das, R. et al. High-resolution structure prediction and the crystallographic phase problem. Nature 450, 259–264 (2007). https://doi.org/10.1038/nature06249

Download citation

Received: 08 May 2007
Accepted: 13 September 2007
Published: 14 October 2007
Issue Date: 08 November 2007
DOI: https://doi.org/10.1038/nature06249

This article is cited by

Molecular mechanisms underlying menthol binding and activation of TRPM8 ion channel
- Lizhen Xu
- Yalan Han
- Fan Yang
Nature Communications (2020)
Molecular basis for heat desensitization of TRPV1 ion channels
- Lei Luo
- Yunfei Wang
- Ren Lai
Nature Communications (2019)
Fast design of arbitrary length loops in proteins using InteractiveRosetta
- William F. Hooper
- Benjamin D. Walcott
- Christopher Bystroff
BMC Bioinformatics (2018)
Structure of the peptidoglycan polymerase RodA resolved by evolutionary coupling analysis
- Megan Sjodt
- Kelly Brock
- Andrew C. Kruse
Nature (2018)
The conformational wave in capsaicin activation of transient receptor potential vanilloid 1 ion channel
- Fan Yang
- Xian Xiao
- Jie Zheng
Nature Communications (2018)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

High-resolution structure prediction and the crystallographic phase problem

Abstract

Access options

Similar content being viewed by others

Highly accurate protein structure prediction for the human proteome

Sampling of the conformational landscape of small proteins with Monte Carlo methods

Extended experimental inferential structure determination method in determining the structural ensembles of disordered protein states

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Competing interests

Supplementary information

Supplementary Information

Rights and permissions

About this article

Cite this article

This article is cited by

Molecular mechanisms underlying menthol binding and activation of TRPM8 ion channel

Molecular basis for heat desensitization of TRPV1 ion channels

Fast design of arbitrary length loops in proteins using InteractiveRosetta

Structure of the peptidoglycan polymerase RodA resolved by evolutionary coupling analysis

The conformational wave in capsaicin activation of transient receptor potential vanilloid 1 ion channel

Comments

Protein predictions

Search

Quick links

Abstract

Access options

Similar content being viewed by others

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Competing interests

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links