Phyre2 is a suite of tools available on the web to predict and analyze protein structure, function and mutations. The focus of Phyre2 is to provide biologists with a simple and intuitive interface to state-of-the-art protein bioinformatics tools. Phyre2 replaces Phyre, the original version of the server for which we previously published a paper in Nature Protocols. In this updated protocol, we describe Phyre2, which uses advanced remote homology detection methods to build 3D models, predict ligand binding sites and analyze the effect of amino acid variants (e.g., nonsynonymous SNPs (nsSNPs)) for a user's protein sequence. Users are guided through results by a simple interface at a level of detail they determine. This protocol will guide users from submitting a protein sequence to interpreting the secondary and tertiary structure of their models, their domain composition and model quality. A range of additional available tools is described to find a protein structure in a genome, to submit large number of sequences at once and to automatically run weekly searches for proteins that are difficult to model. The server is available at http://www.sbg.bio.ic.ac.uk/phyre2. A typical structure prediction will be returned between 30 min and 2 h after submission.
At a glance
- Genome-wide protein structure prediction. in Multiscale Approaches to Protein Modeling (ed. Kolinski, A.) Ch. 11, 255–279 (Springer, 2010). , , & .
- The structure of the protein universe and genome evolution. Nature 420, 218–223 (2002). , &
- Protein structure prediction on the web: a case study using the Phyre server. Nat. Protoc. 4, 363–371 (2009). &
- Functional assignment of Mycobacterium tuberculosis proteome by genome-scale fold-recognition. Tuberculosis 1, 93 (2013). et al.
- Genome3D: a UK collaborative project to annotate genomic sequences with predicted 3D structures based on SCOP and CATH domains. Nucl. Acids Res. 41, D499–D507 (2013). et al.
- ePlant and the 3D data display initiative: integrative systems biology on the world wide web. PLoS ONE 6, e15237 (2010). et al.
- Critical assessment of methods of protein structure prediction (CASP)—round X. Proteins 82 S2: 1–6 (2014). , , , &
- I-TASSER: a unified platform for automated protein structure and function prediction. Nat. Protoc. 5, 725–738 (2010). , &
- The SWISS-MODEL Workspace: a web-based environment for protein structure homology modelling. Bioinformatics 22, 195–201 (2006). , , &
- Protein homology detection by HMM-HMM comparison. Bioinformatics 21, 951–960 (2005).
- pGenTHREADER and pDomTHREADER: new methods for improved protein fold recognition and superfamily discrimination. Bioinformatics. 25, 1761–1767 (2009). , &
- Structure prediction for CASP8 with all-atom refinement using Rosetta. Proteins 77 (suppl. 9), 89–99 (2009).
- Template-based protein structure modeling using the RaptorX web server. Nat. Protoc. 7, 1511–1522 (2012). et al.
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 25, 3389–3402 (1997). et al.
- HHblits: lightning-fast iterative protein sequence searching by HMM-HMM alignment. Nat. Methods 9, 173–175 (2012). , , &
- Protein secondary structure prediction based on position-specific scoring matrices. J Mol Biol. 292, 195–202 (1999).
- Cyclic coordinate descent: a robotics algorithm for protein loop closure. Protein Sci. 12, 963–972 (2003). &
- Protein folding requires crowd control in a simulated cell. J. Mol. Biol. 397, 1329–1338 (2010). , &
- Fast procedure for reconstruction of full-atom protein models from reduced representations. J. Comput. Chem. 29, 1460–1465 (2008). &
- Residue-rotamer-reduction algorithm for the protein side-chain conformation problem. Bioinformatics 22, 188–194 (2006). &
- Improved model quality assessment using ProQ2. BMC Bioinformatics 13, 224 (2012). , &
- MolProbity: all-atom contacts and structure validation for proteins and nucleic acids. Nucleic acids Res 35 (suppl. 2), W375–W383 (2007). et al.
- Fpocket: online tools for protein ensemble pocket detection and tracking. Nucleic acids Res 38 (suppl. 2), W582–W589 (2010). , , &
- The Catalytic Site Atlas: a resource of catalytic sites and residues identified in enzymes using structural data. Nucleic acids Res 32 (suppl. 1), D129–D133 (2004). , &
- SuSPect: enhanced prediction of single amino acid variant (SAV) phenotype using network features. J. Mol. Biol. 426, 2692–2701 (2014). , , &
- Predicting functionally important residues from sequence conservation. Bioinformatics 23, 1875–1882 (2007). &
- PiSite: a database of protein interaction sites using multiple binding states in the PDB. Nucleic Acids Res. 37 (Database issue): D360–D364 (2009). , &
- CDD: conserved domains and protein three-dimensional structure. Nucleic Acids Res 41 (D1): D348–D352 (2013). et al.
- A method and server for predicting damaging missense mutations. Nat. Methods 7, 248–249 (2010). et al.
- SIFT web server: predicting effects of amino acid substitutions on proteins. Nucleic acids Res. 40 W1: W452–W457 (2012). et al.
- Improving the assessment of the outcome of nonsynonymous SNVs with a consensus deleteriousness score, Condel. Am. J. Hum. Genet. 88, 440–449 (2011). &
- Prediction and functional analysis of native disorder in proteins from the three kingdoms of life. J. Mol. Biol. 337, 635–645 (2004). , , , &
- MaxSub: an automated measure for the assessment of protein structure prediction quality. Bioinformatics. 16, 776–785 (2000). , , &
- 3DLigandSite: predicting ligand-binding sites using similar structures. Nucleic Acids Res. 38, W469–W473 (2010). , &
- Improving the accuracy of transmembrane protein topology prediction using evolutionary information. Bioinformatics 3, 538–544 (2007).