Visualization and analysis of non-covalent contacts using the Protein Contacts Atlas

Kayikci, Melis; Venkatakrishnan, A. J.; Scott-Brown, James; Ravarani, Charles N. J.; Flock, Tilman; Babu, M. Madan

doi:10.1038/s41594-017-0019-z

Technical Report
Published: 15 January 2018

Visualization and analysis of non-covalent contacts using the Protein Contacts Atlas

Nature Structural & Molecular Biology volume 25, pages 185–194 (2018)Cite this article

6990 Accesses
87 Citations
40 Altmetric
Metrics details

Subjects

Abstract

Visualizations of biomolecular structures empower us to gain insights into biological functions, generate testable hypotheses, and communicate biological concepts. Typical visualizations (such as ball and stick) primarily depict covalent bonds. In contrast, non-covalent contacts between atoms, which govern normal physiology, pathogenesis, and drug action, are seldom visualized. We present the Protein Contacts Atlas, an interactive resource of non-covalent contacts from over 100,000 PDB crystal structures. We developed multiple representations for visualization and analysis of non-covalent contacts at different scales of organization: atoms, residues, secondary structure, subunits, and entire complexes. The Protein Contacts Atlas enables researchers from different disciplines to investigate diverse questions in the framework of non-covalent contacts, including the interpretation of allostery, disease mutations and polymorphisms, by exploring individual subunits, interfaces, and protein–ligand contacts and by mapping external information. The Protein Contacts Atlas is available at http://www.mrc-lmb.cam.ac.uk/pca/ and also through PDBe.

Access through your institution

Buy or subscribe

This is a preview of subscription content, access via your institution

Access options

Access through your institution

Buy this article

Purchase on Springer Link
Instant access to full article PDF

Buy now

Prices may be subject to local taxes which are calculated during checkout

**Fig. 1: Visualization modes in the Protein Contacts Atlas.**

**Fig. 2: Protein Contacts Atlas framework.**

**Fig. 3: Visualization of protein–protein and protein–DNA interaction interfaces.**

**Fig. 4: Visualization and analysis of protein–ligand contacts.**

**Fig. 5: Comparison of the residue contact networks in the rhodopsin cycle.**

ProtCID: a data resource for structural information on protein interactions

Article Open access 05 February 2020

Towards a unified open access dataset of molecular interactions

Article Open access 01 December 2020

ARCTIC-3D: automatic retrieval and clustering of interfaces in complexes from 3D structural information

Article Open access 06 January 2024

References

Watson, J. D. & Crick, F. H. Molecular structure of nucleic acids; a structure for deoxyribose nucleic acid. Nature 171, 737–738 (1953).
Article CAS PubMed Google Scholar
Kendrew, J. C. et al. Structure of myoglobin: a three-dimensional Fourier synthesis at 2Å resolution. Nature 185, 422–427 (1960).
Article CAS PubMed Google Scholar
Perutz, M. F. et al. Structure of haemoglobin: a three-dimensional Fourier synthesis at 5.5-Å resolution, obtained by X-ray analysis. Nature 185, 416–422 (1960).
Article CAS PubMed Google Scholar
Shi, Y. A glimpse of structural biology through X-ray crystallography. Cell 159, 995–1014 (2014).
Article CAS PubMed Google Scholar
Wüthrich, K. The way to NMR structures of proteins. Nat. Struct. Biol. 8, 923–925 (2001).
Article PubMed Google Scholar
Cheng, Y. Single-particle cryo-EM at crystallographic resolution. Cell 161, 450–457 (2015).
Article CAS PubMed PubMed Central Google Scholar
Ollis, W. D. Models and molecules. Proc. R. Inst. G. B. 45, 1–31 (1972).
CAS Google Scholar
Perutz, M. F. The hemoglobin molecule. Sci. Am. 211, 64–76 (1964).
Article CAS PubMed Google Scholar
Baldwin, J. & Chothia, C. Haemoglobin: the structural changes related to ligand binding and its allosteric mechanism. J. Mol. Biol. 129, 175–220 (1979).
Article CAS PubMed Google Scholar
Pauling, L., Corey, R. B. & Branson, H. R. The structure of proteins; two hydrogen-bonded helical configurations of the polypeptide chain. Proc. Natl. Acad. Sci. USA 37, 205–211 (1951).
Article CAS PubMed PubMed Central Google Scholar
Richardson, J. S. β-Sheet topology and the relatedness of proteins. Nature 268, 495–500 (1977).
Article CAS PubMed Google Scholar
Ramachandran, G. N., Ramakrishnan, C. & Sasisekharan, V. Stereochemistry of polypeptide chain configurations. J. Mol. Biol. 7, 95–99 (1963).
Article CAS PubMed Google Scholar
Richardson, J. S. Early ribbon drawings of proteins. Nat. Struct. Biol. 7, 624–625 (2000).
Article CAS PubMed Google Scholar
Levitt, M. & Chothia, C. Structural patterns in globular proteins. Nature 261, 552–558 (1976).
Article CAS PubMed Google Scholar
Murzin, A. G., Brenner, S. E., Hubbard, T. & Chothia, C. SCOP: a structural classification of proteins database for the investigation of sequences and structures. J. Mol. Biol. 247, 536–540 (1995).
CAS PubMed Google Scholar
Orengo, C. A. et al. CATH—a hierarchic classification of protein domain structures. Structure 5, 1093–1108 (1997).
Article CAS PubMed Google Scholar
Nishikawa, K., Ooi, T., Isogai, Y. & Saitô, N. Tertiary structure of proteins. I. Representation and computation of the conformations. J. Phys. Soc. Jpn. 32, 1331–1337 (1972).
Article CAS Google Scholar
Lesk, A. M. & Chothia, C. How different amino acid sequences determine similar protein structures: the structure and evolutionary dynamics of the globins. J. Mol. Biol. 136, 225–270 (1980).
Article CAS PubMed Google Scholar
Chakrabarty, B. & Parekh, N. NAPS: Network Analysis of Protein Structures. Nucl. Acids Res. 44 W1, W375–W382 (2016).
Article Google Scholar
Seeber, M., Felline, A., Raimondi, F., Mariani, S. & Fanelli, F. WebPSN: a web server for high-throughput investigation of structural communication in biomacromolecules. Bioinformatics 31, 779–781 (2015).
Article CAS PubMed Google Scholar
Jubb, H. C. et al. Arpeggio: a web server for calculating and visualising interatomic interactions in protein structures. J. Mol. Biol. 429, 365–371 (2017).
Article CAS PubMed PubMed Central Google Scholar
Doncheva, N. T., Assenov, Y., Domingues, F. S. & Albrecht, M. Topological analysis and interactive visualization of biological networks and protein structures. Nat. Protoc. 7, 670–685 (2012).
Article CAS PubMed Google Scholar
Piovesan, D., Minervini, G. & Tosatto, S. C. The RING 2.0 web server for high quality residue interaction networks. Nucleic Acids Res. 44 W1, W367–W374 (2016).
Article Google Scholar
Vishveshwara, S., Brinda, K. V. & Kannan, N. Protein structure: insights from graph theory. J. Theor. Comp. Chem. 1, 187–211 (2002).
Article CAS Google Scholar
Süel, G. M., Lockless, S. W., Wall, M. A. & Ranganathan, R. Evolutionarily conserved networks of residues mediate allosteric communication in proteins. Nat. Struct. Biol. 10, 59–69 (2003).
Article PubMed Google Scholar
del Sol, A., Fujihashi, H., Amoros, D. & Nussinov, R. Residues crucial for maintaining short paths in network communication mediate signaling in proteins. Mol. Syst. Biol. 2, 0019 (2006).
PubMed Google Scholar
Kornev, A. P., Haste, N. M., Taylor, S. S. & Eyck, L. F. Surface comparison of active and inactive protein kinases identifies a conserved activation mechanism. Proc. Natl. Acad. Sci. USA 103, 17783–17788 (2006).
Article CAS PubMed PubMed Central Google Scholar
Vishveshwara, S., Ghosh, A. & Hansia, P. Intra- and inter-molecular communications through protein structure network. Curr. Protein Pept. Sci. 10, 146–160 (2009).
Article CAS PubMed Google Scholar
Fanelli, F., Felline, A. & Raimondi, F. Network analysis to uncover the structural communication in GPCRs. Methods Cell. Biol. 117, 43–61 (2013).
Article CAS PubMed Google Scholar
Bhattacharyya, M., Ghosh, S. & Vishveshwara, S. Protein structure and function: looking through the network of side-chain interactions. Curr. Protein Pept. Sci. 17, 4–25 (2016).
Article CAS PubMed Google Scholar
Fanelli, F., Felline, A., Raimondi, F. & Seeber, M. Structure network analysis to gain insights into GPCR function. Biochem. Soc. Trans. 44, 613–618 (2016).
Article CAS PubMed Google Scholar
Ahnert, S. E., Marsh, A. J., Hernández, H., Robinson, C. V. & Teichmann, S. A. Principles of assembly reveal a periodic table of protein complexes. Science 350, aaa2245 (2015).
Article PubMed Google Scholar
Levy, E. D., Pereira-Leal, J. B., Chothia, C. & Teichmann, S. A. 3D complex: a structural classification of protein complexes. PLoS Comput. Biol. 2, e155 (2006).
Article PubMed PubMed Central Google Scholar
Greene, L. H. & Higman, V. A. Uncovering network systems within protein structures. J. Mol. Biol. 334, 781–791 (2003).
Article CAS PubMed Google Scholar
Venkatakrishnan, A. J. et al. Molecular signatures of G-protein-coupled receptors. Nature 494, 185–194 (2013).
Article CAS PubMed Google Scholar
Flock, T. et al. Universal allosteric mechanism for Gα activation by GPCRs. Nature 524, 173–179 (2015).
Article CAS PubMed PubMed Central Google Scholar
Venkatakrishnan, A. J. et al. Diverse activation pathways in class A GPCRs converge near the G-protein-coupling region. Nature 536, 484–487 (2016).
Article CAS PubMed PubMed Central Google Scholar
Flock, T. et al. Selectivity determinants of GPCR-G-protein binding. Nature 545, 317–322 (2017).
Article CAS PubMed PubMed Central Google Scholar
Hauser, A. S. et al Pharmacogenomics of GPCR drug targets. Cell, https://doi.org/10.1016/j.cell.2017.11.033 (2017).
Google Scholar
Doncheva, N. T., Klein, K., Domingues, F. S. & Albrecht, M. Analyzing and visualizing residue networks of protein structures. Trends Biochem. Sci. 36, 179–182 (2011).
Article CAS PubMed Google Scholar
Martin, A. J. et al. RING: networking interacting residues, evolutionary information and energetics in protein structures. Bioinformatics 27, 2003–2005 (2011).
Article CAS PubMed Google Scholar
Zhang, X., Perica, T. & Teichmann, S. A. Evolution of protein structures and interactions from the perspective of residue contact networks. Curr. Opin. Struct. Biol. 23, 954–963 (2013).
Article CAS PubMed Google Scholar
Rose, P. W. et al. The RCSB protein data bank: integrative view of protein, gene and 3D structural information. Nucl. Acids Res. 45, D271–D281 (2017).
Article CAS PubMed Google Scholar
Tsai, J., Taylor, R., Chothia, C. & Gerstein, M. The packing density in proteins: standard radii and volumes. J. Mol. Biol. 290, 253–266 (1999).
Article CAS PubMed Google Scholar
Carpenter, B., Nehmé, R., Warne, T., Leslie, A. G. & Tate, C. G. Structure of the adenosine A(2A) receptor bound to an engineered G protein. Nature 536, 104–107 (2016).
Article CAS PubMed PubMed Central Google Scholar
Emamzadah, S., Tropia, L., Vincenti, I., Falquet, B. & Halazonetis, T. D. Reversal of the DNA-binding-induced loop L1 conformational switch in an engineered human p53 protein. J. Mol. Biol. 426, 936–944 (2014).
Article CAS PubMed Google Scholar
Laskowski, R. A. & Swindells, M. B. LigPlot+: multiple ligand-protein interaction diagrams for drug discovery. J. Chem. Inf. Model. 51, 2778–2786 (2011).
Article CAS PubMed Google Scholar
Cherezov, V. et al. High-resolution crystal structure of an engineered human beta2-adrenergic G protein-coupled receptor. Science 318, 1258–1265 (2007).
Article CAS PubMed PubMed Central Google Scholar
Mendes, H. F., van der Spuy, J., Chapple, J. P. & Cheetham, M. E. Mechanisms of cell death in rhodopsin retinitis pigmentosa: implications for therapy. Trends Mol. Med. 11, 177–185 (2005).
Article CAS PubMed Google Scholar
del Sol, A., Fujihashi, H., Amoros, D. & Nussinov, R. Residue centrality, functionally important residues, and active site shape: analysis of enzyme and non-enzyme families. Protein Sci. 15, 2120–2128 (2006).
Article PubMed PubMed Central Google Scholar
Soundararajan, V., Raman, R., Raguram, S., Sasisekharan, V. & Sasisekharan, R. Atomic interaction networks in the core of protein domains and their native folds. PLoS ONE 5, e9391 (2010).
Article PubMed PubMed Central Google Scholar
Isberg, V. et al. Generic GPCR residue numbers—aligning topology maps while minding the gaps. Trends Pharmacol. Sci. 36, 22–31 (2015).
Article CAS PubMed Google Scholar
Isberg, V. et al. GPCRdb: an information system for G protein-coupled receptors. Nucleic Acids Res. 44 D1, D356–D364 (2016).
Article Google Scholar
Hildebrand, P. W. et al. A ligand channel through the G protein coupled receptor opsin. PLoS ONE 4, e4382 (2009).
Article PubMed PubMed Central Google Scholar
Deupi, X. et al. Stabilized G protein binding site in the structure of constitutively active metarhodopsin-II. Proc. Natl. Acad. Sci. USA 109, 119–124 (2012).
Article CAS PubMed Google Scholar
O’Donoghue, S. I. et al. Visualizing biological data-now and in the future. Nat. Methods 7 (Suppl.), S2–S4 (2010).
Article Google Scholar
Velankar, S. et al. PDBe: improved accessibility of macromolecular structure data from PDB and EMDB. Nucl. Acids Res. 44, D385–D395 (2016).
Article CAS PubMed Google Scholar
Kabsch, W. & Sander, C. Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features. Biopolymers 22, 2577–2637 (1983).
Article CAS PubMed Google Scholar
Touw, W. G. et al. A series of PDB-related databanks for everyday needs. Nucl. Acids Res. 43, D364–D368 (2015).
Article CAS PubMed Google Scholar
Cavallo, L., Kleinjung, J. & Fraternali, F. POPS: a fast algorithm for solvent accessible surface areas at atomic and residue level. Nucl. Acids Res. 31, 3364–3366 (2003).
Article CAS PubMed PubMed Central Google Scholar
Kannan, N. & Vishveshwara, S. Identification of side-chain clusters in protein structures by a graph spectral method. J. Mol. Biol. 292, 441–464 (1999).
Article CAS PubMed Google Scholar
Costa, L. F., Rodrigues, F. A., Travieso, G. & Villas Boas, P. R. Characterization of complex networks: a survey of measurements. Adv. Phys. 56, 167–242 (2007).
Article Google Scholar
Freeman, L. C. A set of measures of centrality based on betweenness. Sociometry 40, 35–41 (1977).
Article Google Scholar
Yoon, J., Blumer, A. & Lee, K. An algorithm for modularity analysis of directed and weighted biological networks based on edge-betweenness centrality. Bioinformatics 22, 3106–3108 (2006).
Article CAS PubMed Google Scholar
Bavelas, A. Communication patterns in task-oriented groups. J. Acoust. Soc. Am. 22, 725–730 (1950).
Article Google Scholar
Sabidussi, G. The centrality of a graph. Psychometrika 31, 581–603 (1966).
Article CAS PubMed Google Scholar
Shannon, P. et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 13, 2498–2504 (2003).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We thank A. Lesk, C. Chothia, S. Balaji, H. Harbrecht, I. Huppertz, M. Ouédraogo, G. Chalancon, A. Morgunov, A. Murzin, A. Andreeva, G. de Baets, R. Peer, S. Chavali, A. Sente, N. S. Latysheva, A. Gunnarsson, and A. Hauser for their comments on this work. We thank T. Nakane for his inputs on structure visualization using WebGLMol. This work was supported by the Medical Research Council (MC_U105185859; M.M.B., M.K., C.N.J.R., T.F., A.J.V., and J.S.-B.), the LMB-Cambridge scholarship (A.J.V.), the St. John's College Benefactor scholarship (A.J.V.), the AFR scholarship from the Luxembourg National Research Fund (C.N.J.R.), and the Boehringer Ingelheim Fond (T.F.). T.F. is a Research Fellow of Fitzwilliam College, University of Cambridge, UK. M.M.B. is a Lister Institute Research Prize Fellow.

Author information

Melis Kayikci
Present address: Genomics England, London, UK
A. J. Venkatakrishnan
Present address: Department of Molecular and Cellular Physiology, Department of Computer Science, and Institute for Computational and Mathematical Engineering, Stanford University, Stanford, CA, USA
James Scott-Brown
Present address: University of Oxford, Oxford, UK
Tilman Flock
Present address: Paul Scherrer Institute, Villigen, Switzerland
Melis Kayikci and A.J. Venkatakrishnan contributed equally to this work.

Authors and Affiliations

MRC Laboratory of Molecular Biology, Cambridge, UK
Melis Kayikci, A. J. Venkatakrishnan, James Scott-Brown, Charles N. J. Ravarani, Tilman Flock & M. Madan Babu
Fitzwilliam College, University of Cambridge, Cambridge, UK
Tilman Flock

Authors

Melis Kayikci
View author publications
You can also search for this author in PubMed Google Scholar
A. J. Venkatakrishnan
View author publications
You can also search for this author in PubMed Google Scholar
James Scott-Brown
View author publications
You can also search for this author in PubMed Google Scholar
Charles N. J. Ravarani
View author publications
You can also search for this author in PubMed Google Scholar
Tilman Flock
View author publications
You can also search for this author in PubMed Google Scholar
M. Madan Babu
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.K. collected the data, developed the computational pipeline, and built the web server. A.J.V. designed the prototype of the representations with M.M.B. A.J.V., M.K., J.S.-B., and M.M.B. optimized the representations, and M.K. and J.S.-B. implemented the representations. M.K. and A.J.V. performed the GPCR analyses. J.S.-B. made the prototype of the web server. C.N.J.R. and T.F. helped with the web server and analyzing examples. M.K., C.N.J.R., and A.J.V. independently wrote separate drafts of the manuscript. M.K., A.J.V., and M.M.B. wrote the final manuscript with critical inputs from C.N.J.R., J.S.-B., and T.F.; M.K., A.J.V., and C.N.J.R. prepared the figures. A.J.V. and M.M.B. conceived and planned the project. M.K., A.J.V., and M.M.B. executed the project. M.M.B. supervised the project.

Corresponding authors

Correspondence to Melis Kayikci, A. J. Venkatakrishnan or M. Madan Babu.

Ethics declarations

Competing interests

The authors declare no competing financial interest.

Additional information

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Integrated supplementary information

Supplementary Figure 1 Plot of all PDB structures analyzed

a, The number of residues (x-axis) vs the number of non-covalent residue contacts (y-axis) in each PDB. b, The number of atoms (x-axis) vs the number of non-covalent atomic contacts (y-axis) in each PDB. Each dot represents a PDB structure. On average, each of these structures contains 5,677 atoms, 45,042 atomic contacts, 684 residues, 3,043 residue contacts, 3 chains, 2,441 atomic contacts between chains, and 125 residue contacts between chains.

Supplementary Figure 2 Visualization and analysis page

The biomolecular complex network panel, sequence panel, 3D structure panel and contacts panel are shown for the β2 adrenergic receptor-G protein complex structure (PDB ID: 3SN6). The sequence panel shows the secondary structure elements (SSEs) and the amino acid sequence. In the sequence panel, some positions are highlighted with a line on the top, using bold typeface or a red color. A line on top of a residue denotes availability of disease mutation data via Uniprot, which can be accessed by moving the mouse over the residue. Bold letters denote residues that were previously hovered over. Red colored letters denote residues currently selected. Helices are represented as rectangles and sheets are represented by arrows. The top line shows the entire SSEs of the chain and the line below shows the zoomed in version with the loops as a thin black line. The size of the rectangles and arrows are proportional to the length of the SSEs. The contacts panel shows the secondary structures view in which the chord plot can be seen on the top left corner with the selected secondary structures (Helix 38 in blue and Helix 40 in orange). In the middle, the residue contact matrix is shown with all the contacting residues of the selected SSEs. The residues belonging to each SSE are highlighted in their respective colors. The number of atomic contacts is shown within the matrix. On the right is the 3D structure panel, which shows the 3D view of the receptor (grey cartoon view).

Supplementary Figure 3 Mapping disease mutations in rhodopsin

a, Selected residues of rhodopsin (1GZM; shown in the text field) are highlighted on the scatter plot in red. b, 3D structure view of rhodopsin (1GZM, chain A) is shown in network view (grey) with contacts highlighted in blue. The thickness of the residue contact denotes the number of atomic contacts made by the interacting residues.

Supplementary Figure 4 Mapping external information for residue-level analysis of structures

a, Scatter plot of residues in the G protein after mapping their stability data (experimentally inferred binding affinity) using a color spectrum (min: cyan; max: magenta). Stability was measured as ΔTm = Tm of Gαi1(Ala) - Tm of Gαi1(WT) with 1mM GDP. Residue F336^G.H5.8 is shown in red and the rest of the residues in cyan and magenta according to their ΔTm values. b, Network view and asteroid plot of F336^G.H5.8 shows the extensive nature of the contacts this residue mediates in the GDP bound structure of Gαi1 (1GDD).

Supplementary Figure 5 Average of the maximum number of atomic contacts made by a residue across all non-redundant crystal structures

These values are used to normalize the contacts with respect to the size of the amino acid (Methods).

Supplementary information

Supplementary Text and Figures

Supplementary Figures 1–5 and Supplementary Notes 1 and 2

Life Sciences Reporting Summary

Supplementary Data Set 1

Sample downloadable statistics for PDB 3SN6.

Supplementary Data Set 2

Sample downloadable list of contacts for PDB 3SN6.

Supplementary Data Set 3

Sample downloadable structure report for PDB 5C1M.

Supplementary Data Set 4

Raw data used for analysis of Rhodopsin conformational cycle.

Supplementary Data Set 5

Contact statistics for all structures in the PDB.

Supplementary Data Set 6

Source Data, Supplementary Fig. 5a,d.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Kayikci, M., Venkatakrishnan, A.J., Scott-Brown, J. et al. Visualization and analysis of non-covalent contacts using the Protein Contacts Atlas. Nat Struct Mol Biol 25, 185–194 (2018). https://doi.org/10.1038/s41594-017-0019-z

Download citation

Received: 23 February 2017
Accepted: 11 December 2017
Published: 15 January 2018
Issue Date: February 2018
DOI: https://doi.org/10.1038/s41594-017-0019-z

This article is cited by

Illuminating the mechanism and allosteric behavior of NanoLuc luciferase
- Michal Nemergut
- Daniel Pluskal
- Martin Marek
Nature Communications (2023)
The implication of holocytochrome c synthase mutation in Korean familial hypoplastic amelogenesis imperfecta
- Hyejin Choi
- Kwanghwan Lee
- Jae Hoon Lee
Clinical Oral Investigations (2022)
Cryo-EM structures of human RNA polymerase III in its unbound and transcribing states
- Mathias Girbig
- Agata D. Misiaszek
- Christoph W. Müller
Nature Structural & Molecular Biology (2021)
An online GPCR structure analysis platform
- Albert J. Kooistra
- Christian Munk
- David E. Gloriam
Nature Structural & Molecular Biology (2021)
Molecular determinants underlying functional innovations of TBP and their impact on transcription initiation
- Charles N. J. Ravarani
- Tilman Flock
- Santhanam Balaji
Nature Communications (2020)