Residues crucial for maintaining short paths in network communication mediate signaling in proteins
Antonio del Sol1, Hirotomo Fujihashi1, Dolors Amoros1 & Ruth Nussinov2,3
- Bioinformatics Research Unit, Research and Development Division, Fujirebio Inc., Hachioji-shi, Tokyo, Japan
- Basic Research Program, SAIC-Frederick Inc., Center for Cancer Research, Nanobiology Program, National Cancer Institute, Frederick, MD, USA
- Department of Human Genetics and Molecular Medicine, Sackler Institute of Molecular Medicine, Tel Aviv University, Tel Aviv, Israel
Correspondence to: Antonio del Sol1 Bioinformatics Research Unit, Research and Development Division, Fujirebio Inc., 51 Komiya-cho, Hachioji-shi, Tokyo 192-0031, Japan. Tel.: +81 426 45 4740; Fax: +81 426 46 8325; E-mail: Email: ao-mesa@fujirebio.co.jp
Received 19 September 2005; Accepted 15 March 2006; Published online 2 May 2006
Article highlights
- Protein structures are represented as residue interacting networks to identify key residues generating the network's small-world character.
- Fold centrally conserved residues are key in maintaining short path lengths and correspond to residues experimentally shown to mediate signaling.
- Residues whose removal increases the characteristic path length relate to system fragility.
- Study of seven allosteric protein families and identification of key residues for allosteric communications as the conserved interconnectivity determinants for the family fold.
Synopsis
Evolution of protein fold is determined by the constraints imposed by its function. An important characteristic for maintaining function is the robustness of protein structures to mutagenesis allowing a level of sequence plasticity. This robustness is accompanied by an extreme sensitivity to mutations at some sites. It has been shown that protein structures can be represented as small-world networks of interactions between amino acids, with residues corresponding to vertices and contacts between them representing the edges (Greene and Higman, 2003). These networks are usually highly clustered with a few links connecting any pair of nodes (Watts and Strogatz, 1998). Consequently, there are relatively few residues interconnecting all residues in the structure.
Although protein structures are robust complex systems, they are also fragile to perturbations at key positions (Taverna and Goldstein, 2002). Experimental studies show that a significant number of single-site mutations have little effect on the protein function, whereas perturbations of key amino acids can abolish protein activity or folding. This robustness is expected to be an intrinsic characteristic of the protein fold. Viewing protein structures as information processing networks, where the communicated information can be transmitted in a physical (or chemical) form, it would be reasonable to assume that certain amino acids are crucial for network communications. Residues receiving and propagating information are expected to be central in the interaction network, lying on the shortest pathways between most residue pairs in the protein. Although the propagation of the information in protein structures is poorly understood, a number of theoretical results have suggested the crucial role of the central residues (Dokholyan et al, 2002; Vendruscolo et al, 2002; Amitai et al, 2004; del Sol and O'Meara, 2004).
Allostery is based on communication and transmission of information from one functional site to another. Using our network representation of protein structures, removal of most vertices (amino acids) with their corresponding edges does not affect substantially the network's interconnectedness expressed by the average of the shortest path distance between all pairs of vertices. On the other hand, removal of fold centrally conserved residues (including their links) affects significantly the network's interconnectedness, suggesting that these residues are crucial in preserving short path lengths. We termed these key amino acids 'interconnectivity determinants' (ICD).
We studied seven allosteric protein families with experimental information on key residues in allosteric communications (myoglobins, G-protein-coupled receptors, the trypsin class of serine proteases, hemoglobins, oligosaccharide phosphorylases, nuclear receptor ligand-binding domains and retroviral proteases). In each case, based on the protein family structural alignment, we determined the ICDs in the structures of most family members (we termed these positions 'conserved interconnectivity determinants' or CICD residues; Figure 2).
Figure 2
Schematic representation of the analysis for determining the conserved central positions based on an example of a protein family comprising four proteins. The position shown in red in the family structural alignment is central in the network representation of each family member. In the family member structures, this same position is represented in blue.
Full figure and legend (188K)Figures & Tables indexOur results revealed a general correspondence between the CICDs and experimentally annotated key residues for allosteric communications. Interestingly, some of the CICD residues in four of the analyzed examples (G-protein-coupled receptors, the trypsin class of serine proteases, hemoglobins and nuclear receptor ligand-binding domains) were found to be amino acids involved in the networks of statistically coupled residues as predicted by Ranganathan and co-workers (Süel et al, 2002). Thus, our findings show that CICD residues, that is, centrally conserved residues crucial for maintaining shorter path lengths in the protein network, mediate the signaling process in protein families, illustrating that topology plays an important role in network communication. The myoglobin family deserves special attention owing to the recent findings on the allosteric nature of myoglobin. This protein illustrates that certain characteristics of a protein design may be involved in new functions. Interestingly, all the key residues whose removal significantly elongates the path length in the network correspond to either residues binding the heme group, amino acids lining three of the main xenon cavities and thus likely to be important for the myoglobin allostery or to redox-active residues, which act in a cooperative way for optimal protein function. The HIV-1 protease is also another interesting example, where our predictions could shed light on some non-active site residues, which could be involved in the communications between the non-active site residues and the active site. Further experiments are required to test our predictions.
Acknowledgements
This project has been funded in whole or in part with Federal funds from the National Cancer Institute, National Institutes of Health, under contract number NO1-CO-12400. The content of this publication does not necessarily reflect the views or policies of the Department of Health and Human Services, nor does mention of trade names, commercial products or organizations imply endorsement by the US Government. This research was supported (in part) by the Intramural Research Program of the NIH, National Cancer Institute, Center for Cancer Research. One of the authors (AS) thanks Tara C Marshall for her help in editing of this manuscript.
References
- Amitai G, Shemesh A, Sitbon E, Shklar M, Netanely D, Venger I, Pietrokovski S (2004) Network analysis of protein structures identifies functional residues. J Mol Biol 344: 1135–1146 | Article | PubMed | ISI | ChemPort |
- del Sol A, O'Meara P (2004) Small-world network approach to identify key residues in protein–protein interaction. Proteins 58: 672–682 | ISI |
- Dokholyan NV, Li L, Ding F, Shakhnovich EI (2002) Topological determinants of protein folding. Proc Natl Acad Sci USA 99: 8637–8641 | Article | PubMed | ChemPort |
- Greene L, Higman V (2003) Uncovering network systems within protein structures. J Mol Biol 334: 781–791 | Article | PubMed | ISI | ChemPort |
- Süel GM, Lockless SW, Wall MA, Ranganathan R (2002) Evolutionary conserved networks of residues mediate allosteric communication in proteins. Nat Struct Biol 10: 59–69
- Taverna DM, Goldstein RA (2002) Why are proteins so robust to site mutations? J Mol Biol 315: 479–484 | Article | PubMed | ISI | ChemPort |
- Vendruscolo M, Dokholyan NV, Paci E, Karplus M (2002) Small-world view of the amino acids that play a key role in protein fold. Phys Rev E 65: 0619101–0619104 | Article |
- Watts DJ, Strogatz SH (1998) Collective dynamics of small-world networks. Nature 393: 440–442 | Article | PubMed | ISI | ChemPort |


