Synopsis

Subject Categories: Metabolic and regulatory networks

Molecular Systems Biology 2 Article number: 66  doi:10.1038/msb4100103
Published online: 28 November 2006
Citation: Molecular Systems Biology 2:66

Biological context networks: a mosaic view of the interactome

John Rachlin1,2, Dikla Dotan Cohen2, Charles Cantor3,4,5 & Simon Kasif2,3,6

  1. Department of Computer Science, Boston University, Boston, MA, USA
  2. Center for Advanced Genomic Technologies, Boston University, Boston, MA, USA
  3. Department of Biomedical Engineering, Boston University, Boston, MA, USA
  4. Center for Advanced Biotechnology, Boston University, Boston, MA, USA
  5. SEQUENOM Inc., San Diego, CA, USA
  6. Children's Hospital Boston, Boston, MA, USA

Correspondence to: John Rachlin1,2 Department of Computer Science, Boston University, 111 Cummington Ave, Boston, MA 02215, USA. Tel.: +1 617 921 9669; Fax: +1 617 353 4814; E-mail: Email: rachlin@bu.edu

Received 30 January 2006; Accepted 22 September 2006; Published online 28 November 2006

Top

Article highlights

  • The scale-free topology of a protein-protein interaction network can be reconstituted by the aggregation of small context-specific sub-networks.
  • Biological context networks provide a model for context-specific protein-protein interactions, leading to novel measures such as 'interactive promiscuity' that characterizes the extent to which interacting neighbors change from one context to another.
  • Over 70% of the top 2% of proteins ranked by context sensitive measures described in the paper are found to be essential, compared to about 53% of the top 2% ranked by node degree.

Top

Synopsis

The machinery of the cell involves a complex network of biochemical interactions. The full complement of known interactions (the so-called interactome) is not a static construct. Rather, specific interactions are activated or deactivated as part of cell-specific processes involving metabolism, cell cycle regulation, signal transduction, and hundreds of other characteristic processes. In this paper, we present a parsimonious network model (a 'biological context network') in which the nodes of the network, representing proteins, are deployed in one of several biological contexts, affecting whether or not connecting edges are active. The specification or the biological program provided by the model only articulates which biological contexts are associated with each protein and whether two proteins, each active in a particular context, interact. We study two types of context specification, GO biological processes or GO protein localization, but the formalism naturally extends to modeling contexts in the form of environmental stimuli, pathological conditions, tissues, or even organisms. The model elucidates interactions that are either conserved or variable from one context to another.

Graphs and their variants are the foundation for modeling complex biological systems. Graph topology reveals the basic properties of connectivity, robustness, modularity, hierarchical structure, and other properties, enabling identification of protein complexes or functional modules (Segal et al, 2003; Spirin and Mirny, 2003), and serves to aid whole-genome functional annotation efforts (Marcotte et al, 1999; Letovsky and Kasif, 2003). Biological networks are also of commercial interest as an aid to drug target discovery (Gardner et al, 2003) or for predicting toxic side effects, and are at the heart of pharmaceutical initiatives focused on integrating and mining pathway data sets (Hood et al, 2004).

Protein interaction networks are often obtained by high-throughput detection assays (Rual et al, 2005) or inferred from literature surveys (Mishra et al, 2006). As a result, they represent a high-level integrated summary of a large number of interactions inferred from many biological contexts. However, representing the interactome as a static biological network is akin to a long-exposure photograph that can mask context-specific patterns of activation across multiple processes, cellular locations, and time. Conclusions drawn from the full network's topology may be compromised by these inherent limitations. A central goal of systems biology research is to elucidate the underlying patterns of interaction in an effort to obtain more realistic and predictive models of the cell (Ideker et al, 2001). This has prompted the development of a broad range of graphical representations coupled with mathematical equations intended to model cellular dynamics. By contrast, protein–protein interaction networks are typically represented as a standard undirected graph where vertices correspond to individual proteins and edges connect pairs of interacting proteins. Biological context networks provide an intermediate-level formalism, in which we label proteins with contextual information about the protein and activate protein interactions as specified by the succinct biological program associated with the network. In its simplest form, the program activates an edge whenever two interacting proteins are in a shared contextual state, and otherwise assumes that the interaction has been inactivated. The biological context network model enables one to view the interactome as a mosaic of overlapping sub-networks each associated with specific contexts or conditions and to further characterize changes in topology from one context to another. For example, in Figure 1, we show the context-specific sub-networks in the local neighborhood of the protein Sec13, highlighting its association with both the nuclear pore complex and the endoplasmic reticulum (Enninga et al, 2003).

Figure 1
Figure 1 :  Unfortunately we are unable to provide accessible alternative text for this. If you require assistance to access this image, or to obtain a text description, please contact npg@nature.com

The local context networks for Sec13 with respect to two of its current GO biological process annotations—GO:0006888, nuclear pore organization and biogenesis, and GO:0006999, ER to Golgi transport—highlighting Sec13's association with both the nuclear pore complex and the ER. Sec13 is an example of a protein whose interacting partners vary from one process context to another. We characterize such proteins as 'interactively promiscuous.' The shuttling of Sec13 between the nucleus and the cytoplasm is believed to play a cross-functional regulatory role (Enninga et al, 2003).

Full figure and legend (260K)Figures & Tables index

It has been widely observed that a broad range of social, technological, and biological networks are scale-free, characterized by a power-law degree distribution where a few 'hub' proteins have many interacting partners, whereas most proteins have very few (Barabasi and Oltvai, 2004). Furthermore, high-degree 'hubs' in protein–protein interaction networks are more likely to be essential for the viability of the organism. In this paper, we provide some evidence that a power-law distribution, while clearly evident in the aggregate experimental protein–protein interaction data, is plausibly an artifact of the aggregation of interactions across multiple process-specific contexts. This observation suggests that paths connecting disparate protein pairs may be substantially impacted by intervening contextual differences. We show, for example, that aggregating approximately 100 small (<14 edges) GO biological process 'leaf-term' sub-networks (i.e., sub-networks corresponding to the most highly specific GO terms) is sufficient to reconstitute a scale-free network (R2=0.88).

Context-specific sub-networks derived using the biological context network model provide the basis for characterizing proteins with respect to several context-specific measures. These include the following

  • Context degree: The degree of a node (number of interacting partners), considering only those partners that share at least one context (annotation).
  • Context mutual information: Measures the degree to which the annotations of neighboring proteins are correlated.
  • Interactive promiscuity: Measures the variability of annotations (contexts) among a protein's interacting partners in an effort to identify those proteins likely to play a cross-contextual 'linking' role.

The context mutual information and interactive promiscuity measures are based on patterns of annotation among a protein's interacting partners, as shown, for example, in Figure 5.

Figure 5
Figure 5 :  Unfortunately we are unable to provide accessible alternative text for this. If you require assistance to access this image, or to obtain a text description, please contact npg@nature.com

(A) DIP yeast sub-network of all 991 essential proteins. (B) Essential proteins having context degree >1. Node coloring is according to the degree of the protein in the full DIP network: 1–4 neighbors (blue), 5–9 neighbors (green), 10–14 neighbors (yellow), 15+ neighbors (red). Many of the essential proteins aggregate into clusters of essential protein complexes that are typically related to cell-cycle regulation and mRNA processing. As a result of the network's improved specificity, context degree is a better predictor for knockout lethality, although applicable only to annotated nodes.

Full figure and legend (179K)Figures & Tables index

Interestingly, we find that the top-ranked proteins with respect to each of these context-specific measures are highly enriched in essential proteins and these measures provide a significantly improved predictor for knockout lethality than the static measure of degree computed from the original 'context-free' network. For example, more than 70% of the top 2% of proteins ranked by either context mutual information or interactive promiscuity are essential, compared to just 53% of the top 2% of proteins ranked by degree.

The biological context network formalism provides insight into the statistical topological characteristics of the network within specific contexts, including hubs, dense-sub-graphs, connectivity, and centrality, but quantified with respect to particular contexts. This formalism also suggests an explanation for the emergence of scale-free properties in protein–protein interaction networks, and offers a measure for the interactive promiscuity of a protein, highlighting those proteins that are either intrinsically multi-functional or are a subunit of a multi-functional complex. Future applications of context-specific functional modules and networks include the modeling of cross-context connectivity, directional and stochastic models of context networks, and the context-specific effects of perturbations on biological function, and enabling improved selection of drug targets by way of more reliable models of toxicity.

Top

Acknowledgements

We thank Noga Alon, Vera Asodi, John Byers, Jim Collins, Tim Gardner, Daniel Hanisch, Sid Redner, and Richard J Roberts for a wide range of helpful suggestions and insights. We also thank our three anonymous reviewers for their extremely helpful comments. This work was supported in part by NSF grants DBI-0239435 and ITR-048715, NHGRI grant #1R33HG002850-01A1, and NIH grant U54 LM008748.

Top

References

  1. Barabasi AL, Oltvai ZN (2004) Network biology: understanding the cell's functional organization. Nat Rev Genet 5: 101–113 | Article | PubMed | ISI | ChemPort |
  2. Enninga J, Levay A, Fontoura BM (2003) Sec13 shuttles between the nucleus and the cytoplasm and stably interacts with Nup96 at the nuclear pore complex. Mol Cell Biol 23: 7271–7284 | Article | PubMed | ISI | ChemPort |
  3. Gardner TS, di Bernardo D, Lorenz D, Collins JJ (2003) Inferring genetic networks and identifying compound mode of action via expression profiling. Science 301: 102–105 | Article | PubMed | ISI | ChemPort |
  4. Hood L, Heath JR, Phelps ME, Lin B (2004) Systems biology and new technologies enable predictive and preventative medicine. Science 306: 640–643 | Article | PubMed | ISI | ChemPort |
  5. Ideker T, Galitski T, Hood L (2001) A new approach to decoding life: systems biology. Annu Rev Genomics Hum Genet 2: 343–372 | Article | PubMed | ISI | ChemPort |
  6. Letovsky S, Kasif S (2003) Predicting protein function from protein/protein interaction data: a probabilistic approach. Bioinformatics 19 (Suppl 1): i197–i204 | Article |
  7. Marcotte EM, Pellegrini M, Thompson MJ, Yeates TO, Eisenberg D (1999) A combined algorithm for genome-wide prediction of protein function. Nature 402: 83–86 | Article | PubMed | ISI | ChemPort |
  8. Mishra GR, Suresh M, Kumaran K, Kannabiran N, Suresh S, Bala P, Shivakumar K, Anuradha N, Reddy R, Raghavan TM, Menon S, Hanumanthu G, Gupta M, Upendran S, Gupta S, Mahesh M, Jacob B, Mathew P, Chatterjee P, Arun KS, Sharma S, Chandrika KN, Deshpande N, Palvankar K, Raghavnath R, Krishnakanth R, Karathia H, Rekha B, Nayak R, Vishnupriya G, Kumar HG, Nagini M, Kumar GS, Jose R, Deepthi P, Mohan SS, Gandhi TK, Harsha HC, Deshpande KS, Sarker M, Prasad TS, Pandey A (2006) Human protein reference database—2006 update. Nucleic Acids Res 34 (Database issue): D411–D414 | Article |
  9. Rual JF, Venkatesan K, Hao T, Hirozane-Kishikawa T, Dricot A, Li N, Berriz GF, Gibbons FD, Dreze M, Ayivi-Guedehoussou N, Klitgord N, Simon C, Boxem M, Milstein S, Rosenberg J, Goldberg DS, Zhang LV, Wong SL, Franklin G, Li S, Albala JS, Lim J, Fraughton C, Llamosas E, Cevik S, Bex C, Lamesch P, Sikorski RS, Vandenhaute J, Zoghbi HY, Smolyar A, Bosak S, Sequerra R, Doucette-Stamm L, Cusick ME, Hill DE, Roth FP, Vidal M (2005) Towards a proteome-scale map of the human protein–protein interaction network. Nature 437: 1173–1178 | Article | PubMed | ISI | ChemPort |
  10. Segal E, Shapira M, Regev A, Pe'er D, Botstein D, Koller D, Friedman N (2003) Module networks: identifying regulatory modules and their condition-specific regulators from gene expression data. Nat Genet 34: 166–176 | Article | PubMed | ISI | ChemPort |
  11. Spirin V, Mirny LA (2003) Protein complexes and functional modules in molecular networks. Proc Natl Acad Sci USA 100: 12123–12128 | Article | PubMed | ChemPort |

MORE ARTICLES LIKE THIS

These links to content published by NPG are automatically generated.

Extra navigation

.
ADVERTISEMENT