Key Points
-
The integration of genetic and physical maps was a defining feature of the Human Genome Project. Mapping of the cell's regulatory and signalling networks is now proceeding along highly analogous lines.
-
A first step in sequencing the human genome was to assign quality scores to each sequenced nucleotide. In the case of physical and genetic interactions, the method of choice for improving quality is integration of data across a wide variety of measurement types.
-
Genome assembly was the process of putting sequence reads together to form contigs. In the context of molecular interactions, assembly refers to the integration of individual interactions into larger network structures that represent pathways, protein complexes and other components of cellular machinery.
-
Network assembly is aided by a classification system for molecular interactions. Towards this goal, recent studies have begun to place interactions into various categories beyond the initial division into genetic and physical.
-
Categories of interactions include ordered versus unordered, transient versus stable, between- versus within-pathway, alleviating versus aggravating, and interactions of the first versus second degree.
-
These types are being combined with one another in various combinations to assemble integrated network models. Examples include integration of protein–protein interactions with aggravating, alleviating or ordered genetic interactions, as well as integration of eQTLs with protein–DNA transcriptional interactions.
-
A final step is network annotation: inference of additional details such as interaction dynamics, strengths and condition-specificity onto the static network.
-
Integration of genetic and physical interaction mapping data will be particularly important to the current wave of genome-wide association studies, in which many genetic interactions are apparent with little physical or mechanistic explanation.
Abstract
Physical and genetic mapping data have become as important to network biology as they once were to the Human Genome Project. Integrating physical and genetic networks currently faces several challenges: increasing the coverage of each type of network; establishing methods to assemble individual interaction measurements into contiguous pathway models; and annotating these pathways with detailed functional information. A particular challenge involves reconciling the wide variety of interaction types that are currently available. For this purpose, recent studies have sought to classify genetic and physical interactions along several complementary dimensions, such as ordered versus unordered, alleviating versus aggravating, and first versus second degree.
This is a preview of subscription content, access via your institution
Access options
Subscribe to this journal
Receive 12 print issues and online access
$209.00 per year
only $17.42 per issue
Buy this article
- Purchase on SpringerLink
- Instant access to full article PDF
Prices may be subject to local taxes which are calculated during checkout
Similar content being viewed by others
References
Yu, A. et al. Comparison of human genetic and sequence-based physical maps. Nature 409, 951–953 (2001).
Sturtevant, A. H. The linear arrangement of six sex-linked factors in Drosophila, as shown by their mode of association. J. Exp. Zool. 14, 43–59 (1913).
Goss, S. J. & Harris, H. New method for mapping genes in human chromosomes. Nature 255, 680–684 (1975).
Cox, D. R., Burmeister, M., Price, E. R., Kim, S. & Myers, R. M. Radiation hybrid mapping: a somatic cell genetic method for constructing high-resolution maps of mammalian chromosomes. Science 250, 245–250 (1990).
Fauth, C. & Speicher, M. R. Classifying by colors: FISH-based genome analysis. Cytogenet. Cell Genet. 93, 1–10 (2001).
Rowen, L., Mahairas, G. & Hood, L. Sequencing the human genome. Science 278, 605–607 (1997).
Green, P. Whole-genome disassembly. Proc. Natl Acad. Sci. USA 99, 4143–4144 (2002).
Twyman, R. M. & Primrose, S. B. Techniques patents for SNP genotyping. Pharmacogenomics 4, 67–79 (2003).
Stein, L. Genome annotation: from sequence to biology. Nature Rev. Genet. 2, 493–503 (2001).
Sharan, R. & Ideker, T. Modeling cellular machinery through biological network comparison. Nature Biotechnol. 24, 427–433 (2006).
Fields, S. High-throughput two-hybrid analysis. The promise and the peril. FEBS J. 272, 5391–5399 (2005).
Tong, A. H. et al. Global mapping of the yeast genetic interaction network. Science 303, 808–813 (2004). A landmark paper that explores a large genetic interaction network in yeast, and introduces the idea of genetic congruence — a second-degree genetic interaction.
Greenwald, I. in WormBook (ed. The C. elegans Research Community) [online], 4 August 2005 (doi/10.1895/wormbook.1.10.1).
Botstein, D. et al. in The Molecular and Cellular Biology of the Yeast Saccharomyces: Cell Cycle and Cell Biology (eds Pringle, J., Broach, J. & Jones, E.) (Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 1997).
Boone, C., Bussey, H. & Andrews, B. J. Exploring genetic interactions and networks with yeast. Nature Rev. Genet. 8, 437–449 (2007). A review of theory and approaches to mapping genetic interaction networks.
Bork, P. et al. Protein interaction networks from yeast to human. Curr. Opin. Struct. Biol. 14, 292–299 (2004).
Ewing, B., Hillier, L., Wendl, M. C. & Green, P. Base-calling of automated sequencer traces using phred. I. Accuracy assessment. Genome Res. 8, 175–185 (1998).
Jansen, R. C. Studying complex biological systems using multifactorial perturbation. Nature Rev. Genet. 4, 145–151 (2003).
Sprinzak, E., Altuvia, Y. & Margalit, H. Characterization and prediction of protein–protein interactions within and between complexes. Proc. Natl Acad. Sci. USA 103, 14718–14723 (2006).
Suthram, S., Shlomi, T., Ruppin, E., Sharan, R. & Ideker, T. A direct comparison of protein interaction confidence assignment schemes. BMC Bioinformatics 7, 360 (2006).
Lee, I., Date, S. V., Adai, A. T. & Marcotte, E. M. A probabilistic functional network of yeast genes. Science 306, 1555–1558 (2004).
Rhodes, D. R. et al. Probabilistic model of the human protein–protein interaction network. Nature Biotechnol. 23, 951–959 (2005).
Beyer, A. et al. Integrated assessment and prediction of transcription factor binding. PLoS Comput. Biol. 2, e70 (2006).
Hollunder, J., Beyer, A. & Wilhelm, T. Identification and characterization of protein subcomplexes in yeast. Proteomics 5, 2082–2089 (2005).
Collins, S. R. et al. Towards a comprehensive atlas of the physical interactome of Saccharomyces cerevisiae. Mol. Cell. Proteomics 6, 439–450 (2007).
von Mering, C. et al. Comparative assessment of large-scale data sets of protein–protein interactions. Nature 417, 399–403 (2002). The first comparison of the quality of various high-throughput physical interaction data sets.
Pellegrini, M., Marcotte, E. M., Thompson, M. J., Eisenberg, D. & Yeates, T. O. Assigning protein functions by comparative genome analysis: protein phylogenetic profiles. Proc. Natl Acad. Sci. USA 96, 4285–4288 (1999).
Stuart, J. M., Segal, E., Koller, D. & Kim, S. K. A gene-coexpression network for global discovery of conserved genetic modules. Science 302, 249–255 (2003).
Collins, S. R., Schuldiner, M., Krogan, N. J. & Weissman, J. S. A strategy for extracting and analyzing large-scale quantitative epistatic interaction data. Genome Biol. 7, R63 (2006).
Downard, K. M. Ions of the interactome: the role of MS in the study of protein interactions in proteomics and structural biology. Proteomics 6, 5374–5384 (2006).
Legrain, P., Wojcik, J. & Gauthier, J. M. Protein–protein interaction maps: a lead towards cellular functions. Trends Genet. 17, 346–352 (2001).
Myers, C. L., Barrett, D. R., Hibbs, M. A., Huttenhower, C. & Troyanskaya, O. G. Finding function: evaluation methods for functional genomic data. BMC Genomics 7, 187 (2006).
Estojak, J., Brent, R. & Golemis, E. A. Correlation of two-hybrid affinity data with in vitro measurements. Mol. Cell. Biol. 15, 5820–5829 (1995).
Gunsalus, K. C. et al. Predictive models of molecular machines involved in Caenorhabditis elegans early embryogenesis. Nature 436, 861–865 (2005).
Avery, L. & Wasserman, S. Ordering gene function: the interpretation of epistasis in regulatory hierarchies. Trends Genet. 8, 312–316 (1992).
Ptacek, J. et al. Global analysis of protein phosphorylation in yeast. Nature 438, 679–684 (2005).
Kelley, R. & Ideker, T. Systematic interpretation of genetic interactions using protein networks. Nature Biotechnol. 23, 561–566 (2005). The first large-scale identification of genetic interactions within and between pathways.
Ulitsky, I. & Shamir, R. Pathway redundancy and protein essentiality revealed in the Saccharomyces cerevisiae interaction networks. Mol. Syst. Biol. 3, 104 (2007).
Collins, S. R. et al. Functional dissection of protein complexes involved in yeast chromosome biology using a genetic interaction map. Nature 446, 806–810 (2007). A large-scale identification of alleviating and aggravating genetic interactions and an interpretation of these interactions in the context of protein complexes.
Drees, B. L. et al. Derivation of genetic interaction networks from quantitative phenotype data. Genome Biol. 6, R38 (2005).
St Onge, R. P. et al. Systematic pathway analysis using high-resolution fitness profiling of combinatorial gene deletions. Nature Genet. 39, 199–206 (2007). An example of using genetic interactions to order pathways involved in DNA damage.
Schuldiner, M. et al. Exploration of the function and organization of the yeast early secretory pathway through an epistatic miniarray profile. Cell 123, 507–519 (2005).
Jana, S. Simulation of quantitative characters from qualitatively acting genes. Theor. Appl. Genet. 42, 119–124 (1972).
Punnett, R. C. Mendelism (Macmillan, New York, 1913).
Tong, A. H. et al. Systematic genetic analysis with ordered arrays of yeast deletion mutants. Science 294, 2364–2368 (2001).
Ye, P. et al. Gene function prediction from congruent synthetic lethal interactions in yeast. Mol. Syst. Biol. 1, 2005.0026 (2005).
Bader, G. D. & Hogue, C. W. An automated method for finding molecular complexes in large protein interaction networks. BMC Bioinformatics 4, 2 (2003).
Yu, H., Paccanaro, A., Trifonov, V. & Gerstein, M. Predicting interactions in protein networks by completing defective cliques. Bioinformatics 22, 823–829 (2006).
Gavin, A. C. et al. Functional organization of the yeast proteome by systematic analysis of protein complexes. Nature 415, 141–147 (2002).
Goldberg, D. S. & Roth, F. P. Assessing experimentally derived interactions in a small world. Proc. Natl Acad. Sci. USA 100, 4372–4376 (2003).
Harbison, C. T. et al. Transcriptional regulatory code of a eukaryotic genome. Nature 431, 99–104 (2004). A large-scale analysis of the DNA binding patterns of most yeast transcription factors using ChIP–chip.
Workman, C. T. et al. A systems approach to mapping DNA damage response pathways. Science 312, 1054–1059 (2006). An example of the integration of physical ChIP–chip data with genetic knockout gene expression data to explore pathways involved in DNA damage.
Iyer, V. R. et al. Genomic binding sites of the yeast cell-cycle transcription factors SBF and MBF. Nature 409, 533–538 (2001).
Chiu, R. et al. The c-Fos protein interacts with c-Jun/AP-1 to stimulate transcription of AP-1 responsive genes. Cell 54, 541–552 (1988).
Vermeirssen, V. et al. Transcription factor modularity in a gene-centered C. elegans core neuronal protein–DNA interaction network. Genome Res. 17, 1061–1071 (2007).
Ideker, T. et al. Integrated genomic and proteomic analyses of a systematically perturbed metabolic network. Science 292, 929–934 (2001).
Yeang, C. H. et al. Validation and refinement of gene-regulatory pathways on a network of physical interactions. Genome Biol. 6, R62 (2005).
Hughes, T. R. et al. Functional discovery via a compendium of expression profiles. Cell 102, 109–126 (2000).
Hu, Z., Killion, P. J. & Iyer, V. R. Genetic reconstruction of a functional transcriptional regulatory network. Nature Genet. 39, 683–687 (2007).
Deplancke, B. et al. A gene-centered C. elegans protein–DNA interaction network. Cell 125, 1193–1205 (2006).
Tu, Z., Wang, L., Arbeitman, M. N., Chen, T. & Sun, F. An integrative approach for causal gene identification and gene regulatory pathway inference. Bioinformatics 22, e489–e496 (2006).
Ott, J. Analysis of Human Genetic Linkage (Johns Hopkins Univ. Press, Baltimore, 1999).
Zhao, R. et al. Navigating the chaperone network: an integrative map of physical and genetic interactions mediated by the Hsp90 chaperone. Cell 120, 715–727 (2005).
Tewari, M. et al. Systematic interactome mapping and genetic perturbation analysis of a C. elegans TGFβ signaling network. Mol. Cell 13, 469–482 (2004).
Pan, X. et al. A DNA integrity network in the yeast Saccharomyces cerevisiae. Cell 124, 1069–1081 (2006).
Zhang, L. V. et al. Motifs, themes and thematic maps of an integrated Saccharomyces cerevisiae interaction network. J. Biol. 4, 6 (2005).
Nguyen, D. H. & D'Haeseleer, P. Deciphering principles of transcription regulation in eukaryotic genomes. Mol. Syst. Biol. 2, 2006.0012 (2006).
Yeang, C. H., Ideker, T. & Jaakkola, T. Physical network models. J. Comput. Biol. 11, 243–262 (2004).
Klipp, E. & Liebermeister, W. Mathematical modeling of intracellular signaling pathways. BMC Neurosci. 7, S10 (2006).
Klipp, E., Nordlander, B., Kruger, R., Gennemark, P. & Hohmann, S. Integrative model of the response of yeast to osmotic shock. Nature Biotechnol. 23, 975–982 (2005).
Roberts, L., Davenport, R. J., Pennisi, E. & Marshall, E. A history of the Human Genome Project. Science 291, 1195 (2001).
Schadt, E. E. & Lum, P. Y. Thematic review series: systems biology approaches to metabolic and cardiovascular disorders. Reverse engineering gene networks to identify key drivers of complex disease phenotypes. J. Lipid Res. 47, 2601–2613 (2006).
Lage, K. et al. A human phenome-interactome network of protein complexes implicated in genetic disorders. Nature Biotechnol. 25, 309–316 (2007). The first study to explain disease phenotypes by genome-wide mapping of genetic loci onto a human interaction network.
Bourgain, C., Genin, E., Cox, N. & Clerget-Darpoux, F. Are genome-wide association studies all that we need to dissect the genetic component of complex human diseases? Eur. J. Hum. Genet. 15, 260–263 (2007).
Williams, S. M. et al. Problems with genome-wide association studies. Science 316, 1840–1842 (2007).
Mathews, C. K. The cell: bag of enzymes or network of channels? J. Bacteriol. 175, 6377–6381 (1993).
Srere, P. A. Complexes of sequential metabolic enzymes. Annu. Rev. Biochem. 56, 89–124 (1987).
Pollack, G. Cells, Gels and the Engines of Life (Ebner & Sons, Seattle, 2001).
Pinney, J. W., Westhead, D. R. & McConkey, G. A. Petri Net representations in systems biology. Biochem. Soc. Trans. 31, 1513–1515 (2003).
Ito, T. et al. A comprehensive two-hybrid analysis to explore the yeast protein interactome. Proc. Natl Acad. Sci. USA 98, 4569–4574 (2001).
Li, S. et al. A map of the interactome network of the metazoan C. elegans. Science 303, 540–543 (2004).
Rual, J. F. et al. Towards a proteome-scale map of the human protein–protein interaction network. Nature 437, 1173–1178 (2005).
Stelzl, U. et al. A human protein–protein interaction network: a resource for annotating the proteome. Cell 122, 957–968 (2005).
Giot, L. et al. A protein interaction map of Drosophila melanogaster. Science 302, 1727–1736 (2003).
Suzuki, H. et al. Protein–protein interaction panel using mouse full-length cDNAs. Genome Res. 11, 1758–1765 (2001).
Uetz, P. et al. A comprehensive analysis of protein–protein interactions in Saccharomyces cerevisiae. Nature 403, 623–627 (2000).
Gavin, A. C. et al. Proteome survey reveals modularity of the yeast cell machinery. Nature 440, 631–636 (2006).
Krogan, N. J. et al. Global landscape of protein complexes in the yeast Saccharomyces cerevisiae. Nature 440, 637–643 (2006).
Pokholok, D. K. et al. Genome-wide map of nucleosome acetylation and methylation in yeast. Cell 122, 517–527 (2005).
Ren, B. et al. Genome-wide location and function of DNA binding proteins. Science 290, 2306–2309 (2000).
Loh, Y. H. et al. The OCT4 and NANOG transcription network regulates pluripotency in mouse embryonic stem cells. Nature Genet. 38, 431–440 (2006).
Wei, C. L. et al. A global map of p53 transcription-factor binding sites in the human genome. Cell 124, 207–219 (2006).
van Steensel, B. & Henikoff, S. Identification of in vivo DNA targets of chromatin proteins using tethered dam methyltransferase. Nature Biotechnol. 18, 424–428 (2000).
Deplancke, B., Dupuy, D., Vidal, M. & Walhout, A. J. A gateway-compatible yeast one-hybrid system. Genome Res. 14, 2093–2101 (2004).
Walhout, A. J. Unraveling transcription regulatory networks by protein–DNA and protein–protein interaction mapping. Genome Res. 16, 1445–1454 (2006).
Berger, M. F. et al. Compact, universal DNA microarrays to comprehensively determine transcription-factor binding site specificities. Nature Biotechnol. 24, 1429–1435 (2006).
Ooi, S. L., Shoemaker, D. D. & Boeke, J. D. DNA helicase gene interaction network defined using synthetic lethality analyzed by microarray. Nature Genet. 35, 277–286 (2003).
Lehner, B., Crombie, C., Tischler, J., Fortunato, A. & Fraser, A. G. Systematic mapping of genetic interactions in Caenorhabditis elegans identifies common modifiers of diverse signaling pathways. Nature Genet. 38, 896–903 (2006).
Lehner, B., Tischler, J. & Fraser, A. G. RNAi screens in Caenorhabditis elegans in a 96-well liquid format and their application to the systematic identification of genetic interactions. Nature Protoc. 1, 1617–1620 (2006).
Sahin, O. et al. Combinatorial RNAi for quantitative protein network analysis. Proc. Natl Acad. Sci. USA 104, 6579–6584 (2007).
Brem, R. B., Storey, J. D., Whittle, J. & Kruglyak, L. Genetic interactions between polymorphisms that affect gene expression in yeast. Nature 436, 701–703 (2005). A pioneering eQTL paper linking genetic variation in yeast to gene expression as a quantitative trait.
Bao, L. et al. Combining gene expression QTL mapping and phenotypic spectrum analysis to uncover gene regulatory relationships. Mamm. Genome 17, 575–583 (2006).
Chesler, E. J., Lu, L., Wang, J., Williams, R. W. & Manly, K. F. WebQTL: rapid exploratory analysis of gene expression and genetic networks for brain and behavior. Nature Neurosci. 7, 485–486 (2004).
Petretto, E. et al. Heritability and tissue specificity of expression quantitative trait loci. PLoS Genet. 2, e172 (2006).
Schadt, E. E. et al. Genetics of gene expression surveyed in maize, mouse and man. Nature 422, 297–302 (2003).
Phillips, P. C. The language of gene interaction. Genetics 149, 1167–1171 (1998).
Phillips, P. C., Otto, S. P. & Whitlock, M. C. Beyond the Average: the Evolutionary Importance of Gene Interactions and Variability of Epistatic Effects in Epistasis and the Evolutionary Process (Oxford Univ. Press, New York, 2000).
Segre, D., Deluna, A., Church, G. M. & Kishony, R. Modular epistasis in yeast metabolism. Nature Genet. 37, 77–83 (2005).
Tan, K., Shlomi, T., Feizi, H., Ideker, T. & Sharan, R. Transcriptional regulation of protein complexes within and across species. Proc. Natl Acad. Sci. USA 104, 1283–1288 (2007).
Carter, G. W. et al. Prediction of phenotype and gene expression for combinations of mutations. Mol. Syst. Biol. 3, 96 (2007).
Carter, G. W., Rupp, S., Fink, G. R. & Galitski, T. Disentangling information flow in the Ras-cAMP signaling network. Genome Res. 16, 520–526 (2006).
Keene, J. D., Komisarow, J. M. & Friedersdorf, M. B. RIP–Chip: the isolation and identification of mRNAs, microRNAs and protein components of ribonucleoprotein complexes from cell extracts. Nature Protoc. 1, 302–307 (2006).
Barrios-Rodiles, M. et al. High-throughput mapping of a dynamic signaling network in mammalian cells. Science 307, 1621–1625 (2005).
Liu, T., Lin, Y., Wen, X., Jorissen, R. N. & Gilson, M. K. BindingDB: a web-accessible database of experimentally determined protein–ligand binding affinities. Nucleic Acids Res. 35, D198–D201 (2007).
Sethupathy, P., Megraw, M. & Hatzigeorgiou, A. G. A guide through present computational approaches for the identification of mammalian microRNA targets. Nature Methods 3, 881–886 (2006).
Schadt, E. E. et al. An integrative genomics approach to infer causal associations between gene expression and disease. Nature Genet. 37, 710–717 (2005).
Eisen, M. B., Spellman, P. T., Brown, P. O. & Botstein, D. Cluster analysis and display of genome-wide expression patterns. Proc. Natl Acad. Sci. USA 95, 14863–14868 (1998).
Dekker, J., Rippe, K., Dekker, M. & Kleckner, N. Capturing chromosome conformation. Science 295, 1306–1311 (2002).
Dostie, J. et al. Chromosome Conformation Capture Carbon Copy (5C): a massively parallel solution for mapping interactions between genomic elements. Genome Res. 16, 1299–1309 (2006).
Giaever, G. et al. Chemogenomic profiling: identifying the functional interactions of small molecules in yeast. Proc. Natl Acad. Sci. USA 101, 793–798 (2004).
Acknowledgements
This work was supported by the US National Institutes of Environmental Health Sciences grant ES014811. T.I. is a David and Lucille Packard Fellow.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing financial interests.
Related links
Glossary
- Radiation hybrid mapping
-
High-resolution mapping of human markers using X-ray exposure to fragment human chromosomes and fusing the irradiated cells with rodent cells. The frequency of co-occurrence of markers on the same fragment relates to their genomic distance.
- Fluorescence in situ hybridization
-
Fluorescently labelled DNA probes are hybridized to chromosomal DNA. This allows genes (probes) to be assigned to chromosomes and provides a rough estimate of the chromosomal position of the cloned fragment.
- Reverse-genetic screening
-
Identifying the mutant phenotype(s) associated with a known genetic mutation or a panel of known mutations, such as a gene-deletion library. This term contrasts with forward-genetic screening, which involves identifying the mutations that affect a given phenotype.
- Regression
-
A statistical method for predicting a dependent variable on the basis of one or more independent variables.
- Likelihood function
-
A statistical method for predicting the likelihood of an outcome that is conditional (dependent) on other evidence.
- Petri network
-
A modelling approach that depicts a process on a bipartite graph. Nodes are either places or transitions that are connected by directed arcs. Tokens are transmitted from places to transitions or from transitions to places.
Rights and permissions
About this article
Cite this article
Beyer, A., Bandyopadhyay, S. & Ideker, T. Integrating physical and genetic maps: from genomes to interaction networks. Nat Rev Genet 8, 699–710 (2007). https://doi.org/10.1038/nrg2144
Issue Date:
DOI: https://doi.org/10.1038/nrg2144
This article is cited by
-
BRCA1 and S phase DNA repair pathways restrict LINE-1 retrotransposition in human cells
Nature Structural & Molecular Biology (2020)
-
PanoromiX: a time-course network medicine platform integrating molecular assays and pathophenotypic data
BMC Bioinformatics (2018)
-
Integrative network analyses of wilt transcriptome in chickpea reveal genotype dependent regulatory hubs in immunity and susceptibility
Scientific Reports (2018)
-
A Systems Perspective of Signalling Networks in Host–Pathogen Interactions
Journal of the Indian Institute of Science (2017)
-
Integrative analysis of cancer genes in a functional interactome
Scientific Reports (2016)