Abstract
Biological networks can be used to functionally annotate genes on the basis of interaction-profile similarities. Metrics known as association indices can be used to quantify interaction-profile similarity. We provide an overview of commonly used association indices, including the Jaccard index and the Pearson correlation coefficient, and compare their performance in different types of analyses of biological networks. We introduce the Guide for Association Index for Networks (GAIN), a web tool for calculating and comparing interaction-profile similarities and defining modules of genes with similar profiles.
This is a preview of subscription content, access via your institution
Access options
Subscribe to this journal
Receive 12 print issues and online access
$259.00 per year
only $21.58 per issue
Buy this article
- Purchase on Springer Link
- Instant access to full article PDF
Prices may be subject to local taxes which are calculated during checkout
Similar content being viewed by others
Change history
27 January 2014
In the version of this article initially published, the formula describing the connection specificity index (CSI) in Box 2 was incorrect. The denominator in the fraction of the CSI equation originally read "ny"; the correct denominator is "# of X-type nodes in the network." The error has been corrected in the HTML and PDF versions of the article.
References
Walhout, A.J.M. et al. Protein interaction mapping in C. elegans using proteins involved in vulval development. Science 287, 116–122 (2000).
Schwikowski, B., Uetz, P. & Fields, S. A network of protein-protein interactions in yeast. Nat. Biotechnol. 18, 1257–1261 (2000).
Costanzo, M. et al. The genetic landscape of a cell. Science 327, 425–431 (2010).
Harbison, C.T. et al. Transcriptional regulatory code of a eukaryotic genome. Nature 431, 99–104 (2004).
Walhout, A.J.M. Unraveling transcription regulatory networks by protein-DNA and protein-protein interaction mapping. Genome Res. 16, 1445–1454 (2006).
Reece-Hoyes, J.S. et al. Extensive rewiring and complex evolutionary dynamics in a C. elegans multiparameter transcription factor network. Mol. Cell 51, 116–127 (2013).
Bordbar, A. & Palsson, B.O. Using the reconstructed genome-scale human metabolic network to study physiology and pathology. J. Intern. Med. 271, 131–141 (2012).
Watson, E., MacNeil, L.T., Arda, H.E., Zhu, L.J. & Walhout, A.J.M. Integration of metabolic and gene regulatory networks modulates the C. elegans dietary response. Cell 153, 253–266 (2013).
Green, R.A. et al. A high-resolution C. elegans essential gene network based on phenotypic profiling of a complex tissue. Cell 145, 470–482 (2011).
Su, A.I. et al. A gene atlas of the mouse and human protein-encoding transcriptomes. Proc. Natl. Acad. Sci. USA 101, 6062–6067 (2004).
Fowlkes, C.C. et al. A quantitative spatiotemporal atlas of gene expression in the Drosophila blastoderm. Cell 133, 364–374 (2008).
Martinez, N.J., Ow, M.C., Reece-Hoyes, J., Ambros, V. & Walhout, A.J. Genome-scale spatiotemporal analysis of Caenorhabditis elegans microRNA promoter activity. Genome Res. 18, 2005–2015 (2008).
Grove, C.A. et al. A multiparameter network reveals extensive divergence between C. elegans bHLH transcription factors. Cell 138, 314–327 (2009).
Ritter, A.D. et al. Complex expression dynamics and robustness in C. elegans insulin networks. Genome Res. 23, 954–965 (2013).
Lee, I., Blom, U.M., Wang, P.I., Shim, J.E. & Marcotte, E.M. Prioritizing candidate disease genes by network-based boosting of genome-wide association data. Genome Res. 21, 1109–1121 (2011).
Ravasz, E., Somera, A.L., Mongru, D.A., Oltvai, Z.N. & Barabasi, A.L. Hierarchical organization of modularity in metabolic networks. Science 297, 1551–1555 (2002).
Spirin, V. & Mirny, L.A. Protein complexes and functional modules in molecular networks. Proc. Natl. Acad. Sci. USA 100, 12123–12128 (2003).
Hayek, L.-A.C. in Measuring and Monitoring Biological Diversity: Standard Methods for Amphibians. (ed. Heyer, W.R.) Ch. 9, 207–269 (Smithsonian Institution, Washington, DC, 1994).
Goldberg, D.S. & Roth, F.P. Assessing experimentally derived interactions in a small world. Proc. Natl. Acad. Sci. USA 100, 4372–4376 (2003).
Gunsalus, K.C. et al. Predictive models of molecular machines involved in Caenorhabditis elegans early embryogenesis. Nature 436, 861–865 (2005).
Langfelder, P., Luo, R., Oldham, M.C. & Horvath, S. Is my network module preserved and reproducible? PLoS Comput. Biol. 7, e1001057 (2011).
Huttenhower, C., Hibbs, M., Myers, C. & Troyanskaya, O.G. A scalable method for integration and functional analysis of multiple microarray datasets. Bioinformatics 22, 2890–2897 (2006).
Acknowledgements
We thank members of A.J.M.W.'s lab, R. McCord and B. Lajoie for discussions and critical reading of the manuscript. We thank J.C. Bare (Institute of Systems Biology) for helpful advice in the development of GAIN. This work was supported by the US National Institutes of Health grants DK068429 and GM082971 to A.J.M.W. J.I.F.B. is partially supported by a postdoctoral fellowship from the Pew Latin American Fellows Program. J.N. and C.L.M. are partially supported by grant DBI-0953881 from the US National Science Foundation.
Author information
Authors and Affiliations
Contributions
J.I.F.B. and A.J.M.W. conceived the project; J.I.F.B. performed the data analysis with the assistance of A.D., J.N. and C.L.M.; A.D. and J.I.F.B. developed the GAIN web tool in collaboration with J.N., C.L.M. and J.M.S.; J.I.F.B. and A.J.M.W. wrote the paper.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing financial interests.
Supplementary information
Supplementary Text and Figures
Supplementary Figures 1–3, Supplementary Table 1 and Supplementary Methods (PDF 948 kb)
Rights and permissions
About this article
Cite this article
Bass, J., Diallo, A., Nelson, J. et al. Using networks to measure similarity between genes: association index selection. Nat Methods 10, 1169–1176 (2013). https://doi.org/10.1038/nmeth.2728
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1038/nmeth.2728
This article is cited by
-
Integrated modeling framework reveals co-regulation of transcription factors, miRNAs and lncRNAs on cardiac developmental dynamics
Stem Cell Research & Therapy (2023)
-
Single-cell RNA sequencing analysis of the temporomandibular joint condyle in 3 and 4-month-old human embryos
Cell & Bioscience (2023)
-
Constructing gene similarity networks using co-occurrence probabilities
BMC Genomics (2023)
-
Three-dimensional molecular architecture of mouse organogenesis
Nature Communications (2023)
-
Stage-specific coexpression network analysis of Myc in cohorts of renal cancer
Scientific Reports (2023)