Synopsis

Subject Categories: Molecular Biology of Disease

Molecular Systems Biology 5 Article number: 321  doi:10.1038/msb.2009.80
Published online: 3 November 2009
Citation: Molecular Systems Biology 5:321

Edgetic perturbation models of human inherited disorders

There is a News and Views associated with this document.

Quan Zhong1,2,a, Nicolas Simonis1,2,a, Qian-Ru Li1,2,a, Benoit Charloteaux1,2,3,a, Fabien Heuze1,2,3,a, Niels Klitgord1,2,a, Stanley Tam1,2, Haiyuan Yu1,2, Kavitha Venkatesan1,2, Danny Mou1,2, Venus Swearingen1,2, Muhammed A Yildirim1,2, Han Yan1,2, Amélie Dricot1,2, David Szeto1,2, Chenwei Lin1,2, Tong Hao1,2, Changyu Fan1,2, Stuart Milstein1,2, Denis Dupuy1,2, Robert Brasseur3, David E Hill1,2, Michael E Cusick1,2 & Marc Vidal1,2

  1. Center for Cancer Systems Biology (CCSB) and Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA, USA
  2. Department of Genetics, Harvard Medical School, Boston, MA, USA
  3. Centre de Biophysique Moléculaire Numérique, Faculté Universitaire des Sciences Agronomiques de Gembloux, Gembloux, Wallonia, Belgium

Correspondence to: Marc Vidal1,2 Center for Cancer Systems Biology, Department of Cancer Biology, 44 Binney Street, Smith 858, Boston, MA 02115, USA. Tel.: +1 617 632 5180; Fax: +1 617 632 5739; Email: marc_vidal@dfci.harvard.edu

Received 5 August 2009; Accepted 2 October 2009; Published online 3 November 2009

aThese authors contributed equally to this work

Top

Article highlights

  • We present alternative models to explain molecular dysfunctions underlying human inherited disorders based on interaction-specific or "edgetic" perturbations rather than complete loss of gene products.
  • We find that a substantial fraction of known genetic variants in human Mendelian disorders likely cause edgetic perturbations.
  • We find frequent situations where edgetic perturbation models can explain how different mutations in a single gene can cause distinct disorders.
  • Edgetic perturbation models should provide alternative explanations to complex genotype-to-phenotype relationships

Top

Synopsis

Genotype-to-phenotype relationships in human genetic disease are often modeled as: 'mutation in gene X leads to loss of gene product X, which leads to disease A'. However, single 'gene-loss' models cannot explain the increasingly appreciated prevalence of complex genotype-to-phenotype relationships, particularly with instances of allelic or locus hetrogeneity (Goh et al, 2007).

Genes and gene products function not in isolation but as components of complex networks of macromolecules (DNA, RNA, or proteins) and metabolites linked through biochemical or physical interactions, often represented in 'interactome' network models as 'nodes' and 'edges', respectively. Here we use network perturbation models to explain molecular dysfunctions underlying human disease in addition to the gene-loss model.

We hypothesize that different mutations leading to different molecular defects to proteins may cause distinct perturbations of cellular networks, giving rise to distinct phenotypic outcomes (Figure 1). For example, truncations close to the start of an open-reading frame, or mutations that grossly destabilize a protein structure, can be modeled as removing a protein node from the network ('node removal'). Alternatively, single amino-acid substitutions that affect specific binding sites, or truncations that preserve certain domains of a protein, may give rise to partially functional gene products with specific changes in distinct molecular interaction(s) (edge-specific or 'edgetic' perturbations) (Figure 1B).

Figure 1
Figure 1 :  Unfortunately we are unable to provide accessible alternative text for this. If you require assistance to access this image, or to obtain a text description, please contact npg@nature.com

Node removal versus edgetic perturbation models of network changes underlying phenotypic alterations. (A) Schematic illustration of pleiotropic phenotypic outcomes resulting from distinct network perturbations upon complete loss of gene product (node removal, blue box) versus perturbation of specific molecular interactions (edgetic perturbation, red box). Solid lines between two nodes represent preserved interactions and dashed lines represent perturbed interactions. Edges are generally biophysical interactions, but could also be biochemical interactions. (B) Schematic illustration of distinct 'truncating' versus 'in-frame' mutations causing distinct molecular defects in proteins leading to distinct node removal versus edgetic perturbation.

Full figure and legend (251K)Figures & Tables index

Taking advantage of the large number of known disease-causing allelic variations in human Mendelian disorders, we investigated how disease-associated mutations may cause complete loss of gene products or, alternatively, may cause specific loss or gain of individual molecular interaction(s). We examined approx50 000 Mendelian disease-causing alleles, affecting over 1900 protein-coding genes, altogether associated with more than 2000 human disorders available in the Human Gene Mutation Database (HGMD) (Stenson et al, 2003), that can be subdivided into two subsets: truncating' alleles (truncations or frameshifts caused by stop codons, out-of-frame insertions or deletions, or defective splicing) versus 'in-frame' alleles (missense mutations and in-frame insertions or deletions). Over 50% (27 919/52 491) of Mendelian alleles in HGMD correspond to 'in-frame' mutations. Our hypothesis is that, 'in-frame' alleles may affect specific interactions of a given gene product while leaving most other interactions unperturbed.

Although exceptions may apply, our hypothesis has several predictions. First, 'truncating' versus 'in-frame' alleles may distribute differently among autosomal dominant and autosomal recessive disease, given that dominant mutations are more likely to be edgetic than recessive ones. Indeed, autosomal dominant and autosomal recessive traits annotated in the Online Mendelian Inheritance in Man (OMIM) database (Hamosh et al, 2005) show a clear separation with respect to the associated 'in-frame' versus 'truncating' mutations. Among genes affected solely by 'in-frame' mutations, the proportion of dominant diseases is approx10-fold higher than that of recessive ones, supporting 'in-frame' mutations causing distinct molecular defects as opposed to 'truncating' mutations.

A proof-of-principle characterization of binary protein interaction defects of mutant alleles associated with five genetic disorders supports our hypothesis that 'in-frame' alleles indeed produce mostly functional proteins, preserving many specific protein interactions. As grossly disruptive mutations versus mutations leading to loss or gain of specific interaction(s) probably distribute differently on protein structures, we examined available three-dimensional structures of all disease proteins. Mutated residues in autosomal dominant disease are significantly more exposed to the surface of the structure than those in autosomal recessive disease, consistent with the idea that disease with distinct modes of inheritance probably involves distinct network perturbations.

A second testable prediction of our edgetic perturbation model is that edgetic perturbation versus gene loss for a given gene product might in some cases cause different diseases. We examined 142 genes associated with two or more distinct diseases in which at least five distinct alleles have been reported for each disease. We found approx30% of the cases for which distribution of 'in-frame' versus 'truncating' mutations is significantly different between the two diseases linked to the same gene (P<0.05). Hence, when affecting the same gene, node removal versus edgetic perturbation can confer strikingly different phenotypes.

A third testable prediction is that different edgetic perturbations for a given gene product might cause phenotypically distinguishable diseases (Figure 6). We used predicted Pfam domains (Finn et al, 2006) as surrogates for functional interaction domains, assuming that 'in-frame' mutations located in distinct Pfam domain-encoding sequences probably alter distinct interactions. Among 169 genes associated with two or more diseases and encoding proteins containing at least two Pfam domains, nine proteins have at least two Pfam domains significantly enriched with 'in-frame' mutations (P<0.05). For each of the nine proteins, we found a striking pattern of near mutual exclusivity, whereby different Pfam domains seem to be specifically affected in distinct disorders (Figure 6B).

Figure 6
Figure 6 :  Unfortunately we are unable to provide accessible alternative text for this. If you require assistance to access this image, or to obtain a text description, please contact npg@nature.com

Distinct edgetic perturbations underlying pleiotropy. (A) Schematic illustration of distinct 'in-frame' alleles in a single gene product causing distinct network perturbations giving rise to distinct disorders. (B) Enrichment of 'in-frame' mutations causing different disorders in different Pfam domains. Color intensity of Pfam domains represents fold enrichment of each disease associated 'in-frame' mutations (P<0.05). Vertical lines below corresponding Pfam domains mark disease-causing 'in-frame' mutations in TP63.

Full figure and legend (453K)Figures & Tables index

We conclude that edgetic alleles probably underlie many complex genotype-to-phenotype relationships in human disease, such as incomplete penetrance or variable expressivity, as well as allele-specific phenotypic variations among patients. Edgetic perturbation of human inherited disorders might help explain how seemingly devastating alleles have appeared and persevered in human populations.

Top

Acknowledgements

We thank all members of the Vidal Lab and the Center for Cancer Systems Biology (CCSB), Dr Patricia K Donahoe and Dr Roseann Mulloy for helpful suggestions; Ines M Pinto for help with experiments. This study was supported by the Ellison Foundation and the WM Keck Foundation (MV), NIH grants R01-HG001715 from NHGRI (MV and F Roth), U01-CA105423 (PI, S Orkin, project leader, MV), U54-CA112952 (PI, J Nevins, subcontract, MV) and R33-CA132073 (MV) from NCI, and by Institute Sponsored Research funds from the Dana-Farber Cancer Institute Strategic Initiative awarded to CCSB. KV was supported by an NIH NRSA training grant fellowship (T32-CA09361). BC was supported by the Belgian Program on Interuniversity Attraction Poles initiated by the Federal Office for Scientific, Technical and Cultural Affairs (IAP P6/19 PROFUSA). MV and RB are 'Honorary Research Associate' and 'Research Director' from the Fonds de la Recherche Scientifique (FRS-FNRS, French Community of Belgium), respectively.

Top

References

  1. Finn RD, Mistry J, Schuster-Bockler B, Griffiths-Jones S, Hollich V, Lassmann T, Moxon S, Marshall M, Khanna A, Durbin R, Eddy SR, Sonnhammer EL, Bateman A (2006) Pfam: clans, web tools and services. Nucleic Acids Res 34: D247–D251 | Article | PubMed | ISI | ChemPort |
  2. Goh KI, Cusick ME, Valle D, Childs B, Vidal M, Barabasi AL (2007) The human disease network. Proc Natl Acad Sci USA 104: 8685–8690 | Article | PubMed | ADS | ChemPort |
  3. Hamosh A, Scott AF, Amberger JS, Bocchini CA, McKusick VA (2005) Online Mendelian Inheritance in Man (OMIM), a knowledgebase of human genes and genetic disorders. Nucleic Acids Res 33: D514–D517 | Article | PubMed | ISI | ChemPort |
  4. Stenson PD, Ball EV, Mort M, Phillips AD, Shiel JA, Thomas NS, Abeysinghe S, Krawczak M, Cooper DN (2003) Human Gene Mutation Database (HGMD): 2003 update. Hum Mutat 21: 577–581 | Article | PubMed | ISI | ChemPort |

Extra navigation

.
ADVERTISEMENT