Estimating dispensable content in the human interactome

Ghadie, Mohamed; Xia, Yu

doi:10.1038/s41467-019-11180-2

Download PDF

Article
Open access
Published: 19 July 2019

Estimating dispensable content in the human interactome

Nature Communications volume 10, Article number: 3205 (2019) Cite this article

3351 Accesses
9 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Protein-protein interaction (PPI) networks (interactome networks) have successfully advanced our knowledge of molecular function, disease and evolution. While much progress has been made in quantifying errors and biases in experimental PPI datasets, it remains unknown what fraction of the error-free PPIs in the cell are completely dispensable, i.e., effectively neutral upon disruption. Here, we estimate dispensable content in the human interactome by calculating the fractions of PPIs disrupted by neutral and non-neutral mutations. Starting with the human reference interactome determined by experiments, we construct a human structural interactome by building homology-based three-dimensional structural models for PPIs. Next, we map common mutations from healthy individuals as well as Mendelian disease-causing mutations onto the human structural interactome, and perform structure-based calculations of how these mutations perturb the interactome. Using our predicted as well as experimentally-determined interactome perturbation patterns by common and disease mutations, we estimate that <~20% of the human interactome is completely dispensable.

Inferring gene regulatory networks from single-cell multiome data using atlas-scale external data

Article Open access 12 April 2024

An open source knowledge graph ecosystem for the life sciences

Article Open access 11 April 2024

Three million images and morphological profiles of cells treated with matched chemical and genetic perturbations

Article Open access 09 April 2024

Introduction

Protein–protein interactions (PPIs) are a central type of molecular interactions in the cell which collectively form the interactome network. Significant progress has been made toward mapping interactome networks for several species including human^1,2. These networks have been highly successful in providing insights into molecular function^3,4, disease^1,5,6,7,8, and evolution^{9,10,11,12,13}. While much work has been done in quantifying errors and biases in experimental PPI datasets^14,15,16, it remains unknown what fraction of the error-free PPIs in the cell are completely dispensable, i.e., effectively neutral upon disruption. Unlike erroneous PPIs which are experimental false positives, completely dispensable PPIs are true physical interactions which may or may not be associated with well-defined molecular functions. Furthermore, we draw a clear distinction between completely dispensable PPIs and non-specific PPIs. Non-specific PPIs have been used in the literature to describe non-stereospecific interactions or transient interactions that may well be crucial to cellular function^17,18, interactions that involve promiscuous binding of a protein to many partners¹⁹, or interactions that may have no function²⁰. While the last definition of non-specific PPIs comes close to our definition of completely dispensable PPIs, the first two definitions of non-specific PPIs are very different from our definition of completely dispensable PPIs. The unique defining feature of completely dispensable PPIs is that they do not measurably affect organismal fitness upon disruption.

The question of dispensable content in interactome networks is of utmost importance to cell systems biology, with widely diverging opinions in the literature. On the one hand, current systems biology studies using interactome networks to understand human disease depend crucially on the assumption that completely dispensable PPIs do not dominate the human interactome^1,21,22. On the other hand, the existence of completely dispensable PPIs is well-anticipated by molecular evolution and population genetics theory^23,24, as well as strongly supported by empirical analysis of genome-wide data^25,26,27. PPIs that are completely dispensable are introduced into and maintained in the interactome by non-adaptive processes, when purifying selection is not strong enough to maintain the perfect interactome in the presence of mutation and genetic drift, especially in species with small population sizes^23,24,27. Completely dispensable PPIs can lead to increasing robustness of the interactome network against mutations in that their elimination by mutations does not induce any measurable change in organismal fitness^8,13,28,29. This type of interactome robustness against mutations, which is unique to completely dispensable PPIs, is distinct from another type of interactome robustness against mutations, where PPIs in the interactome network are preserved in the presence of mutations at the binding interface^11,13,30. Because completely dispensable PPIs are effectively neutral upon disruption, they tend to evolve more quickly than other PPIs²⁵. Given the estimate that much of the human genome may be “junk” DNA under neutral evolution^31,32, it is possible that a large fraction of the human interactome may be “junk” interactions as well^16,23,24,25.

Here, in an effort to resolve the long-standing debate over completely dispensable contents in interactome networks, we estimate the overall fraction of PPIs in the human interactome that are effectively neutral upon disruption by mutation. Starting with a high-quality, experimentally determined human reference interactome, we construct a human structural interactome by building three-dimensional (3D) structural models for known human PPIs and annotating PPI interfaces at the residue level using template-based homology modeling. Similar structural network biology approaches have been previously used to provide insights into protein function, disease, and evolution^{33,34,35,36,37,38,39,40,41}. Next, we map common mutations from healthy individuals as well as Mendelian disease-causing mutations onto the human structural interactome, and perform structure-based prediction of the edgotype⁴² for each mutation, i.e., the precise pattern of interactome perturbation as the result of each mutation. We integrate these results to calculate the probabilities for common mutations (assumed to be neutral) and disease-causing mutations (assumed to be mildly deleterious) to disrupt human PPIs, and then apply Bayes’ theorem to calculate the probabilities for human PPIs to be neutral or non-neutral upon disruption¹³. Our calculations reveal that overall <~20% of the human interactome is completely dispensable, i.e., effectively neutral upon disruption. Finally, instead of using computationally predicted edgotypes for mutations, we repeat our calculations using experimentally determined edgotypes for mutations⁸. Our dispensable PPI estimate remains broadly consistent despite minimal overlap in protein space covered by computational and experimental edgotyping data.

Results

Construction of the human structural interactome

We started with two high-quality, experimentally determined human reference interactomes: the HI-II-14 interactome⁴³ consisting of PPIs identified in yeast two-hybrid (Y2H) screens, and the IntAct interactome consisting of PPIs reported in the IntAct database⁴⁴ by at least two independent experiments in the literature. From each of the two human reference interactomes, we constructed a human structural interactome by building 3D structural models for known PPIs via homology modeling, using experimentally determined PPI structural templates in the Protein Data Bank (PDB)⁴⁵ (Fig. 1). As a result, we obtained two high-resolution human structural interactomes: the HI-II-14 structural interactome (Y2H-SI) consisting of 486 PPIs among 573 proteins with their binding interfaces resolved at the residue level (Supplementary Data 1a and 2a), and the IntAct structural interactome (IntAct-SI) consisting of 3333 PPIs among 2654 proteins with their binding interfaces resolved at the residue level (Supplementary Data 1b and 2b). The high quality of our structurally annotated interactomes is confirmed by high functional similarity and tissue co-expression among interacting proteins (Supplementary Fig. 1a–d). Our structural interactomes are modeled from a diverse set of PDB structures (Supplementary Fig. 1e), with >40 residues on average mapped to an interface per protein (Supplementary Fig. 1f).

Geometry-based prediction of mutation edgotypes

We mapped 3705 Mendelian disease-causing missense mutations from ClinVar⁴⁶ and 28,788 common missense mutations not associated with disease from dbSNP⁴⁷ onto our two structural interactomes: Y2H-SI and IntAct-SI (Fig. 1). Overall, Y2H-SI carries 145 disease mutations and 376 non-disease mutations (Supplementary Data 3a, b), and IntAct-SI carries 908 disease mutations and 2394 non-disease mutations (Supplementary Data 3c, d). These mutations span a significant part of the human structural interactome, covering ~32% of proteins in Y2H-SI and ~41% of proteins in IntAct-SI.

Next, we used the structural interactome to perform geometry-based prediction of the edgotype for each mutation, i.e., the precise pattern of interactome perturbation as the result of each mutation. Mutations can be either edgetic (i.e., disrupt specific PPIs by disrupting binding interfaces), quasi-null (i.e., disrupting all PPIs by disrupting overall protein stability), or quasi-wildtype (i.e., do not disrupt any PPIs)⁸. We predict that a mutation edgetically disrupts a PPI if and only if the mutation occurs on the interface mediating that PPI (Figs. 1 and 2a). In Y2H-SI, we predicted 5.1% (19 out of 376) of non-disease mutations to be edgetic and 18.6% (27 out of 145) of disease mutations to be edgetic (Fig. 2b; Supplementary Data 3a, b). In IntAct-SI, we predicted 6.9% (164 out of 2394) of non-disease mutations to be edgetic and 15.4% (140 out of 908) of disease mutations to be edgetic (Fig. 2b; Supplementary Data 3c, d). In comparison, in the experimental study of Sahni et al.⁸, it was found that 4.3% (2 out of 47) of non-disease mutations are edgetic and 31.5% (62 out of 197) of disease mutations are edgetic (Fig. 2b). Thus, our computational results are consistent with experimental results in that disease mutations are significantly more likely to be edgetic than non-disease mutations (p < 10⁻⁴ for all cases, two-sided Fisher’s exact test).

Geometry-based calculation of dispensable PPI content

We used the mutation edgotypes predicted above to estimate the fraction of PPIs in the human interactome that are completely dispensable, i.e, effectively neutral upon disruption, following the procedure we had previously developed¹³. We assume that mutations are either effectively neutral (similar to synonymous mutations), mildly deleterious, or strongly detrimental (similar to nonsense mutations that introduce premature stop codons). In addition, we assume that common mutations from healthy individuals are effectively neutral, that Mendelian disease-causing mutations are mildly deleterious on average, and that strongly detrimental mutations are quasi-null (i.e., disrupt overall protein stability) rather than edgetic.

Using our predicted mutation edgotypes in Y2H-SI from the previous section, we obtained the probabilities for effectively neutral (N), mildly deleterious (M), and strongly detrimental (S) mutations to be edgetic (E): P(E|N) = 5.1%, P(E|M) = 18.6%, P(E|S) = 0 (Fig. 2b). Furthermore, we obtained from Kryukov et al.⁴⁸ the probabilities for new missense mutations to be effectively neutral (N), mildly deleterious (M), or strongly detrimental (S): P(N) = 27%, P(M) = 53%, P(S) = 20%. We then integrated these numbers to calculate the probability for new missense mutations to be edgetic (E): P(E) = P(E|N)P(N)+P(E|M)P(M) + P(E|S)P(S) = 11.2%. Finally, using Bayes’ theorem P(A|B) = P(B|A)P(A)/P(B), we calculated the probability for edgetic mutations (E) to be effectively neutral (N): P(N|E) = P(E|N)P(N)/P(E) = 12.1%. Thus, given that most (54%) edgetic mutations disrupt one PPI in Y2H-SI, we estimated that ~12.1% of the human interactome is completely dispensable, i.e., effectively neutral upon disruption, with a 95% confidence interval of 7.4–19.4% (Fig. 3).

Next, we repeated the same calculation using our predicted mutation edgotypes in IntAct-SI from the previous section (Fig. 2b), and estimated that ~18.5% of the human interactome is completely dispensable, with a 95% confidence interval of 15.5–21.9% (Fig. 3). Finally, we repeated the same calculation using the experimental mutation edgotype data from Sahni et al.⁸ (Fig. 2b), and estimated that ~6.4% of the human interactome is completely dispensable, with a 95% confidence interval of 1.7–21.4% (Fig. 3). These three dispensable PPI content estimates obtained from predicted and experimental mutation edgotypes are broadly consistent with each other.

Physics-based calculation of dispensable PPI content

Our geometry-based mutation edgotype predictions described above assume that both mildly deleterious disease mutations (M) and effectively neutral non-disease mutations (N) located at a PPI interface disrupt that PPI with the same probability γ_M = γ_N = 100%. This assumption is inaccurate, because disease mutations and non-disease mutations impact PPI stability differently due to their different physicochemical properties on average. Indeed, when we calculated the substitution scores for all 28,788 non-disease missense mutations and 3705 disease missense mutations in human using the PAM30 substitution matrix, we found that disease mutations tend to have a lower substitution score than non-disease mutations (p < 10⁻⁶, two-sided bootstrap test with 1,000,000 resamplings; Fig. 4a), indicating that disease mutations tend to be more radical than non-disease mutations.

Hence, we performed physics-based calculation of γ_M and γ_N, the probabilities for disease and non-disease interfacial mutations to disrupt the corresponding PPI. We first focused on Y2H-SI. For each interfacial mutation, we calculated the change in binding free energy (ΔΔG) caused by that mutation from the PPI structural model using BindProfX⁴⁹, which has been shown to accurately reproduce experimental ΔΔG measurements⁴⁹. The PPI is considered disrupted by the mutation if and only if ΔΔG >0.5 kcal mol⁻¹. We performed this physics-based calculation on all interfacial mutations to obtain γ_M = 66% and γ_N = 60% (Fig. 4b; Supplementary Data 4a, b). Using these physics-based PPI perturbation predictions, we found 3.2% (12 out of 374) of non-disease mutations to be edgetic and 11.4% (16 out of 140) of disease mutations to be edgetic (Fig. 4d; Supplementary Data 5a, b), and we estimated that ~12.5% of the human interactome is completely dispensable, with a 95% confidence interval of 6.5–22.8% (Fig. 4e).

Next, we repeated the same physics-based calculation on IntAct-SI. We obtained γ_M = 77% and γ_N = 65% (Fig. 4c; Supplementary Data 4c, d). Using these physics-based PPI perturbation predictions, we found 4.4% (103 out of 2360) of non-disease mutations to be edgetic and 11.6% (104 out of 894) of disease mutations to be edgetic (Fig. 4d; Supplementary Data 5c, d), and we estimated that ~16% of the human interactome is completely dispensable, with a 95% confidence interval of 12.8–19.9% (Fig. 4e). These adjusted mutation edgotype predictions and corresponding dispensable PPI content estimates remain consistent with those obtained from mutation edgotype experiments⁸.

Discussion

Our estimates of dispensable PPI content were derived from PPI perturbation patterns (edgotypes) in diverse human interactome datasets (HI-II-14 and IntAct). These PPI perturbation patterns were obtained by computation as well as by experiment. Our computational predictions complement experimental data as they probe different subsets of the human protein space, with <7% of computational edgotyping data covered by experiments. Despite such minimal overlap in protein coverage, our dispensable PPI estimates are broadly consistent with one another (~13% from Y2H-SI, ~16% from IntAct-SI, and ~6% from experiment). Indeed, the 95% confidence intervals for all three estimates overlap below ~20%. Furthermore, the dispensable PPI content obtained using physics-based calculations on the combined network of Y2H-SI and IntAct-SI remains below ~20% (15.8% with a 95% confidence interval of 12.6–19.6%). Taking these results together, we conclude that up to ~20% of the human interactome is completely dispensable, i.e., effectively neutral upon disruption.

PPI datasets are known to contain experimental false positives (erroneous PPIs)^14,15,16. These include, among others, non-reproducible experimental artifacts, in vitro physical interactions that do not occur in vivo (more likely to occur in Y2H experiments), and pairs of proteins from the same complex that do not directly interact with each other (more likely to occur in affinity capture experiments). Our goal here is to focus on real PPIs that are free from these errors, and estimate the fraction of these error-free PPIs that are effectively neutral upon disruption. We used several methods to minimize such false positive errors. First, we started from experimentally determined PPIs, rather than computationally predicted PPIs. Second, we used the HI-II-14 dataset, which was subjected to multiple Y2H screens and other quality control measures, and is similar in quality to a gold-standard dataset of literature-derived PPIs^8,43. Third, for the IntAct dataset, we only considered high-quality PPIs reported by at least two independent experiments in the literature. Fourth, we further reduced false positive errors by focusing on those PPIs for which we can build homology models using experimentally determined 3D structural templates of interacting proteins in PDB.

Despite these efforts, it remains a possibility that the false positive rates of our structural interactome datasets are non-negligible. These erroneous PPIs do not physically occur in the cell with detectable phenotypic consequences, and hence they are typically unable to distinguish deleterious mutations from neutral mutations. Consequently, the error-free portion of the PPI dataset must distinguish deleterious mutations from neutral mutations better than the average performance of the entire PPI dataset. Since in our case, higher predictive power of PPIs for deleterious mutations leads to lower estimate of dispensable PPI content, the fraction of error-free PPIs that are completely dispensable will be even lower than calculated from the entire PPI dataset. Thus, our calculated ~20% completely dispensable content in the human interactome represents an upper bound in the presence of errors in PPI datasets.

Our structure-based mutation edgotype computations contain several potential biases and approximations. First, literature-derived PPIs are biased toward interactions with functional and disease importance. We address this bias by additionally examining systematic PPI datasets such as HI-II-14. Second, experimentally determined 3D structures of interacting proteins are biased toward PPIs with functional and disease importance. We partially address this bias by using homology models in addition to experimental 3D structures of PPIs. Third, our mutation edgotype predictions involve numerous approximations. We address this issue by complementing geometry-based calculations with physics-based calculations, and by using the well-known BindProfX⁴⁹ method that has been shown to accurately reproduce experimental measurements of binding free energy change upon mutation. In addition to the BindProfX method, we also repeated our physics-based calculations of dispensable PPI content using ΔΔG values calculated by another well-known method FoldX⁵⁰ (Supplementary Data 6a–d), which produces high-quality ΔΔG values when benchmarked using the gold-standard dataset of SKEMPI⁵¹ (Pearson correlation coefficient between predicted versus experimental ΔΔG is 0.50 for co-crystal structures, and 0.42 for homology models). In Y2H-SI, we found that 11.1% of the human interactome is completely dispensable, with a 95% confidence interval of 4.7–24.1%. In IntAct-SI, we found that 13% of the human interactome is completely dispensable, with a 95% confidence interval of 9.5–17.5%. These FoldX-based estimates of dispensable PPI content remain in broad agreement with our BindProfX-based estimates (12.5% in Y2H-SI, and 16% in IntAct-SI). Fourth, we compare our mutation edgotype computations with experiments. The experimental mutation edgoptyping data, while not perfect (low coverage, possible false positives, false negatives), are nonetheless not affected by any of the aforementioned biases and approximations present in our predictions. The broad agreement between computation and experiment indicates that our estimates are robust against these biases and approximations.

Our calculations of dispensable PPI content make the reasonable assumption that strongly detrimental mutations are quasi-null rather than edgetic. While it is difficult to calculate the precise probability for strongly detrimental mutations to be edgetic in the absence of genome-wide data, including such probability in our calculations will only further decrease our estimate of dispensable PPI content. This is because the fraction of PPIs effectively neutral upon disruption is inversely proportional to the overall fraction of missense mutations that are edgetic. Hence, including some strongly detrimental mutations as edgetic in our calculations will increase the overall fraction of missense mutations that are edgetic, resulting in a smaller estimate of dispensable PPI content.

Our calculations of dispensable PPI content assume that each edgetic mutation disrupts one PPI, which is true for most mutations in Y2H-SI (61%) and IntAct-SI (63%). We further repeated our calculations using physics-based mutation edgotype predictions in Y2H-SI and IntAct-SI, this time replacing the fractions of mutations that are edgetic for both disease and non-disease mutations by the fractions of mutations that are mono-edgetic, i.e., those that disrupt only one PPI. Applying our modified calculation to Y2H-SI, we estimated that ~14.5% of the human interactome is completely dispensable with a 95% confidence interval of 6.3–30.1%. Applying the same modified calculation to IntAct-SI, we estimated that ~21.3% of the human interactome is completely dispensable with a 95% confidence interval of 16.2–27.6%. These estimates remain very close to our previous estimates. A similar calculation on the experimental dataset of Sahni et al.⁸ is not possible, as there are only two non-disease mutations in the dataset that are edgetic, both of which disrupt multiple PPIs and none of which are mono-edgetic.

The most accurate way of calculating dispensable PPI content is to measure the fitness change of the cell by systematically deleting PPIs one at a time. In the absence of such experiments, our calculations offer the next best solution by examining phenotypic consequences of edgetic mutations that disrupt as few as one PPI at a time, while maintaining all other aspects of protein biophysics and cell biology (e.g., protein stability, protein expression, and other protein interactions). Our calculations clearly distinguish edgetic mutations from quasi-null mutations, which, by disrupting overall protein stability, cause complex cellular and phenotypic changes beyond those explainable by simple PPI disruptions. Our structure-based predictions offer a clear definition of edgetic mutations, where mutations at interfacial sites are considered edgetic if they disrupt at least one PPI. On the other hand, the definition of edgetic mutations is less straightforward in the experimental dataset of Sahni et al.⁸ due to lack of structural information. There, mutations are considered edgetic if they disrupt at least one PPI but not all PPIs associated with the protein, and mutations that disrupt all PPIs are considered quasi-null. This definition is not completely accurate because some edgetic mutations may disrupt all PPIs by disrupting the binding interface without affecting protein stability, and they will be misclassified as quasi-null mutations. To test the effect of such potential misclassification on our experiment-based dispensable PPI content estimate, we repeated our calculations using the experimental dataset of Sahni et al.⁸, this time treating all quasi-null mutations as edgetic mutations. Using this modified calculation, we estimated that ~7% of the human interactome is completely dispensable with a 95% confidence interval of 2.9–16.3%. This estimate remains very close to our previous estimate obtained from experiments.

Our estimate of dispensable content in the human reference interactome is robust to the presence of gain-of-function mutations. Gain-of-function mutations are capable of driving diverse disease phenotypes by creating new molecular interactions^29,52. A classic example is sickle cell anemia, where a mutation on the surface of the hemoglobin molecule can cause it to bind to other hemoglobin molecules²⁹. Many other examples of gain-of-function mutations have been identified as important in cancer^53,54, neurodegenerative diseases⁵⁵, as well as other diseases⁵⁶. Such gain-of-function mutations are challenging to detect systematically, either by experiment or by computation. A recent genome-wide screen suggests that gain-of-interaction mutations are ~30 times less likely to occur in human disease than edgetic loss-of-interaction mutations⁸. Our definition of completely dispensable interactions only refers to pre-existing PPIs in the reference interactome which are neutral upon elimination by mutation, and is independent of the extent of gain-of-function mutations. Furthermore, our Bayesian formulation for estimating dispensable PPI content is robust to gain-of-function mutations. The three prior probabilities P(N), P(M), and P(S), for new missense mutations to be neutral (N), mildly deleterious (M), and strongly detrimental (S), are obtained from the literature using procedures that are robust to gain-of-function mutations⁴⁸. In addition, the other three conditional probabilities in our Bayesian framework P(E|N), P(E|M), and P(E|S), for neutral (N), mildly deleterious (M), and strongly detrimental (S) mutations to edgetically eliminate PPIs (E), are also independent of the extent of gain-of-function mutations. While gain-of-function mutations are beyond the scope of our current study and do not affect our estimate of completely dispensable content among pre-existing PPIs in the reference interactome, our Bayesian framework can be extended in the future to the calculation of completely dispensable content in de novo PPIs newly created by gain-of-function mutations.

The existence of completely dispensable interactions is confirmed by in vitro experiments based on yeast two-hybrid assays⁸. In addition, genome-wide analysis suggests widespread occurrence of completely dispensable interactions in protein phosphorylation^26,27. Using the PANTHER⁵⁷ webtool for Gene Ontology analysis of dispensable interactions identified by both experiments and predictions, we found that none of the Gene Ontology terms are significantly enriched in completely dispensable interactions (false discovery rate <0.05), consistent with the expectation that completely dispensable interactions tend to be non-functional or not well-studied in the literature.

Fitness measurements under laboratory conditions do not accurately reflect selective pressures in natural environments over evolutionary timescales^58,59,60. Hence, instead of using fitness measurements under laboratory conditions, we use population genetic datasets to accurately measure selective pressures and fitness effects of mutations. Another important factor to consider is macromolecular crowding, which is known to modulate protein–protein interactions in vivo⁶¹. In this study, we make the reasonable assumption that macromolecular crowding exerts similar thermodynamic effects on each binary protein–protein interaction before and after mutation. Crowding effects can be modulated by several factors, including protein shape⁶¹. The effects of protein shape on crowding at the interactome scale remains to be investigated in future work.

In summary, we estimate that up to ~20% of the overall human interactome is completely dispensable. This estimate represents an average over the entire human interactome, likely with significant variations within the interactome. Indeed, dispensable PPI content may be much larger in certain subsets of the interactome, specifically transient PPIs mediated by motif-domain interactions^25,27. Our study suggests that the majority of the human interactome is under strong purifying selection, enabling the maintenance of a somewhat close-to-streamlined interactome (where non-dispensable interactions outnumber completely dispensable interactions) in the presence of mutation and genetic drift. Furthermore, our study provides a solid justification for the utility of interactome networks in elucidating the phenotypic consequences of genetic mutations. These insights are enabled by systematic determination of precise interactome perturbation patterns induced by mutations, and they illustrate the power and utility of complementing high-resolution mutation edgotyping experiments with structural systems biology computations.

Methods

Construction of the human structural interactome

Three-dimensional (3D) protein structures at atomic resolution were retrieved in October 2017 from the Protein Data Bank (PDB)⁴⁵. For structures containing more than one model, the first model was selected. Gene names and gene Entrez IDs in the HI-II-14 reference interactome were mapped to protein UniProt IDs and corresponding amino acid sequences using the ID mapping table provided by UniProt⁶². For proteins in the IntAct reference interactome, UniProt IDs provided by the IntAct database were used to obtain corresponding amino acid sequences. Next, we used BLAST⁶³ to perform sequence alignment on all protein sequences against all PDB chain sequences found in PDB’s SEQRES records, with an E-value cut-off of 10⁻¹⁰. For each pair of protein sequence and PDB chain, the alignment with the smallest E-value was retained, and the remaining alignments were discarded. A PPI was annotated with a pair of interacting chains in the same PDB structure (with at least one interface residue mediating the interaction) if (i) one of the proteins in the PPI has a sequence alignment with one of the chains in the chain pair, with ≥50% of interface residues mapped onto the protein; and (ii) the other protein in the PPI has a sequence alignment with the other chain in the chain pair, with ≥50% of interface residues mapped onto the protein. PPIs without any PDB chain-pair annotations were discarded. For each structurally annotated PPI, up to five PDB chain-pair annotations with the smallest joint alignment E-values were used to identify interface residues, and the rest chain-pair annotations were discarded.

Identifying binding interface residues for two chains in a PDB structure

3D coordinates at atomic resolution for each chain were loaded from the PDB structure using the Biopython library⁶⁴, and amino acid residues associated with these coordinates were verified with the chain’s backbone sequence provided by the SEQRES records of PDB. Residues that are not part of the chain’s backbone sequence were discarded. Next, we calculated the Euclidean distance between each residue of one chain and all residues of the other chain. The distance between two residues was calculated as the minimum distance between all atoms of the first residue and all atoms of the second residue. If the residue of one chain is within a distance of 5 Å from any residue in the other chain, that residue was labeled as an interface residue.

Mapping disease mutations onto the human structural interactome

Germline mutations in human with associated phenotypic consequences were retrieved in February 2019 from the ClinVar database⁴⁶ (genome assembly GRCh38). We selected missense mutations that are strictly labeled as pathogenic only, with supporting evidence (i.e., with at least one star), and with no conflicting phenotypic interpretations. To map mutations onto proteins in the human structural interactome, we searched the protein’s RefSeq transcript provided by ClinVar for the mutation flanking sequence, defined as either the first 10 amino acid residues or all amino acid residues, whichever one is shorter, on both sides of the mutation. Then we searched the protein’s sequence designated by UniProt for the mutation flanking sequence obtained from the RefSeq transcript. If the flanking sequence was found on the protein sequence at the same position reported by ClinVar, the mutation was retained for further analysis, otherwise the mutation was discarded. For multiple mutations mapping onto the same position, only one mutation was retained for further analysis.

Mapping non-disease mutations onto the human structural interactome

Single-nucleotide polymorphism (SNP) mutations in human were retrieved in October 2017 from the Single Nucleotide Polymorphism Database (dbSNP)⁴⁷ (build 150 GRCh38p7). First, we selected only missense SNPs that are labeled as validated and not withdrawn, and are assigned a location on the RefSeq transcript of a protein. Next, we discarded all mutations labeled with disease assertions (e.g., pathogenic, likely pathogenic, drug-response, uncertain significance or other). Then we selected mutations whose minor allele frequencies are higher than 1%, as common mutations with high frequencies are unlikely to be associated with a disease. To map mutations onto proteins in the human structural interactome, we searched the protein’s RefSeq transcript provided by dbSNP for the mutation flanking sequence, defined as either the first 10 amino acid residues or all amino acid residues, whichever one is shorter, on both sides of the mutation. Then we searched the protein’s sequence designated by UniProt for the mutation flanking sequence obtained from the RefSeq transcript. If the flanking sequence was found on the protein sequence at the same position reported by dbSNP, the mutation was retained for further analysis, otherwise the mutation was discarded. Finally, mutations overlapping in position with disease mutations were also discarded.

Calculating functional similarity between two proteins

Gene Ontology (GO) associations were retrieved in March 2019 from the Gene Ontology Consortium^65,66, which provides a set of controlled hierarchical GO terms distributed among three root categories: ~29,600 biological process terms, ~11,100 molecular function terms, and ~4200 cellular component terms. Functional similarity between two proteins was then calculated using the SimGIC⁶⁷ semantic similarity measure implemented in the Fastsemsim python library.

Calculating tissue co-expression for two proteins

Gene tissue expression data were retrieved from four databases: the Illumina Body Map 2.0 project⁶⁸ with RNA-seq data in 16 normal human body tissues (log2 transformed), the Genotype-Tissue Expression (GTEx) project⁶⁹ with normalized RNA-seq data in 48 normal human body tissues, the Human Protein Atlas (HPA)⁷⁰ with protein immunohistochemistry microarray data in 44 normal human body tissues, and the Fantom5 project⁷¹ with CAGE (Cap Analysis of Gene Expression) peaks (tags per million) for gene promoters in 183 normal human body tissue samples. For GTEx data, gene expression levels in each tissue were averaged over all samples. For HPA data, gene expression levels were mapped from the four symbolic values {not detected, low, medium, high} to numeric values {0, 1, 2, 3}, respectively. For Fantom5 data, promoter CAGE peaks were mapped to genes using the associated HGNC IDs. For genes with multiple CAGE peaks, the average over all peaks was considered. Tissue co-expression for two proteins was then calculated using Pearson’s correlation coefficient for their tissue expression profiles. Only protein pairs whose expression levels are defined together in at least five tissues were considered.

Calculating the 95% confidence interval of the fraction of completely dispensable PPIs

Each mutation can be either edgetic (E) or not edgetic. In addition, the fitness effect of a mutation can be either neutral (N), mildly deleterious (M), or severely detrimental (S). The fraction of PPIs effectively neutral upon edgetic disruption P(N|E) was calculated using Bayes’ theorem: P(N|E) = P(E|N)P(N)/P(E), where P(E) = P(E|N)P(N) + P(E|M)P(M) + P(E|S)P(S) = P(E|N)P(N) + P(E|M)P(M), assuming that P(E|S) = 0. Since the probabilities P(N) and P(M) are constants, it is easy to see that P(N|E) only depends on P(E|M)/P(E|N) in the following way: 1/P(N|E) = {P(E|M)/P(E|N)} × {P(M)/P(N)} + 1. The 95% confidence interval for the ratio of two proportions P(E|M)/P(E|N) was calculated according to Bland⁷², which was then used to calculate the 95% confidence interval for P(N|E) using the above equation.

Data availability

The human structural interactomes (Y2H-SI and IntAct-SI) and genetic mutations analyzed in this study are included in this article and its Source Data files. The HI-II-14 reference interactome is available at The Human Reference Protein Interactome Mapping Project (http://interactome.baderlab.org). The IntAct reference interactome is available at the IntAct Molecular Interaction Database (http://www.ebi.ac.uk/intact). Protein sequences are available at the UniProt database (https://www.uniprot.org). Three-dimensional structural templates used for the modeling of protein–protein interactions are available at the Protein Data Bank (https://www.wwpdb.org). Non-disease missense mutations are available at the dbSNP database (https://www.ncbi.nlm.nih.gov/snp). Disease-causing missense mutations are available at the ClinVar database (https://www.ncbi.nlm.nih.gov/clinvar). Gene ontology association data underlying supplementary figures are available at the Gene Ontology database (http://geneontology.org). Gene tissue expression data underlying supplementary figures are available at the Illumina Body Map 2.0 project (https://www.ebi.ac.uk/gxa/experiments/E-MTAB-513), the Genotype-Tissue Expression (GTEx) project (https://gtexportal.org/home/datasets), the Human Protein Atlas (HPA) project (https://www.proteinatlas.org/about/download) and the Functional Annotation Of The Mammalian Genome (FANTOM5) project (http://fantom.gsc.riken.jp/5/datafiles/reprocessed/hg38_latest/extra). The source data underlying Figs. 2, 3, and 4b–e are provided as Supplementary Data files.

Code availability

Software code used for data analyses and calculations is available at https://github.com/MohamedGhadie/dispensable_ppi_content.

References

Vidal, M., Cusick, M. E. & Barabási, A. L. Interactome networks and human disease. Cell 144, 986–998 (2011).
Article CAS Google Scholar
Cafarelli, T. M. et al. Mapping, modeling, and characterization of protein–protein interactions on a proteomic scale. Curr. Opin. Struct. Biol. 44, 201–210 (2017).
Article CAS Google Scholar
Sharan, R., Ulitsky, I. & Shamir, R. Network-based prediction of protein function. Mol. Sys. Biol. 3, 88 (2007).
Google Scholar
Yang, X. et al. Widespread expansion of protein interaction capabilities by alternative splicing. Cell 164, 805–817 (2016).
Article CAS Google Scholar
Goh, K. I. et al. The human disease network. Proc. Natl Acad. Sci. USA 104, 8685–8690 (2007).
Article ADS CAS Google Scholar
Zhou, X., Menche, J., Barabási, A. L. & Sharma, A. Human symptoms–disease network. Nat. Comm. 5, 4212 (2014).
Article ADS CAS Google Scholar
Menche, J. et al. Uncovering disease-disease relationships through the incomplete interactome. Science 347, 1257601 (2015).
Article Google Scholar
Sahni, N. et al. Widespread macromolecular interaction perturbations in human genetic disorders. Cell 161, 647–660 (2015).
Article CAS Google Scholar
Qian, W., He, X., Chan, E., Xu, H. & Zhang, J. Measuring the evolutionary rate of protein–protein interaction. Proc. Natl Acad. Sci. USA 108, 8725–8730 (2011).
Article ADS CAS Google Scholar
Das, J. et al. Cross-species protein interactome mapping reveals species-specific wiring of stress-response pathways. Sci. Signal. 6, ra38 (2013).
Article Google Scholar
Vo, T. V. et al. A proteome-wide fission yeast interactome reveals network evolution principles from yeasts to human. Cell 164, 310–323 (2016).
Article CAS Google Scholar
Zhong, Q. et al. An inter‐species protein-protein interaction network across vast evolutionary distance. Mol. Syst. Biol. 12, 865 (2016).
Article Google Scholar
Ghadie, M., Coulombe-Huntington, J. & Xia, Y. Interactome evolution: insights from genome-wide analyses of protein-protein interactions. Curr. Opin. Struct. Biol. 50, 42–48 (2018).
Article CAS Google Scholar
Von Mering, C. et al. Comparative assessment of large-scale data sets of protein–protein interactions. Nature 417, 399–403 (2002).
Article ADS Google Scholar
Wodak, S. J., Vlasblom, J., Turinsky, A. L. & Pu, S. Protein–protein interaction networks: the puzzling riches. Curr. Opin. Struct. Biol. 23, 941–953 (2013).
Article CAS Google Scholar
Landry, C. R., Levy, E. D., Rabbo, D. A., Tarassov, K. & Michnick, S. W. Extracting insight from noisy cellular networks. Cell 155, 983–989 (2013).
Article CAS Google Scholar
Blundell, T. L. & Fernández-Recio, J. Cell biology: brief encounters bolster contacts. Nature 444, 279–280 (2006).
Article ADS CAS Google Scholar
Tang, C., Iwahara, J. & Clore, G. M. Visualization of transient encounter complexes in protein–protein association. Nature 444, 383–386 (2006).
Article ADS CAS Google Scholar
Schreiber, G. & Keating, A. E. Protein binding specificity versus promiscuity. Curr. Opin. Struct. Biol. 21, 50–61 (2011).
Article CAS Google Scholar
Kanshin, E., Bergeron-Sandoval, L. P., Isik, S. S., Thibault, P. & Michnick, S. W. A cell-signaling network temporally resolves specific versus promiscuous phosphorylation. Cell Rep. 10, 1202–1214 (2015).
Article CAS Google Scholar
Caldera, M., Buphamalai, P., Müller, F. & Menche, J. Interactome-based approaches to human disease. Curr. Opin. Syst. Biol. 3, 88–94 (2017).
Article Google Scholar
Cowen, L., Ideker, T., Raphael, B. J. & Sharan, R. Network propagation: a universal amplifier of genetic associations. Nat. Rev. Genet. 18, 551–562 (2017).
Article CAS Google Scholar
Lynch, M. The evolution of genetic networks by non-adaptive processes. Nat. Rev. Genet. 8, 803–813 (2007).
Article CAS Google Scholar
Levy, E. D., Landry, C. R. & Michnick, S. W. How perfect can protein interactomes be? Sci. Signal. 2, e11 (2009).
Article Google Scholar
Landry, C. R., Levy, E. D. & Michnick, S. W. Weak functional constraints on phosphoproteomes. Trends Genet. 25, 193–197 (2009).
Article CAS Google Scholar
Levy, E. D., Michnick, S. W. & Landry, C. R. Protein abundance is key to distinguish promiscuous from functional phosphorylation based on evolutionary information. Philos. Trans. R. Soc. B 367, 2594–2606 (2012).
Article CAS Google Scholar
Studer, R. A. et al. Evolution of protein phosphorylation across 18 fungal species. Science 354, 229–232 (2016).
Article ADS CAS Google Scholar
Jubb, H. C. et al. Mutations at protein-protein interfaces: small changes over big surfaces have large impacts on human health. Prog. Biophys. Mol. Biol. 128, 3–13 (2017).
Article CAS Google Scholar
Yates, C. M. & Sternberg, M. J. The effects of non-synonymous single nucleotide polymorphisms (nsSNPs) on protein–protein interactions. J. Mol. Biol. 425, 3949–3963 (2013).
Article CAS Google Scholar
Leducq, J. B. et al. Evidence for the robustness of protein complexes to inter-species hybridization. PLoS Genet. 8, e1003161 (2012).
Article CAS Google Scholar
Ohno, S. So much “junk” DNA in our genome. Brookhaven Symp. Biol. 23, 366–370 (1972).
CAS PubMed Google Scholar
Graur, D. An upper limit on the functional fraction of the human genome. Genome Biol. Evol. 9, 1880–1885 (2017).
Article Google Scholar
Kim, P. M., Lu, L. J., Xia, Y. & Gerstein, M. B. Relating three-dimensional structures to protein networks provides evolutionary insights. Science 314, 1938–1941 (2006).
Article ADS CAS Google Scholar
Franzosa, E. A. & Xia, Y. Structural principles within the human-virus protein-protein interaction network. Proc. Natl Acad. Sci. USA 108, 10538–10543 (2011).
Article ADS CAS Google Scholar
Wang, X. et al. Three-dimensional reconstruction of protein networks provides insight into human genetic disease. Nat. Biotechnol. 30, 159–164 (2012).
Article CAS Google Scholar
Garamszegi, S., Franzosa, E. A. & Xia, Y. Signatures of pleiotropy, economy and convergent evolution in a domain-resolved map of human–virus protein–protein interaction networks. PLoS Pathog. 9, e1003778 (2013).
Article Google Scholar
Guo, Y. et al. Dissecting disease inheritance modes in a three-dimensional protein network challenges the “guilt-by-association” principle. Am. J. Hum. Genet. 93, 78–89 (2013).
Article CAS Google Scholar
Ghadie, M., Lambourne, L., Vidal, M. & Xia, Y. Domain-based prediction of the human isoform interactome provides insights into the functional impact of alternative splicing. PLoS Comput. Biol. 13, e1005717 (2017).
Article ADS Google Scholar
Mosca, R., Céol, A. & Aloy, P. Interactome3D: adding structural details to protein networks. Nat. Methods 10, 47–53 (2013).
Article CAS Google Scholar
Meyer, M. J., Das, J., Wang, X. & Yu, H. INstruct: a database of high-quality 3D structurally resolved protein interactome networks. Bioinformatics 29, 1577–1579 (2013).
Article CAS Google Scholar
Mosca, R. et al. dSysMap: exploring the edgetic role of disease mutations. Nat. Methods 12, 167–168 (2015).
Article CAS Google Scholar
Sahni, N. et al. Edgotype: a fundamental link between genotype and phenotype. Curr. Opin. Genet. Dev. 23, 649–657 (2013).
Article CAS Google Scholar
Rolland, T. et al. A proteome-scale map of the human interactome network. Cell 159, 1212–1226 (2014).
Article CAS Google Scholar
Orchard, S. et al. The MIntAct project—IntAct as a common curation platform for 11 molecular interaction databases. Nucleic Acids Res. 42, D358–D363 (2014).
Article CAS Google Scholar
Berman, H., Henrick, K. & Nakamura, H. Announcing the worldwide Protein Data Bank. Nat. Struct. Mol. Biol. 10, 980 (2003).
Article CAS Google Scholar
Landrum, M. J. et al. ClinVar: public archive of interpretations of clinically relevant variants. Nucleic Acids Res. 44, D862–D868 (2015).
Article Google Scholar
Sherry, S. T. et al. dbSNP: the NCBI database of genetic variation. Nucleic Acids Res. 29, 308–311 (2001).
Article CAS Google Scholar
Kryukov, G. V., Pennacchio, L. A. & Sunyaev, S. R. Most rare missense alleles are deleterious in humans: implications for complex disease and association studies. Am. J. Hum. Genet. 80, 727–739 (2007).
Article CAS Google Scholar
Xiong, P., Zhang, C., Zheng, W. & Zhang, Y. BindProfX: assessing mutation-induced binding affinity change by protein interface profiles with pseudo counts. J. Mol. Biol. 429, 426–434 (2017).
Article CAS Google Scholar
Schymkowitz, J. et al. The FoldX web server: an online force field. Nucleic Acids Res. 33, W382–W388 (2005).
Article CAS Google Scholar
Jankauskaitė, J., Jiménez-García, B., Dapkūnas, J., Fernández-Recio, J. & Moal, I. H. SKEMPI 2.0: an updated benchmark of changes in protein–protein binding energy, kinetics and thermodynamics upon mutation. Bioinformatics 35, 462–469 (2018).
Article Google Scholar
Li, X. H. & Babu, M. M. Human diseases from gain-of-function mutations in disordered protein regions. Cell 175, 40–42 (2018).
Article CAS Google Scholar
Van Oijen, M. G. & Slootweg, P. J. Gain-of-function mutations in the tumor suppressor gene p53. Clin. Cancer Res. 6, 2138–2145 (2000).
PubMed Google Scholar
Kakiuchi, M. et al. Recurrent gain-of-function mutations of RHOA in diffuse-type gastric carcinoma. Nat. Genet. 46, 583–587 (2014).
Article CAS Google Scholar
Lashuel, H. A., Wurth, C., Woo, L. & Kelly, J. W. The most pathogenic transthyretin variant, L55P, forms amyloid fibrils under acidic conditions and protofilaments under physiological conditions. Biochemistry 38, 13560–13573 (1999).
Article CAS Google Scholar
Meyer, K. et al. Mutations in disordered regions can cause disease by creating dileucine motifs. Cell 175, 239–253 (2018).
Article CAS Google Scholar
Mi, H., Muruganujan, A. & Thomas, P. D. PANTHER in 2013: modeling the evolution of gene function, and other gene attributes, in the context of phylogenetic trees. Nucleic Acids Res. 41, D377–D386 (2013).
Article CAS Google Scholar
Roscoe, B. P., Thayer, K. M., Zeldovich, K. B., Fushman, D. & Bolon, D. N. Analyses of the effects of all ubiquitin point mutants on yeast growth rate. Jour. Mol. Biol. 425, 1363–1377 (2013).
Article CAS Google Scholar
Mavor, D. et al. Determination of ubiquitin fitness landscapes under different chemical stresses in a classroom setting. Elife 5, e15802 (2016).
Article Google Scholar
Mavor, D. et al. Extending chemical perturbations of the ubiquitin fitness landscape in a classroom setting reveals new constraints on sequence tolerance. Biol. Open 7, bio036103 (2018).
Article Google Scholar
Guseman, A. J., Goncalves, G. M., Speer, S. L., Young, G. B. & Pielak, G. J. Protein shape modulates crowding effects. Proc. Natl Acad. Sci. USA 115, 10965–10970 (2018).
Article CAS Google Scholar
The UniProt Consortium. Activities at the universal protein resource (UniProt). Nucleic Acids Res. 42, D191–D198 (2014).
Article Google Scholar
Altschul, S. F., Gish, W., Miller, W., Myers, E. W. & Lipman, D. J. Basic local alignment search tool. J. Mol. Biol. 215, 403–410 (1990).
Article CAS Google Scholar
Cock, P. A. et al. Biopython: freely available Python tools for computational molecular biology and bioinformatics. Bioinformatics 25, 1422–1423 (2009).
Article CAS Google Scholar
Ashburner, M. et al. Gene ontology: tool for the unification of biology. Nat. Genet. 25, 25–29 (2000).
Article CAS Google Scholar
Gene Ontology Consortium. The Gene Ontology resource: 20 years and still GOing strong. Nucleic Acids Res. 47, D330–D338 (2018).
Article Google Scholar
Pesquita, C. et al. Metrics for GO based protein semantic similarity: a systematic evaluation. BMC Bioinforma. 9, S4 (2008).
Article Google Scholar
Yates, A. et al. Ensembl 2016. Nucleic Acids Res. 44, D710–D716 (2016).
Article CAS Google Scholar
Lonsdale, J. et al. The genotype-tissue expression (GTEx) project. Nat. Genet. 45, 580–585 (2013).
Article CAS Google Scholar
Uhlén, M. et al. Tissue-based map of the human proteome. Science 347, 1260419 (2015).
Article Google Scholar
The FANTOM Consortium and the RIKEN PMI and CLST (DGT). A promoter-level mammalian expression atlas. Nature 507, 462–470 (2014).
Article ADS Google Scholar
Bland, M. An Introduction to Medical Statistics (Oxford University Press, Oxford, 2015).

Download references

Acknowledgements

This work was supported by Natural Sciences and Engineering Research Council of Canada grants RGPIN-2019-05952 and RGPAS-2019-00012, Canada Foundation for Innovation grants JELF-33732 and IF-33122, and Canada Research Chairs program to Y.X., and McGill Engineering Doctoral Awards program to M.G. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the paper.

Author information

Authors and Affiliations

Department of Bioengineering, McGill University, Montreal, QC, Canada
Mohamed Ghadie & Yu Xia

Authors

Mohamed Ghadie
View author publications
You can also search for this author in PubMed Google Scholar
Yu Xia
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Y.X. conceived and oversaw all aspects of the project. M.G. and Y.X. designed experiments. M.G. performed experiments and analyzed data. Y.X. supervised research. M.G. and Y.X. wrote the paper.

Corresponding author

Correspondence to Yu Xia.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information: Nature Communications thanks the anonymous reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Description of Additional Supplementary Files

Supplementary Data 1

Supplementary Data 2

Supplementary Data 3

Supplementary Data 4

Supplementary Data 5

Supplementary Data 6

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Ghadie, M., Xia, Y. Estimating dispensable content in the human interactome. Nat Commun 10, 3205 (2019). https://doi.org/10.1038/s41467-019-11180-2

Download citation

Received: 03 December 2018
Accepted: 21 June 2019
Published: 19 July 2019
DOI: https://doi.org/10.1038/s41467-019-11180-2

This article is cited by

In silico analysis of differentially expressed genesets in metastatic breast cancer identifies potential prognostic biomarkers
- Jongchan Kim
World Journal of Surgical Oncology (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Construction of the human structural interactome

Geometry-based prediction of mutation edgotypes

Geometry-based calculation of dispensable PPI content

Physics-based calculation of dispensable PPI content

Discussion

Methods

Construction of the human structural interactome

Identifying binding interface residues for two chains in a PDB structure

Mapping disease mutations onto the human structural interactome

Mapping non-disease mutations onto the human structural interactome

Calculating functional similarity between two proteins

Calculating tissue co-expression for two proteins

Calculating the 95% confidence interval of the fraction of completely dispensable PPIs

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links