Integration of Molecular Interactome and Targeted Interaction Analysis to Identify a COPD Disease Network Module

Sharma, Amitabh; Kitsak, Maksim; Cho, Michael H.; Ameli, Asher; Zhou, Xiaobo; Jiang, Zhiqiang; Crapo, James D.; Beaty, Terri H.; Menche, Jörg; Bakke, Per S.; Santolini, Marc; Silverman, Edwin K.

doi:10.1038/s41598-018-32173-z

Download PDF

Article
Open access
Published: 27 September 2018

Integration of Molecular Interactome and Targeted Interaction Analysis to Identify a COPD Disease Network Module

Amitabh Sharma^1,3,4,5,
Maksim Kitsak⁴,
Michael H. Cho ORCID: orcid.org/0000-0002-4907-1657^1,2,3,
Asher Ameli^1,10,
Xiaobo Zhou^1,3,
Zhiqiang Jiang¹,
James D. Crapo⁶,
Terri H. Beaty ORCID: orcid.org/0000-0002-7644-1705⁷,
Jörg Menche⁸,
Per S. Bakke⁹,
Marc Santolini ORCID: orcid.org/0000-0003-1491-0120^1,4,5 &
…
Edwin K. Silverman^1,2,3

Scientific Reports volume 8, Article number: 14439 (2018) Cite this article

3378 Accesses
26 Citations
9 Altmetric
Metrics details

Subjects

Abstract

The polygenic nature of complex diseases offers potential opportunities to utilize network-based approaches that leverage the comprehensive set of protein-protein interactions (the human interactome) to identify new genes of interest and relevant biological pathways. However, the incompleteness of the current human interactome prevents it from reaching its full potential to extract network-based knowledge from gene discovery efforts, such as genome-wide association studies, for complex diseases like chronic obstructive pulmonary disease (COPD). Here, we provide a framework that integrates the existing human interactome information with experimental protein-protein interaction data for FAM13A, one of the most highly associated genetic loci to COPD, to find a more comprehensive disease network module. We identified an initial disease network neighborhood by applying a random-walk method. Next, we developed a network-based closeness approach (C_AB) that revealed 9 out of 96 FAM13A interacting partners identified by affinity purification assays were significantly close to the initial network neighborhood. Moreover, compared to a similar method (local radiality), the C_AB approach predicts low-degree genes as potential candidates. The candidates identified by the network-based closeness approach were combined with the initial network neighborhood to build a comprehensive disease network module (163 genes) that was enriched with genes differentially expressed between controls and COPD subjects in alveolar macrophages, lung tissue, sputum, blood, and bronchial brushing datasets. Overall, we demonstrate an approach to find disease-related network components using new laboratory data to overcome incompleteness of the current interactome.

Integrated transcriptomic correlation network analysis identifies COPD molecular determinants

Article Open access 25 February 2020

Discovering the genes mediating the interactions between chronic respiratory diseases in the human interactome

Article Open access 10 February 2020

A network-based approach to uncover microRNA-mediated disease comorbidities and potential pathobiological implications

Article Open access 13 November 2019

Introduction

Chronic obstructive pulmonary disease (COPD) is the third leading cause of death worldwide, and recently it was estimated that COPD cases in developed countries would increase by more than 150% from 2010 to 2030^1,2,3,4. Furthermore, similar to other complex diseases, it has been challenging to identify systematically the likely multiple genetic risk factors for COPD. Genome-wide association studies (GWAS) can identify specific genetic loci consistently associated with disease in an unbiased manner and have reported hundreds of associations between complex diseases and traits^5,6,7. However, for the vast majority of such genome-wide “hits”, specific causal mechanisms remain uncertain. There is increasing evidence supporting the hypothesis that the onset and progression of complex diseases like COPD arise from the interplay between a number of interconnected causative genes in a manner compounding the effects of any one variant^8,9,10. Indeed, integrating GWAS data with molecular interaction networks and gene expression information facilitates a better understanding of disease pathogenetic mechanisms^{10,11,12,13,14}. A variety of approaches have been developed to infer relationships between genes showing genome-wide significant evidence of association within the human interactome—the comprehensive set of molecular relationships between cellular proteins^14,15,16,17. For example, we showed that a disease network module is enriched for disease susceptibility variants in asthma¹⁰. A GWAS of inflammatory bowel disease used DAPPLE, which is based on the observation that truly causal genes tend to link to each other in the human interactome, to prioritize potential disease candidates¹⁸. Since combinations of genetic alterations associated with a disease might affect a common component of the cellular system, module-centric approaches might be helpful in finding the disease-related components in the interactome^13,19. Yet, the output of these approaches can be strongly influenced by (i) the incompleteness of the pre-specified interactome (false-negative results), and (ii) false-positive errors in the interactome. The impact of the incompleteness could result in failure to identify network relationships for genes implicated by GWAS. Thus, integrating the module-centric approach with targeted interaction analysis (e.g., pull-down assays) of GWAS genes might be helpful in discovering the functional relationships of these genes with a disease of interest. In this work we combine new experimental protein-protein interaction data with the existing human interactome to enhance our understanding of the genes involved in COPD. The objective relies on the “local impact hypothesis,” which assumes that if a few disease components are identified, other components are likely to be found in their vicinity of the human interactome^10,12. Moreover, if a disease gene is not mapped in the interactome, it is possible that its neighbors detected by targeted interaction analysis might indicate its biological function. Hence, we first identify the disease-related network neighborhood including known COPD disease genes (seed genes) in the interactome by applying a degree-adjusted random-walk algorithm²⁰ (DADA), which is a guilt-by-association approach. Next, we test whether experimentally determined links (pull-down assay) for a single, consistently associated COPD gene (FAM13A) not mapped on the human interactome could enhance our knowledge about functional implications of FAM13A in COPD pathogenesis. The approach first aggregates the network neighborhood around the COPD ‘seed’ disease genes using DADA²⁰. Further, to define a boundary of the disease network neighborhood, we use the sub-genome-wide significant association signals from the COPD GWAS (Fig. 1). This step helps to find enrichment of moderate p–value signals associated with those neighboring genes that are in the proximity of the seed genes. We hypothesized that combining experimental interaction data with the existing human interactome would develop a more comprehensive disease network module for COPD. To test this hypothesis, we derive a novel network-based closeness approach (C_AB) to predict FAM13A partners significantly close to the initial COPD localized neighborhood. Overall, our approach enhances our understanding about the COPD disease network module and predicts new candidate genes and pathways influencing COPD pathogenesis.

Results

Building an initial COPD network neighborhood using the Degree-Aware Disease Gene Prioritization approach (DADA)

The disease module hypothesis postulates that disease susceptibility genes should form one or a few large connected components in a well-defined neighborhood of the human interactome^10,12. Selection of the seed genes strongly influences the interpretation of such a module-centric approach, and therefore we restricted our analysis to only high-confidence COPD disease genes from GWAS and Mendelian syndromes (Fig. 1B). To avoid bias toward including highly connected genes in the network neighborhood, we implemented the random walk-based DADA approach, which provides statistical adjustment models to remove the bias with respect to degree of the genes²⁰. Since DADA provides ranking to all of the genes in the human interactome, we defined the boundary of the disease network neighborhood by integrating additional genetic signals from COPD GWAS (not reaching traditional p-value thresholds for genome-wide significance) (Supplementary Figure 1). We first generated a single genetic association p-value for each gene in the interactome using VEGAS with the default all snps test²¹, and then plotted p-values of the added DADA genes vs. the background p-value distribution (Fig. 2A). After the addition of 150 genes, the genetic association p-value of added genes reached a plateau (Fig. 2A) and the connected components among the 150 genes were defined as the ‘initial network neighborhood’. At this threshold, we found eight seed genes in the largest connected component (LCC) of size 129 genes, and the other two seed genes were part of two small components of sizes 17 and 4, respectively (Fig. 2C). Indeed, the LCC of 129 genes was found to be significant compared to the largest connected component that would emerge by chance if the 129 genes were placed randomly (10,000 times) in the human interactome (Z-score = 27, p = <0.00001, Fig. 2B). Overall, these three components constitute the COPD localized neighborhood with 140 DADA genes plus 10 original high-confidence COPD seed genes. We compared our results with the Disease Module Detection (DIAMOnD) algorithm, which identifies the disease neighborhood around a set of known disease proteins based on the connectivity significance²². Interestingly, we found a significant overlap between DADA and DIAMoND output (Supplementary Figure 2), indicating that the results are consistent using a different network-based approach.

The 10 COPD seed genes that were part of the initial network neighborhood included: IREB2, SERPINA1, MMP12, HHIP, RIN3, ELN, FBLN5, CHRNA3, CHRNA5, and TGFB2 (Fig. 2C). Since one of the key genes identified by COPD GWAS, FAM13A, was not mapped in the human interactome, we tested whether specific interacting partners of FAM13A could reveal new knowledge regarding this particular gene in COPD.

FAM13A pull down assay

FAM13A contains a Rho GTPase-activating protein-binding domain; it inhibits signal transduction and responds to hypoxia. Recent work by our research group indicates that FAM13A is involved in WNT/beta catenin pathway signaling²³. FAM13A was not mapped in the edge-weighted human interactome (ConsensuspathDB) and moreover, no edges were reported in Rolland et al.²⁴ high-quality human binary protein-protein interactions and BioGRID interaction data (2014)²⁵. Thus, we performed a pull-down assay using affinity purification-mass spectrometry, which identified 96 interacting partners of FAM13A²³. We measured the likelihood of having a protein with at least 96 interacting proteins in the interactome. Among 14,280 genes in the interactome, 581 genes had a degree of 96 or greater $(P(k\ge 96)=0.04)$, suggesting that FAM13A is a relatively highly connected protein in the interactome (Supplementary Figure 3A). Further, we tested whether the FAM13A interacting partners are closer to each other within the interactome than a same-sized set of randomly selected proteins. Based on 10,000 simulations, we observed significant closeness (Zscore = −9.685) among FAM13A partners (Supplementary Figure 3B). This indicates that even if FAM13A partners are not directly interacting, they might be involved in a similar biological process because of their close proximity to each other. We found that none of the 96 FAM13A interacting partners were among the COPD localized neighborhood that we had created with DADA.

Topological distance between the COPD neighborhood proteins and FAM13A interacting proteins in the interactome

Given the substantial incompleteness of the current human interactome¹², it is difficult to conclusively determine whether the COPD disease network neighborhood would directly connect to interacting partners of FAM13A, as a single missing link might have disconnected FAM13A from the COPD localized neighborhood. Hence, we computed a network-based closeness metric (C_AB) that compares the weighted distance between FAM13A partners (A) and proteins in the COPD localized network neighborhood (B) to random expectation in order to compute the Z-score (see methods and Fig. 3A). With a Z-score significance threshold of −1.6 (p < 0.05), we found 9 genes significantly close to the COPD localized neighborhood in the human interactome and 87 genes that were not significant (Fig. 3B). The 9 genes with significant closeness to the COPD localized neighborhood were: GPC4 (Z = −4.04), ESF1 (Z = −3.46), OSBPL8 (Z = −2.97), KIAA1430 (−2.93), ZNF768 (Z = −2.68), AP3D1 (Z = −2.00), ANKRD17 (Z = −1.96), NIP7 (-Z = 1.79) and RBM34 (Z = −1.77).

Comparison with the Local Radiality (LR) method

We compared the C_AB results with the Local Radiality (LR) method that utilizes topological information (i.e., shortest path distance) to predict the proximity of dysregulated genes to corresponding drug targets²⁶. In our case, we measured the closeness of FAM13A partners (96 genes) with the COPD disease neighborhood (150 genes) by applying the LR method. In C_AB the confidence scores of the edges play an important role to either shorten or increase the distances. Thus, to carefully claim that a gene is close to the COPD network neighborhood, we not only ensured that the gene is topologically close to the neighborhood but also considered the strength of each interaction based on different sources of evidence for the existence of such a path. As compared to top C_AB genes, the nine highest score genes by LR were enriched in hubs. As a consequence, the average degrees <k> between these two methods were significantly different (P = 0.0004, Mann–Whitney U test) (Supplementary Figure 4). The hubness criterion helped us discriminate between the results from these two approaches. This seems pragmatic, as the low degree genes might be more likely to be involved in a local biological process than those high degree genes representing global molecular pathways. Furthermore, it has been proposed that highly connected superhubs perform the most basic biological functions (evolutionarily early), with the more specialized functions (evolutionarily late) being performed by the peripheral genes. Thus, C_AB helps to predict the FAM13A partners that might be involved in more specialized biological functions (low degree genes) related to COPD pathogenesis. Furthermore, it has also been observed that changes in gene expression predominantly occur in the genes (nodes) with low connectivity, but not in the superhubs²⁷.

COPD disease module with all eleven COPD seed genes

C_AB considers all of the possible paths between $\langle {{\rm{C}}}_{{\rm{A}}}\rangle $ and $\langle {{\rm{C}}}_{{\rm{B}}}\rangle $ genes to calculate the statistical significance; hence, we applied a greedy strategy (Steiner) to find the optimal paths among all of the paths connecting the COPD network neighborhood and C_AB genes²⁸. We observed a single network module consisting of C_AB genes and COPD network neighborhood genes with only four intermediate genes (ELAVL1, CSNK2A2, BARD1 and SIRT7). Of interest, including these linker genes provided connections to the network module for the two COPD seed genes, RIN3 and HHIP, that were not part of the original largest connected component of 129 genes. Our resulting expanded set of 163 connected genes, including all of the 11 seed genes (Supplementary Table 1), is referred to as the ‘COPD disease network module’ (Fig. 4A).

Validation of COPD disease network module in COPD specific gene-expression data

We tested the relevance of the COPD disease network module by evaluating fold change of differentially expressed module genes in COPD-specific gene expression data sets. We compared the fold change (absolute value of logarithm of fold change) of differentially expressed module genes to all other differentially expressed genes with unadj.p < 0.05 in eight COPD-specific gene expression data sets (Table 2). We observed a significantly higher fold-change in the COPD disease network module compared to other differentially expressed genes in seven datasets (Fig. 4B). As shown in Table 2, even after removing the seed genes, the significance was retained in six datasets (Supplementary Figure 5). Further, by considering all of the genes tested for differential expression, we still find that COPD disease network module genes were significantly enriched in four COPD gene-expression datasets (sputum, lung tissue, peripheral blood and alveolar macrophages) (Supplementary Figure 6). These results suggest the ability of our network-based approach to identify new genes relevant to COPD. Additionally, to correct for connectivity as a potential selection bias in the comparison of module and non-module genes, we selected 10 random genes either from the disease network module or from all differentially expressed genes (filtered at p < 0.05). For the latter, we made sure that all selected genes were connected using an iterative procedure: the first gene was selected at random, the second gene was selected in the neighborhood of the first gene, the third gene was selected in the neighborhood of the two first genes and so on. As compared to our previous observation in Supplementary Figure 5, we observed that the selection of a connected subset increases the significance of the differences in gene expression between the COPD disease module genes and randomly selected genes (*p < 0.05, **p < 0.01, ***p < 0.001, Supplementary Figure 7). This seems to be due to the fact that high fold change genes selected at random when looking at all differentially expressed genes tend to not be connected to other differentially expressed genes. Overall, these results indicate that the differentially expressed genes were heavily localized in the gene set added by our approach, and not influenced by the p-value criteria, thus supporting our method’s ability to identify candidate genes relevant to COPD.

Potential candidate genes for COPD

With an adjusted p-value < 0.05 (limma), we found 36 COPD disease module genes differentially expressed in different COPD-related datasets. For example, AP3D1 (adj.p-0.038) and IL32 (adj.p-0.001) were up-regulated and MMP12 (adj.p = 0.042) was down-regulated in non-smoking controls vs. COPD subjects in alveolar macrophages²⁹ (Alveolar macrophage I). In lung tissue, we found TGFB2 (adj.p = 0.014) and CAT (adj.p = 0.03) were down-regulated in control vs. COPD subjects³⁰ (Lung I). Twenty COPD disease module genes were differentially expressed in GOLD stage II vs. GOLD stage IV subjects in ECLIPSE induced sputum data³¹. CTGF (adj.p = 0.047), GSDMB (adj.p = 0.044) and CHRNA7 (adj.p = 0.043) were up-regulated between current smokers with no COPD vs. current smokers with COPD in bronchial brushing samples³² (Table 1). These results support the ability of our approach to localize candidate genes of potential relevance in COPD-related tissue types. Moreover, all of the 9 C_AB genes were differentially expressed in at least one of the gene expression datasets (Z = 2.2, p = 0.016) (Supplementary Figure 8).

Table 1 Differentially expressed COPD disease network module genes in four datasets with adjusted p-values < 0.05.

Full size table

Table 2 Enrichment of COPD disease module genes in different tissue gene expression data sets with and without seed genes.

Full size table

Biological pathway enrichment in the COPD disease module

Among the biological pathways most significantly enriched in the COPD disease network module were inflammatory response, collagen catabolic process, regulation of TGFB-receptor signaling pathway, and extracellular matrix organization pathway (Table 3). Alterations of extracellular matrix components (ECM), including elastin, are known in patients with COPD, and they contribute to airflow obstruction³³. In the COPD network module, 34 genes representing the ECM pathway were connected to each other (Fig. 5A). Moreover, we found support from the medical literature for 23 module genes from the total of 41 genes representing the ECM pathway in COPD pathogenesis (Supplementary Table 2). C_AB genes were part of: Glycosaminoglycan/aminoglycan catabolic process (GPC4), negative regulation of muscle cell differentiation (ANKRD17), negative regulation of cell migration (OSBPL8), regulation of alpha-beta T cell activation (AP3D1) and response to decrease in oxygen levels (AP3D1). Gene expression analyses in cell lines from several tissues have demonstrated an increase in FAM13A levels in response to decrease in oxygen levels³⁴. It has been suggested that lower oxygen tension might modulate FAM13A activity³⁵, however, the exact mechanism has not been explained. In the COPD disease network module, AP3D1 (C_AB gene) interacts with FAM13A and is an immediate neighbor of the CTGF gene, which is part of the hypoxia pathway (decrease in oxygen levels). Thus, the connection of FAM13A to CTGF reveals a potential mechanism by which FAM13A could contribute to the hypoxia response (Fig. 5B).

Table 3 Biological pathways significantly enriched in the COPD disease network module.

Full size table

We observed a small overlap (37 genes, 23%; vs 14% background, p-value = 0.0013) of the COPD disease network module with the Inflammasome (see methods)³⁶ (Supplementary Table 1). This suggests that the COPD disease network module was enriched for inflammation-related genes, which is consistent with the known role of inflammation in COPD³⁷. Overall, the COPD disease network module not only contains the inflammation component, but also other functional components like extracellular matrix organization, hypoxia response, and WNT/beta catenin signaling pathways²³.

Discussion

The purpose of this work was to determine whether a network-based approach could enhance our understanding of the genes involved in the pathogenesis of a complex disease (COPD) by combining new experimental protein-protein interaction data with the existing human interactome. Identifying causal genes for complex diseases like COPD, which are likely influenced by many genetic factors of modest effect size, is a major bottleneck in understanding the biological mechanisms leading to these diseases. A complete and accurate map of the human interactome could have tremendous impact on our ability to understand the molecular underpinnings of human disease. Yet, such a map is far from completion, which makes it currently impossible to evaluate precisely how far a given disease network module is from completion. Here, we showed that despite its incompleteness, a systematic network-based approach could help us to understand the connectivity of disease genes in COPD. Our initial analysis provided a set of 140 potential candidate genes that were part of three connected components in a disease network neighborhood. Interestingly, the largest connected component of this set of genes included 8 of the seed genes, which showed substantial network coherence and localization. Some of these 140 candidate genes have been previously implicated in COPD. For example, OLFM2 was among the genes within the COPD protein-protein interaction network built with a greedy search algorithm³⁸. SOD3 is known to attenuate emphysema and reduces oxidative fragmentation of ECM in mouse lung³⁹. In addition, TGFB1 and its pathway members have been frequently implicated in COPD pathogenesis⁴⁰. The novel C_AB measure assisted in constructing a more comprehensive COPD disease network module including FAM13A and its relevant partners, eventually connecting all of the 11 COPD seed genes into a single connected component comprising 163 genes/proteins (Fig. 4A). Overall, the COPD module genes showed significant differences in gene expression levels from lung tissue, alveolar macrophage, blood, and sputum samples. For example, Tumor necrosis factor alpha (TNFα)-induced protein 1 (TNFAIP1) was upregulated in smokers with COPD and directly interacts with RIN3, a COPD GWAS gene⁴¹. TNFAIP1 has been reported to be crucial for the induction of apoptosis⁴², indicating a potential role of RIN3 in apoptosis. Furthermore, AP3D1 was upregulated in COPD subjects²⁹ (Alveolar macrophage I) and directly interacts with FAM13A in our pull-down assay. The new alliance of FAM13A to the COPD disease network module via AP3D1 connects it to the hypoxia pathway (Fig. 5B), which reveals the potential molecular mechanism by which FAM13A influences hypoxia in epithelial and endothelial cells. Although other lung (e.g., idiopathic pulmonary fibrosis) and heart (e.g., congestive heart failure) diseases can cause hypoxia, hypoxia is a common and important complication of advanced COPD. Thus, we found it interesting that pathway analysis of the COPD network module identified the hypoxia pathway. This evidence suggests the potential of the C_AB measure to reveal new disease biology that might have been missed due to the incomplete human interactome.

Our approach adds a new dimension to the current causal gene identification approaches in complex diseases using the human interactome. Moreover, we were able to localize the network neighborhood of COPD and try to address (at least in part) the shortcomings of interactome incompleteness by providing new experimentally derived interactions for FAM13A, a key COPD gene not present in the current human interactome. We were able to connect FAM13A individual interactions to a localized network neighborhood by developing a new metric of network closeness, C_AB. With the current thrust to understand GWAS genes with the help of incomplete protein interaction networks, our approach provides an alternative to connect targeted interaction and interactome data to identify a disease network module.

We focused on only a small set of seed genes for COPD, and that could be one of the limitations of the work. Moreover, since the disease-related gene within each COPD GWAS locus has not been definitively proven, we selected those genes that had the most compelling evidence for a role in COPD pathogenesis. For example, murine models of emphysema have demonstrated a smoking-related phenotypic effect for genes in four of the COPD GWAS loci that we included: 1) HHIP⁴³; 2) FAM13A⁴⁴; 3) IREB2⁴⁵; and 4) MMP12⁴⁶. In addition, several other COPD GWAS loci have strong candidate genes, such as the nicotinic acetylcholine receptor genes that have been related to nicotine addiction (CHRNA3 and CHRNA5) and TGFB2 (part of the TGFBeta pathway). Thus, we contend that most of our selected seed genes are likely related to COPD pathogenesis. We also acknowledge that protein-protein interactions observed during in vitro experiments like yeast two-hybrid or affinity purification assays may not actually occur due to the absence of cellular co-localization or gene expression in the tissue of interest. COPD is a heterogeneous disease, and it is possible that different subtypes of COPD patients could have different disease network modules. Since linker genes connected the three COPD disease components in the COPD network neighborhood into a single disease network module, it could be possible that these are really three different COPD network modules. Thus, future research to identify network modules related to specific COPD subtypes is warranted.

Overall, the disease network module approach that we applied is generic and can be applied to other diseases; this type of approach may be of broad use in disease gene identification in complex diseases in the coming era of network medicine.

Materials and Methods

Selection of high confidence COPD-associated genes

Starting with previous GWAS for COPD susceptibility, and with specific genes implicated by eQTL or functional studies within GWAS regions, we identified a set of well-established genes associated with COPD: HHIP, CHRNA3/CHRNA5/IREB2, and FAM13A. We added recently described genome-wide significant associations to moderate-to-severe COPD or severe COPD, including RIN3, MMP12, and TGFB2^{41,47,48,49,50,51}. We also considered genes causing Mendelian syndromes which include emphysema as part of their syndrome constellation: alpha-1 antitrypsin deficiency (SERPINA1) and cutis laxa (ELN and FBLN5)^52,53. These 11 genes, in toto, were subsequently used as seed genes for network analyses. We included several genes from the chromosome 15q25 locus, since previous work from our group has suggested that there are likely at least two COPD genetic determinants in this region—both related to nicotine addiction (nicotinic acetylcholine receptor genes CHRNA3 and CHRNA5) and unrelated to nicotine addiction (IREB2)⁵⁴. Of note, HHIP, FAM13A, and IREB2 are also supported by animal models of emphysema. In addition to these five COPD GWAS genes, we added MMP12, which was associated with COPD before it was discovered by GWAS⁵⁰ and which is also supported by an animal model of emphysema, as well as TGFB2 and RIN3. TGFB2 and RIN3 (as well as HHIP, FAM13A, and the chromosome 15q25 region) were also strongly supported by the recent International COPD Genetics Consortium GWAS⁷.

Human protein interaction network: Interactome

We compiled the physical protein-protein interactions from the ConsensusPathDB database⁵⁵. Physical protein interactions were assigned a confidence score between 0 and 1 using the interaction confidence-scoring tool (IntScore)⁵⁶. We relied only on physical interaction data in ConsensusPathDB, obtaining M = 150,168 links between N = 14,280 genes encoding these proteins with mean degree <k> of 21.03 and average clustering coefficient <C> of 0.141.

Localization of COPD network neighborhood in the human interactome

The concept that proteins located close to one another in the human interactome may cause similar diseases is becoming an increasingly important factor in the search for complex disease genes. Different approaches tackle this problem of predicting complex disease susceptibility genes using different kinds of integrative data, but all of them involve superimposing a set of candidate genes alongside a set of known disease genes in some physical or functional network^13,17,57,58. However, many existing methods are likely to favor highly connected genes, making prioritization sensitive to the skewed degree distribution of protein-protein interaction (PPI) networks, as well as ascertainment bias in available interaction and disease association data. To enhance our understanding regarding the local neighborhood of seed genes in the network, we applied the degree aware algorithm (DADA)²⁰ to compute the proximity of the selected COPD seed genes to their neighbors by exploiting the global structure of the network. Several studies^17,59 have shown global approaches like random walk outperform other local approaches like shortest path distances, and therefore we focused on the global method. The final ranking for 14,280 genes encoding proteins included in the network was achieved by merging the random walk restarts output and statistical adjustment models. We used the results from a COPD GWAS of 6,633 cases and 5,704 controls from 4 cohorts to define a boundary for the most promising DADA-ranked genes⁴¹. We assigned significant SNPs to genes using 50 kb boundaries, and generated gene-based p-values using VEGAS²¹.

FAM13A pull-down assay

The FAM13A gene resides at a locus associated with COPD and with lung function in the general population by GWAS^41,51,60,61. FAM13A contains a Rho GTPase-activating protein-binding domain, inhibits signal transduction, and responds to hypoxia; however, its primary function in the lung remains to be determined. The pull-down assay using affinity purification-mass spectrometry was performed previously²³ and resulted in 96 interacting proteins, establishing 96 edges for FAM13A in the interactome.

Proximity of the targeted interactions to the COPD neighborhood - Cab

To quantify the network-based separation between the identified FAM13A interactions and the COPD disease network neighborhood, we introduce the Cab minimum weighted distance, which we define as follows:

For any two nodes l and m we define the Cab distance as:

$${d}_{lm}=\,\min \{{\sum }_{u,v\in path(l,m)}ln(p/{w}_{uv})\},$$

(1)

where ${w}_{uv}\in [0,1]$ is the edge confidence score and $p\in (1,\infty )$ is the parameter of the model. Note that distance d_lm depends both on the total number of network-based edges one needs to traverse from node l to node m and also the confidence scores of these weights, while parameter p tunes the relative contribution of these two factors.

In particular, in the p = 1 case, d_lm depends only on confidence scores of edges connecting two nodes:

$${d}_{lm}=\,\min \{-ln({\prod }_{u,v\in path(l,m)}{w}_{uv})\}$$

(2)

If confidence scores w_uv are regarded as independent probabilities for the edges to be present in the network, then the product in Eq. (2) is simply the probability that given path from l to m exists. The larger this probability, the smaller distance d_lm is.

On the other hand, if p is large, then d_lm is independent of confidence scores:

$${d}_{lm}\approx Lln(p)$$

(3)

where L is the smallest number of edges that need to be traversed from l to m.

We then use d_lmto define distance from l to a set of nodes M as the sum of distances from l to all nodes in M:

$${d}_{M}(l)\,\sum _{k\in M}{d}_{lk}$$

(4)

We evaluated distances from nodes to the neighborhood for a set of parameters $p=\{{e}^{0},{e}^{1},{e}^{2},{e}^{10}\}$. In the following, the values of the parameter are indexed with power of the exponent (0, 1, 2, 10). To quantify the significance of the observed distribution of distances $P\,({d}_{lm})$ from target proteins to the COPD localized neighborhood we used the Mann–Whitney U test with significance cutoff of P < 0.05. Specifically, we calculated the distribution of distances between targeted proteins to the module $P\,({d}_{lm})$ and a random distribution of distance from target proteins to all proteins in the network $P\,({d}_{lm})$. To measure how much the two distributions are different, we calculate the Z-score:

$$Z-score=\frac{TMc-(TR{c}^{rand})}{\sigma (TR{c}^{rand})}$$

(5)

where $TR{c}^{rand}$ and $\sigma (TR{c}^{rand})$ denote the mean value and standard deviation of the random expectation ${p}^{rand}(TRc)$. Assuming normality of ${p}^{rand}(TRc)$, we can analytically calculate a corresponding p-value for each z-score, yielding a threshold of z-score ≤ −1.6 for the distance to be smaller than expected by chance with significant p-value ≤ 0.05.

Local Radiality (LR) method for target prediction

The LR method quantifies the proximity of a node from a set of genes of interest.

The LR score of node n in the network G is calculated as follows²⁶:

$${\rm{LR}}({\rm{n}})=\frac{{\sum }_{{\rm{g}}\in {\rm{M}}}|{\rm{sp}}({\rm{n}},{\rm{g}},{\rm{G}})|}{|{\rm{M}}|}$$

where sp calculates the length of shortest path between nodes n and g, and $|{\rm{M}}|$ is the size of the community of interest (150 genes).

In other words, LR calculates the average shortest paths from node n to the module M.

COPD network module overlap with inflammasome genes

Since clinical COPD is influenced by inflammation⁶², we looked for the potential overlap between the COPD disease network module and recognized genes relevant to inflammatory response or the ‘inflammasome’ genes. These inflammasome signature genes were compiled from 11 disease models (asthma, COPD, fibrosis, atherosclerosis, diabetes (adipose), diabetes (islet), obesity, stroke, neuropathic pain, inflammation pain and sarcopenia)³⁶. We used the total of 2,483 inflammatory signature genes previously reported from mouse models and converted them to their human orthologs, obtaining 2,331 genes in our analysis. Mouse to human orthologs were extracted from the Mouse Genome Informatics (MGI) database (http://www.informatics.jax.org).

Validation of COPD disease module in COPD- specific gene-expression data

Our disease network module approach selects genes based on their topological closeness to the COPD seed genes. To evaluate COPD-specific relevance of genes localized around the seed genes, we extracted significantly differentially expressed genes (p-value < 0.05) from eight publicly available COPD-specific gene-expression datasets and assessed for each case the fold change difference between genes present in the COPD disease module compared to non-module differentially expressed genes. We used the limma R package (ver 3.10.1) for differential expression analysis. The 8 datasets are as follows:

1.
Singh 2014: Peripheral blood gene expression samples from 171 subjects from the Evaluation of COPD Longitudinally to Identify Predictive Surrogate Endpoints (ECLIPSE) study (GSE54837). Differential expression analysis was performed between control (n = 6) (healthy nonsmokers) vs. severe COPD (n = 13)⁶³.
2.
Singh 2011: Induced sputum gene expression from 148 COPD subjects in the ECLIPSE study, with 69 Global Initiative for Chronic Obstructive Lung Disease (GOLD) stage 2, and 71 GOLD stage 3 & 4 subjects (GSE22148). Gene expression differences between GOLD 2 and GOLD 3&4 were analyzed³¹.
3.
Shaykhiev 2009: Transcriptional profiling of alveolar macrophages obtained by bronchoalveolar lavage of 24 healthy nonsmokers and 12 COPD smokers (GSE13896)²⁹.
4.
Bahr 2013: Expression data from peripheral blood mononuclear cells (PBMC) generated from 136 subjects from the COPDGene study (GSE42057), which consisted of 42 ex-smoking control subjects and 94 subjects with varying severity of COPD⁶⁴.
5.
Tedrow 2013: Microarray data from whole lung homogenates of subjects undergoing thoracic surgery from the Lung Tissue Research Consortium (LTRC). These subjects were diagnosed as being controls or having COPD as determined by clinical history, chest CT scan, and surgical pathology. We considered 220 COPD subjects and 108 controls with no chronic lung disease by CT or pathology. These subjects went for surgery typically to investigate a pulmonary nodule and normal lung tissue was obtained for differential expression analysis (GSE47460)³⁰.
6.
Bhattacharya 2009: Gene expression patterns in lung tissue samples derived from 56 subjects (GSE8581). Cases (n = 15) were defined as subjects with FEV1 < 70% predicted and FEV1/FVC < 0.7 and Controls (n = 18) as subjects with FEV1 > 80% predicted and FEV1/FVC > 0.7⁶⁵.
7.
Poliska 2011: Gene expression data from alveolar macrophage samples from 26 COPD and 20 healthy control subjects (GSE16972)⁶⁶.
8.
Steiling 2013: Bronchial brushings obtained from current and former smokers with and without COPD (GSE37147). Data from 238 subjects was used in the analysis to determine the association of gene expression with COPD-related phenotypes³².

References

Minino, A. M., Heron, M. P., Murphy, S. L. & Kochanek, K. D. Deaths: final data for 2004. Natl Vital Stat Rep 55, 1–119 (2007).
PubMed Google Scholar
Minino, A. M., Xu, J. & Kochanek, K. D. Deaths: preliminary data for 2008. Natl Vital Stat Rep 59, 1–52 (2010).
PubMed Google Scholar
Vestbo, J. et al. Global strategy for the diagnosis, management, and prevention of chronic obstructive pulmonary disease: GOLD executive summary. Am J Respir Crit Care Med 187, 347–365, https://doi.org/10.1164/rccm.201204-0596PP (2013).
Article CAS PubMed Google Scholar
Khakban, A. et al. The Projected Epidemic of Chronic Obstructive Pulmonary Disease Hospitalizations over the Next 15 Years. A Population-based Perspective. Am J Respir Crit Care Med 195, 287–291, https://doi.org/10.1164/rccm.201606-1162PP (2017).
Article PubMed Google Scholar
Ramos, E. M. et al. Phenotype-Genotype Integrator (PheGenI): synthesizing genome-wide association study (GWAS) data with existing genomic resources. Eur J Hum Genet 22, 144–147, https://doi.org/10.1038/ejhg.2013.96 (2014).
Article CAS PubMed Google Scholar
Huang, J. et al. WikiGWA: an open platform for collecting and using genome-wide association results. Eur J Hum Genet 21, 471–473, https://doi.org/10.1038/ejhg.2012.187 (2013).
Article CAS PubMed Google Scholar
Hobbs, B. D. et al. Genetic loci associated with chronic obstructive pulmonary disease overlap with loci for lung function and pulmonary fibrosis. Nat Genet 49, 426–432, https://doi.org/10.1038/ng.3752 (2017).
Article CAS PubMed PubMed Central Google Scholar
Goh, K. I. et al. The human disease network. Proc Natl Acad Sci USA 104, 8685–8690, https://doi.org/10.1073/pnas.0701361104 (2007).
Article ADS CAS PubMed PubMed Central Google Scholar
Barabasi, A. L., Gulbahce, N. & Loscalzo, J. Network medicine: a network-based approach to human disease. Nat Rev Genet 12, 56–68, https://doi.org/10.1038/nrg2918 (2011).
Article CAS PubMed PubMed Central Google Scholar
Sharma, A. et al. A disease module in the interactome explains disease heterogeneity, drug response and captures novel pathways and genes in asthma. Hum Mol Genet 24, 3005–3020, https://doi.org/10.1093/hmg/ddv001 (2015).
Article CAS PubMed PubMed Central Google Scholar
Lage, K. Protein-protein interactions and genetic diseases: The interactome. Biochim Biophys Acta 1842, 1971–1980, https://doi.org/10.1016/j.bbadis.2014.05.028 (2014).
Article CAS PubMed PubMed Central Google Scholar
Menche, J. et al. Disease networks. Uncovering disease-disease relationships through the incomplete interactome. Science 347, 1257601, https://doi.org/10.1126/science.1257601 (2015).
Article CAS PubMed PubMed Central Google Scholar
Oti, M., Snel, B., Huynen, M. A. & Brunner, H. G. Predicting disease genes using protein-protein interactions. J Med Genet 43, 691–698, https://doi.org/10.1136/jmg.2006.041376 (2006).
Article CAS PubMed PubMed Central Google Scholar
Sharma, A. et al. Network-based analysis of genome wide association data provides novel candidate genes for lipid and lipoprotein traits. Mol Cell Proteomics 12, 3398–3408, https://doi.org/10.1074/mcp.M112.024851 (2013).
Article CAS PubMed PubMed Central Google Scholar
Hutz, J. E., Kraja, A. T., McLeod, H. L. & Province, M. A. CANDID: a flexible method for prioritizing candidate genes for complex human traits. Genet Epidemiol 32, 779–790, https://doi.org/10.1002/gepi.20346 (2008).
Article PubMed PubMed Central Google Scholar
Moreau, Y. & Tranchevent, L. C. Computational tools for prioritizing candidate genes: boosting disease gene discovery. Nature reviews. Genetics 13, 523–536, https://doi.org/10.1038/nrg3253 (2012).
Article CAS PubMed Google Scholar
Vanunu, O., Magger, O., Ruppin, E., Shlomi, T. & Sharan, R. Associating genes and protein complexes with disease via network propagation. PLoS Comput Biol 6, e1000641, https://doi.org/10.1371/journal.pcbi.1000641 (2010).
Article ADS MathSciNet CAS PubMed PubMed Central Google Scholar
Jostins, L. et al. Host-microbe interactions have shaped the genetic architecture of inflammatory bowel disease. Nature 491, 119–124, https://doi.org/10.1038/nature11582 (2012).
Article CAS PubMed PubMed Central Google Scholar
Cho, D. Y., Kim, Y. A. & Przytycka, T. M. Chapter 5: Network biology approach to complex diseases. PLoS Comput Biol 8, e1002820, https://doi.org/10.1371/journal.pcbi.1002820 (2012).
Article CAS PubMed PubMed Central Google Scholar
Erten, S., Bebek, G., Ewing, R. M. & Koyuturk, M. DADA: Degree-Aware Algorithms for Network-Based Disease Gene Prioritization. BioData Min 4, 19, https://doi.org/10.1186/1756-0381-4-19 (2011).
Article PubMed PubMed Central Google Scholar
Liu, J. Z. et al. A versatile gene-based test for genome-wide association studies. Am J Hum Genet 87, 139–145, https://doi.org/10.1016/j.ajhg.2010.06.009 (2010).
Article CAS PubMed PubMed Central Google Scholar
Ghiassian, S. D., Menche, J. & Barabasi, A. L. A DIseAse MOdule Detection (DIAMOnD) algorithm derived from a systematic analysis of connectivity patterns of disease proteins in the human interactome. PLoS Comput Biol 11, e1004120, https://doi.org/10.1371/journal.pcbi.1004120 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Jiang, Z. et al. A Chronic Obstructive Pulmonary Disease Susceptibility Gene, FAM13A, Regulates Protein Stability of beta-catenin. Am J Respir Crit Care Med, https://doi.org/10.1164/rccm.201505-0999OC (2016).
Rolland, T. et al. A proteome-scale map of the human interactome network. Cell 159, 1212–1226, https://doi.org/10.1016/j.cell.2014.10.050 (2014).
Article CAS PubMed PubMed Central Google Scholar
Chatr-Aryamontri, A. et al. The BioGRID interaction database: 2015 update. Nucleic Acids Res 43, D470–478, https://doi.org/10.1093/nar/gku1204 (2015).
Article CAS PubMed Google Scholar
Isik, Z., Baldow, C., Cannistraci, C. V. & Schroeder, M. Drug target prioritization by perturbed gene expression and network information. Scientific reports 5, 17417, https://doi.org/10.1038/srep17417 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Lu, X., Jain, V. V., Finn, P. W. & Perkins, D. L. Hubs in biological interaction networks exhibit low changes in expression in experimental asthma. Mol Syst Biol 3, 98, https://doi.org/10.1038/msb4100138 (2007).
Article PubMed PubMed Central Google Scholar
Zheng, S. & Zhao, Z. GenRev: exploring functional relevance of genes in molecular networks. Genomics 99, 183–188, https://doi.org/10.1016/j.ygeno.2011.12.005 (2012).
Article CAS PubMed Google Scholar
Shaykhiev, R. et al. Smoking-dependent reprogramming of alveolar macrophage polarization: implication for pathogenesis of chronic obstructive pulmonary disease. J Immunol 183, 2867–2883, https://doi.org/10.4049/jimmunol.0900473 (2009).
Article CAS PubMed Google Scholar
Peng, X. et al. Plexin C1 deficiency permits synaptotagmin 7-mediated macrophage migration and enhances mammalian lung fibrosis. FASEB J 30, 4056–4070, https://doi.org/10.1096/fj.201600373R (2016).
Article CAS PubMed PubMed Central Google Scholar
Singh, D. et al. Induced sputum genes associated with spirometric and radiological disease severity in COPD ex-smokers. Thorax 66, 489–495, https://doi.org/10.1136/thx.2010.153767 (2011).
Article PubMed Google Scholar
Steiling, K. et al. A dynamic bronchial airway gene expression signature of chronic obstructive pulmonary disease and lung function impairment. Am J Respir Crit Care Med 187, 933–942, https://doi.org/10.1164/rccm.201208-1449OC (2013).
Article CAS PubMed PubMed Central Google Scholar
Annoni, R. et al. Extracellular matrix composition in COPD. Eur Respir J 40, 1362–1373, https://doi.org/10.1183/09031936.00192611 (2012).
Article PubMed Google Scholar
Chi, J. T. et al. Gene expression programs in response to hypoxia: cell type specificity and prognostic significance in human cancers. PLoS Med 3, e47, https://doi.org/10.1371/journal.pmed.0030047 (2006).
Article CAS PubMed PubMed Central Google Scholar
Ziolkowska-Suchanek, I. et al. FAM13A as a Novel Hypoxia-Induced Gene in Non-Small Cell Lung Cancer. J Cancer 8, 3933–3938, https://doi.org/10.7150/jca.20342 (2017).
Article PubMed PubMed Central Google Scholar
Wang, I. M. et al. Systems analysis of eleven rodent disease models reveals an inflammatome signature and key drivers. Mol Syst Biol 8, 594, https://doi.org/10.1038/msb.2012.24 (2012).
Article CAS PubMed PubMed Central Google Scholar
Hogg, J. C. et al. The nature of small-airway obstruction in chronic obstructive pulmonary disease. N Engl J Med 350, 2645–2653, https://doi.org/10.1056/NEJMoa032158 (2004).
Article CAS PubMed Google Scholar
McDonald, M. L. et al. Beyond GWAS in COPD: probing the landscape between gene-set associations, genome-wide associations and protein-protein interaction networks. Hum Hered 78, 131–139, https://doi.org/10.1159/000365589 (2014).
Article CAS PubMed Google Scholar
Yao, H. et al. Extracellular superoxide dismutase protects against pulmonary emphysema by attenuating oxidative fragmentation of ECM. Proceedings of the National Academy of Sciences of the United States of America 107, 15571–15576, https://doi.org/10.1073/pnas.1007625107 (2010).
Article ADS PubMed PubMed Central Google Scholar
Aschner, Y. & Downey, G. P. Transforming Growth Factor-beta: Master Regulator of the Respiratory System in Health and Disease. Am J Respir Cell Mol Biol, https://doi.org/10.1165/rcmb.2015-0391TR (2016).
Cho, M. H. et al. Risk loci for chronic obstructive pulmonary disease: a genome-wide association study and meta-analysis. Lancet Respir Med 2, 214–225, https://doi.org/10.1016/S2213-2600(14)70002-5 (2014).
Article CAS PubMed PubMed Central Google Scholar
Kim, D. M. et al. RhoB induces apoptosis via direct interaction with TNFAIP1 in HeLa cells. International journal of cancer. Journal international du cancer 125, 2520–2527, https://doi.org/10.1002/ijc.24617 (2009).
Article CAS PubMed Google Scholar
Lao, T. et al. Haploinsufficiency of Hedgehog interacting protein causes increased emphysema induced by cigarette smoke through network rewiring. Genome medicine 7, 12, https://doi.org/10.1186/s13073-015-0137-3 (2015).
Article CAS PubMed PubMed Central Google Scholar
Jiang, Z. et al. A Chronic Obstructive Pulmonary Disease Susceptibility Gene, FAM13A, Regulates Protein Stability of beta-Catenin. Am J Respir Crit Care Med 194, 185–197, https://doi.org/10.1164/rccm.201505-0999OC (2016).
Article CAS PubMed PubMed Central Google Scholar
Cloonan, S. M. et al. Mitochondrial iron chelation ameliorates cigarette smoke-induced bronchitis and emphysema in mice. Nature medicine 22, 163–174, https://doi.org/10.1038/nm.4021 (2016).
Article CAS PubMed PubMed Central Google Scholar
Hautamaki, R. D., Kobayashi, D. K., Senior, R. M. & Shapiro, S. D. Requirement for macrophage elastase for cigarette smoke-induced emphysema in mice. Science 277, 2002–2004 (1997).
Article CAS PubMed Google Scholar
Zhou, X. et al. Identification of a chronic obstructive pulmonary disease genetic determinant that regulates HHIP. Human Molecular Genetics 21, 1325–1335, https://doi.org/10.1093/hmg/ddr569 (2012).
Article CAS PubMed Google Scholar
Pillai, S. G. et al. A genome-wide association study in chronic obstructive pulmonary disease (COPD): identification of two major susceptibility loci. PLoS Genet 5, e1000421, https://doi.org/10.1371/journal.pgen.1000421 (2009).
Article CAS PubMed PubMed Central Google Scholar
Wilk, J. B. et al. A genome-wide association study of pulmonary function measures in the Framingham Heart Study. PLoS Genet 5, e1000429, https://doi.org/10.1371/journal.pgen.1000429 (2009).
Article CAS PubMed PubMed Central Google Scholar
Hunninghake, G. M. et al. MMP12, lung function, and COPD in high-risk populations. N Engl J Med 361, 2599–2608, https://doi.org/10.1056/NEJMoa0904006 (2009).
Article CAS PubMed PubMed Central Google Scholar
Cho, M. H. et al. Variants in FAM13A are associated with chronic obstructive pulmonary disease. Nat Genet 42, 200–202, https://doi.org/10.1038/ng.535 (2010).
Article CAS PubMed PubMed Central Google Scholar
Cho, M. H. et al. Analysis of exonic elastin variants in severe, early-onset chronic obstructive pulmonary disease. Am J Respir Cell Mol Biol 40, 751–755, https://doi.org/10.1165/rcmb.2008-0340OC (2009).
Article CAS PubMed Google Scholar
Loeys, B. et al. Homozygosity for a missense mutation in fibulin-5 (FBLN5) results in a severe form of cutis laxa. Human Molecular Genetics 11, 2113–2118 (2002).
Article CAS PubMed Google Scholar
Siedlinski, M. et al. Dissecting direct and indirect genetic effects on chronic obstructive pulmonary disease (COPD) susceptibility. Hum Genet 132, 431–441, https://doi.org/10.1007/s00439-012-1262-3 (2013).
Article PubMed PubMed Central Google Scholar
Kamburov, A., Stelzl, U., Lehrach, H. & Herwig, R. The ConsensusPathDB interaction database: 2013 update. Nucleic Acids Res 41, D793–800, https://doi.org/10.1093/nar/gks1055 (2013).
Article CAS PubMed Google Scholar
Kamburov, A., Stelzl, U. & Herwig, R. IntScore: a web tool for confidence scoring of biological interactions. Nucleic Acids Res 40, W140–146, https://doi.org/10.1093/nar/gks492 (2012).
Article CAS PubMed PubMed Central Google Scholar
Kohler, S., Bauer, S., Horn, D. & Robinson, P. N. Walking the interactome for prioritization of candidate disease genes. Am J Hum Genet 82, 949–958, https://doi.org/10.1016/j.ajhg.2008.02.013 (2008).
Article CAS PubMed PubMed Central Google Scholar
Wu, X., Jiang, R., Zhang, M. Q. & Li, S. Network-based global inference of human disease genes. Mol Syst Biol 4, 189, https://doi.org/10.1038/msb.2008.27 (2008).
Article CAS PubMed PubMed Central Google Scholar
Navlakha, S. & Kingsford, C. The power of protein interaction networks for associating genes with diseases. Bioinformatics 26, 1057–1063, https://doi.org/10.1093/bioinformatics/btq076 (2010).
Article CAS PubMed PubMed Central Google Scholar
Cho, M. H. et al. A genome-wide association study of COPD identifies a susceptibility locus on chromosome 19q13. Human Molecular Genetics 21, 947–957, https://doi.org/10.1093/hmg/ddr524 (2012).
Article CAS PubMed Google Scholar
Soler Artigas, M. et al. Genome-wide association and large-scale follow up identifies 16 new loci influencing lung function. Nat Genet 43, 1082–1090, https://doi.org/10.1038/ng.941 (2011).
Article CAS PubMed Google Scholar
Agusti, A. et al. Persistent systemic inflammation is associated with poor clinical outcomes in COPD: a novel phenotype. PloS one 7, e37483, https://doi.org/10.1371/journal.pone.0037483 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Singh, D. et al. Altered gene expression in blood and sputum in COPD frequent exacerbators in the ECLIPSE cohort. PloS one 9, e107381, https://doi.org/10.1371/journal.pone.0107381 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Bahr, T. M. et al. Peripheral blood mononuclear cell gene expression in chronic obstructive pulmonary disease. Am J Respir Cell Mol Biol 49, 316–323, https://doi.org/10.1165/rcmb.2012-0230OC (2013).
Article CAS PubMed PubMed Central Google Scholar
Bhattacharya, S. et al. Molecular biomarkers for quantitative and discrete COPD phenotypes. Am J Respir Cell Mol Biol 40, 359–367, https://doi.org/10.1165/rcmb.2008-0114OC (2009).
Article CAS PubMed Google Scholar
Poliska, S. et al. Chronic obstructive pulmonary disease-specific gene expression signatures of alveolar macrophages as well as peripheral blood monocytes overlap and correlate with lung function. Respiration; international review of thoracic diseases 81, 499–510, https://doi.org/10.1159/000324297 (2011).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

We thank Craig Hersh, Dawn Demeo, and Jarrett Morrow for the discussion regarding the gene expression data. We would like to thank Arda Halu and Enrico Maiorino for his help and discussion on protein-protein interaction data analysis. Maksim Kitsak was supported by ARO grants W911NF-16-1-0391 and W911NF-17-1-0491. The work was supported by the NIH/NHLBI grants: P01 HL13285, R01 HL118455-02, P01 HL105339, R01 HL111759, P01 HL114501, U01 HL089856 (EKS); CEGS- P50 HG004233-06, R01 HL118455 and R37 HL066289; U01 HL089897 (JDC) and R01HL113264 (E.K.S. and M.H.C.). We thank Amund Gulsvik, Per Bakke, Augusto Litonjua, Pantel Vokonas, Ruth Tal-Singer, and the GenKOLS, NETT/NAS, ECLIPSE, and COPDGene studies for use of GWAS meta-analysis data. The COPDGene study (NCT00608764) was funded by U01 HL089856 and U01 HL089897 and also supported by the COPD Foundation through contributions made to an Industry Advisory Board comprised of AstraZeneca, Boehringer Ingelheim, GSK, Novartis, Pfizer, Siemens and Sunovion. The National Emphysema Treatment Trial was supported by the NHLBI N01HR76101, N01HR76102, N01HR76103, N01HR76104, N01HR76105, N01HR76106, N01HR76107, N01HR76108, N01HR76109, N01HR76110, N01HR76111, N01HR76112, N01HR76113, N01HR76114, N01HR76115, N01HR76116, N01HR76118 and N01HR76119, the Centers for Medicare and Medicaid Services and the Agency for Healthcare Research and Quality. The Normative Aging Study is supported by the Cooperative Studies Program/ERIC of the US Department of Veterans Affairs and is a component of the Massachusetts Veterans Epidemiology Research and Information Center (MAVERIC). The Norway GenKOLS study (Genetics of Chronic Obstructive Lung Disease, GSK code RES11080), the ECLIPSE study (NCT00292552; GSK code SCO104960), and the ICGN study were funded by GlaxoSmithKline.

Author information

Authors and Affiliations

Channing Division of Network Medicine, Department of Medicine, Brigham and Women’s Hospital, Boston, USA
Amitabh Sharma, Michael H. Cho, Asher Ameli, Xiaobo Zhou, Zhiqiang Jiang, Marc Santolini & Edwin K. Silverman
Pulmonary and Critical Care Division, Brigham and Women’s Hospital and Harvard Medical School, Boston, USA
Michael H. Cho & Edwin K. Silverman
Department of Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, 02115, USA
Amitabh Sharma, Michael H. Cho, Xiaobo Zhou & Edwin K. Silverman
Center for Complex Networks Research and Department of Physics, Northeastern University, Boston, MA, 02115, USA
Amitabh Sharma, Maksim Kitsak & Marc Santolini
Center for Cancer Systems Biology, Dana-Farber Cancer Institute, Boston, MA, 02115, USA
Amitabh Sharma & Marc Santolini
Department of Medicine, National Jewish Health, Denver, Colorado, USA
James D. Crapo
Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland, USA
Terri H. Beaty
Department of Bioinformatics, CeMM Research Center for Molecular Medicine of the Austrian Academy of Sciences, A-1090, Vienna, Austria
Jörg Menche
Department of Clinical Science, University of Bergen, Bergen, Norway
Per S. Bakke
Department of Physics, Northeastern University, Boston, MA, 02115, United States
Asher Ameli

Authors

Amitabh Sharma
View author publications
You can also search for this author in PubMed Google Scholar
Maksim Kitsak
View author publications
You can also search for this author in PubMed Google Scholar
Michael H. Cho
View author publications
You can also search for this author in PubMed Google Scholar
Asher Ameli
View author publications
You can also search for this author in PubMed Google Scholar
Xiaobo Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Zhiqiang Jiang
View author publications
You can also search for this author in PubMed Google Scholar
James D. Crapo
View author publications
You can also search for this author in PubMed Google Scholar
Terri H. Beaty
View author publications
You can also search for this author in PubMed Google Scholar
Jörg Menche
View author publications
You can also search for this author in PubMed Google Scholar
Per S. Bakke
View author publications
You can also search for this author in PubMed Google Scholar
Marc Santolini
View author publications
You can also search for this author in PubMed Google Scholar
Edwin K. Silverman
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.S. and M.K. conceived the idea for this study and E.K.S. and M.H.C. co-supervised the analyses. A.A., J.M., X.Z., Z.J. and M.S. performed the computational, statistical analyses and experiments. A.A., E.K.S., J.C., T.H.B., P.S.B. contributed to the interpretation of the results and in writing of the paper. All authors have read and approved the final manuscript.

Corresponding authors

Correspondence to Amitabh Sharma or Edwin K. Silverman.

Ethics declarations

Competing Interests

The authors declare the following competing interests. In the past three years, Edwin K. Silverman received honoraria and consulting fees from Merck, grant support and consulting fees from GlaxoSmithKline, and honoraria and travel support from Novartis.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary Material

Supplementary Table 1

Supplementary Table 2

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Sharma, A., Kitsak, M., Cho, M.H. et al. Integration of Molecular Interactome and Targeted Interaction Analysis to Identify a COPD Disease Network Module. Sci Rep 8, 14439 (2018). https://doi.org/10.1038/s41598-018-32173-z

Download citation

Received: 02 November 2017
Accepted: 20 August 2018
Published: 27 September 2018
DOI: https://doi.org/10.1038/s41598-018-32173-z

Keywords

This article is cited by

Deeper insights into long-term survival heterogeneity of pancreatic ductal adenocarcinoma (PDAC) patients using integrative individual- and group-level transcriptome network analyses
- Archana Bhardwaj
- Claire Josse
- Kristel Van Steen
Scientific Reports (2022)
Exploration of the sputum methylome and omics deconvolution by quadratic programming in molecular profiling of asthma and COPD: the road to sputum omics 2.0
- Espen E. Groth
- Melanie Weber
- Torsten Goldmann
Respiratory Research (2020)
Integrated transcriptomic correlation network analysis identifies COPD molecular determinants
- Paola Paci
- Giulia Fiscon
- Lorenzo Farina
Scientific Reports (2020)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Building an initial COPD network neighborhood using the Degree-Aware Disease Gene Prioritization approach (DADA)

FAM13A pull down assay

Topological distance between the COPD neighborhood proteins and FAM13A interacting proteins in the interactome

Comparison with the Local Radiality (LR) method

COPD disease module with all eleven COPD seed genes

Validation of COPD disease network module in COPD specific gene-expression data

Potential candidate genes for COPD

Biological pathway enrichment in the COPD disease module

Discussion

Materials and Methods

Selection of high confidence COPD-associated genes

Human protein interaction network: Interactome

Localization of COPD network neighborhood in the human interactome

FAM13A pull-down assay

Proximity of the targeted interactions to the COPD neighborhood - Cab

Local Radiality (LR) method for target prediction

COPD network module overlap with inflammasome genes

Validation of COPD disease module in COPD- specific gene-expression data

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing Interests

Additional information

Electronic supplementary material

Rights and permissions

About this article

Cite this article

Share this article

Keywords

This article is cited by

Comments

Search

Quick links