A Computational Method of Defining Potential Biomarkers based on Differential Sub-Networks

Huang, Xin; Lin, Xiaohui; Zeng, Jun; Wang, Lichao; Yin, Peiyuan; Zhou, Lina; Hu, Chunxiu; Yao, Weihong

doi:10.1038/s41598-017-14682-5

Download PDF

Article
Open access
Published: 30 October 2017

A Computational Method of Defining Potential Biomarkers based on Differential Sub-Networks

Xin Huang¹,
Xiaohui Lin¹,
Jun Zeng²,
Lichao Wang²,
Peiyuan Yin²,
Lina Zhou²,
Chunxiu Hu² &
…
Weihong Yao¹

Scientific Reports volume 7, Article number: 14339 (2017) Cite this article

2875 Accesses
14 Citations
3 Altmetric
Metrics details

Subjects

Abstract

Analyzing omics data from a network-based perspective can facilitate biomarker discovery. To improve disease diagnosis and identify prospective information indicating the onset of complex disease, a computational method for identifying potential biomarkers based on differential sub-networks (PB-DSN) is developed. In PB-DSN, Pearson correlation coefficient (PCC) is used to measure the relationship between feature ratios and to infer potential networks. A differential sub-network is extracted to identify crucial information for discriminating different groups and indicating the emergence of complex diseases. Subsequently, PB-DSN defines potential biomarkers based on the topological analysis of these differential sub-networks. In this study, PB-DSN is applied to handle a static genomics dataset of small, round blue cell tumors and a time-series metabolomics dataset of hepatocellular carcinoma. PB-DSN is compared with support vector machine-recursive feature elimination, multivariate empirical Bayes statistics, analyzing time-series data based on dynamic networks, molecular networks based on PCC, PinnacleZ, graph-based iterative group analysis, KeyPathwayMiner and BioNet. The better performance of PB-DSN not only demonstrates its effectiveness for the identification of discriminative features that facilitate disease classification, but also shows its potential for the identification of warning signals.

An efficient and effective method to identify significantly perturbed subnetworks in cancer

Article 14 January 2021

A network approach for low dimensional signatures from high throughput data

Article Open access 23 December 2022

Systems biology in cardiovascular disease: a multiomics approach

Article 18 December 2020

Introduction

Biomarkers can provide information on pathogenic processes and pharmacological responses for a therapeutic intervention¹. The identification of biomarkers for clinical diagnosis is one of several interesting topics in medical research². Although measurements pertaining to discriminative molecules have been traditionally applied in the clinic, identifying meaningful biomarkers for clinical diagnostics based on information-rich biological data is challenging³. To identify discriminative molecules, different approaches for feature selection, such as support vector machine-recursive feature elimination (SVM-RFE)⁴, genetic algorithms (GAs)⁵ and random forests (RFs)⁶, have been widely applied^7,8,9. These methods select features based on the feature expression values among different classes rather than the changes in the feature relationships. However, a feature is also important if it has a remarkable joint effect on others¹⁰. Since molecules interact and relate to each other, exploring changes in the relationships among molecules to obtain a comprehensive understanding of disease mechanisms has attracted increasing attention in recent years^11,12,13,14. Hence, analyzing the biological data from a network perspective could be a better strategy for discovering key biomarkers and facilitating the study of disease phenotypes.

Disease development is usually studied from two aspects: static and dynamic. In clinical studies, static and dynamic (or time-series) data are applied to meet different clinical goals. Static data are used to compare changes under different conditions and to define the discriminative information. To extract information from static data, different network construction methods and network analysis techniques have been proposed. Pearson correlation coefficient (PCC) which measures associated relationships of features has been widely applied to construct the networks^15,16,17, and the hubs are retained as key factors. Krumsiek et al.^18,19 used the partial correlation coefficient to construct networks for biological data analyses. In metabolomics, a ratio could be designated as the pathway reaction in which one metabolite is converted into another metabolite via single or multiple reaction pathways²⁰. Thus, Netzer et al.²⁰ constructed a network based on the paired biomarker identifier values of the metabolite ratios. PinnacleZ²¹ applied mutual information to calculate the discriminative ability of the network. Graph-based iterative group analysis (GiGA)²² ranked the features in the network and identifies the informative sub-network based on the p-value calculated using the ranks of the features. KeyPathwayMiner²³ applied ant colony heuristic to screen for the key sub-network. BioNet²⁴ used the integer-linear programming approach to define the informative sub-network. Other efficient network-based methods exist, including a two-step module cover²⁵ and condition-specific sub-networks (COSINE)²⁶. Additional state-of-the-art methods have been summarized in a recent review paper²⁷.

As biological processes are dynamic, the systematic exploration of the temporal responses of molecules could facilitate the extraction of potential biomarkers that indicate the onset of complex diseases²⁸. The early diagnosis of complex diseases could prevent the qualitative deterioration of patients and improve survival rates. However, extracting potential biomarkers of complex diseases based on time-series data is a notable challenge. For example, the behavior of hepatocellular carcinoma (HCC) at early disease stages shows little apparent difference from that of precancerous cirrhosis (CIR)²⁹. Therefore, to explore the dynamics of disease development and screen for early warning signals, some methods for analyzing time-series data have been proposed. Tai et al.³⁰ selected important molecules using Hotelling’s T², whereas Chen et al. calculated the composite index to identify the dynamic network biomarkers of complex diseases^31,32. We also proposed a strategy for analyzing time-series data based on dynamic networks (ATSD-DN) to define the warning signal³³.

In the present study, we propose a computational method that defines potential biomarkers based on differential sub-networks (PB-DSN). PB-DSN explores the changes in correlation between feature ratios among different groups to define differential sub-networks. Subsequently, the hub vertices are identified as key feature ratios to discriminate different group samples. PB-DSN can also assess changes in correlations during disease development along time points to define differential sub-networks and selects hub vertices as key information for disease phenotyping. Moreover, signals from the sub-network consisting of the edges associated with the hub vertex can be used to indicate the onset of a specific disease stage. Hence, PB-DSN can analyze both static biological data and time-series data. In this study, a static malignant tumor genomics dataset and a time-series metabolomics dataset from a rat model of DEN-induced HCC are used to validate the performance of PB-DSN.

Results

Application of PB-DSN in the static dataset

Many studies have explored the mechanisms of malignant tumors from the viewpoint of genomics^34,35,36. A gene can signify a disease state if its expression is suppressive or augmentative under certain clinical conditions³⁷. However, some diseases result from multifaceted gene webs that interact with each other in complex ways³⁸.

Small, round blue cell tumors (SRBCTs) include four subtypes: neuroblastoma (NB), rhabdomyosarcoma (RMS), non-Hodgkin lymphoma (NHL) and the Ewing family of tumors (EWS). The routine histological appearances of these four tumors are similar³⁹. These cancers are not distinguished well by light microscopy, and there is no single test that can precisely separate the different cancers³⁹. An accurate diagnosis of the type of SRBCT is essential for providing patients with the appropriate treatment.

Training and test subsets (see supplement information) exist for four different groups of SRBCTs, including EWS, RMS, Burkitt lymphoma (BL, a subset of NHL), and NB. Detailed information about these datasets can be found in the literature³⁹. PB-DSN is used to study genomic problems at a network level. Figure S1 shows the workflow of PB-DSN. A feature is retained if the |log(fold-change)| is greater than or equal to 3 between any two subtype groups. Eighty-one features are retained, and a total of 3240 ratios are computed to construct the networks. The network G _EWS is built based on these 3240 ratios. If PCC of two ratios is greater than or equal to 0.7 in EWS, then the two ratios are linked with a red edge in G _EWS. If PCC of two ratios is less than or equal to −0.7 in EWS, then the edge is green in G _EWS. G _RMS, G _BL and G _NB are constructed using the same method applied for constructing G _EWS.

To define the discriminative information for separating EWS from the other three groups, in this study, an edge appearing in G _EWS that has different behaviors in two of the other three networks (G _RMS, G _BL and G _NB) is regarded as a differential edge of EWS, and all differential edges of EWS constitute a differential sub-network of EWS (SG _EWS). The corresponding expression of the edges in SG _EWS in the other three groups constitutes the sub-networks SG _EWS-RMS, SG _EWS-BL and SG _EWS-NB. Expanding on this idea, let G = (V(G), E(G)) be a graph, V(G) is the vertex set of G and E(G) is the edge set of G. (1) SG _EWS-RMS = (V(SG _EWS-RMS), E(SG _EWS-RMS)), where V(SG _EWS-RMS) = V(SG _EWS) and E(SG _EWS-RMS) = E(SG _EWS) ∩ E(G _RMS); (2) SG _EWS-BL = (V(SG _EWS-BL), E(SG _EWS-BL)), where V(SG _EWS-BL) = V(SG _EWS) and E(SG _EWS-BL) = E(SG _EWS) ∩ E(G _BL); (3) SG _EWS-NB = (V(SG _EWS-NB), E(SG _EWS-NB)), where V(SG _EWS-NB) = V(SG _EWS) and E(SG _EWS-NB) = E(SG _EWS) ∩ E(G _NB). The edges in SG _EWS-RMS, SG _EWS-BL and SG _EWS-NB have the same color as the corresponding ones in G _RMS, G _BL and G _NB, respectively.

Subsequently, the vertices in SG _EWS are ranked according to their degrees in descending order. The node with the highest degree (ratio 1) is selected for further analysis. The star sub-network consisting of the edges linked to the ratio 1 in SG _EWS is defined and shown in Fig. 1(a). The star sub-networks consisting of the edges linked with ratio1 in SG _EWS-RMS, SG _EWS-BL and SG _EWS-NB are also shown in Fig. 1. These data clearly express the difference between EWS and the other three groups and reveal that the correlations of some ratios in EWS samples are significantly different from those in the other three groups. The top 5 vertices in SG _EWS (see Table S1) are retained as potential biomarkers; the statistical analysis is shown in Fig. 2. The differential expression levels of ratio 1 and ratio 2 in EWS and RMS indicate that they can separate the two malignant tumor samples well. Ratio 3 is significantly decreased in NB compared with that of EWS. As the levels of ratio 1, ratio 2, and ratio 3 show significant differences between EWS and non-EWS groups, these values could be used to separate EWS from non-EWS samples. The level of ratio 4 in EWS is remarkably lower than that in RMS, thereby contributing to the discrimination of the two tumor samples. Ratio 5 increases in EWS and could be used to distinguish EWS and non-EWS samples.

PB-DSN is compared with PinnacleZ²¹, GiGA²², KeyPathwayMiner²³, BioNet²⁴ and the popular statistical analysis method SVM-RFE⁴. We also compare PB-DSN with molecular network based on PCC (MN-PCC) which builds the networks on features instead of feature ratios and applies the same network analysis method as PB-DSN. In PinnacleZ, KeyPathwayMiner and GiGA, to reduce the irrelevant features and improve the classification performance, the upper bound of the sub-network size is $\sqrt{N}$, where N is the number of total features in the network. PinnacleZ and GiGA select the sub-network with the best discriminative ability to discriminate the different diseases. The value of parameter l in KeyPathwayMiner is set as 0. A false-discovery rate of 10⁻⁶ is used in BioNet. In SVM-RFE, the kernel function is linear and the value of penalty factor is set as 1. In MN-PCC, if |PCC| of two features is great than, or equal to 0.7, then there is an edge between the two features. In PB-DSN τ is set as 0.7. To compare the performance of these methods, the binary logistic regression is performed. The areas under the curve (AUCs) of separated EWS and non-EWS samples are listed in Table 1. The AUCs obtained using PB-DSN, MN-PCC, PinnacleZ, GiGA, KeyPathwayMiner, BioNet, and SVM-RFE are 1.000, 0.991, 0.940, 0.953, 0.939, 0.987 and 0.789, respectively, in the training set. The corresponding AUC values in the test set are 1.000, 1.000, 0.905, 1.000, 0.786, 0.917 and 0.595, respectively.

Table 1 Comparison of the ROC analysis (AUC) for different methods in separating EWS from non-EWS.

Full size table

A similar method is also used to analyze BL vs. non-BL, RMS vs. non-RMS and NB vs. non-NB, and the results are shown in Tables 2–4. Table 2 shows that PB-DSN has a higher AUC than PinnacleZ, KeyPathwayMiner and SVM-RFE for separating BL and non-BL samples in the training set and exhibits the same performance as MN-PCC, GiGA and BioNet. In the test set, seven methods have the same AUC values. In the case of discriminating samples between RMS and non-RMS groups (see Table 3), PB-DSN has a slightly lower performance than PinnacleZ, GiGA, BioNet and SVM-RFE in the training set, but has the same performance in the test set. Compared with MN-PCC and KeyPathwayMiner, PB-DSN has a remarkable advantage for separating samples between RMS and non-RMS groups in the training and test sets. For NB vs. non-NB (see Table 4), PB-DSN has a better performance than PinnacleZ, GiGA and SVM-RFE in the training set. The AUCs of PB-DSN, MN-PCC, KeyPathwayMiner and BioNet are the same in the training set. In the test set, PB-DSN, MN-PCC, PinnacleZ, GiGA, KeyPathwayMiner and BioNet can well separate the NBs from the non-NB samples, whereas the AUCs of SVM-RFE are markedly low. The performance of PB-DSN shows better potential to identify discriminative information for the improvement of disease diagnosis.

Table 2 Comparison of the ROC analysis (AUC) for different methods in separating BL from non-BL.

Full size table

Table 3 Comparison of the ROC analysis (AUC) for different methods in separating RMS from non-RMS.

Full size table

Table 4 Comparison of the ROC analysis (AUC) for different methods in separating NB from non-NB.

Full size table

Application of PB-DSN in the time-series dataset

Metabolomics, a powerful platform in systems biology used to study changes in holistic low-molecular-weight metabolites (≤1500 Da), plays a significant role in different fields of life science^40,41,42. The dynamics of metabolite concentrations reflect physiological and pathological disturbances, and studying cancer from the perspective of cell-reprogrammed metabolism can provide insights into the process of carcinogenesis^28,43. Thus, metabolomics studies have been successfully employed in some cases to screen for biomarkers of malignant tumors^44,45,46. HCC is one of the major diseases with serious effects in humans. The early and precise diagnosis of HCC is crucial for ensuring that patients receive the appropriate treatment. However, due to the rapid development and early metastasis of HCC⁴⁷, it is difficult to improve the performance of HCC diagnosis and, in particular, to distinguish small malignant HCCs from precancerous CIR samples. Although some traditional tumor markers (i.e., α-fetoprotein) are effective for HCC discrimination, the poor sensitivity of these molecules suggests that they are far from ideal^47,48. Thus, developing efficient methods for the extraction of new biomarkers that signal HCC onset is urgently needed.

The metabolomics training set used in this study contains control and model groups and has been reported in a previous study²⁸. Week 0 was defined as the starting time point of animal experiment. The collection of time-series sera set was conducted from week 8 to week 20 once every 2 weeks. The model group contains three stages: week 8 (hepatitis (H) stage, S ₁), week 10–14 (CIR stage, S ₂–S ₄) and week 16–20 (HCC stage, S ₅–S ₇). S ₁, S ₄, and S ₇ are the typical time points of the corresponding liver disease stages, whereas S ₂ and S ₅ are the first time points of the corresponding liver diseases. If a variable has missing values in a group, we replace these values with the minimum nonzero value in that group at the same time point. A |log(fold-change)| greater than or equal to 1 is used to filter the non-informative features, and seventeen features are selected based on the typical time points in three sub-problems (H vs. CIR, H vs. HCC and CIR vs. HCC). In total, 136 metabolite ratios are computed based on these 17 metabolites to construct the networks.

To screen the prospective information of HCC, PB-DSN focuses on S ₅, which is the starting time point of HCC, and extracts differential edges to infer the differential sub-network of S ₅ (SG ₅). The differential edges are those that appear in the network of S ₅ but have different behaviors in most (2/3 in this study) of other networks in this time-series dataset at S _t (1 ≤ t ≤ 4). In SG ₅, the hub vertex (N,N-dimethylglycine/threonic acid) having the largest degree and its associated nodes are selected for further analysis. Figure 3 shows the dynamics of the correlation between N,N-dimethylglycine/threonic acid and its associated nodes in disease initiation and progression. We observed that the correlations of feature ratios change with the development of liver disease. Differences in the correlations of ratios between the starting time point of HCC and the stages prior to HCC could represent the onset of HCC. Therefore, changes in the correlations between N,N-dimethylglycine/threonic acid and its associated nodes could, at the network level, be critical information signaling the emergence of disease deterioration during liver disease development. The statistical result of N,N-dimethylglycine/threonic acid, as shown in Fig. 3(h), indicates that N,N-dimethylglycine/threonic acid is significantly different between HCC and non-HCC groups; thus, N,N-dimethylglycine/threonic acid exhibits potential for distinguishing HCC samples from non-HCC samples.

The vertices in SG ₅ are ranked in descending order based on their degrees, and the top 5 ratios (see Table S2) are selected for the subsequent statistical analysis. Among the 5 ratios, the levels of 3 metabolite ratios showed significant differences between the model and age-matched groups at any time point (Table S3); thus, these metabolite ratios (N,N-dimethylglycine/mucic acid, N,N-dimethylglycine/threonic acid and betaine/mucic acid) contribute to separating the samples between control and model groups. Figure 4(a–c) show the metabolic trajectory of these 3 ratios along the time points in the training set. The significant differences of these 3 metabolite ratios are shown between HCC and non-HCC groups. Thus, we find that when the levels of these 3 metabolite ratios in non-HCC samples significantly decrease, HCC occurs. The AUCs of the 3 ratios used to discriminate HCC from non-HCC groups are 0.954, 0.923, and 0.939 in the training set, respectively (Fig. 4(d)). The detailed results of statistical analysis shown in Tables S4–S6 suggest that the levels of the 3 metabolite ratios exhibit significant differences between any time point in the HCC stage and any time point in the non-HCC stage, further indicating the ability to discriminate between HCC and non-HCC samples. Notably, for N,N-dimethylglycine/threonic acid, significant differences are also observed between the H stage and any time point in the CIR stage. The significantly decreasing level of N,N-dimethylglycine/threonic acid at different disease stages could suggest its potential for a more complete presentation of liver disease development.

In the present study, the external test set (see supplement information) contains 36 sera from 6 model rats monitored at 6 time points (i.e., S ₁–S ₆). Histological examinations to validate HCC reveal that S ₁–S ₄ are the pre-cancer stage, whereas S ₅–S ₆ are the HCC stage. The AUCs of the 3 metabolite ratios in the test set are 0.948, 0.903, and 0.865, respectively, for the separation of HCC and non-HCC samples (Fig. 4(e)).

To evaluate the performance of PB-DSN, we compared this method with multivariate empirical Bayes statistics (MEBA)³⁰, ATSD-DN³³, MN-PCC, PinnacleZ²¹, GiGA²², KeyPathwayMiner²³ and BioNet²⁴. In ATSD-DN, τ is set as 0.7, and the ratio of imidazole-4-acetic acid/trimethylamine N-oxide is selected. Trimethylamine N-oxide is selected by MN-PCC. The top ratio in PB-DSN is N,N-dimethylglycine/threonic acid. In MEBA, based on Hotelling’s T², top 3 features are selected to discriminate different diseases. PinnacleZ, GiGA, KeyPathwayMiner and BioNet select the sub-network with the best discriminative ability.

The comparison results shown in Table 5 indicate that in the training set, the performance of PB-DSN for separating disease and normal groups is better than those of ATSD-DN, MN-PCC and PinnacleZ. The AUC obtained using PB-DSN for discriminating HCC and non-HCC samples is 0.954, which is higher than the AUCs of 0.808, 0.820, 0.884, 0.815, 0.900 and 0.934 obtained by ATSD-DN, MN-PCC, PinnacleZ, GiGA, KeyPathwayMiner and BioNet, respectively. The performance of PB-DSN is only 0.002 lower than that of MEBA. In the test set, the AUC obtained using PB-DSN is better than those of other methods except ATSD-DN and KeyPathwayMiner. To discriminate H from the CIR samples, the AUCs obtained for the training set by PB-DSN, MEBA, ATSD-DN, MN-PCC, PinnacleZ, GiGA, KeyPathwayMiner and BioNet are 0.966, 1.000, 0.776, 0.619, 0.912, 0.687, 0.966 and 0.959, respectively, and the corresponding AUCs in the test set are 0.972, 0.917, 0.870, 0.741, 0.787, 0.750, 0.907 and 0.889, respectively. In our previous study²⁸, a ratio of creatine/betaine was identified. The AUCs of creatine/betaine in the training and test sets are shown in Table 5. In all cases, the AUCs of N,N-dimethylglycine/threonic acid selected by PB-DSN are superior to those of creatine/betaine. The better performance of PB-DSN for analyzing the time-series dataset illustrates the potential of this method to define prospective signals that indicates the onset of HCC, thereby improving the precise diagnoses of different liver diseases.

Table 5 Comparison of the ROC analysis (AUC) in metabolomics data.

Full size table

Discussion

The precise, early diagnosis of malignant tumors can better facilitate appropriate treatments and improve the survival rates of patients. However, due to the complex factors of individual differences, epigenetics and environmental effects, identifying efficient biomarkers remains a challenge. Molecules interact with each other in networks or pathways to implement biological functions⁴⁹. In contrast to the molecule level, deregulation at the pathway level is more critical to carcinogenesis⁵⁰. Thus, discovering biomarkers from a network-based perspective can provide a more efficient strategy to characterize disease phenotypes.

To define discriminative information for classifying different disease groups in the static dataset and to identify the warning signals of disease in the time-series dataset, PB-DSN examines changes in the relationships among ratios and extracts differential edges to infer differential sub-networks.

To identify discriminative information in the static dataset about a specific group, the differential edges with different behaviors between the network of the specific group and most of other networks are extracted. Based on these differential edges, the differential sub-network, which reflects the discriminative information between the specific group and other groups, is constructed. The vertices with large degree in the differential sub-network contain crucial information, as their relationships with many other feature ratios (i.e., adjacent vertices) have changed and, thus, the correlations of the feature ratios are much close in the specific group.

Exploring the important prospective information about the onset of a specific physiological or pathological stage is crucial. To define prospective information about the disease severity and phenotype based on the time-series dataset, PB-DSN focuses on a specific time point (e.g., the starting time point of HCC) and traces the changes in relationships of the ratios from the beginning of the assessment to the specific time point. Subsequently, PB-DSN extracts the differential edges that have different behaviors between the network of the specific time point and most other networks of the earlier time points. Based on these differential edges, a differential sub-network, which reflects the discriminative information between the specific time point and the before time points, is constructed. The vertices with the large degree in the differential sub-network contain crucial prospective information about the onset of a specific physiological or pathological phenomenon, as their relationships with many other feature ratios (i.e., adjacent vertices) have changed; thus, the correlation of feature ratios is much close at the specific time point.

Based on the comparisons in the analysis of static datasets, PB-DSN outperforms MN-PCC, PinnacleZ, GiGA, KeyPathwayMiner, BioNet and SVM-RFE. Hence, by studying the differences of the feature ratio correlations among different groups, PB-DSN can be used to mine important discriminative information.

In the time-series metabolomics data, three metabolite ratios, N,N-dimethylglycine/threonic acid, N,N-dimethylglycine/mucic acid and betaine/mucic acid, are defined. Increased N,N-dimethylglycine is considered an important indicator for a shift of homocysteine remethylation towards the betaine-homocysteine-methyltransferase reaction in liver CIR⁵¹. Betaine is an important methyl donor that plays a significant role in hepatic methyl balance²⁸. These two compounds are closely associated with homocysteine remethylation. Moreover, threonic acid is a product of ascorbic acid oxidation⁵² and is thus associated with systemic oxidative stress in patients. In the present study, mucic acid is decreased in the model group compared with controls, which likely indicates that low levels of mucic acid are associated with the onset of liver disease. The combinations of N,N-dimethylglycine/threonic acid, N,N-dimethylglycine/mucic acid and betaine/mucic acid as biomarkers may improve the diagnosis of HCC development. These three ratios can promote the discrimination of metabolic differences, as the combination patterns indicate that different physiological perspectives are considered based on different metabolic pathways rather than traditional individual features or single pathway-derived metabolites. Larger HCC patient cohorts are needed to validate these results in future studies.

PB-DSN is compared with MEBA³⁰, ATSD-DN³³, MN-PCC, PinnacleZ²¹, GiGA²², KeyPathwayMiner²³ and BioNet²⁴ in the analysis of time-series data. The performance of PB-DSN is better than those of other methods, except MEBA for discriminating between non-HCC and HCC samples in the training set. In the test set, the biomarker performance indicated by PB-DSN is also efficient. In the case of separating samples between H and CIR groups, PB-DSN only has the slightly lower performance than that of MEBA in the training set, but has the best performance in the test set. The biomarkers identified by PB-DSN are effective in discriminating diseases from normal control samples. The better performance shows that compared with other methods, PB-DSN is more advantageous for extracting prospective information to facilitate the diagnosis of HCC.

Moreover, for PB-DSN and some compared methods, we made experiments to show the influence of the parameters on the effectiveness. Different parameter settings are tested and the corresponding performances are given in Tables S7–S14.

In summary, PB-DSN analyzes biological data from networks to define discriminative information and prospective signals of complex diseases. The application of PB-DSN in the two malignant tumor datasets shows that this method has the potential to effectively define discriminative information from a static dataset and to identify the prospective signals from a time-series dataset. Moreover, the network analysis method of PB-DSN can be applied in molecular networks, which also have the effective performance in some cases.

Methods

Let $D=\{{S}_{t}|1\le t\le {N}_{s}\}$ represent the data set, where S _t is a data subset of the tth disease group in a static problem or a data subset of the tth time point in the dynamic problem and N _s represents the number of the various disease groups or the number of time points. Let $F=\{{f}_{1},\,{f}_{2},\,\ldots ,\,{f}_{m}\}$ represent the feature set, where m represents the number of features. Since a ratio can be designated as the pathway reaction in which one metabolite is converted into another metabolite via single or multiple reaction pathways, the relationship of these ratios was explored to construct the network used to analyze the metabolomics data²⁰. Furthermore, ratios of gene expression levels were also studied in genomics^53,54. Hence, PB-DSN also studies feature ratios to analyze the biological data.

In PB-DSN, the feature ratio that means the ratio of feature concentration or expression level is defined as r _ij(t) = f _it/f _jt, where f _it is f _i at S _t (1 ≤ i < j ≤ m, 1 ≤ t ≤ N _s).

Network construction

The Pearson correlation coefficient of two feature ratios r _x(t) and r _y(t) in group (or at time point) S _t (1 ≤ t ≤ N _s) is defined as

$$PCC({r}_{x}(t),{r}_{y}(t))=\frac{1}{{n}_{t}-1}\sum _{k=1}^{{n}_{t}}(\frac{{r}_{x}(t,k)-{\mu }_{{r}_{x}(t)}}{{\sigma }_{{r}_{x}(t)}})(\frac{{r}_{y}(t,k)-{\mu }_{{r}_{y}(t)}}{{\sigma }_{{r}_{y}(t)}}),$$

(1)

where r _x(t, k) and r _y(t, k) are the values of ratio r _x(t) and r _y(t) in the kth sample at S _t, ${\mu }_{{r}_{x}(t)}$ and ${\mu }_{{r}_{y}(t)}$ are the means of ratio r _x(t) and r _y(t), ${\sigma }_{{r}_{x}(t)}$ and ${\sigma }_{{r}_{y}(t)}$ are the standard deviation, and n _t is the number of the samples at S _t. The Pearson correlation coefficient describes the relationships between variables in a phenomenological form. When two variables occur adjacently in a pathway or are derived from a common precursor, the correlation coefficient is positive, and when one variable is used to directly or indirectly generate the other, the correlation coefficient is negtive⁵⁵. Large |PCC(r _x(t), r _y(t))| suggests that the two corresponding feature ratios are closely related to each other at S _t. Hence, the network G _t is built based on the Pearson correlation coefficient for depicting the relationships among feature ratios at S _t. Let each feature ratio represent a vertex in the network, and when the Pearson correlation coefficient of the two feature ratios |PCC(r _x(t), r _y(t))| ≥ τ, there is an edge between the two corresponding feature ratios r _x(t) and r _y(t) in G _t. Since the Pearson correlation coefficient, which represents the different relationships of the two ratios, may be positive or negative, the edge is colored red for PCC(r _x(t), r _y(t)) ≥ τ and green for PCC(r _x(t), r _y(t)) ≤ -τ.

Defining the differential sub-network

In a complex biological system, the relationship of the feature ratios in different physiological or pathological phenomena may be different. Thus, the correlation difference among the different sample groups or along the time points could reflect different physiological or pathological changes.

Definition 1. Let D = {S _t| 1 ≤ t ≤ N _s} represent the data set, where S _t is a data subset of the tth disease group in a static problem. Let G _t represent the network at S _t. If e ∈ V(G _t) has different behaviors (i.e., disappears or has a different color) in most of the other networks, then e is a “differential edge” at S _t. The sub-network, SG _t, consisting of all the differential edges at S _t is called the differential sub-network at S _t in the static problem.

Definition 2. Let D = {S _t| 1 ≤ t ≤ N _s} represent the data set, where S _t is a data subset of the tth time point in the dynamic problems. Let G _t represent the network at S _t. If e ∈ V(G _t) has different behaviors (i.e., disappears or has a different color) in most of the networks G _p (1 ≤ p < t), then e is a “differential signal edge” of S _t. The sub-network, SG _t, consisting of all the differential signal edges of S _t is called the differential sub-network at S _t in time-series problems.

Hence, the differential sub-network at S _t in a static problem contains information to discriminate group S _t from other groups based on the changes in the relationships between the ratios. Moreover, the differential sub-network of S _t in a dynamic problem contains information that could signal the onset of the specific physiological or pathological stage at S _t, which contains certain information that is markedly different from that of the previous time point.

PB-DSN applies topological structure analysis to select the most important ratios from the differential sub-network. This method ranks the nodes of the differential sub-network at S _t in a descending order according to their degrees, and the top k ≥ 1 nodes are selected.

The compared methods

SVM-RFE

This method selects the important feature subset by sequential backward elimination⁴. In each iteration, SVM-RFE measures the weights of the features based on the contribution to the hyperplane, and the features with the lowest weights are removed from the current feature subset. During the recursive procession, the feature subset with the best classification performance is retained as the selected feature subset.

MEBA

This method is based on multivariate empirical Bayes statistics and uses Hotelling’s T² to measure the importance of features in a systematic time dimension³⁰. To reduce the false positive and false negative results, MEBA applies the time-course mean profiles to evaluate treatment differences.

MN-PCC

PCC is applied to measure the relationship of the molecules and construct the networks. Subsequently, the same network analysis method as PB-DSN is applied to identify the key factors.

ATSD-DN

This method constructs networks based on the non-overlapping ratios of the feature ratios³³. Two network analysis techniques, dynamic concentration analysis and topological structure analysis, are performed to identify the informative feature ratios. Among the feature ratios selected by both two techniques, the ratio with the highest AUC is considered a potential biomarker.

PinnacleZ

This method uses mutual information to define the discriminative ability of a sub-network²¹. To search a sub-network, PinnacleZ starts from each node in the network and performs a greedy search. During the search, PinnacleZ iteratively adds a node that is associated with other nodes in the current sub-network and can yield a maximal score increase in the sub-network. This procedure is continued until the improvement rate of the score is less than or equal to 0.05.

GiGA

This method selects the discriminative sub-network based on p-values²². First, GiGA assigns a rank to each feature based on expression changes and identifies the local minimum (i.e., the node with a lower rank than all direct neighbors in the network). Subsequently, each local minimum is viewed as a seed and is iteratively extended. In each iteration, the neighboring node with the smallest rank is added. After n steps, a sub-network with n nodes with a maximum rank m is built. Based on p-value, the sub-network is scored as

$$p=\underset{i=0}{\overset{n-1}{{\rm{\Pi }}}}\frac{m-i}{N-i},$$

(2)

where N is the number of total nodes in the network.

KeyPathwayMiner

The goal of this method is also to define the informative sub-network²³. A strategy called Global Node Exceptions is used in KeyPathwayMiner. Moreover, ant colony heuristic is used for finding the solution to subnet problem.

BioNet

BioNet is an effective method that can compute provably optimal or sub-optimal solutions to screen for the maximal-scoring sub-network²⁴. First, BioNet calculates the maximum likelihood score for each features based on the beta uniform mixture distribution of the p-values. Subsequently, integer-linear programming is applied to define the optimal sub-network in reasonable computation time.

References

Atkinson, A. J. et al. Biomarkers and surrogate endpoints: preferred definitions and conceptual framework. Clin. Pharmacol. Ther. 69, 89–95, https://doi.org/10.1067/mcp.2001.113989 (2001).
Article Google Scholar
Liu, R., Wang, X., Aihara, K. & Chen, L. Early diagnosis of complex diseases by molecular biomarkers, network biomarkers, and dynamical network biomarkers. Med. Res. Rev. 34, 455–478, https://doi.org/10.1002/med.21293 (2014).
Article PubMed Google Scholar
Saccenti, E., Hoefsloot, H. C. J., Smilde, A. K., Westerhuis, J. A. & Hendriks, M. M. W. B. Reflections on univariate and multivariate analysis of metabolomics data. Metabolomics 10, 361–374, https://doi.org/10.1007/s11306-013-0598-6 (2013).
Article CAS Google Scholar
Guyon, I., Weston, J., Barnhill, S. & Vapnik, V. Gene selection for cancer classification using support vector machines. MLear. 46, 389–422, https://doi.org/10.1023/a:1012487302797 (2002).
MATH Google Scholar
Goldberg, D. E. & Holland, J. H. Genetic algorithms and machine learning. MLear. 3, 95–99, https://doi.org/10.1023/A:1022602019183 (1988).
Google Scholar
Breiman, L. Random forests. MLear. 45, 5–32, https://doi.org/10.1023/A:1010933404324 (2001).
MATH Google Scholar
Tapia, E., Bulacio, P. & Angelone, L. Sparse and stable gene selection with consensus SVM-RFE. Pattern Recog. Lett. 33, 164–172, https://doi.org/10.1016/j.patrec.2011.09.031 (2012).
Article Google Scholar
Diaz-Uriarte, R. & A de Andres, S. Gene selection and classification of microarray data using random forest. BMC Bioinformatics 7, doi:https://doi.org/10.1186/1471-2105-7-3 (2006).
Li, L. et al. A robust hybrid between genetic algorithm and support vector machine for extracting an optimal feature gene subset. Genomics 85, 16–23, https://doi.org/10.1016/j.ygeno.2004.09.007 (2005).
Article CAS PubMed Google Scholar
Chen, Y., Wang, L., Li, L., Zhang, H. & Yuan, Z. Informative gene selection and the direct classification of tumors based on relative simplicity. BMC Bioinformatics 17, https://doi.org/10.1186/s12859-016-0893-0 (2016).
Long, F., Su, J. H., Liang, B., Su, L. L. & Jiang, S. J. Identification of gene biomarkers for distinguishing small-cell lung cancer from non-small-cell lung cancer using a network-based approach. Biomed. Res. Int., https://doi.org/10.1155/2015/685303 (2015).
Feng, L. et al. A network-based method for identifying prognostic gene modules in lung squamous carcinoma. Oncotarget 7, 18006–18020 (2016).
Article PubMed PubMed Central Google Scholar
Nai, W. Q. et al. Identification of novel genes and pathways in carotid atheroma using integrated bioinformatic methods. Sci. Rep. 6, https://doi.org/10.1038/srep18764 (2016).
Qin, C., Sun, Y. Q. & Dong, Y. D. A new method for identifying essential proteins based on network topology properties and protein complexes. PloS One 11, https://doi.org/10.1371/journal.pone.0161042 (2016).
Zhang, X., Yang, H., Gong, B., Jiang, C. & Yang, L. Combined gene expression and protein interaction analysis of dynamic modularity in glioma prognosis. J. Neurooncol. 107, 281–288, https://doi.org/10.1007/s11060-011-0757-4 (2012).
Article CAS PubMed Google Scholar
Xue, H. et al. A modular network model of aging. Mol. Syst. Biol. 3, doi:https://doi.org/10.1038/msb4100189 (2007).
Shao, T. et al. Identification of module biomarkers from the dysregulated ceRNA-ceRNA interaction network in lung adenocarcinoma. Mol Biosyst 11, 3048–3058, https://doi.org/10.1039/c5mb00364d (2015).
Article CAS PubMed Google Scholar
Krumsiek, J., Suhre, K., Illig, T., Adamski, J. & Theis, F. J. Gaussian graphical modeling reconstructs pathway reactions from high-throughput metabolomics data. BMC Syst. Biol. 5, https://doi.org/10.1186/1752-0509-5-21 (2011).
Castro, C. et al. A study of Caenorhabditis elegans DAF-2 mutants by metabolomics and differential correlation networks. Mol. BioSyst. 9, 1632–1642, https://doi.org/10.1039/c3mb25539e (2013).
Article CAS PubMed Google Scholar
Netzer, M. et al. Profiling the human response to physical exercise: a computational strategy for the identification and kinetic analysis of metabolic biomarkers. J. Clin. Bioinformatics 1, https://doi.org/10.1186/2043-9113-1-34 (2011).
Chuang, H., Lee, E., Liu, Y., Lee, D. & Ideker, T. Network-based classification of breast cancer metastasis. Mol. Syst. Biol. 3, https://doi.org/10.1038/msb4100180 (2007).
Breitling, R., Amtmann, A. & Herzyk, P. Graph-based iterative group analysis enhances microarray interpretation. BMC Bioinformatics 5, https://doi.org/10.1186/1471-2105-5-100 (2004).
Alcaraz, N. et al. KeyPathwayMiner 4.0: condition-specific pathway analysis by combining multiple omics studies and networks with Cytoscape. BMC Syst. Biol. 8, https://doi.org/10.1186/s12918-014-0099-x (2014).
Dittrich, M., Klau, G., Rosenwald, A., Dandekar, T. & Muller, T. Identifying functional modules in protein-protein interaction networks: an integrated exact approach. Bioinformatics 24, i223–231, https://doi.org/10.1093/bioinformatics/btn161 (2008).
Article CAS PubMed PubMed Central Google Scholar
Kim, Y., Salari, R., Wuchty, S. & Przytycka, T. Module cover - a new approach to genotype-phenotype studies. Pac. Symp. Biocomput, 135–146 (2013).
Ma, H., Schadt, E., Kaplan, L. & Zhao, H. COSINE: condition-specific sub-network identification using a global optimization method. Bioinformatics 27, 1290–1298, https://doi.org/10.1093/bioinformatics/btr136 (2011).
Article CAS PubMed PubMed Central Google Scholar
Batra, R. et al. On the performance of de novo pathway enrichment. Syst. Biol. Appl. 3, https://doi.org/10.1038/s41540-017-0007-2 (2017).
Zeng, J. et al. Metabolomics identifies biomarker pattern for early diagnosis of hepatocellular carcinoma: from diethylnitrosamine treated rats to patients. Sci. Rep. 5, https://doi.org/10.1038/srep16101 (2015).
Zhou, L. et al. Serum metabolomics reveals the deregulation of fatty acids metabolism in hepatocellular carcinoma and chronic liver diseases. Anal. Bioanal. Chem. 403, 203–213, https://doi.org/10.1007/s00216-012-5782-4 (2012).
Article CAS PubMed Google Scholar
Tai, Y. & Speed, T. A multivariate empirical Bayes statistic for replicated microarray time course data. Ann. Stat. 34, 2387–2412, https://doi.org/10.1214/009053606000000759 (2006).
Article MATH MathSciNet Google Scholar
Chen, L., Liu, R., Liu, Z. P., Li, M. & Aihara, K. Detecting early-warning signals for sudden deterioration of complex diseases by dynamical network biomarkers. Sci. Rep. 2, https://doi.org/10.1038/srep00342 (2012).
Li, M., Zeng, T., Liu, R. & Chen, L. Detecting tissue-specific early warning signals for complex diseases based on dynamical network biomarkers: study of type 2 diabetes by cross-tissue analysis. Brief Bioinform. 15, 229–243, https://doi.org/10.1093/bib/bbt027 (2014).
Article CAS PubMed Google Scholar
Huang, X. et al. A new strategy for analyzing time-series data using dynamic networks: identifying prospective biomarkers of hepatocellular carcinoma. Sci. Rep. 6, https://doi.org/10.1038/srep32448 (2016).
Konopka, T. & Nijman, S. Comparison of genetic variants in matched samples using thesaurus annotation. Bioinformatics 32, 657–663, https://doi.org/10.1093/bioinformatics/btv654 (2015).
Article PubMed PubMed Central CAS Google Scholar
Geman, D., d’Avignon, C., Naiman, D. Q. & Winslow, R. L. Classifying gene expression profiles from pairwise mRNA comparisons. Stat. Appl. Genet. Mol. Biol. 3 (2004).
Yazdani, A. & Dunson, D. B. A hybrid bayesian approach for genome-wide association studies on related individuals. Bioinformatics 31, 49–54, https://doi.org/10.1093/bioinformatics/btv496 (2015).
Google Scholar
Gibbons, G. H. et al. Genetic markers: progress and potential for cardiovascular disease. Circulation 109, 47–58, https://doi.org/10.1161/01.CIR.0000133440.86427.26 (2004).
Article Google Scholar
Rather, R. A. & Dhawan, V. Genetic markers: potential candidates for cardiovascular disease. Int. J. Cardiol. 220, 914–923, https://doi.org/10.1016/j.ijcard.2016.06.251 (2016).
Article PubMed Google Scholar
Khan, J. et al. Classification and diagnostic prediction of cancers using gene expression profiling and artificial neural networks. Nat. Med. 7, 673–679, https://doi.org/10.1038/89044 (2001).
Article CAS PubMed PubMed Central Google Scholar
Feng, Q. et al. Integrated metabolomics and metagenomics analysis of plasma and urine identified microbial metabolites associated with coronary heart disease. Sci. Rep. 6, https://doi.org/10.1038/srep22525 (2016).
Liu, P., Qi, C. B., Zhu, Q. F., Yuan, B. F. & Feng, Y. Q. Determination of thiol metabolites in human urine by stable isotope labeling in combination with pseudo-targeted mass spectrometry analysis. Sci. Rep. 6, https://doi.org/10.1038/srep21433 (2016).
Moreno-Navarrete, J. M. et al. Metabolomics uncovers the role of adipose tissue PDXK in adipogenesis and systemic insulin sensitivity. Diabetologia 59, 822–832, https://doi.org/10.1007/s00125-016-3863-1 (2016).
Article CAS PubMed Google Scholar
Jain, M. et al. Metabolite profiling identifies a key role for glycine in rapid cancer cell proliferation. Science 336, 1040–1044, https://doi.org/10.1126/science.1218595 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Chan, A. W. et al. 1)H-NMR urinary metabolomic profiling for diagnosis of gastric cancer. Br. J. Cancer 114, 59–62, https://doi.org/10.1038/bjc.2015.414 (2016).
Article CAS PubMed Google Scholar
Ke, C. et al. Metabolic phenotyping for monitoring ovarian cancer patients. Sci. Rep. 6, https://doi.org/10.1038/srep23334 (2016).
Lu, Y. et al. Identification of serum biomarkers associated with hepatitis B virus-related hepatocellular carcinoma and liver cirrhosis using mass-spectrometry-based metabolomics. Metabolomics 11, 1526–1538, https://doi.org/10.1007/s11306-015-0804-9 (2015).
Article CAS Google Scholar
Zeng, J. et al. Metabolomics study of hepatocellular carcinoma: discovery and validation of serum potential biomarkers by using capillary electrophoresis-mass spectrometry. J. Proteome Res. 13, 3420–3431, https://doi.org/10.1021/pr500390y (2014).
Article CAS PubMed Google Scholar
Parikh, S. & Hyman, D. Hepatocellular cancer: a guide for the internist. Am. J. Med. 120, 194–202, https://doi.org/10.1016/j.amjmed.2006.11.020 (2007).
Article PubMed Google Scholar
Barabasi, A. L. & Oltvai, Z. N. Network biology: understanding the cell’s functional organization. Nat. Rev. Genet. 5, 101–113, https://doi.org/10.1038/nrg1272 (2004).
Article CAS PubMed Google Scholar
Chopra, P., Lee, J., Kang, J. & Lee, S. Improving cancer classification accuracy using gene pairs. PloS One 5, https://doi.org/10.1371/journal.pone.0014305 (2010).
Look, M. P. et al. Is the increase in serum cystathionine levels in patients with liver cirrhosis a consequence of impaired homocysteine transsulfuration at the level of gamma-cystathionase? Scand. J. Gastroenterol 35, 866–872, https://doi.org/10.1080/003655200750023255 (2000).
Article CAS PubMed Google Scholar
Isbell, H. S. & Frush, H. L. Oxidation of L-ascorbic acid by hydrogen peroxide: preparation of L-threonic acid. Carbohydr. Res. 72, 301–304, https://doi.org/10.1016/S0008-6215(00)83954-3 (1979).
Article CAS Google Scholar
Netzer, M. et al. A coupled three-step network-based approach to identify genes associated with breast cancer. The Fourth International Conference on Bioinformatics, Biocomputational Systems and Biotechnologies, St. Maarten, Netherlands Antilles. IARIA XPS Press. (2012, March 25–30).
Fang, X., Netzer, M., Baumgartner, C., Bai, C. & Wang, X. Genetic network and gene set enrichment analysis to identify biomarkers related to cigarette smoking and lung cancer. Cancer Treat. Rev. 39, 77–88, https://doi.org/10.1016/j.ctrv.2012.06.001 (2013).
Article CAS PubMed Google Scholar
Wang, L. et al. Reconstruction and analysis of correlation networks based on GC-MS metabolomics data for young hypertensive men. Anal. Chim. Acta. 854, 95–105, https://doi.org/10.1016/j.aca.2014.11.009 (2015).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

The study has been supported by National Natural Science Foundation of China (21375011).

Author information

Authors and Affiliations

School of Computer Science & Technology, Dalian University of Technology, 116024, Dalian, China
Xin Huang, Xiaohui Lin & Weihong Yao
CAS Key Laboratory of Separation Science for Analytical Chemistry, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian, 116023, China
Jun Zeng, Lichao Wang, Peiyuan Yin, Lina Zhou & Chunxiu Hu

Authors

Xin Huang
View author publications
You can also search for this author in PubMed Google Scholar
Xiaohui Lin
View author publications
You can also search for this author in PubMed Google Scholar
Jun Zeng
View author publications
You can also search for this author in PubMed Google Scholar
Lichao Wang
View author publications
You can also search for this author in PubMed Google Scholar
Peiyuan Yin
View author publications
You can also search for this author in PubMed Google Scholar
Lina Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Chunxiu Hu
View author publications
You can also search for this author in PubMed Google Scholar
Weihong Yao
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

X.L. and X.H. conceived the research. X.L., X.H. and L.W. wrote the manuscript; J.Z., L.Z., C.H., P.Y. and W.Y. analyzed and discussed the raw data; X.L. reviewed and edited the manuscript.

Corresponding author

Correspondence to Xiaohui Lin.

Ethics declarations

Competing Interests

The authors declare that they have no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Huang, X., Lin, X., Zeng, J. et al. A Computational Method of Defining Potential Biomarkers based on Differential Sub-Networks. Sci Rep 7, 14339 (2017). https://doi.org/10.1038/s41598-017-14682-5

Download citation

Received: 24 March 2017
Accepted: 16 October 2017
Published: 30 October 2017
DOI: https://doi.org/10.1038/s41598-017-14682-5

This article is cited by

Multiomics characterization of fatty acid metabolism for the clinical management of hepatocellular carcinoma
- Xin Huang
- Benzhe Su
- Xinyu He
Scientific Reports (2023)
Data analysis methods for defining biomarkers from omics data
- Chao Li
- Zhenbo Gao
- Xiaohui Lin
Analytical and Bioanalytical Chemistry (2022)
Inflammatory biomarkers and pendelluft magnitude in ards patients transitioning from controlled to partial support ventilation
- Rodrigo A. Cornejo
- Daniel H. Arellano
- Nivia R. Estuardo
Scientific Reports (2022)
Network-Based Analysis of Cognitive Impairment and Memory Deficits from Transcriptome Data
- Elif Emanetci
- Tunahan Çakır
Journal of Molecular Neuroscience (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Application of PB-DSN in the static dataset

Application of PB-DSN in the time-series dataset

Discussion

Methods

Network construction

Defining the differential sub-network

The compared methods

SVM-RFE

MEBA

MN-PCC

ATSD-DN

PinnacleZ

GiGA

KeyPathwayMiner

BioNet

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing Interests

Additional information

Electronic supplementary material

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links