Diversity and molecular network patterns of symptom phenotypes

Shu, Zixin; Wang, Jingjing; Sun, Hailong; Xu, Ning; Lu, Chenxia; Zhang, Runshun; Li, Xiaodong; Liu, Baoyan; Zhou, Xuezhong

doi:10.1038/s41540-021-00206-5

Download PDF

Article
Open access
Published: 30 November 2021

Diversity and molecular network patterns of symptom phenotypes

Zixin Shu¹^na1,
Jingjing Wang¹^na1,
Hailong Sun¹^na1,
Ning Xu²,
Chenxia Lu³,
Runshun Zhang⁴,
Xiaodong Li³,
Baoyan Liu⁵ &
…
Xuezhong Zhou ORCID: orcid.org/0000-0002-4713-3594¹

npj Systems Biology and Applications volume 7, Article number: 41 (2021) Cite this article

2724 Accesses
2 Citations
32 Altmetric
Metrics details

Subjects

Abstract

Symptom phenotypes have continuously been an important clinical entity for clinical diagnosis and management. However, non-specificity of symptom phenotypes for clinical diagnosis is one of the major challenges that need be addressed to advance symptom science and precision health. Network medicine has delivered a successful approach for understanding the underlying mechanisms of complex disease phenotypes, which will also be a useful tool for symptom science. Here, we extracted symptom co-occurrences from clinical textbooks to construct phenotype network of symptoms with clinical co-occurrence and incorporated high-quality symptom-gene associations and protein–protein interactions to explore the molecular network patterns of symptom phenotypes. Furthermore, we adopted established network diversity measure in network medicine to quantify both the phenotypic diversity (i.e., non-specificity) and molecular diversity of symptom phenotypes. The results showed that the clinical diversity of symptom phenotypes could partially be explained by their underlying molecular network diversity (PCC = 0.49, P-value = 2.14E-08). For example, non-specific symptoms, such as chill, vomiting, and amnesia, have both high phenotypic and molecular network diversities. Moreover, we further validated and confirmed the approach of symptom clusters to reduce the non-specificity of symptom phenotypes. Network diversity proposes a useful approach to evaluate the non-specificity of symptom phenotypes and would help elucidate the underlying molecular network mechanisms of symptom phenotypes and thus promotes the advance of symptom science for precision health.

LeMeDISCO is a computational method for large-scale prediction & molecular interpretation of disease comorbidity

Article Open access 25 August 2022

Network analysis to identify symptoms clusters and temporal interconnections in oncology patients

Article Open access 12 October 2022

A network model of depressive and anxiety symptoms: a statistical evaluation

Article 18 January 2024

Introduction

Symptom phenotypes (i.e., symptoms and signs), one of the main clinical manifestations of disease conditions, that could be obtained by human natural perception and cognition abilities, play a vital role for medical visiting, clinical diagnosis, and disease treatment. It has been well-recognized that exploring the clinical patterns and their underlying molecular mechanisms of symptom phenotypes would contribute significantly to nursing science and precision medicine^1,2. However, non-specificity (or diversity) is one of the main obstacles to fully utilize the symptom phenotypes for both diagnosis and treatment. In particular, it has been estimated that Medically Unexplained Symptoms such as tiredness, dizziness, and headache³, which are actually the first part of manifestations in early stage of disease, account for up to 49% of all general practice consultations and high healthcare cost⁴. This means there has no specified pathology to sufficiently reveal and explain the persistent bodily complaints⁵.

Furthermore, due to the network pathological mechanisms of clinical manifestations, symptoms tend to occur together clinically to form symptom clusters⁶ across different chronic disease condition⁷, which would be more specific and meaningful for diagnosis and treatment. Therefore, the assessment of symptom clusters has been recognized as a promising research task for symptom science. For example, the identification of the typical symptom clusters and their underlying mechanisms, such as depression and pain⁸, have promoted the understanding of mental disorders and better treatment. In addition, network medicine approach⁹ to investigate the interconnection of symptoms in mental disorders has emerged as one of the most popular investigation methods in the field of psychometrics¹⁰.

However, although it is vital there is no work to quantify the diversity of symptom phenotypes in the context of clinical settings and their underlying molecular networks, largely because of the lack of high-quality symptom-gene associations and clinical symptom co-occurrence data. Here, we extracted symptom co-occurrences from clinical textbooks to construct phenotype network of symptoms with clinical co-occurrence and incorporated high-quality symptom-gene associations¹¹ and protein–protein interactions to explore the molecular mechanisms of symptom phenotypes¹². Furthermore, we adopted a well-established measure in network medicine¹³ to quantify both phenotypic and molecular diversity of symptom phenotypes (Fig. 1).

**Fig. 1: Quantifying the phenotypic and molecular network diversity of symptom phenotypes.**

Results

High-quality symptom-gene associations

To obtain the high-quality symptom gene associations, we utilized the phenomenon of some “Dual Phenotypes” (DP)¹⁴, such as obesity, fever, and insomnia, which are not only regarded as diseases, but also as symptoms in clinical settings. The associated genes of symptoms can be directly derived from the disease–gene associations by filtering the disease with DP properties. In order to identify these kinds of phenotype terms, we filtered an integrated phenotype–genotype associations (PGA) dataset by limiting the semantic types of Unified Medical language System (UMLS) concepts as T184¹⁵, which resulted in 16,049 associations between 490 symptoms with concept unified identifiers (CUI) code and 4193 genes (see Methods). In fact, these concepts including syndromes (e.g., kearn sayer syndrome), signs (e.g., abnormal reflexes), laboratory tests (e.g., leukopenia) and diseases (e.g., edema lung). Therefore, we manually reviewed and removed symptoms without clear meaning under the guidance of medical to ensure the accuracy of results (Supplementary Table 2). Finally, we obtained 12,719 high-quality symptom–gene associations between 341 symptoms and 3598 genes.

Here, we found there are 37.30 related genes on an average per symptom and 3.53 related symptoms for a single gene. More specifically, 60% symptoms have less than 20 associated genes (Fig. 2a); however, there still exist several symptoms with hundreds of genes, such as obesity (560 genes) and convulsion (673 genes), which indicate the underlying complex pathophysiology and comorbidities of these symptom phenotypes^16,17,18. On the other side, over 50% genes have less than 3 associated symptoms, whereas some genes, such as PRNP, PSEN1, MAPT, GBA, and MECP2 are associated to >20 symptoms (Fig. 2b).

**Fig. 2: The basic statistics of high-quality symptom-gene associations.**

Furthermore, we mapped 341 symptoms to 14 systems or categories according to Symptom Ontology (SYMP) with the principles of the OBO Foundry¹⁹. The SYMP standard ontology (https://www.ebi.ac.uk/ols/ontologies/symp/terms) was developed in 2005 at the Institute for Genome Sciences (IGS) at the University of Maryland and contain more than 900 symptoms in 2020. Despite the limited number of our symptom terms, it covers almost all system categories, which of the large number of symptoms belong to the nervous system (Fig. 2c).

Clinical diversity of symptom phenotypes

To measure the symptom diversity in the context of network, we first constructed a symptom clinical association network (SCN) using 2381 records of symptom clusters curated from a well-recognized textbook named differential diagnosis of traditional Chinese medicine symptoms (DDTS)²⁰, which resulted in a network with 1419 nodes (symptoms) and 32,523 links. In SCN, the symptoms with higher phenotypic diversity (PD) and phenotypic degree (PE), such as neurological and physiological symptoms (e.g., dysphoria, PD: 100.32, PE: 623), respiratory system symptoms (e.g., chest distress, PD: 89.24, PE: 381), and digestive system symptoms (e.g., diarrhea, PD:84.24, PE: 230) which may involve in a various of diseases (Fig. 3). For example, for diarrhea²¹ accompanied with abdominal pain, fever, or gastrointestinal bleeding, it would suggest inflammatory diseases. For another diarrhea phenotype with symptoms of fatigue, cough, and fever, it might relate to virus infectious diseases, such as the severe acute respiratory syndrome coronavirus 2²². Other top ranked symptoms, such as night sweats (PD:89.44, PE:282) and difficulty in urination (PD:80.93, PE:177) (Table 1) would tend to occur as complications in a critical condition. However, the symptoms with low diversity, such as nail symptoms (e.g., flat nails, PD:0.95, PE: 2) and feet symptoms (e.g., digit fester, PD:3.57, PE:8), tend to be local clinical manifestations.

**Fig. 3: Construction of symptom clinical association network(SCN).**

Table 1 Quantifying the diversity of symptom phenotypes in SCN (including the top 50 symptoms sorted by the phenotypic diversity in SCN).

Full size table

Molecular network diversity of symptom phenotypes

To explore the underlying molecular mechanisms of symptom phenotypic diversity, we mapped 252 (73.90%) English terms with associated genes into 116 Chinese terms in SCN (see Methods, Supplementary Table 1), including neurological and physiological symptoms (e.g., night sweats) and general symptom (e.g., chill). 89 (26.10%) symptoms not mapped are mostly from nervous system symptoms (e.g., echo speech), head and neck symptoms (e.g., conjunctiva inflammation), and musculoskeletal system symptoms (e.g. gait ataxic) (Fig. 2d). Next, we attempt to calculate the maximum node diversity and degree of the symptom-related genes in protein–protein interactions (PPI) network²³ to represent molecular network diversity (MD) of symptom phenotypes (see Methods). The maximum gene diversity (MGD) of 116 symptoms range from 9.12 to 491.39, and ~45% of symptoms had MGDs greater than 200. The maximum gene degree (MGE) of symptoms range from 10 to 1400, and only 10% symptoms had a value greater than 600 (Fig. 4a, b) (Table 2).

**Fig. 4: Symptom network diversity analysis.**

Table 2 Quantifying the molecular network diversity of symptom phenotype in SCN (including the top 50 symptoms sorted by the molecular network diversity in SCN).

Full size table

Here, we calculated the Pearson correlation coefficient (PCC) to find the relationships of phenotypic and molecular diversity of these symptoms. The result showed that there exists a positive correlation between the two measures (PD and MGD: PCC = 0.49, P-value = 2.14E-08; PE and MGE: PCC = 0.39, P-value = 1.55E-05) (Fig. 4b). This means that symptoms occurred in more symptom clusters might tend to held higher diverse underlying molecular networks. For example, we found depression have rather high MGD (299.95), which actually is derived from the high diversity of the related gene: MAPK1 in PPI network. MAPK1 as one of the important regulated gene in the mTOR signaling pathway which plays an important role in synaptic plasticity in Alzheimer’s disease and relate to the depression disorder as well as functioning of the immune system^24,25. It is similar for obesity, which has high MGD (367.89) and is considered both as complicated chronic disease condition and symptom with a major negative impact on human health. Since one of the vital obesity genes: AKT1 has the high node diversity (367.89) in PPI network, which at molecular level not only mediated type II muscle growth and thus led to the reversible reduction of fat mass, but also have a direct role on cancer and hearing loss^26,27,28,29.

To further validate and detect the potential applications of symptom diversity for drug development, we curated 948 drugs and their 1451 drug targets from the DrugBank database³⁰ and calculated the correlations between symptom diversity to the number of drug targets located in the neighborhoods of symptom genes in the PPI network. We would expect that drugs tend to regulate symptom by directly targeting symptom genes or the neighbors of symptom genes, the similar principle of which has been used for various related studies³¹. After obtaining the related drug targets associated with 116 symptoms in the 1^st order PPI interactions, we found that there actually exists a strong positive correlation between the number of drug targets and the MGD of symptoms (PCC = 0.79, P-value = 1.93E-26, Fig. 5b). This is similar for phenotypic network diversity (PCC = 0.54, P-value = 4.55E-10, Fig. 5b). The results indicate that symptoms with higher diversity in the clinical settings may tend to have higher number of drug targets to regulate the underlying molecular mechanisms of symptoms. Symptoms with higher drug target number (DTN) also have higher phenotypic diversity, such as dysphoria (DTN: 323), insomnia (DTN: 431), and vomiting (DTN: 761). For example, about 10 categories of drugs are associated with insomnia, including antihistamine (e.g., doxylamine³²), anxiolytics (e.g., etizolam³³), and antipsychotics (e.g. melperone³⁴), which affect GABA-A, D2 dopaminergic and 5HT2A serotonergic and other receptors to treat insomnia. Thus, the symptoms with more clinical diversities would have the potential to be induced and treated by more drugs that target the related genes in their PPI neighborhoods. Furthermore, it is also interesting and important to validate whether the trend is also held for diseases. Therefore, using the integrated disease-gene associations with 179,307 records (12,563 diseases and 18,189 genes), we further investigate the correlation between disease diversities (i.e., in terms of its underlying molecular network) and the number of their drug targets by additional calculations. We found that there exactly exists a strong positive correlation between the number of drug targets and the MGD of diseases (PCC = 0.77, P-value < 4.9E-324). This is similar for the number of drugs (PCC = 0.74, P-values < 4.9E-324). These results indicate that diseases with higher diversity in the molecular network may tend to have higher number of drug targets (Supplementary Fig. 1).

**Fig. 5: Correlations of the symptom network diversity and related drug-targets diversity.**

Molecular network diversity (symptom vs disease phenotypes)

Traditional clinical diagnosis often relied on symptom manifestations, which would be more directly be observed in patients’ daily life and thus convenient for clinical management. However, similar symptom phenotypes always involved in different disease conditions, which would propose substantial obstacles for clinical diagnosis and treatment. Due to the more specific mechanisms of disease phenotypes, changing from symptom-based diagnosis to disease-based diagnosis is the main contribution of modern disease taxonomy and biomedical science^{35,36,37,38,39}. To validate the advantages of disease diagnosis, we utilized the disease–gene associations from MalaCards to similarly calculate the MD for 12,563 disease phenotypes. We found that disease phenotypes tend to have lower diversity than those of symptom phenotypes in terms of MGD (median: 75.39 vs 115.16, P-value = 9.03E-06) and MGE (median: 162 vs 277, P-value = 4.58E-13) (Fig. 4c, d). For example, the diseases, such as bronchitis (213.7), asthma (213.7), and rhinitis (153.3), have lower MGDs than those of cough (241.1), which are three typical causes of chronic cough^40,41. The lower MD of disease phenotypes could partially explain their advantages as diagnostic schema in modern biomedicine.

Clinical symptom clusters hold approach for specific molecular network mechanisms

To resolve the non-specificity of symptom phenotypes, many contemporary diagnoses owe their existence to symptom cluster which has been defined as two or more interrelated symptoms that present together and involve the similar etiology and pathophysiology, such as nephrotic syndrome, irritable bowel syndrome, and chronic fatigue syndrome^42,43,44. Particularly, those symptom clusters with specific underlying common mechanisms have been accepted in clinical practice and frequently used by clinicians today^45,46,47,48. Therefore, we would expect that the common molecular mechanisms involved in symptom clusters would propose an effective approach to reduce the high molecular diversity of a symptom phenotypes. To further validate this assumption, we obtained 1740 symptom pairs (as representations of symptom clusters) with the overlapping genes from SCN, which we found only 704 symptom pairs with symptom-gene association randomization (1740 vs 704, P-value = 3.07E-101). This means that symptom pairs in SCN tend to have shared genes. Next, we obtained the MGDs of symptom pairs in terms of maximum node diversity of their shared genes. We found that symptom pairs tend to have significant lower MGD (median: 108.30 vs 115.16, P-value = 1.8E-04) and MGE (median: 222 vs 277, P-value = 3.14E-08) than those of single symptoms. Particularly, the proportions of MGD (4.94% vs 12.68%) and MGE (41.38% vs 55.46%) in high value (i.e., >=250) are lower in symptom pairs than in single symptoms (Fig. 4d). These results confirmed the significance of symptom clusters as a feasible solution to acquire specific understanding of disease conditions.

Case study: insomnia symptom clusters

Insomnia is a typical chronic disorder and symptom phenotypes that has both diverse underlying molecular mechanisms and can cause various psychiatric and physical health problems^49,50. It has also been considered a strong risk factor of psychiatric illness, such as anxiety disorder, major depressive disorder⁵¹, and associated with many types of metabolic disease^52,53, obstructive airway disease⁵⁴, and cancer⁵⁵. To investigate the underlying molecular mechanisms of specific symptom cluster, we identified 72 insomnia symptom pairs from 1740 clusters with overlapping genes. A total of 11 systems are involved in insomnia-related symptoms, which 36.2% of symptoms related to neurological and physiological systems, such as abdominal pain, amnesia, and dysphoria (Supplementary Fig. 2). We found 19 insomnia pairs with co-occurrence > =15 in DDTS, including the pairs of (insomnia, dysphoria), (insomnia, dizzy), and (insomnia, poor appetite) (Table 3). Moreover, we obtained the overlapped enriched KEGG⁵⁶ pathways (P-value < 0.05) between these symptoms and insomnia to explore the shared molecular mechanisms of these insomnia pairs (see Methods). The number of enriched overlapped pathways of insomnia-related symptom pairs range from 1 to 49. Fever, fatigue, and amnesia have great overlapping pathways and co-occurrence with insomnia, which reflected the high diversity of these insomnia symptom pairs from both phenotype and molecular mechanisms (Table 3). For example, there are many reasons for insomnia patients with fever, such as influenza⁵⁷, tuberculosis⁵⁸, pneumonia⁵⁹, tumors⁶⁰, and neurological disorders⁶¹, which would be involved in various molecular pathways, including the immune system pathway (e.g., intestinal immune network for IgA production and intestinal immune network for IgA production), signal transduction pathway (e.g., cAMP signaling pathway and AMPK signaling pathway), and infectious disease pathway (e.g., Influenza A and Tuberculosis) (Fig. 6).

Table 3 The basic molecular features of insomnia symptom cluster (sorted by the co-occurrences).

Full size table

**Fig. 6: The overlapped pathways of insomnia symptom clusters.**

Particularly, using hierarchical agglomerative clustering analysis (by the cluster map function in the Python Seaborn library)⁶², we identified 54 enriched pathways of 22 pathogenesis types and 5 main symptom clusters, such as (insomnia, fever, rash), (insomnia, body pain, emaciation, fatigue), (insomnia, loose stools, poor appetite), (insomnia, night sweats, headache), and (insomnia, constipation, emotional lability) for insomnia disorder (Fig. 6). For example, the overlapped pathways of insomnia-fever-rash cluster are involved in immune and infectious disease (e.g., herpes simplex infection). The related report that sleep–wake cycles have emerged as prominent regulators of the immune system and variations in sleep duration that occur in the natural setting have the potential to impact infectious disease risk⁶³. The patient of insomnia-body pain-emaciation-fatigue cluster are associated with cancer^64,65, and the related pathways include dysregulation of cancer transcriptional regulation. Other insomnia patients often show constipation and emotional lability after taking drugs⁶⁶, and the pathways are related to the substance dependence, such as amphetamine addiction, alcoholism, and cocaine addiction.

In addition, we have extracted the PPI networks of the 5 insomnia-related symptom clusters (Fig. 7 and Supplementary Figs. 3–6) and obtained the enriched gene ontology terms of biological process (GO_BP) of the overlapping genes for each cluster (Table 4 and Supplementary Tables 3–5). We found that insomnia-fever-rash symptom cluster includes the cytokines (e.g., IL6, IL10, and IL1B) and inflammatory biomarkers (e.g., PIK3R1, STAT3, and TNF) as the hub genes in their associated PPI network and tends to be related to the inflammatory immune-related insomnia subtype involving the biological processes, such as B-cell differentiation, antigen processing and presentation, and cytokine-mediated signaling pathway (Fig. 7 and Table 4). We also found that genes in the network, such as PTGS2 and PTGS1, are targeted by a variety of nonsteroidal anti-inflammatory drugs (NSAIDs), including dexibuprofen, mefenamic acid, and bufexamac to improve symptoms of fever, rash, and insomnia^67,68,69. It is similar and biomedical meaningful for the other 4 insomnia-related symptom clusters.

**Fig. 7: Construction the PPI network of insomnia-fever-rash cluster.**

Table 4 The GO BP of overlapping genes enriched of insomnia-fever-rash cluster.

Full size table

Discussion

Symptom phenotypes are the overt manifestations of disease observed by physicians and patients. However, most symptoms are non-specific and rarely identify a disease unambiguously. In fact, numerous diseases—including some of the most common ones such as cancer, cardiovascular disease, and HIV infection—may manifest unspecific symptoms (e.g., fatigue) in the early stage which often easily be ignored to regard as the asymptomatic phenomenon⁵. Therefore, it is a vital task to elucidate the underlying molecular mechanisms of symptoms, in particular the network mechanisms of them to investigate the pathogenesis of non-specificity of symptom phenotypes. However, the biological mechanisms of symptom phenotypes have rarely been addressed in systematic approach, which might largely be owing to the lack of high-quality symptom-gene associations data.

Here, we curated high-quality symptom-gene associations and quantitatively evaluated the network diversity of symptom phenotypes using a well-established network measure (i.e., node diversity). The results showed that the degree of un-specificity of symptoms could be represented by node diversity and we further found that the clinical diversity of symptom phenotypes could be partially explained by the molecular network diversity of symptom phenotypes (significant positive correlation between MGD and PD was detected; PCC = 0.49, P-value = 2.14E-08). Furthermore, we evaluated the molecular diversity of diseases and found it is lower than those of symptom phenotypes. These results validate the advantages of disease diagnosis and the reliability of MGD for evaluating the diversity of symptom phenotypes. Overall, our work proposes a feasible approach to evaluate the diversity of symptom phenotypes and it could further be used for “symptom subtyping” as recent literature for establishing the new disease taxonomy⁷⁰.

Particularly, as a recent hot research topic that has been intensively investigated in nursing science⁷¹. Various studies have identified significant symptom clusters (e.g., fatigue, depressive symptoms, and anxiety⁷²) of the typical diseases during the nursing process, such as psychiatric diseases (e.g., depression and anxiety)⁷³, cancer diseases (e.g., breast cancer, gastrointestinal cancer, lung cancer)⁷⁴, and chronic diseases (e.g., chronic kidney disease, chronic obstructive pulmonary disease, type 2 diabetes)^75,76,77. For example, related study found that patients with heart failure (HF) would manifest distinct symptom clusters, the weary (lack of energy, lack of appetite, and difficulty sleeping) and the dyspneic symptom clusters (shortness of breath, difficulty breathing when lying flat, and waking up breathless at night). Each one unit increase in mean distress score in the dyspneic symptom cluster doubled the risk for cardiac death and the risk of cardiac rehospitalization increased by 1.5 times for each one unit increase in mean distress score in the weary symptom cluster⁷⁸. Therefore, it is a promising clinical analysis task to find significant symptom clusters involved in various disease conditions. It also emphasizes the importance of investigating and monitoring of symptom clusters which can help improve the capability of clinical diagnosis, treatment and predict the outcomes in patients rather than individual symptoms. Altogether, symptom clusters have proposed an effective approach for symptom subtyping, which would deliver population stratification with higher specificity than single symptom phenotype. In our study, using the molecular diversity measurement of symptom phenotypes, we further investigate the underlying network mechanisms of symptom clusters and why their clinical specificities could be obtained, which would finally be helpful to detect and understand various symptom subtypes involved in different disease conditions.

There still have several limitations for our work. First, the number of symptom-gene associations is limited, which is mainly owing to the focus of PGA on congenital hereditary diseases. In our study, most of the symptoms with gene associations belong to the nervous system, which would be result in certain deviations. However, the 341 symptoms in our work have covered 180 (46.63%) of symptoms in Medical Subject Heading vocabulary⁶⁷ which was created and updated annually by the NLM since 1960s. This means that our results would deliver some kinds of reliable and useful knowledge for understanding the network mechanisms of the whole spectrum of symptom phenotypes. Second, the disparity of clinical and biomedical terminologies on symptom phenotypes is another obstacle to perform the translational medicine studies as our work. We found that clinical terminologies in clinical settings would tend to be in more specific granularities and the terms in biomedical data would be in higher levels. Therefore, the semantic mapping between different terminologies is a vital task for our study. This is further challenged by the cross-language translation difficulty involving Chinese and English languages. Actually, we have used the symptom cluster data in Chinese to construct the SCN, which would have the constraints of specific language (i.e., Chinese). In addition, the recordings of symptom clusters in Chinese and Chinese population would possibly influence the generalization of our results for other populations. Notwithstanding these plenty of challenges, we are convinced that advances in the field of symptom science will eventually enable us to substantially expand the data sources and thus promote the understanding of symptom phenotypes in the postgenomic era. In the future, we hope to identify novel and effective drug targets for symptom subtypes by incorporating the underlying network mechanisms of symptom diversity, so as to better serve the individualized diagnosis and treatment.

Methods

Basic datasets and preprocessing

We curated both clinical and molecular related data on symptom phenotypes to perform our study, which includes (i) clinical symptom manifestations from textbook, (ii) phenotype-genotype associations, (iii) protein interactome data, and (iiii) drug–targets associations.

Clinical symptom manifestations

We curated the data related to clinical symptoms derived from a well-recognized textbook named DDTS for clinicians in China, which contain 431 investigated symptoms and their symptom clusters (with 988 additional symptoms) in traditional Chinese medicine (TCM) clinical settings. This book is an important part of TCM syndrome differentiation and treatment, which reflects the use of TCM basic theory syndrome differentiation method for subtype analysis of symptoms. The characteristics of the same symptom in different clusters reflect the diversity and complexity of symptom in clinical settings. Therefore, the book could have served as a data source for exploring the diversity of symptoms.

Phenotype–genotype associations

We used an integrated PGA from DisGeNet⁷⁹ and MalaCards⁸⁰, which contains 110,407 associations with 11,362 unique diseases represented by UMLS CUI code and 13,271 unique genes.

Protein–protein interactions

The PPI were filtered from the human subset of STRING V11²³ by the score threshold > =700, which include 17,185 distinct proteins and 420,534 high-quality interactions.

Drug–targets associations

The drug–targets associations obtained from the DrugBank database³⁰, which is a comprehensive online database containing information on drugs and drug targets. Finally, we obtained 948 unique drugs and their 1451 targets for correlation analysis.

Construction of symptom association network

In the DDTS, several established symptom clusters would be associated for each chief symptom. We considered symptom cluster as one record and constructed the SCN by symptom co-occurrence in symptom clusters and visualized by Gephi 0.9.2 software. To connect phenotypic and genetic data of symptoms in SCN, we manually mapped Chinese terms of symptoms in clinical data to English terms of symptoms in PGA by the trained medical researchers (e.g., Zixin Shu, Ning Xu, Chenxia Lu, Runshun Zhang) in our author list, thereby ensuring highly accurate terminological mappings. 252 (73.90%) English symptom terms with associated genes mapped to 116 Chinese symptom terms in SCN. Therefore, there is a phenomenon of multiple CUI code merging corresponding to one TCM symptom, for example, C0035021 and C0015967 were both mapped to发热 (i.e., fever). Finally, we obtained the genetic information of 116 symptoms in SCN by merging the genetic associations of the CUI code symptoms (Supplementary Table 1).

Measuring the phenotypic diversity

We used node diversity¹³ to characterize the diversity of symptom phenotypes in the context of network, which have been successfully used for measuring disease diversity in recent studies^12,70. The diversity ϕ of node j is based on the node bridging coefficient⁸¹ and defined by

$$\phi (j) = \mathop {\sum }\limits_{i\; \in N(i)} \frac{{\delta (i)}}{{k\left( i \right) - 1}}$$

where k (i) is the degree of node i, N (i) denotes its neighborhood, that is, the set of all its direct neighborhood and δ (i) is the total number of links leaving that neighborhood. The diversity ϕ is large for nodes with many neighbors that have out-going links themselves.

To evaluate the MD of phenotypes, we assume the molecular diversity of symptom phenotypes would largely lie on the related genes in the context of molecular network. For example, to quantify the MD (in terms of node diversity) of amnesia, we calculated all the node diversity values for the amnesia-related genes, such as MAPK1, EP300, and APP. Finally, we considered the MD of amnesia as 299.95 since we found that MAPK1 has the maximum node diversity of 299.95 among those genes. Furthermore, it is intuitively that node degree also could be considered as additional measure for molecular diversity.

Shortest paths length between drug targets and symptom genes

Shortest paths are an important topological measurement for the analysis of social and biological networks¹². Here, we utilize Dijkstra’s algorithm⁸² to find all shortest path lengths between drug targets and genes of symptom in the PPI network to help obtain 1-order drug targets and their related drugs for a given symptom phenotypes.

Enrichment analysis

In order to identify molecular pathways and biological processes that could be impacted by the gene variations of each symptom cluster we used enrichment analysis. Pathway analysis offers the great power for discovering the biological functions underlying genes and proteins. The KEGG PATHWAY database is the main database in Kyoto Encyclopedia of Genes and Genomes (KEGG), and it consists of manually drawn reference pathway maps together with organism specific pathway maps⁵⁶. Gene set enrichment analysis is a method of identifying classes of genes or proteins that are over-represented in a large set of genes or proteins and may be associated with disease phenotypes. We obtained the enriched KEGG pathways and gene ontology terms of biological process using the database for annotation, visualization, and integrated discovery (DAVID)⁸³, which is a web-based online bioinformatics resource that aims to provide tools for the functional interpretation of large lists of genes/proteins.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

All the relevant data supporting the findings of this study are included in the paper and its Supplementary material files.

Code availability

The source codes and data are available at: https://github.com/shuzixin9212/symptom-diversity. The codes including the construction of the SCN and the calculation of node diversity. Node diversity algorithm was implemented using Java JDK 1.8. The other data analysis tasks were implemented using Python 3.7. In addition, we also provide two types of source files, (1) the clinical data used to construct SCN, the mapping data between Chinese and English terms of symptom phenotypes; (2) the symptom-gene associations and protein–protein interaction network data.

References

Hickey, K. T., Bakken, S., Byrne, M. W., Bailey, D. & Grady, P. A. Corrigendum to precision health: advancing symptom and self-management science. Nurs. Outlook 68, 139–140 (2020).
Article CAS PubMed Google Scholar
Ca Shion, A. K., Gill, J., Hawes, R., Henderson, W. A. & Saligan, L. National institutes of health symptom science model sheds light on patient symptoms. Nurs. Outlook 64, 499–506 (2016).
Article Google Scholar
Yon, K., Nettleton, S., Walters, K., Lamahewa, K. & Buszewicz, M. Junior doctors’ experiences of managing patients with medically unexplained symptoms: a qualitative study. Bmj Open 5, e009593 (2015).
Article PubMed PubMed Central Google Scholar
Haller, H., Cramer, H., Lauche, R. & Dobos, G. Somatoform disorders and medically unexplained symptoms in primary care. Dtsch. Ärzteblatt Int. 112, 279–287 (2015).
Google Scholar
Chew-Graham, C. A. Medically unexplained symptoms: continuing challenges for primary care. Br. J. Gen. Pract. 67, 106–107 (2017).
Article PubMed PubMed Central Google Scholar
Dodd, M. J., Miaskowski, C. & Lee, K. A. Occurrence of symptom clusters. JNCI Monogr. 32, 76–78 (2004).
Article Google Scholar
Lu, K. Z. et al. Integrated network analysis of symptom clusters across disease conditions. J. Biomed. Inform. 107, 103482 (2020).
Article PubMed Google Scholar
Torta, R. & Munari, J. Symptom cluster: depression and pain. Surg. Oncol. 19, 155–159 (2010).
Article PubMed Google Scholar
Maron, B. A. et al. A global network for network medicine. npj Syst. Biol. Appl. 6, 1–3 (2020).
Article Google Scholar
Jones, P. J., Alexandre, H. & Mcnally, R. J. Commentary: a network theory of mental disorders. Front. Psychol. 8, 1305 (2017).
Article PubMed PubMed Central Google Scholar
Wu, Y. et al. Symmap: an integrative database of traditional chinese medicine enhanced by symptom mapping. Nucleic Acids Res. 47, D1110–D1117 (2018).
Article PubMed Central Google Scholar
Zhou, X. Z., Menche, J., Barabási, A. & Sharma, A. Human symptoms–disease network. Nat. Commun. 5, 1–10 (2014).
Article Google Scholar
Kitagawa, H., Ishikawa, Y., Li, W. j. & Watanabe, C. Database systems for advanced applications: 15th international conference, dasfaa 2010, Tsukuba, Japan, April 1–4, 2010, Proceedings, part I (Springer,Tsukuba, 2010).
Yang, K. Heterogeneous network embedding for identifying symptom candidate genes. J. Am. Med. Inform. Assoc. 25, 1452–1459 (2018).
Article PubMed PubMed Central Google Scholar
Bodenreider, O. The unified medical language system (umls): integrating biomedical terminology. Nucleic Acids Res. 32, D267–D270 (2004).
Article CAS PubMed PubMed Central Google Scholar
Lee, H., Kwon, A., Kim, H.-S. & Lee, J.-S. Fructose-1,6-bisphosphatase deficiency presented with complex febrile convulsion. Neuro Endocrinol. Lett. 39, 533–536 (2019).
PubMed Google Scholar
Yu, S., Xing, L., Du, Z., Tian, Y. & Li, C. Prevalence of obesity and associated risk factors and cardiometabolic comorbidities in rural northeast china. BioMed. Res. Int. 2019, 1–9 (2019).
Google Scholar
Font-Clos, F., Zapperi, S. & Porta, C. L. Integrative analysis of pathway deregulation in obesity. Npj Syst. Biol. Appl. 3, 18 (2017).
Article PubMed PubMed Central Google Scholar
Whetzel, P. L. et al. Bioportal: enhanced functionality via new web services from the national center for biomedical ontology to access and use ontologies in software applications. Nucleic Acids Res. 39, W541–W545 (2011).
Article CAS PubMed PubMed Central Google Scholar
Yao, N. Differential Diagnosis of tcm Syndromes 2nd edn (People’s Medical Publishing House, 2002).
Schiller, L. R., Pardi, D. S. & Sellin, J. H. Chronic diarrhea: diagnosis and management. Clin. Gastroenterol. Hepatol. 15, 182–193 (2017).
Article PubMed Google Scholar
D’Amico, F., Baumgart, D. C., Danese, S. & Peyrin-Biroulet, L. Diarrhea during covid-19 infection: pathogenesis, epidemiology, prevention and management. Clin. Gastroenterol. Hepatol. 18, 1663–1672 (2020).
Article PubMed PubMed Central Google Scholar
Damian, S. et al. String v11: protein-protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets. Nucleic Acids Res. 47, D607–D613 (2018).
Google Scholar
Gerschütz, A., Heinsen, H., Grünblatt, E., Wagner, A. K. & Monoranu, C. M. Neuron-specific alterations in signal transduction pathways associated with alzheimer’s disease. J. Alzheimers Dis. Jad. 40, 135–142 (2014).
Article PubMed Google Scholar
Chang, K. W., Zong, H. F., Rizvi, M. Y., Ma, K. G. & Qian, Y. H. Modulation of the mapks pathways affects aβ-induced cognitive deficits in alzheimer’s disease via activation of α7nachr. Neurobiol. Learn. Mem. 168, 107154 (2020).
Article CAS PubMed Google Scholar
Kopaliani, I., Egorov, D., Tugtekin, S. M., Matschke, K. & Deussen, A. The endothelial angiotensin ii type 1 receptor/akt1 axis mediates vascular remodeling during hypertension. FASEB J. 34, 1–1 (2020).
Article Google Scholar
Izumiya, Y. et al. Fast/glycolytic muscle fiber growth reduces fat mass and improves metabolic parameters in obese mice. Cell Metab. 7, 159–172 (2008).
Article CAS PubMed PubMed Central Google Scholar
Yves, B. et al. All akt isoforms (akt1, akt2, akt3) are involved in normal hearing, but only akt2 and akt3 are involved in auditory hair cell survival in the mammalian inner ear. PLoS ONE 10, e0121599 (2015).
Article Google Scholar
Kabraji, S. et al. Akt1low quiescent cancer cells in ductal carcinoma in situ of the breast. npj Breast Cancer 5 (2019).
Wishart, D. S. et al. Drugbank 5.0: a major update to the drugbank database for 2018. Nucleic Acids Res. 46, D1074–D1082 (2018).
Article CAS PubMed Google Scholar
Okada, Y. et al. Genetics of rheumatoid arthritis contributes to biology and drug discovery. Nature 506, 376–381 (2014).
Article CAS PubMed Google Scholar
Melnikov, A. Y. et al. Effectiveness of reslip (doxylamine) in short-term insomnia: multicenter comparative randomized study. Zh . nevrologii i psikhiatrii Im. SS Korsakova 117, 56–59 (2017).
Article Google Scholar
McElroy, H. et al. Comparison of the effect of lemborexant and other insomnia treatments on driving performance: a systematic review and meta-analysis. Sleep. Adv. 2, zpab010 (2021).
Article Google Scholar
Frase, L., Nissen, C., Riemann, D. & Spiegelhalder, K. Making sleep easier: pharmacological interventions for insomnia. Expert Opin. Pharmacother. 19, 1465–1473 (2018).
Article CAS PubMed Google Scholar
Peyrin-Biroulet, L., Loftus, E. V., Colombel, J. F. & Sandborn, W. J. The natural history of adult crohn’s disease in population-based cohorts. Am. J. Gastroenterol. 105, 289–297 (2010).
Article PubMed Google Scholar
Torruellas, C., French, S. W. & Medici, V. Diagnosis of alcoholic liver disease. World J. Gastroenterol. 20, 11684–11699 (2014).
Article PubMed PubMed Central Google Scholar
Vasileios, M. & Athanasios, A. Biomarkers for alzheimer’s disease diagnosis. Curr. Alzheimer Res. 14, 1149–1154 (2017).
Google Scholar
Zhang, H., Zhen, Z., Arjudeb, M. & Shen, B. Molecular diagnosis and classification of inflammatory bowel disease. Expert Rev. Mol. Diagn. 18, 867–886 (2018).
Article CAS PubMed Google Scholar
William et al. The 2015 world health organization classification of lung tumors: impact of genetic, clinical and radiologic advances since the 2004 classification. J. Thorac. Oncol. 10, 1243–1260 (2015).
Article Google Scholar
Satia, I., Ba Dri, H., Woodhead, M., O’Byrne, P. & Smith, J. The interaction between bronchoconstriction and cough in asthma. Thorax 48, PA4192 (2016).
Google Scholar
Filippo, P. D., Scaparrotta, A., Petrosino, M. I., Attanasi, M. & Mohn, A. An underestimated cause of chronic cough: the protracted bacterial bronchitis. Ann. Thorac. Med. 13, 7–13 (2018).
Kang, H. G. & Cheong, H. I. Nephrotic syndrome: what’s new, what’s hot? Korean J. Pediatr. 58, 275–282 (2015).
Article CAS PubMed PubMed Central Google Scholar
Rodiño-Janeiro, B. K., Vicario, M., Alonso-Cotoner, C., Pascua-García, R. & Santos, J. A review of microbiota and irritable bowel syndrome: future in therapies. Adv. Ther. 35, 289–310 (2018).
Article PubMed PubMed Central Google Scholar
Naviaux, R. K., Naviaux, J. C., Li, K., Bright, A. T. & Gordon, E. Metabolic features of chronic fatigue syndrome. Proc. Natl Acad. Sci. USA. 113, E5472 (2016).
CAS PubMed PubMed Central Google Scholar
Tang, Y. A. et al. Symptom cluster of icu nurses treating covid-19 pneumonia patients in wuhan, china. J. Pain. Symptom Manag. 60, e48–e53 (2020).
Article Google Scholar
Kristine, K. et al. Randomized controlled trial of a brief cognitive-behavioral strategies intervention for the pain, fatigue, and sleep disturbance symptom cluster in advanced cancer. Psycho Oncol. 27, 2761–2769 (2018).
Article Google Scholar
Bjerkeset, E., Rhrl, K. & Schou-Bredal, I. Symptom cluster of pain, fatigue, and psychological distress in breast cancer survivors: prevalence and characteristics. Breast Cancer Res. Treat. 180, 63–71 (2020).
Article CAS PubMed PubMed Central Google Scholar
Aronowitz & Robert, A. When do symptoms become a disease? Ann. Intern. Med. 134, 803 (2001).
Article CAS PubMed Google Scholar
Levenson, J. C., Kay, D. B. & Buysse, D. J. The pathophysiology of insomnia. Chest 147, 1179–1192 (2015).
Article PubMed PubMed Central Google Scholar
Burman, D. Sleep disorders: insomnia. FP Essent. 460, 22–28 (2017).
PubMed Google Scholar
Blake, M. J., Trinder, J. A. & Allen, N. B. Mechanisms underlying the association between insomnia, anxiety, and depression in adolescence: implications for behavioral sleep interventions. Clin. Psychol. Rev. 63, 25 (2018).
Article PubMed Google Scholar
Chapman, J. L. et al. Is metabolic rate increased in insomnia disorder? A systematic review. Front. Endocrinol. 9, 374 (2018).
Article Google Scholar
Nakamura, M. & Nagamine, T. Neuroendocrine, autonomic, and metabolic responses to an orexin antagonist, suvorexant, in psychiatric patients with insomnia. Innov. Clin. Neuroence 14, 30–37 (2017).
Google Scholar
Alexandra, N., Nadja, B. D., Hans-Hartmut, P. & Susanne, W. Psychophysiological insomnia and respiratory tract infections: results of an infection-diary-based cohort study. Sleep. 42, zsz098 (2019).
Ge, L., Guyatt, G., Tian, J., Pan, B. & Yang, K. Insomnia and risk of mortality from all-cause, cardiovascular disease, and cancer: systematic review and meta-analysis of prospective cohort studies. Sleep. Med. Rev. 48, 101215 (2019).
Article PubMed Google Scholar
Minoru, K., Yoko, S., Masayuki, K., Miho, F. & Mao, T. Kegg as a reference resource for gene and protein annotation. Nucleic Acids Res. 44, D457–D462 (2016).
Article Google Scholar
Javanian, M. et al. A brief review of influenza virus infection. J. Med. Virol. 93, 4638–4646 (2021).
Article CAS PubMed Google Scholar
Storla, D. G., Yimer, S. & Bjune, G. A. A systematic review of delay in the diagnosis and treatment of tuberculosis. BMC Public Health 8, 1–9 (2008).
Article Google Scholar
Shu, Z. et al. Add-on chinese medicine for coronavirus disease 2019 (accord): a retrospective cohort study of hospital registries. Am. J. Chin. Med. 49, 543–575 (2021).
Article CAS PubMed Google Scholar
Fiorentino, L. & Ancoli-Israel, S. Sleep dysfunction in patients with cancer. Curr. Treat. Options Neurol. 9, 337–346 (2007).
Article PubMed PubMed Central Google Scholar
Provini, F., Lombardi, C. & Lugaresi, E. Insomnia in Neurological Diseases and Disorders (Humana, 2010).
Emberti Gialloreti, L., Enea, R., Di Micco, V., Di Giovanni, D. & Curatolo, P. Clustering analysis supports the detection of biological processes related to autism spectrum disorder. Genes 11, 1476 (2020).
Article PubMed Central Google Scholar
Irwin, M. R. Sleep and infectious disease risk. Sleep 35, 1025–1026 (2012).
Article PubMed PubMed Central Google Scholar
Rambod, M., Pasyar, N. & Shamsedini, M. The effect of reflexology on fatigue, pain, and sleep quality in lymphoma patients: a clinical trial. Eur. J. Oncol. Nurs. 43, 101678 (2019).
Article PubMed Google Scholar
Nishiura, M., Tamura, A., Nagai, H. & Matsushima, E. Assessment of sleep disturbance in lung cancer patients: relationship between sleep disturbance and pain, fatigue, quality of life, and psychological distress. Palliat. Supportive Care 13, 575–581 (2015).
Article Google Scholar
Yayla, E. M., Yavuz, E., Bilge, U., Keskin, A. & Binen, E. Drugs with anticholinergic side-effects in primary care. Niger. J. Clin. Pract. 18, 18–21 (2015).
CAS PubMed Google Scholar
Kim, C. K. et al. Dexibuprofen for fever in children with upper respiratory tract infection. Pediatr. Int. 55, 443–449 (2013).
Article CAS PubMed Google Scholar
Sailaja, A. K. & Lola, V. S. Formulation of mefenamic acid loaded polymeric nanoparticles for the treatment of rheumatoid arthritis. J. Bionanoscience 12, 177–183 (2018).
Article Google Scholar
Zhu, X. J., Gang, X. V. & Yu, K. M. Efficacy of combination of bufexamac cream with hydrocortisone butyrate cream in the treatment of eczema. Chin. J. Dermatovenereol. 9 (2010).
Zhou, X. et al. A systems approach to refine disease taxonomy by integrating phenotypic and molecular networks. Ebiomedicine 31, 79–91 (2018).
Article PubMed PubMed Central Google Scholar
National Research Council (U.S.). Committee on A Framework for Developing a New Taxonomy of Disease. Toward Precision Medicine: Building a Knowledge Network for Biomedical Research and a new Taxonomy of Disease (National Academies Press, 2011).
Fiorentino, L., Rissling, M., Liu, L. & Ancoli-Israel, S. The symptom cluster of sleep, fatigue and depressive symptoms in breast cancer patients: severity of the problem and treatment options. Drug Discov. Today Dis. Models 8, 167–173 (2012).
Article Google Scholar
Aktas, A., Walsh, D. & Rybicki, L. Symptom clusters: myth or reality? Palliat. Med. 24, 373–385 (2010).
Kwekkeboom, K. L. Cancer symptom cluster management. Semin. Oncol. Nurs. 32, 373–382 (2016).
Article PubMed PubMed Central Google Scholar
Jhamb, M. et al. Comparison of fatigue, pain, and depression in patients with advanced kidney disease and cancer—symptom burden and clusters. J. Pain Symptom Manage. 57 (2019).
Bradlee et al. Symptom clusters in chronic obstructive pulmonary disease: a systematic review - sciencedirect. Appl. Nurs. Res. 45, 23–29 (2019).
Article Google Scholar
Ann, J. Z., Bose, E., Park, J., Danet, M. L.-B. & Alexandra, A. G. Diabetes changes symptoms cluster patterns in persons living with hiv. J. Assoc. Nurses AIDS Care 28, 888–896 (2017).
Article Google Scholar
Song, E. K., Moser, D. K., Rayens, M. K. & Lennie, T. A. Symptom clusters predict event-free survival in patients with heart failure. J. Cardiovasc. Nurs. 25, 284–291 (2010).
Article PubMed PubMed Central Google Scholar
Piero, J., Ramírez-Anguita, J., Saüch-Pitarch, J., Ronzano, F. & Furlong, L. I. The disgenet knowledge platform for disease genomics: 2019 update. Nucleic Acids Res. 48 (2019).
Noa, R. et al. Malacards: an amalgamated human disease compendium with diverse clinical and genetic annotation and structured search. Nucleic Acids Res. 45, D877–D887 (2017).
Hwang, W., Kim, T., Ramanathan, M. & Zhang, A. Bridging centrality: graph mining from element level to group level. Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, 336–344.
Cormen, T. H., Leiserson, C. E., Rivest, R. L. & Stein, C. Introduction to Algorithms (MIT press, 2009).
Huang, D. W. et al. David bioinformatics resources: expanded annotation database and novel algorithms to better extract biology from large gene lists. Nucleic Acids Res. 35, W169–W175 (2007).
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This work is partially supported by the National Key Research and Development Program (Nos. 2017YFC1703506, 2017YFC1703505, 2017YFC1703502, 2020YFC0841600, and 2020YFC0845000-4), the Natural Science Foundation of Beijing (Nos. M21012), the Fundamental Research Funds for the Central Public Welfare Research Institutes (Nos. ZZ10-005 and 2018JBZ006), the National Major Scientific and Technological Special Project (2017ZX09301018) and Key Technologies R & D Program of China Academy of Chinese Medical Sciences (CI2021A03808).

Author information

These authors contributed equally: Zixin Shu, Jingjing Wang, Hailong Sun.

Authors and Affiliations

Institute of Medical Intelligence, School of Computer and Information Technology, Beijing Jiaotong University, Beijing, 100063, China
Zixin Shu, Jingjing Wang, Hailong Sun & Xuezhong Zhou
The First Affiliated Hospital of Henan University of Chinese Medicine (Co-construction Collaborative Innovation Center for Chinese Medicine and Respiratory Diseases by Henan, Henan University of Chinese Medicine), Zhengzhou, 450046, China
Ning Xu
Hubei Provincial Hospital of Traditional Chinese Medicine (Affiliated Hospital of Hubei University of Traditional Chinese Medicine, Hubei Academy of Traditional Chinese Medicine), Wuhan, 430061, China
Chenxia Lu & Xiaodong Li
Guang’anmen Hospital, China Academy of Chinese Medical Sciences, Beijing, 100053, China
Runshun Zhang
China Academy of Chinese Medical Sciences, Beijing, 100700, China
Baoyan Liu

Authors

Zixin Shu
View author publications
You can also search for this author in PubMed Google Scholar
Jingjing Wang
View author publications
You can also search for this author in PubMed Google Scholar
Hailong Sun
View author publications
You can also search for this author in PubMed Google Scholar
Ning Xu
View author publications
You can also search for this author in PubMed Google Scholar
Chenxia Lu
View author publications
You can also search for this author in PubMed Google Scholar
Runshun Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Xiaodong Li
View author publications
You can also search for this author in PubMed Google Scholar
Baoyan Liu
View author publications
You can also search for this author in PubMed Google Scholar
Xuezhong Zhou
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

X Zhou, X Li, R Zhang, and B Liu conceived the study. Z Shu, N Xu, and C Lu collected and processed the data. Z Shu, J Wang, and H Sun analyzed the data. Z Shu, J Wang, and X Zhou drafted and revised the manuscript. All authors have proofread the manuscript. Z shu, J Wang, and H Sun are considered “co-first author”.

Corresponding author

Correspondence to Xuezhong Zhou.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Shu, Z., Wang, J., Sun, H. et al. Diversity and molecular network patterns of symptom phenotypes. npj Syst Biol Appl 7, 41 (2021). https://doi.org/10.1038/s41540-021-00206-5

Download citation

Received: 16 April 2021
Accepted: 01 November 2021
Published: 30 November 2021
DOI: https://doi.org/10.1038/s41540-021-00206-5

Subjects

Abstract

Similar content being viewed by others

LeMeDISCO is a computational method for large-scale prediction & molecular interpretation of disease comorbidity

Network analysis to identify symptoms clusters and temporal interconnections in oncology patients

A network model of depressive and anxiety symptoms: a statistical evaluation

Introduction

Results

High-quality symptom-gene associations

Clinical diversity of symptom phenotypes

Molecular network diversity of symptom phenotypes

Molecular network diversity (symptom vs disease phenotypes)

Clinical symptom clusters hold approach for specific molecular network mechanisms

Case study: insomnia symptom clusters

Discussion

Methods

Basic datasets and preprocessing

Clinical symptom manifestations

Phenotype–genotype associations

Protein–protein interactions

Drug–targets associations

Construction of symptom association network

Measuring the phenotypic diversity

Shortest paths length between drug targets and symptom genes

Enrichment analysis

Reporting summary

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

Supplementary Information

Reporting Summary

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links