Disease clusters subsequent to anxiety and stress-related disorders and their genetic determinants

Han, Xin; Shen, Qing; Hou, Can; Yang, Huazhen; Chen, Wenwen; Zeng, Yu; Qu, Yuanyuan; Suo, Chen; Ye, Weimin; Fang, Fang; Valdimarsdóttir, Unnur A.; Song, Huan

doi:10.1038/s41467-024-45445-2

Download PDF

Article
Open access
Published: 08 February 2024

Disease clusters subsequent to anxiety and stress-related disorders and their genetic determinants

Nature Communications volume 15, Article number: 1209 (2024) Cite this article

2049 Accesses
1 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Anxiety/stress-related disorders have been associated with multiple diseases, whereas a comprehensive assessment of the structure and interplay of subsequent associated diseases and their genetic underpinnings is lacking. Here, we first identify 136, out of 454 tested, medical conditions associated with incident anxiety/stress-related disorders attended in specialized care using a population-based cohort from the nationwide Swedish Patient Register, comprising 70,026 patients with anxiety/stress-related disorders and 1:10 birth year- and sex-matched unaffected individuals. By combining findings from the comorbidity network and disease trajectory analyses, we identify five robust disease clusters to be associated with a prior diagnosis of anxiety/stress-related disorders, featured by predominance of psychiatric disorders, eye diseases, ear diseases, cardiovascular diseases, and skin and genitourinary diseases. These five clusters and their featured diseases are largely validated in the UK Biobank. GWAS analyses based on the UK Biobank identify 3, 33, 40, 4, and 16 significantly independent single nucleotide polymorphisms for the link to the five disease clusters, respectively, which are mapped to several distinct risk genes and biological pathways. These findings motivate further mechanistic explorations and aid early risk assessment for cluster-based disease prevention among patients with newly diagnosed anxiety/stress-related disorders in specialized care.

A major role for common genetic variation in anxiety disorders

Article 20 November 2019

Polygenic risk for anxiety influences anxiety comorbidity and suicidal behavior in bipolar disorder

Article Open access 24 August 2020

Symptom-level modelling unravels the shared genetic architecture of anxiety and depression

Article 15 April 2021

Introduction

Anxiety and stress-related disorders are among the most common mental disorders, with a regional variation in prevalence globally (i.e., from 5.3% to 10.4%)¹ and a pooled lifetime prevalence of ~12.9%². With shared clinical symptoms and neurobiological features, anxiety and stress-related disorders are considered highly correlated―they are historically in the same diagnosis category³, with similarities in genetic architecture demonstrated in familial coaggregation^4,5 and genome-wide association analysis (GWAS) studies⁶.

Individuals with anxiety and stress-related disorders usually follow an intermittent recurring symptom episode throughout life and a as result, experience impaired mental and physical functioning, increased rates of disability, and higher-than-expected mortality^7,8. The causes of the excess disability and mortality include a considerable range of medical conditions, such as other psychiatric disorders (e.g., depression)⁹, metabolic diseases^10,11, cardiovascular disease^12,13,14, autoimmune disease^15,16, and infections¹⁷. However, previous studies have mostly focused on specific groups of diseases and, to the best of our knowledge, no comprehensive assessment of disease clusters arising subsequent to anxiety and stress-related disorders has been conducted.

Although still early on, the scant literature suggests a role of stress-related genetic loci in the development of cardiovascular disease¹⁸ and mortality¹⁹ after exposure to terrorist attacks or social adversity. Likewise, a recent GWAS study revealed that both anxiety and stress-related disorders are genetically correlated with multiple obesity-related phenotypes⁶, promoting studies on the genetic basis of adverse health consequences after anxiety and stress-related disorders. However, as a wide range of medical conditions have been associated with a prior diagnosis of anxiety and stress-related disorders, exploration of patients’ genetic susceptibilities to a single disease has limited clinical implications. The recent advances in human disease network methodology, e.g., disease trajectory²⁰ and comorbidity network²¹, provide new means to comprehensively summarize the possible sets of diseases (i.e., disease clusters with close temporal or non-temporal relationships) following a predetermined phenotype^22,23. Furthermore, with the notion that disease located in the same cluster should have shared or linked biological mechanisms, the identification of cluster-specific, instead of diseases-specific, genetic variants, has the potential to realize the prevention of a general further health decline among patients with anxiety or stress-related disorders.

Therefore, taking advantage of the nationwide population and health registers in Sweden as well as the community-based health records and genetic information available in the UK Biobank, we aimed to identify major clusters of subsequent medical conditions after a diagnosis of anxiety and stress-related disorders. We further aimed to elucidate the underlying genetic determinants associated with those identified disease clusters.

Results

Baseline characteristics

Based on the specialized diagnoses from the nationwide Swedish Patient Register, we first included a Swedish cohort comprising 70,026 patients at first diagnosis of anxiety or stress-related disorders from 2001 to 2016 and ten randomly selected birth year- and sex-matched unaffected individuals per patient using incidence density sampling (N = 700,260), without history of other psychiatric disorders and severe somatic diseases, as the exploratory dataset (Fig. 1, Supplementary Fig. 1, Supplementary Data 1). To validate the identified disease clusters in the Swedish cohort, we constructed a UK cohort (i.e., the validation dataset) based on the UK Biobank, including 23,365 patients diagnosed with anxiety or stress-related disorders between 1997 and 2019 from inpatient and primary care, and 233,596 unaffected participants individually matched by sex and year of birth (Supplementary Fig. 2). The Swedish and the UK cohorts were both overrepresented by females (62.7% and 67.3% respectively), with a similar median age at diagnosis for anxiety and stress-related disorders (Table 1). The median follow-up time was 7.1 and 13.3 years in the Swedish cohort and UK cohort, respectively. Patients in both cohorts had lower educational and income levels than their matched unaffected individuals.

Table 1 Baseline characteristics of study participants from Swedish exploration cohort and UK validation cohort

Full size table

Identification of associated disease clusters

In the Swedish cohort, 183 medical conditions (among 454 tested) had a prevalence of ≥0.5% and 136 were positively associated with a prior diagnosis of anxiety or stress-related disorders (Supplementary Data 2). The top HRs were noted for other psychiatric disorders, including personality and behavior disorder (HR [95% confidence intervals]: 16.7 [15.0–18.5]), sedatives or hypnotics abuse (16.0 [14.1–18.2]), and other mood disorder (13.4 [12.1–14.9]). Somatic medical conditions with the highest HRs were other headache syndromes (2.8 [2.6–3.0]), irritable bowel syndrome (2.6 [2.3–2.8]), and other functional intestinal disorders (2.3 [2.2–2.4]). In addition, one negative association was noted between anxiety/stress-related disorders and varicose veins of lower extremities (0.93 [0.89–0.98]).

Subsequently, we identified 433 and 97 disease pairs to construct a comorbidity network and disease trajectory after anxiety or stress-related disorders, respectively (details of the disease pair identification in Supplementary Fig. 3 and results in Supplementary Data 3-4). The analysis of the comorbidity network identified seven modules, characterized by their predominant components related to psychiatric disorders, eye diseases, ear diseases, cardiovascular disease, genitourinary diseases, musculoskeletal diseases, and cerebrovascular diseases (Fig. 2A). Figure 2B shows an overview of the disease trajectories after anxiety or stress-related disorders. The medical conditions that were listed immediately after anxiety or stress-related disorders in chronological order (i.e., D1) included disorders of sense organs, genitourinary diseases, cardiovascular disease, and psychiatric disorders.

**Fig. 2: Comorbidity network and disease trajectory after diagnosis of anxiety and stress-related disorders.**

We determined stable disease clusters by merging the findings from the comorbidity network and disease trajectory analyses together (Fig. 3), which consequently led to five clusters including 31 medical conditions (Fig. 4A, B). Cluster 1 denoted a link from anxiety or stress-related disorders to depression and alcohol abuse, and further to obesity and several other psychiatric disorders. Cluster 2 and Cluster 3 denoted a link to eye and ear diseases. Cluster 4 included mainly cardiovascular disease, with a direct link to hypertensive disorders, ischemic heart diseases, and angina pectoris. Cluster 5 was predominated by genitourinary diseases as the first affected medical conditions, and further linked to skin diseases.

**Fig. 3: Identification of disease cluster featured by psychiatric disorders by combination of disease trajectory and comorbidity network.**

**Fig. 4: Disease clusters and disease list among individuals with anxiety and stress-related disorders.**

Among the 31 medical conditions identified from the Swedish cohort, 19 medical conditions were validated in the UK cohort (i.e., statistically significantly associated with a prior diagnosis of anxiety or stress-related disorders, Supplementary Data 5). When validating the disease clusters, we identified 170 possible disease pairs and selected 55 disease pairs to be included in the comorbidity network analysis (Supplementary Data 6). All five disease clusters were replicated in the UK cohort, although the cluster predominated by psychiatric disorders and ear diseases from the Swedish cohort was found to be merged as one cluster in the UK cohort (Supplementary Table 1).

Genetic determinants of associated disease clusters

To identify the potential genetic determinants for each disease cluster among patients with anxiety or stress-related disorders, we first calculated five cluster-specific quantitative scores as an index of individual’s susceptibility to each disease cluster, and then performed GWAS analyses for the five susceptibility scores separately, among individuals from the UK cohort with eligible genotyping data (n = 27,781, Supplementary Fig. 2), using mixed linear model (MLM)-based models. GWAS analyses identified three, 33, 40, 4, and 16 independent single nucleotide polymorphisms (SNPs) for clusters featured by psychiatric disorders, eye diseases, ear diseases, cardiovascular diseases, and skin and genitourinary diseases, respectively (Table 2). The full lists of the SNPs, their mapped genes, and enriched biological pathways are presented in Supplementary Data 7–9. According to the genomic inflation analysis results of Linkage Disequilibrium (LD) score regression, we found little indication for confounding effects in the GWAS of the five disease clusters (Supplementary Table 2). We found 20 (e.g., AP000304.12, ATP5O, MRPS6), 362 (e.g., ZFAND1, CHMP4C, SNX16), 440 (e.g., C10orf88, PSTK, TEX36), 30 (e.g., EFNA5, VAV3, SLC25A24) and 229 (e.g., BCHE, ZBBX, OR6S1) mapped genes for the five associated disease clusters, which were then associated with several enrichment biologic pathways topped by GO:0098660 inorganic ion transmembrane transport, WP5224 2q37 copy number variation syndrome, GO:0048545 response to steroid hormone, M5884 NABA CORE MATRISOME, and GO:0097484 dendrite extension, respectively. Based on information from FUMA and GeneCards, we found that several cluster-specific genes have been previously associated with individual psychiatric or somatic traits in the disease cluster (e.g., PRPF38B for angina pectoris and myocardial infarction, Supplementary Data 10).

Table 2 Genetic determinants for five disease clusters associated with anxiety and stress-related disorders

Full size table

When comparing genes and pathways crossing different disease clusters, we found several common genes (e.g., AGAP1, AOAH, C8orf59 for both clusters featured by eye and ear diseases, Fig. 5A), although no common pathways. Further protein–protein interaction (PPI) analysis identified ten MCODE components (e.g., MCODE components featured by pathways of ‘Transmembrane signal transduction’ and ‘Signaling by G protein-coupled receptor (GPCR)’, Fig. 5B) shared by disease clusters of eye, ear and skin and genitourinary diseases.

**Fig. 5: Genetic overlap between five disease clusters associated with anxiety and stress-related disorders.**

In sensitivity analyses using the disease clusters and their component medical conditions that can be validated in the UK cohort alone (Table 2 and Supplementary Data 11), we found some identical genetic determinants. Specifically, for the cluster predominated by eye diseases, 11 independent SNPs (e.g., rs578045293, rs574810100, rs192296901) and 115 mapped genes (e.g., DMRTA2, FAF1, CDKN2C) were identified in both genetic analyses. For the cluster predominated by ear diseases, five independent SNPs (e.g., rs113248357, rs193072412, rs556283981), 183 mapped genes (e.g., RHCE, TMEM57, LDLRAP1), and two biological pathways (i.e., R-HSA-3700989 Transcriptional Regulation by TP53, R-HSA-1475029 Reversible hydration of carbon dioxide) were identified in both.

Furthermore, comparing the mapped genes for each disease cluster among individuals with anxiety/stress-related disorders (n = 27,781) to those obtained among individuals without anxiety/stress-related disorders (n = 452,148, Supplementary Data 12), we found only a small proportion of overlapping genes (i.e., 5 and 2 genes for clusters predominated by ear diseases and cardiovascular disease, respectively), indicating that few identified genetic hits were driven by the disease clusters only.

Subgroup analyses

We found largely similar disease clusters when using the entire Swedish cohort, although the clusters dominated by eye diseases, ear diseases, and cardiovascular disease were merged into one cluster (Supplementary Fig. 4 and Supplementary Table 3). When exploring the disease clusters following anxiety and stress-related disorders separately, we found an additional disease cluster predominated by cerebrovascular diseases for both disorders and another cluster predominated by musculoskeletal diseases for stress-related disorders (Supplementary Figs. 5, 6). We obtained similar disease clusters among females as in the main analysis, although two clusters predominated by cerebrovascular diseases and digestive diseases were also identified (Supplementary Fig. 7). The cluster predominated by ear diseases was not identified but a disease cluster predominated by musculoskeletal diseases was noted among males (Supplementary Fig. 8).

Discussion

Leveraging the nationwide health registers in Sweden and the large community-based UK Biobank, our study, revealed the most comprehensive picture of subsequent disease clusters following a diagnosis of anxiety and stress-related disorders. The five distinct disease clusters, featured by psychiatric disorders, eye diseases, ear diseases, cardiovascular diseases, and skin and genitourinary diseases, were discovered in the Swedish cohort and validated in the UK cohort. Furthermore, based on individual-level genotyping data in the UK cohort, we identified several distinct genetic determinants for the five disease clusters, as well as genetic components involved in the GPCR signaling pathway that were shared between multiple disease clusters. With novel attempts to conceptualize associated disease clusters, these findings shed light on the biological basis, both commonly and specifically, towards further diverse health consequences after a diagnosis of anxiety and stress-related disorders, which could aid mechanistic explorations, and facilitate risk surveillance (e.g., precise risk assessment) and management (e.g., development of targeted interventions) for health decline prevention among patients with newly diagnosed anxiety and stress-related disorders.

Over the past decade, accumulated evidence suggests a positive link between anxiety or stress-related disorders and a number of medical conditions. Previous studies have, however often relied on small samples²⁴, with incomplete follow-up²⁵, and focused on a single outcome/disease^11,26. Our study, therefore complements the knowledge gaps through a data-driven approach of including virtually all medical conditions after the diagnosis of anxiety and stress-related disorders. As a result, we managed to identify five key clusters of disease associated with a prior diagnosis of anxiety or stress-related disorders (with component diseases in the same system or across different systems), considering the temporal order and high intrinsic connectivity between diseases and with validation across two populations. Although evidence on the risk of these disease clusters after anxiety and stress-related disorders was scarce, our findings gain support from previous studies reporting associations between anxiety or stress-related disorders and individual diseases. For instance, anxiety and stress-related disorders have previously been reported to co-occur with other psychiatric disorders (e.g., major depressive episode, bipolar disorder, and alcohol dependence)²⁷. Additionally, a population-based cohort study covering 5.9 million people in Denmark reported increased risks of 31 somatic medical conditions, including cardiovascular disease, vision problems, and hearing problems, following a prior diagnosis of neurotic disorders (including anxiety and stress-related disorders)²⁸. In our previous population-based cohort study in Sweden, we also noted risk increases of 16 specific cardiovascular diseases among patients with stress-related disorders¹². In addition, several diseases in disease clusters predominated by skin and genitourinary diseases have been reported to be associated with anxiety or stress-related disorders, such as urinary infection²⁹, irregular menstruation³⁰, and premenstrual syndrome³¹. We found largely similar disease clusters following a diagnosis of anxiety and stress-related disorders, with an additional cluster of musculoskeletal diseases noted among patients with stress-related disorders. Several studies have reported similar associations between post-traumatic stress disorder (PTSD) and arthritis, although using self-reported data^32,33. Furthermore, we found little role of age and sex in the identification of most disease clusters, indicating that the risk of developing these disease clusters is independent of age and sex.

Prior attempts to illustrate disease networks include a recent Danish study based on national data from inpatient and outpatient care, which established a browser presenting the disease trajectories both before and after a target disease of interest³⁴. Using this browser, we found that some key diseases in our study, such as cardiovascular diseases identified subsequent to stress-related disorders, were listed prior to stress-related disorders. However, as the comparability of these two studies is limited (due to the different research purposes and study designs), the inconsistent results do not necessarily invalidate each other.

With the notion that diseases with high levels of connectivity (i.e., in a disease cluster) may share common pathological mechanisms with possibly the same affected genes and biological pathways, we consider it is reasonable to focus on disease clusters, instead of each individual disease, for the purpose of genetic determinant identification and for the future development of disease prevention strategies. Despite the lack of comparable results from studies of similar design, the findings of our cluster-specific genetic analyses were in line with prior studies. For instance, some mapped genes for susceptibility to the disease cluster featured by psychiatric disorders (e.g., AMOTL1, CWC15, KDM4D) were reported to be associated with mood disorders³⁵ as well as attention deficit hyperactivity disorder (ADHD) and conduct disorder³⁶. IQCB1 and GOLGB1, the genes identified for the disease cluster dominated by eye diseases, have been associated with corneal resistance factor (a measure of the biomechanical properties of the cornea)³⁷, while CHDH and FILIP1L have been associated with ocular axial length³⁸. Additionally, some mapped genes for the disease cluster dominated by cardiovascular disease have been demonstrated as risk genes for diastolic blood pressure (COL23A1 and PHYKPL)³⁹ or stroke (COL23A1, PHYKPL, and ADAMTS2)^40,41. Nevertheless, we found several risk genes (e.g., ATP5O, POLQ, RHCE) and biological pathways not discussed in the existing literature, mainly involved in the regulation of molecular transport, cellular metabolic processes, and the structure and function of proteins. Particularly, it is also notable that, except for genes and pathways that were linked to a specific disease cluster, we identified the biological components related to the signaling of GPCR that may contribute to the development of multiple disease clusters (i.e., clusters featured by eye, ear, and skin and genitourinary diseases). GPCR signaling has been widely reported to play a role in responses to stress⁴², inflammatory response⁴³, and development and drug targets for multiple diseases⁴⁴ (e.g., dry eye disease⁴⁵, allergic conjunctivitis⁴⁶, and urinary tract infection⁴⁷), which may indicate the key shared pathways and mechanisms linking anxiety and stress-related disorders to subsequent sequelae. Collectively, if verified, the findings of our study might provide additional insights into why patients diagnosed with anxiety and stress-related disorders face a general health decline, with large variations in developed disease outcomes. Regardless, our measure of susceptibility score to a disease cluster rather than to a single disease, might limit the comparison of findings between the present study and previous studies.

Our efforts to identify of disease clusters and their genetic determinants were conceptual and were largely based on accumulating evidence of the existence of disease networks and their shared biological mechanisms^20,21,48, with the potential to aid in the development of cost-effective health promotion strategies for this vulnerable population. For instance, medications indicated for the genes/pathways within each disease cluster could be further tested for effectiveness in reducing risks of further disease development among individuals with anxiety and stress-related disorders. Other major strengths of our study include the inclusion of two large population- and community-based cohorts with long and complete follow-up data collected prospectively and independently, which largely minimized information and selection biases, and enabled the ascertainment of disease clusters using data from two distinct populations. Furthermore, the combination of disease trajectory and comorbidity network analyses to ascertain disease clusters enhanced the reliability of the connectivity and temporal order between disease pairs in each disease cluster. Last, the availability of enriched phenotypic and genotypic data, together with the application of comprehensive analytic strategies, including PheWAS, disease network analysis, and genetic analysis, for the first time, led to a comprehensive illustration of health consequences in relation to anxiety and stress-related disorders from phenotypic to genetic levels. This analytic strategy could be applied as a pipeline for studying comorbidities of other phenotypes.

Several limitations should be acknowledged. First, given the lack of complete primary care data in the Swedish Patient Register and the UK Biobank, as well as the lack of outpatient care data in the UK Biobank, we might have underestimated the number of patients with anxiety and stress-related disorders as well as the number of studied medical conditions, primarily the milder forms of these diseases. Therefore, disease cluster identification based on a more comprehensive data source to validate the findings of the present study is warranted. Second, although we excluded patients with a history of other psychiatric disorders and severe somatic diseases and started the follow-up from six months after the index date, we cannot rule out the possibility that some pre-existing diseases other than anxiety and stress-related disorders might have contributed to the identified disease clusters. Third, the identification of disease clusters relied on the results of association analyses (i.e., PheWAS analysis). Although confirmed by using two methods and validated in the UK cohort, the lack of data on important confounders in the health register data (e.g., lifestyle and environmental factors) can raise the concern of residual confounding. This also applies to the noted negative association of anxiety/stress-related disorders with varicose veins of lower extremities. With few supportive data from existing literature, such a finding needs to be validated in future studies. Last, our findings may not be generalized to other populations with non-European ancestry or different healthcare systems than in Sweden and the UK.

In conclusion, based on detailed phenotypic and genetic analyses of two large-scale cohorts, we identified five distinct disease clusters subsequent to an inpatient/outpatient diagnosis of anxiety and stress-related disorders, featured by other psychiatric disorders, eye diseases, ear diseases, cardiovascular diseases, and skin and genitourinary diseases as predominant diseases in each cluster. We further identified a list of genetic variants and biological pathways linking anxiety and stress-related disorders, specifically or commonly, to those identified disease clusters, contributing to a better understanding of the underlying mechanisms.

Methods

Study design

The analytic process included two parts, namely phenotypic and genetic analyses (Fig. 1). In the phenotypic analysis, we undertook a phenome-wide association study (PheWAS), followed by both comorbidity network analysis and disease trajectory analysis to determine robust disease clusters (i.e., associated diseases with both temporal and non-temporal relationships) following a diagnosis of anxiety or stress-related disorders in the Swedish cohort (i.e., the exploratory dataset). To validate identified disease clusters and the diseases they entailed, we performed similar analyses in the UK cohort of similar study designs (i.e., the validation dataset). Regarding the genetic analyses based on individual-level genotyping data of the UK cohort, we first calculated a cluster-specific susceptibility score, which was designated as a quantitative index of individuals’ susceptibility to a specific disease cluster, and then performed GWAS analysis, gene mapping, and enrichment analysis to identify risk genes and biological pathways that may count for the pathogenesis of such a disease cluster after anxiety or stress-related disorders.

Swedish cohort

The Swedish Patient Register includes nearly complete health records of inpatient care since 1987 and outpatient specialist care since 2001 in Sweden⁴⁹. By cross-linkage to the Total Population Register using the unique Swedish personal identification numbers, we included all Swedish-born individuals residing from 2001 to 2016 in Sweden and excluded those with any pre-existing psychiatric disorders or history of severe somatic diseases at the time of diagnosis determined by the Charlson Comorbidity Index before 2001⁵⁰, leading to a study population of 8,456,485 (Supplementary Fig. 1). We focused on Swedish-born individuals in the present study to reduce the heterogeneity in genetic background as well as other sociodemographic factors, including differential health-seeking behaviors. Among these, we identified all individuals who received a first primary diagnosis in specialized care of anxiety or stress-related disorders from 2001 to 2016 (N = 212,767, 63.3% with anxiety disorder), and a set of ten unaffected individuals randomly selected from the study base per exposed patient, individually matched on sex and birth year using incidence density sampling (N = 2,127,670), without history of other psychiatric disorders and severe somatic diseases. The diagnostic date of anxiety or stress-related disorders was used as the index date for the start of follow-up of both the exposed patients and their matched unaffected individuals.

Follow-up

To minimize the concern of reverse causality, we followed all participants of the Swedish cohort for all medical conditions from 6 months after the index date until death, first diagnosis of anxiety or stress-related disorders (for matched unexposed individuals), emigration, or the end of the study period (i.e., 31 December 2016), whichever occurred first.

Ascertainment of anxiety and stress-related disorders and subsequent medical conditions

In the Swedish cohort, we defined anxiety or stress-related disorders as any first specialist care diagnosis in an inpatient or outpatient hospital visit, where these disorders were identified as the primary discharge diagnosis, according to the Swedish Patient Register, using the 10th Swedish revision of the International Classification of Diseases (ICD-10) codes (anxiety: F40 and F41, stress-related disorder: F43) (Supplementary Table 1). In the PheWAS, medical conditions refer to any disease or health outcomes recorded in the Patient Register comprising inpatient and outpatient diagnoses. We ascertained medical conditions through the primary diagnosis from the Patient Register, using the 3-digit ICD-10 codes (A00 to N99) (Supplementary Table 1). The diagnostic codes for most common diseases in the Patient Register have been validated, showing a satisfactory accuracy with positive predicted values [PPV] of 85–95% for most common diseases⁴⁹, 81% for social anxiety disorder⁵¹, and 75–90% for PTSD⁵². We obtained the highest level of education and income at the year of index date from the Swedish Longitudinal Integration Database for Health Insurance and Labor Market⁵³.

UK cohort

The validation dataset (UK cohort) was constructed based on the UK Biobank, using a similar design (Supplementary Fig. 2). The UK Biobank (UKB) is a community-based cohort study that enrolled half a million participants aged 40–69 at recruitment between 2006 and 2010 across England, Scotland, and Wales. Details of the study design are described elsewhere⁵⁴. The inpatient hospital data, obtained from the Hospital Episode Statistics database, the Scottish Morbidity Record, and the Patient Episode Database, cover all UK Biobank participants since 1997⁵⁵. The primary care data, provided by various general practitioner computer system suppliers, cover ~45% of participants since 1985⁵⁵. We first excluded individuals who had withdrawn from the UK Biobank (n = 108) or had conflicting information (n = 1), leaving 502,398 eligible participants (Supplementary Fig. 2). Among these, we constructed a matched cohort, including patients with newly inpatient/primary care diagnosed anxiety or stress-related disorders between January 1, 1997 and December 31, 2019 (N = 23,365) who had no history of severe somatic diseases or other psychiatric disorders, and up to ten unaffected individuals for each patient who were randomly selected and individually matched by sex and year of birth using incidence density sampling (N = 233,596). The diagnostic date of anxiety or stress-related disorders was used as the index date for the start of follow up of the exposed and matched unaffected individuals.

We followed all participants of the UK cohort for all medical conditions from 6 months after the index date until death, first diagnosis of anxiety or stress-related disorders (for matched unexposed individuals), loss to follow-up⁵⁶, or the end of the study period (i.e., 31 December 2019), whichever occurred first.

In the UK cohort, we defined a new diagnosis of anxiety or stress-related disorder as a first primary diagnosis based on the inpatient and primary care data, using the ICD-10 codes for the inpatient hospital data (Supplementary Data 1) and the version 2 and version 3 read codes (i.e., Read v2 and Read v3) for the primary care data (Supplementary Table 4). Medical conditions were ascertained from the primary and secondary diagnoses through the inpatient hospital data (Supplementary Data 1). Information on the highest educational level and Townsend Deprivation Index (proxy for socioeconomic status, with a higher index score indicating a higher degree of deprivation)⁵⁷ were collected at recruitment through questionnaires.

Statistical analyses

The age distribution differed between the Swedish cohort and the UK cohort (median age at diagnosis 32 versus 52). To facilitate validation between the two cohorts, we selected a sub-cohort of the Swedish cohort, namely participants with age >second tertile (median age at index date = 51, N = 70,026, Table 1), and used this sub-cohort as the exploration dataset throughout the main analyses. A sensitivity analysis was performed using the entire Swedish cohort with all age groups.

Exploration of associated disease clusters in the Swedish cohort

In the exploration dataset (Swedish cohort), we identified a total of 454 medical conditions diagnosed six months after the diagnosis of anxiety or stress-related disorders. To ensure statistical power, we included only medical conditions with a prevalence ≥0.5% among patients with anxiety or stress-related disorders. We performed a PheWAS to investigate the associations between anxiety or stress-related disorders and each medical condition, using Cox regression models stratified by matching variables (i.e., sex and birth year) with adjustment for highest education and income. Individuals with a prior diagnosis of the studied medical condition were excluded, when estimating the association with each medical condition. Only medical conditions with statistically significant positive associations, after adjusting for multiple testing (hazard ratio [HR] > 1, and false discovery rate [FDR] adjusted p value [i.e., q value] <0.05), were included in the following analyses.

Among the identified medical conditions from the PheWAS, we constructed all possible disease pairs as disease 1 (D1) and disease 2 (D2) pairs and only analyzed disease pairs that co-occurred with a prevalence ≥0.25% among patients with anxiety or stress-related disorders. To ensure comorbidity strength, we calculated the relative risk (RR) and Pearson’s correlation (Φ-correlation) for each disease pair. For each disease pair, a sub-cohort was formed through excluding patients with a history of D1 and D2 before their index date (i.e., the diagnosis date of anxiety or stress-related disorders). The formulas for RR and Φ-correlation were calculated using the following formulas:

$${{RR}}_{{ij}}=\frac{{C}_{{ij}}{N}_{{ij}}}{{C}_{i}{C}_{j}}$$

$${\Phi }_{{ij}}=\frac{{C}_{{ij}}{N}_{{ij}}-{C}_{i}{C}_{j}}{\sqrt{{C}_{i}{C}_{j}({N}_{{ij}}-{C}_{i})({N}_{{ij}}-{C}_{j})}}$$

Where ${C}_{{ij}}$ is the number of patients affected by both D1 and D2, and ${N}_{{ij}}$ is the number of individuals in the sub-cohort, while ${C}_{i}$ and ${C}_{j}$ are the number of patients affected by D1 and D2 respectively. For both RR and Φ-correlation measures, the significance of RR = 0 and Φ = 0 can be both determined using z-test (given large sample size in our study). The corresponding z-score for RR and Φ-correlation were calculated using the following formula^21,58:

$${{z}}_{{ij}}^{{{{{{\rm{RR}}}}}}}=\frac{{{{{\mathrm{ln}}}}}\,({R}{{R}}_{{ij}})}{\sqrt{\frac{1}{{{C}}_{{ij}}}-\frac{1}{{{N}}_{{ij}}}+\frac{1}{{{C}}_{{i}}{{C}}_{{j}}/{{N}}_{{ij}}}-\frac{1}{{{N}}_{{ij}}}}}$$

$${{z}}_{{ij}}^{\varPhi }=\frac{{\varPhi }_{{ij}}\sqrt{{{\max }}({{C}}_{{ij}},\, {{C}}_{{j}})-2}}{\sqrt{1-{{\varPhi }_{{ij}}}^{2}}}$$

P values were then calculated using the z-score and adjusted for the issue of multiple testing. Only disease pairs with strong comorbidity strength (i.e., RR > 1, Φ-correlation > 0, and q value < 0.05) were included in the comorbidity network and disease trajectory analyses.

In the comorbidity network analysis, we used logistic regression to determine the magnitude of association between the disease pairs with strong comorbidity strength (i.e., significant non-temporal relationship). Disease pairs with confirmed positive association (i.e., odds ratio [OR] > 1 and q value < 0.05) were selected to construct a comorbidity network. The comorbidity network was then subdivided into different comorbidity modules with high intrinsic connectivity determined by the Louvain clustering algorithm⁵⁹. For disease trajectory analysis, binomial tests were used to assess the temporal direction (i.e., D1 → D2 or D2 → D1 among D1D2 pairs) among disease pairs with strong comorbidity strength (i.e., significant temporal relationship). For each disease pair with a determined temporal order, we constructed a nested case-control dataset in the sub-cohort, by considering D2 as outcome and D1 as exposure. For each patient with D2, at most two controls were matched by sex and birth year using intensity density sampling. to confirm the magnitude of the association between the disease pair, we then used conditional logistic regression by adjusting for education level and Townsend Deprivation Index. We then included the disease pairs with positive associations (OR > 1 and q value < 0.05) to construct the disease trajectory.

As disease trajectory analysis is designed to visualize sequential disease progression while comorbidity network analysis captures disease groups with high intrinsic connectivity, the combined use of those two data-driven approaches can theoretically lead to the identification of more reliable disease clusters (i.e., groups of diseases with both temporal and non-temporal relationships). Thus, based on results from the aforementioned disease trajectory and comorbidity network analyses, we defined disease clusters as the first layer diseases (D1) and their subsequent diseases in a disease trajectory that were also located within the same comorbidity module (Fig. 3). For example, in the disease trajectories, out of all diseases in the first layer, “F32” and “F10” were located in one comorbidity module (i.e., the module predominated by psychiatric disorders) derived from the comorbidity network. We then found the following diseases of “F32” and “F10” in the trajectories which were also in such a comorbidity module to constitute a disease cluster (i.e., the disease cluster featured by psychiatric disorders including “E66”, “F10”, “F13”, “F19”, “F20”, “F30”, “F32”, “F39”, “F60”, and “F90”).

Validation of associated disease clusters in the UK cohort

To validate the identified disease clusters in the UK cohort, we used Cox models to assess the associations between anxiety and stress-related disorders and each medical condition of the identified disease cluster, comparing affected patients to their matched unaffected individuals. The models were stratified by matching variables (i.e., sex and birth year) and adjusted for highest educational level and Townsend Deprivation Index. Only medical conditions with statistically significant positive associations were included in the comorbidity network analysis (same as described above) to construct disease clusters in the UK cohort. Trajectory analysis was not performed in the UK cohort due to the lack of complete primary care and outpatient data.

Genetic determinants of associated disease clusters using data from the UK Biobank

In the UK cohort, a cluster-specific “susceptibility score” was calculated to quantify the subsequent risk of each disease cluster for each patient with anxiety or stress-related disorder. The susceptibility score was defined as an individual person’s number of diagnosed diseases included in each disease cluster, according to the inpatient hospital data.

The quality control contains two parts. For quality control on individuals from the UK Biobank, we removed individuals with non-European ancestry, inconsistent sex, or sex chromosome aneuploidy. For quality control on individual level of genetic data, we first removed SNPs with imputation quality score <0.8, minor allele frequency < 0.001, or deviations from Hardy-Weinberg equilibrium (p < 1 × 10⁻¹⁰)⁶⁰. We removed SNPs in the extended major histocompatibility complex (MHC) region (chr6: 25–34 Mb), considering the long-range linkage disequilibrium (LD) and special genetic architecture in this region. After standard GWAS quality control on the individual-level genotyping data⁶⁰, we included 22,781 patients (out of the 23,365 patients) and 13,225,429 variants for further analysis.

To assess the association between SNPs and five susceptibility scores (as a continuous variable) respectively, we used mixed linear model (MLM)-based models for GWAS analysis, adjusted for sex, birth year, genotyping array, and the first ten principal components⁶¹. Independent significant SNPs with p < 5 × 10⁻⁸ were identified for each genomic locus. The inflation of GWAS analyses was tested by LD score regression⁶². Together with surrounding genomic loci that were identified based on LD structure at r² ≥ 0.6, all the SNPs were further mapped to identify potential genes using the web-based platform FUMA (http://fuma.ctglab.nl/)⁶³. The strategies for gene mapping included positional mapping, expression quantitative trait loci mapping based on the GTEx v8 Project⁶⁴, and Chromatin interaction mapping^65,66. Further, these mapped genes were included in the gene-set enrichment analyses based on Metascape (https://metascape.org/) to identify underlying biological pathways for each disease cluster with the following ontology sources: KEGG Pathway, GO Biological Processes, Reactome Gene Sets, Canonical Pathways, CORUM, WikiPathways, and PANTHER Pathway⁶⁷. To further investigate genetic overlap across disease clusters, the abovementioned mapping genes were included in the PPI network enrichment analysis to identify protein network components using the Molecular Complex Detection (MCODE) algorithm based on Metascape⁶⁸ based on the following genomic interaction databases: STRING⁶⁹, BioGrid⁷⁰, OmniPath⁷¹, and InWeb_IM⁷². To test whether these identified cluster-specific genes are associated with any of the individual diseases in each disease cluster, we first conducted hypergeometric tests using the function of “GENE2FUNC” in the website platform ‘FUMA’. As an alternative approach serving a similar purpose, we searched GeneCards, a gene database providing all annotated and predicted human genes (https://www.genecards.org/), to identify traits that have been previously associated with the top 20 identified genes of each disease cluster.

In a sensitivity analysis, we repeated the genetic analysis merely for the disease clusters (and their components) that can be validated from the UK cohort. Additionally, to test whether the identified genes were primarily driven by the studied disease clusters, independent of the prior anxiety/stress-related disorders, we conducted additional GWAS analyses for those disease clusters among individuals without anxiety/stress-related disorders (n = 452,148).

Subgroup analyses

As we used the advanced age group (i.e., >second tertile) of the Swedish cohort in the main analysis, we repeated the phenotypic analyses in the entire Swedish cohort (N = 216,727). To assess whether the results would differ between anxiety and stress-related disorders, we constructed two independent matched cohorts for anxiety and stress-related disorders, separately, and repeated the main analyses to identify disease clusters associated with anxiety disorder and stress-related disorder exclusively. To explore the role of sex in disease clusters, we performed analyses separately for males and females.

The phenotypic analyses were conducted using SAS 9.4 (SAS Institute), R (Version 4.0.2), Python (Version 3.8), and Cytoscape (Version 3.8.0). PLINK (Version 1.9) and GCTA (Version 1.24) were used for genetic analyses. For multiple testing, a q value < 0.05 was considered statistically significant. This study was approved by the Ethical Vetting Board in Stockholm, Sweden (DNRs 2012/1814-31/4 and 2015/1062-32), the NHS National Research Ethics Service (16/NW/0274), and the Biomedical Research Ethics Committee of West China Hospital (2019-1171). The requirement of informed consent for Swedish participants is waived in register-based studies in Sweden, and all the participants in the UK Biobank provided written informed consent before data collection.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

The raw data from Swedish registers are protected and not available due to Swedish law. The raw data from the UK Biobank (http://www.ukbiobank.ac.uk/) are available to all researchers upon making an application. Part of this research was conducted using the UK Biobank Resource under Application 54803. Other data or platforms are available to all researchers: FUMA (http://fuma.ctglab.nl/), Metascape (https://metascape.org/), GeneCards (https://www.genecards.org/).

Code availability

As the codes are highly specific to our curated database and may not be universally applicable to others, all codes associated with the current submission are available and can be requested by contacting the corresponding authors.

References

Baxter, A. J., Scott, K. M., Vos, T. & Whiteford, H. A. Global prevalence of anxiety disorders: a systematic review and meta-regression. Psychol. Med. 43, 897–910 (2013).
Article CAS PubMed Google Scholar
Steel, Z. et al. The global prevalence of common mental disorders: a systematic review and meta-analysis 1980-2013. Int. J. Epidemiol. 43, 476–493 (2014).
Article PubMed PubMed Central Google Scholar
Williamson, J. B., Jaffee, M. S. & Jorge, R. E. Posttraumatic stress disorder and anxiety-related conditions. Continuum 27, 1738–1763 (2021).
PubMed Google Scholar
Telman, L. G. E., van Steensel, F. J. A., Maric, M. & Bogels, S. M. What are the odds of anxiety disorders running in families? A family study of anxiety disorders in mothers, fathers, and siblings of children with anxiety disorders. Eur. Child Adolesc. Psychiatry 27, 615–624 (2018).
Article PubMed Google Scholar
Hettema, J. M., Neale, M. C. & Kendler, K. S. A review and meta-analysis of the genetic epidemiology of anxiety disorders. Am. J. Psychiatry 158, 1568–1578 (2001).
Article CAS PubMed Google Scholar
Meier, S. M. et al. Genetic variants associated with anxiety and stress-related disorders: a genome-wide association study and mouse-model study. JAMA Psychiatry 76, 924–932 (2019).
Article PubMed PubMed Central Google Scholar
Baxter, A. J., Vos, T., Scott, K. M., Ferrari, A. J. & Whiteford, H. A. The global burden of anxiety disorders in 2010. Psychol. Med. 44, 2363–2374 (2014).
Article CAS PubMed Google Scholar
Olatunji, B. O., Cisler, J. M. & Tolin, D. F. Quality of life in the anxiety disorders: a meta-analytic review. Clin. Psychol. Rev. 27, 572–581 (2007).
Article PubMed Google Scholar
Choi, K. W., Kim, Y. K. & Jeon, H. J. Comorbid anxiety and depression: clinical and conceptual consideration and transdiagnostic treatment. Adv. Exp. Med. Biol. 1191, 219–235 (2020).
Article CAS PubMed Google Scholar
Michopoulos, V., Vester, A. & Neigh, G. Posttraumatic stress disorder: a metabolic disorder in disguise? Exp. Neurol. 284, 220–229 (2016).
Article PubMed PubMed Central Google Scholar
Rosenbaum, S. et al. The prevalence and risk of metabolic syndrome and its components among people with posttraumatic stress disorder: a systematic review and meta-analysis. Metabolism 64, 926–933 (2015).
Article CAS PubMed Google Scholar
Song, H. et al. Stress related disorders and risk of cardiovascular disease: population-based, sibling controlled cohort study. BMJ 365, l1255 (2019).
Article PubMed PubMed Central Google Scholar
Batelaan, N. M., Seldenrijk, A., Bot, M., van Balkom, A. J. & Penninx, B. W. Anxiety and new onset of cardiovascular disease: critical review and meta-analysis. Br. J. Psychiatry 208, 223–231 (2016).
Article PubMed Google Scholar
Emdin, C. A. et al. Meta-analysis of anxiety as a risk factor for cardiovascular disease. Am. J. Cardiol. 118, 511–519 (2016).
Article PubMed Google Scholar
Song, H. et al. Association of stress-related disorders with subsequent autoimmune disease. JAMA 319, 2388–2400 (2018).
Article PubMed PubMed Central Google Scholar
Bookwalter, D. B. et al. Posttraumatic stress disorder and risk of selected autoimmune diseases among US military personnel. BMC Psychiatry 20, 23 (2020).
Article PubMed PubMed Central Google Scholar
Song, H. et al. Stress related disorders and subsequent risk of life-threatening infections: population based sibling controlled cohort study. BMJ 367, l5784 (2019).
Article PubMed PubMed Central Google Scholar
Holman, E. A. Acute stress and cardiovascular health: is there an ACE gene connection? J. Trauma Stress 25, 592–597 (2012).
Article PubMed Google Scholar
Cole, S. W. et al. Computational identification of gene-social environment interaction at the human IL6 locus. Proc. Natl Acad. Sci. USA 107, 5681–5686 (2010).
Article ADS CAS PubMed PubMed Central Google Scholar
Jensen, A. B. et al. Temporal disease trajectories condensed from population-wide registry data covering 6.2 million patients. Nat. Commun. 5, 4022 (2014).
Article ADS CAS PubMed Google Scholar
Hidalgo, C. A., Blumm, N., Barabasi, A. L. & Christakis, N. A. A dynamic network approach for the study of human phenotypes. PLoS Comput. Biol. 5, e1000353 (2009).
Article PubMed PubMed Central Google Scholar
Hou, C. et al. Medical conditions associated with coffee consumption: disease-trajectory and comorbidity network analyses of a prospective cohort study in UK Biobank. Am. J. Clin. Nutr. 116, 730–740 (2022).
Article CAS PubMed PubMed Central Google Scholar
Han, X. et al. Disease trajectories and mortality among individuals diagnosed with depression: a community-based cohort study in UK Biobank. Mol. Psychiatry 26, 6736–6746 (2021).
Article PubMed PubMed Central Google Scholar
Kubzansky, L. D., Koenen, K. C., Jones, C. & Eaton, W. W. A prospective study of posttraumatic stress disorder symptoms and coronary heart disease in women. Health Psychol. 28, 125–130 (2009).
Article PubMed PubMed Central Google Scholar
Burg, M. M. et al. Risk for incident hypertension associated with posttraumatic stress disorder in military veterans and the effect of posttraumatic stress disorder treatment. Psychosom. Med. 79, 181–188 (2017).
Article PubMed PubMed Central Google Scholar
Roy, S. S., Foraker, R. E., Girton, R. A. & Mansfield, A. J. Posttraumatic stress disorder and incident heart failure among a community-based sample of US veterans. Am. J. Public Health 105, 757–763 (2015).
Article PubMed PubMed Central Google Scholar
McGrath, J. J. et al. Comorbidity within mental disorders: a comprehensive analysis based on 145 990 survey respondents from 27 countries. Epidemiol. Psychiatr. Sci. 29, e153 (2020).
Article MathSciNet CAS PubMed PubMed Central Google Scholar
Momen, N. C. et al. Association between mental disorders and subsequent medical conditions. N. Engl. J. Med. 382, 1721–1731 (2020).
Article PubMed PubMed Central Google Scholar
Jiang, T. et al. Posttraumatic stress disorder and incident infections: a nationwide cohort study. Epidemiology 30, 911–917 (2019).
Article PubMed PubMed Central Google Scholar
Kim, T. et al. Associations of mental health and sleep duration with menstrual cycle irregularity: a population-based study. Arch. Womens Ment. Health 21, 619–626 (2018).
Article PubMed Google Scholar
Jung, S. J. et al. Posttraumatic stress disorder and development of premenstrual syndrome in a longitudinal cohort of women. Arch. Womens Ment. Health 22, 535–539 (2019).
Article PubMed Google Scholar
Weisberg, R. B. et al. Nonpsychiatric illness among primary care patients with trauma histories and posttraumatic stress disorder. Psychiatr. Serv. 53, 848–854 (2002).
Article PubMed Google Scholar
Lauterbach, D., Vora, R. & Rakow, M. The relationship between posttraumatic stress disorder and self-reported health problems. Psychosom. Med. 67, 939–947 (2005).
Article PubMed Google Scholar
Siggaard, T. et al. Disease trajectory browser for exploring temporal, population-wide disease progression patterns in 7.2 million Danish patients. Nat. Commun. 11, 4952 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Thompson, A. G. et al. Genome-wide association study of behavioural and psychiatric features in human prion disease. Transl. Psychiatry 5, e552 (2015).
Article CAS PubMed PubMed Central Google Scholar
Anney, R. J. et al. Conduct disorder and ADHD: evaluation of conduct problems as a categorical and quantitative trait in the international multicentre ADHD genetics study. Am. J. Med. Genet. B Neuropsychiatr. Genet. 147B, 1369–1378 (2008).
Article CAS PubMed Google Scholar
He, W. et al. Association of novel loci with keratoconus susceptibility in a multitrait genome-wide association study of the UK Biobank database and Canadian longitudinal study on aging. JAMA Ophthalmol. 140, 568–576 (2022).
Article PubMed PubMed Central Google Scholar
Cheng, C. Y. et al. Nine loci for ocular axial length identified through genome-wide association studies, including shared loci with refractive error. Am. J. Hum. Genet. 93, 264–277 (2013).
Article CAS PubMed PubMed Central Google Scholar
Gouveia, M. H. et al. Trans-ethnic meta-analysis identifies new loci associated with longitudinal blood pressure traits. Sci. Rep. 11, 4075 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Kumar, A. et al. Association of SUMOylation pathway genes with stroke in a genome-wide association study in India. Neurology 97, e345–e356 (2021).
Article CAS PubMed PubMed Central Google Scholar
Arning, A. et al. A genome-wide association study identifies a gene network of ADAMTS genes in the predisposition to pediatric stroke. Blood 120, 5231–5236 (2012).
Article CAS PubMed Google Scholar
Hauger, R. L. et al. Molecular and cell signaling targets for PTSD pathophysiology and pharmacotherapy. Neuropharmacology 62, 705–714 (2012).
Article CAS PubMed Google Scholar
Ge, Y. J. et al. Anti-inflammatory signaling through G protein-coupled receptors. Acta Pharmacol. Sin. 41, 1531–1538 (2020).
Article CAS PubMed PubMed Central Google Scholar
Sriram, K. & Insel, P. A. G protein-coupled receptors as targets for approved drugs: how many targets and how many drugs? Mol. Pharmacol. 93, 251–258 (2018).
Article CAS PubMed PubMed Central Google Scholar
C.T. Development America, I. A study of RX-10045 in the treatment of dry eye disease. https://ClinicalTrials.gov/show/NCT01675570 (2012).
C.T. Development America, I. Evaluation of the onset and duration of action of RX-10045 in allergic conjunctivitis. https://ClinicalTrials.gov/show/NCT01639846 (2012).
Chen, P., Li, B. & Ou-Yang, L. Role of estrogen receptors in health and disease. Front. Endocrinol. 13, 839005 (2022).
Article Google Scholar
Barabasi, A. L., Gulbahce, N. & Loscalzo, J. Network medicine: a network-based approach to human disease. Nat. Rev. Genet. 12, 56–68 (2011).
Article CAS PubMed PubMed Central Google Scholar
Ludvigsson, J. F. et al. External review and validation of the Swedish national inpatient register. BMC Public Health 11, 450 (2011).
Article PubMed PubMed Central Google Scholar
Quan, H. et al. Coding algorithms for defining comorbidities in ICD-9-CM and ICD-10 administrative data. Med. Care 43, 1130–1139 (2005).
Article PubMed Google Scholar
Vilaplana-Perez, A. et al. Validity and reliability of social anxiety disorder diagnoses in the Swedish National Patient Register. BMC Psychiatry 20, 242 (2020).
Article PubMed PubMed Central Google Scholar
Hollander, A. C. et al. Validation study of randomly selected cases of PTSD diagnoses identified in a Swedish regional database compared with medical records: is the validity sufficient for epidemiological research? BMJ Open 9, e031964 (2019).
Article PubMed PubMed Central Google Scholar
Longitudinell Integrationsdatabas för Sjukförsäkrings- och Arbetsmarknadsstudier (LISA). Statistiska Centralbyrån, (Statistiska Centralbyrån, 2018).
Fry, A. et al. Comparison of sociodemographic and health-related characteristics of UK Biobank participants with those of the general population. Am. J. Epidemiol. 186, 1026–1034 (2017).
Article PubMed PubMed Central Google Scholar
Sudlow, C. et al. UK biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS Med. 12, e1001779 (2015).
Article PubMed PubMed Central Google Scholar
Carney, R. M., Freedland, K. E., Eisen, S. A., Rich, M. W. & Jaffe, A. S. Major depression and medication adherence in elderly patients with coronary artery disease. Health Psychol. 14, 88–90 (1995).
Article PubMed Google Scholar
Townsend, P., Phillimore, P. & Beattie, A. Health and Deprivation: Inequality and the North (Routledge, 1988).
Katz, D., Baptista, J., Azen, S. & Pike, M. Obtaining confidence intervals for the risk ratio in cohort studies. Biometrics 34, 469–474 (1978).
Meo, P. D., Ferrara, E., Fiumara, G. & Provetti, A. Generalized Louvain method for community detection in large networks. In Proc. 11th International Conference on Intelligent Systems Design and Applications, 88–93 (2011).
Choi, S. W., Mak, T. S. & O’Reilly, P. F. Tutorial: a guide to performing polygenic risk score analyses. Nat. Protocols 15, 2759–2772 (2020).
Article CAS PubMed Google Scholar
Jiang, L. et al. A resource-efficient tool for mixed model association analysis of large-scale data. Nat. Genet. 51, 1749–1755 (2019).
Article CAS PubMed Google Scholar
Bulik-Sullivan, B. K. et al. LD Score regression distinguishes confounding from polygenicity in genome-wide association studies. Nat. Genet. 47, 291–295 (2015).
Article CAS PubMed PubMed Central Google Scholar
Watanabe, K., Taskesen, E., van Bochoven, A. & Posthuma, D. Functional mapping and annotation of genetic associations with FUMA. Nat. Commun. 8, 1826 (2017).
Article ADS PubMed PubMed Central Google Scholar
Consortium, G. T. The GTEx Consortium atlas of genetic regulatory effects across human tissues. Science 369, 1318–1330 (2020).
Article Google Scholar
Wang, D. et al. Comprehensive functional genomic resource and integrative model for the human brain. Science 362, eaat8464 (2018).
Schmitt, A. D. et al. A compendium of chromatin contact maps reveals spatially active regions in the human genome. Cell Rep. 17, 2042–2059 (2016).
Article CAS PubMed PubMed Central Google Scholar
Zhou, Y. et al. Metascape provides a biologist-oriented resource for the analysis of systems-level datasets. Nat. Commun. 10, 1523 (2019).
Bader, G. D. & Hogue, C. W. An automated method for finding molecular complexes in large protein interaction networks. BMC Bioinform. 4, 2 (2003).
Article Google Scholar
Szklarczyk, D. et al. STRING v11: protein–protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets. Nucleic Acids Res. 47, D607–D613 (2018).
Article PubMed Central Google Scholar
Stark, C. et al. BioGRID: a general repository for interaction datasets. Nucleic Acids Res. 34, D535–D539 (2006).
Article CAS PubMed Google Scholar
Türei, D., Korcsmáros, T. & Saez-Rodriguez, J. OmniPath: guidelines and gateway for literature-curated signaling pathway resources. Nat. Methods 13, 966–967 (2016).
Article PubMed Google Scholar
Li, T. et al. A scored human protein-protein interaction network to catalyze genomic interpretation. Nat. Methods 14, 61–64 (2017).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

We thank the team members and colleagues involved in the West China Biomedical Big Data Center- UK Biobank project for their support. This work is supported by the National Natural Science Foundation of China (No. 81971262 to H.S.), 1.3.5 project for disciplines of excellence, West China Hospital, Sichuan University (No. ZYYC21005 to H.S.), EU Horizon2020 Research and Innovation Action Grant (847776 to U.V. and F.F.), Consolidator grant from the European Research Council (726413 to U.V.), the Fundamental Research Funds for Central Universities (No. 20826041F4144 to X.H.), the Outstanding Clinical Discipline Project of Shanghai Pudong (No.PWYgy2021-02 to Q.S.) and the Fundamental Research Funds for the Central Universities (to Q.S.). This research has been conducted using the UK Biobank Resource under Application 54803. This work uses data provided by patients and collected by the NHS as part of their care and support. This research used data assets made available by National Safe Haven as part of the Data and Connectivity National Core Study, led by Health Data Research UK in partnership with the Office for National Statistics and funded by UK Research and Innovation (grant ref: MC_PC_20029 and MC_PC_20058).

Author information

These authors contributed equally: Xin Han, Qing Shen.
These authors jointly supervised this work: Unnur A Valdimarsdóttir, Huan Song.

Authors and Affiliations

Mental Health Center, West China Hospital, Sichuan University, Chengdu, China
Xin Han
West China Biomedical Big Data Center, West China Hospital, Sichuan University, Chengdu, China
Xin Han, Can Hou, Huazhen Yang, Wenwen Chen, Yu Zeng, Yuanyuan Qu & Huan Song
Med-X Center for Informatics, Sichuan University, Chengdu, China
Xin Han, Can Hou, Huazhen Yang, Wenwen Chen, Yu Zeng, Yuanyuan Qu & Huan Song
Clinical Research Center for Mental Disorders, Shanghai Pudong New Area Mental Health Center, Tongji University School of Medicine, Shanghai, China
Qing Shen
Institute for Advanced Study, Tongji University, Shanghai, China
Qing Shen
Institute of Environmental Medicine, Karolinska Institutet, Stockholm, Sweden
Qing Shen, Fang Fang & Unnur A. Valdimarsdóttir
Department of Epidemiology & Ministry of Education Key Laboratory of Public Health Safety, School of Public Health, Fudan University, Shanghai, China
Chen Suo
Taizhou Institute of Health Sciences, Fudan University, Taizhou, China
Chen Suo
Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
Weimin Ye
Center of Public Health Sciences, Faculty of Medicine, University of Iceland, Reykjavík, Iceland
Unnur A. Valdimarsdóttir & Huan Song
Department of Epidemiology, Harvard T H Chan School of Public Health, Boston, MA, USA
Unnur A. Valdimarsdóttir

Authors

Xin Han
View author publications
You can also search for this author in PubMed Google Scholar
Qing Shen
View author publications
You can also search for this author in PubMed Google Scholar
Can Hou
View author publications
You can also search for this author in PubMed Google Scholar
Huazhen Yang
View author publications
You can also search for this author in PubMed Google Scholar
Wenwen Chen
View author publications
You can also search for this author in PubMed Google Scholar
Yu Zeng
View author publications
You can also search for this author in PubMed Google Scholar
Yuanyuan Qu
View author publications
You can also search for this author in PubMed Google Scholar
Chen Suo
View author publications
You can also search for this author in PubMed Google Scholar
Weimin Ye
View author publications
You can also search for this author in PubMed Google Scholar
Fang Fang
View author publications
You can also search for this author in PubMed Google Scholar
Unnur A. Valdimarsdóttir
View author publications
You can also search for this author in PubMed Google Scholar
Huan Song
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

H.S. and U.A.V. were responsible for the study concept and design. X.H., Q.S., H.Y., W.C., Y.Z., Y.Q. and W.Y. did the data and project management. X.H., Q.S. and C.H. did the data cleaning and analysis. X.H., Q.S., C.S., H.S., U.A.V. and F.F. interpreted the data. X.H., Q.S., U.A.V., F.F. and H.S. drafted the manuscript. All authors approved the final manuscript as submitted and agree to be accountable for all aspects of the work. The corresponding author attests that all listed authors meet authorship criteria and that no others meeting the criteria have been omitted.

Corresponding author

Correspondence to Huan Song.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Amaia Calderón-Larrañaga and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Description of Additional Supplementary Files

Supplementary Data 1-12

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Han, X., Shen, Q., Hou, C. et al. Disease clusters subsequent to anxiety and stress-related disorders and their genetic determinants. Nat Commun 15, 1209 (2024). https://doi.org/10.1038/s41467-024-45445-2

Download citation

Received: 01 May 2023
Accepted: 23 January 2024
Published: 08 February 2024
DOI: https://doi.org/10.1038/s41467-024-45445-2

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Baseline characteristics

Identification of associated disease clusters

Genetic determinants of associated disease clusters

Subgroup analyses

Discussion

Methods

Study design

Swedish cohort

Follow-up

Ascertainment of anxiety and stress-related disorders and subsequent medical conditions

UK cohort

Statistical analyses

Exploration of associated disease clusters in the Swedish cohort

Validation of associated disease clusters in the UK cohort

Genetic determinants of associated disease clusters using data from the UK Biobank

Subgroup analyses

Reporting summary

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links