Multi-omic biomarker identification and validation for diagnosing warzone-related post-traumatic stress disorder

Dean, Kelsey R.; Hammamieh, Rasha; Mellon, Synthia H.; Abu-Amara, Duna; Flory, Janine D.; Guffanti, Guia; Wang, Kai; Daigle, Bernie J.; Gautam, Aarti; Lee, Inyoul; Yang, Ruoting; Almli, Lynn M.; Bersani, F. Saverio; Chakraborty, Nabarun; Donohue, Duncan; Kerley, Kimberly; Kim, Taek-Kyun; Laska, Eugene; Young Lee, Min; Lindqvist, Daniel; Lori, Adriana; Lu, Liangqun; Misganaw, Burook; Muhie, Seid; Newman, Jennifer; Price, Nathan D.; Qin, Shizhen; Reus, Victor I.; Siegel, Carole; Somvanshi, Pramod R.; Thakur, Gunjan S.; Zhou, Yong; Hood, Leroy; Ressler, Kerry J.; Wolkowitz, Owen M.; Yehuda, Rachel; Jett, Marti; Doyle, Francis J.; Marmar, Charles

doi:10.1038/s41380-019-0496-z

Download PDF

Article
Open access
Published: 10 September 2019

Multi-omic biomarker identification and validation for diagnosing warzone-related post-traumatic stress disorder

Kelsey R. Dean^1,2^na1,
Rasha Hammamieh³^na1,
Synthia H. Mellon⁴^na1,
Duna Abu-Amara⁵,
Janine D. Flory^6,7,
Guia Guffanti⁸,
Kai Wang⁹,
Bernie J. Daigle Jr.¹⁰,
Aarti Gautam³,
Inyoul Lee⁹,
Ruoting Yang¹¹,
Lynn M. Almli¹²,
F. Saverio Bersani ORCID: orcid.org/0000-0002-7555-8020^13,14,
Nabarun Chakraborty¹⁵,
Duncan Donohue¹⁵,
Kimberly Kerley¹²,
Taek-Kyun Kim⁹,
Eugene Laska⁵,
Min Young Lee⁹,
Daniel Lindqvist^13,16,
Adriana Lori¹²,
Liangqun Lu¹⁰,
Burook Misganaw²,
Seid Muhie¹⁵,
Jennifer Newman⁵,
Nathan D. Price ORCID: orcid.org/0000-0002-4157-0267⁹,
Shizhen Qin⁹,
Victor I. Reus¹³,
Carole Siegel⁵,
Pramod R. Somvanshi²,
Gunjan S. Thakur²,
Yong Zhou⁹,
The PTSD Systems Biology Consortium,
Leroy Hood⁹,
Kerry J. Ressler⁸,
Owen M. Wolkowitz¹³,
Rachel Yehuda^6,7,
Marti Jett³,
Francis J. Doyle III² &
…
Charles Marmar⁵

Molecular Psychiatry volume 25, pages 3337–3349 (2020)Cite this article

18k Accesses
53 Citations
195 Altmetric
Metrics details

Subjects

Abstract

Post-traumatic stress disorder (PTSD) impacts many veterans and active duty soldiers, but diagnosis can be problematic due to biases in self-disclosure of symptoms, stigma within military populations, and limitations identifying those at risk. Prior studies suggest that PTSD may be a systemic illness, affecting not just the brain, but the entire body. Therefore, disease signals likely span multiple biological domains, including genes, proteins, cells, tissues, and organism-level physiological changes. Identification of these signals could aid in diagnostics, treatment decision-making, and risk evaluation. In the search for PTSD diagnostic biomarkers, we ascertained over one million molecular, cellular, physiological, and clinical features from three cohorts of male veterans. In a discovery cohort of 83 warzone-related PTSD cases and 82 warzone-exposed controls, we identified a set of 343 candidate biomarkers. These candidate biomarkers were selected from an integrated approach using (1) data-driven methods, including Support Vector Machine with Recursive Feature Elimination and other standard or published methodologies, and (2) hypothesis-driven approaches, using previous genetic studies for polygenic risk, or other PTSD-related literature. After reassessment of ~30% of these participants, we refined this set of markers from 343 to 28, based on their performance and ability to track changes in phenotype over time. The final diagnostic panel of 28 features was validated in an independent cohort (26 cases, 26 controls) with good performance (AUC = 0.80, 81% accuracy, 85% sensitivity, and 77% specificity). The identification and validation of this diverse diagnostic panel represents a powerful and novel approach to improve accuracy and reduce bias in diagnosing combat-related PTSD.

Utilization of machine learning for identifying symptom severity military-related PTSD subtypes and their biological correlates

Article Open access 20 April 2021

Carole E. Siegel, Eugene M. Laska, … Charles R. Marmar

Pre-deployment risk factors for PTSD in active-duty personnel deployed to Afghanistan: a machine-learning approach for analyzing multivariate predictors

Article Open access 02 June 2020

Katharina Schultebraucks, Meng Qian, … Charles R. Marmar

Genetically regulated multi-omics study for symptom clusters of posttraumatic stress disorder highlights pleiotropy with hematologic and cardio-metabolic traits

Article 03 March 2022

Gita A. Pathak, Kritika Singh, … Renato Polimanti

Introduction

Combat-related post-traumatic stress disorder (PTSD) has a lifetime prevalence of between 10.1%–30.9% in U.S. veterans of the Vietnam and subsequent conflicts, including the Iraq and Afghanistan wars [1,2,3,4]. PTSD is precipitated by experiencing or witnessing actual or threatened death, serious injury, or violence, and has symptoms that include re-experiencing, avoidance, negative thoughts, or moods associated with the traumatic event and hyperarousal (DSM-5 [5]). There is limited understanding of the biological processes underlying the core features of PTSD and associated psychiatric and somatic comorbidity [6].

Limited progress in the discovery of biological markers of PTSD has hampered accurate diagnosis, early identification of cases, staging and prognosis, stratification, personalized treatment, and new drug development. Additionally, individuals meeting diagnostic criteria for PTSD represent a heterogeneous group, as evidenced by differences in symptomatology, course, and treatment response [7]. Currently, case identification is limited by heavy reliance on self-reported symptoms for a disorder in which many trauma survivors under-report symptoms because of stigma, and some over-report symptoms for financial or other gains. Personalized treatment selection is limited by errors of omission (failing to identify individuals who would likely benefit from a specific behavioral or biological treatment) and errors of commission (treating individuals who are unlikely to benefit from a specific treatment), in part because of the lack of validated diagnostic and prognostic markers.

Previous PTSD biomarker studies have primarily focused on using gene expression for predicting risk and diagnosis [8,9,10,11]. These studies have demonstrated moderate success in identifying predictive and diagnostic markers, but have been limited due to small sample sizes, as well as the focus on an individual molecular data type. In cancer, multi-site, integrated multi-omic studies have shown great promise in generating novel insights into disease mechanism, diagnostic and predictive markers, and signals of progression and stratification [12,13,14]. These studies have included high-throughput ‘omics data such as genomics, transcriptomics, proteomics, methylomics, lipidomics and metabolomics [15]. By employing a systems biology framework, multi-omic datasests provide the ability to understand the underlying disease network-associated biological processes [16]. The systems biology approach aims to characterize a large and diverse set of molecules within an illness or individual by examining entire biological systems, not just individual components, allowing the assessment of interactions among levels of cellular pathology, ranging from DNA to circulating metabolites [17,18,19]. This approach has the potential to provide a more comprehensive characterization of illnesses, to track underlying biological dysregulation before clinical symptoms develop or worsen, to lead to the identification of improved diagnostic markers, and to allow for the discovery of novel targets for treatment [20].

In 2012, the Department of Defense initiated a multi-site “PTSD Systems Biology Consortium”, which applied multiple ‘omics technologies to the same sample of combat-exposed PTSD and control participants. The goals of the PTSD Systems Biology Consortium included developing a reproducible panel of blood-based biomarkers with good sensitivity and specificity for PTSD diagnosis. Here, we present identification and validation of a set of multi-omic biomarkers for diagnosing warzone-related PTSD.

Materials and methods

Study inclusion criteria

General inclusion criteria included being an Operation Enduring Freedom (OEF) and/or Operation Iraqi Freedom (OIF) male veteran between 20 and 60 years old, being able to understand the protocol and sign written informed consent, and meeting criteria for either PTSD-positive or PTSD-negative groups. PTSD-positive participants were defined as participants who met DSM-IV PTSD criteria for current warzone-related PTSD for at least 3 months duration, as indexed by the Clinician-Administered PTSD Scale (CAPS), with a minimum total score ≥ 40, which was calculated by summing each symptom on frequency and intensity ratings. Full criteria for DSM-IV diagnosis of PTSD was also met for all PTSD-positive participants. PTSD-negative controls were combat-exposed veterans that were negative for lifetime combat or civilian PTSD and had a current CAPS total score < 20. All study participants were exposed to DSM-IV PTSD Criterion A trauma during deployment. Detailed recruitment, enrollment, and exclusion criteria are listed in the Supplemental Material and Methods.

Clinical assessment measures

The Structured Clinical Interview for DSM (SCID) was used to determine whether participants met DSM-IV diagnostic criteria for mood, anxiety, psychotic, and substance use disorders [21]. The CAPS was used to determine combat-related PTSD status, as well as the severity of current PTSD symptoms (past month is the “CAPS current”) and the severity of the most severe lifetime episode of combat-related PTSD (“CAPS lifetime”) [22].

Molecular assays

Blood samples were assayed for many molecular species, including genetics, methylomics, proteomics, metabolomics, immune cell counts, cell aging, endocrine markers, microRNAs (miRNAs), cytokines, and more. DNA methylation was quantified using two approaches: a genome-wide unbiased approach, and a targeted sequencing-based approach. The genome-wide methylation approach quantified methylation using the Illumina Infinium HumanMethylation450K BeadChip array (Illumina Inc., CA). Using targets generated from this genome-wide approach, as well as other hypotheses generated from literature, a smaller set of methylation sites were evaluated by targeted sequencing via Zymo Research (Zymo Research, CA). Plasma miRNAs were evaluated using small RNA sequencing, and processed using sRNAnalyzer [23]. Proteins were evaluated using three methods: peptide quantification using selected reaction monitoring (SRM), quantification of six neurodegenerative disease-related markers using the Human Neurodegenerative Disease Panel 1, and quantification of serum levels of BDNF using a BDNF ELISA assay. Non-targeted metabolomics analysis was conducted using three platforms: ultrahigh performance liquid chromatography/tandem mass spectrometry (UHPLC/MS/MS²) optimized for basic species, UHPLC/MS/MS² for acidic species, and gas chromatography/mass spectrometry (GC/MS). Additional data types, including routine clinical lab values and physiological measurements, were collected using standard procedures. Details on all molecular assays and blood draw information are contained in the Supplementary Materials (Table S2).

Results

Participant recruitment and multi-omic data generation

A set of three cohorts totaling 281 samples from male combat veterans from OEF/OIF conflicts were recruited as part of a larger study designed to identify biomarkers for PTSD diagnosis using a combination of clinical, genetic, endocrine, multi-omic, and imaging information (Fig. 1). Participants were recruited in three cohorts: discovery, recall, and validation (Fig. 2a and Table 1). The discovery cohort (cohort 1) consisted of 83 PTSD and 82 trauma-exposed control participants who met the inclusion and exclusion criteria (described in Materials and Methods and Supplementary Material). All participants completed clinical interviews and blood draws. After assessment of data quality, 77 PTSD and 74 trauma-exposed control samples were available with all completed blood marker assays. This discovery cohort was used to generate an initial pool of candidate biomarkers. Participants from the discovery cohort were invited back for clinical re-evaluation and a blood draw approximately three years after their initial evaluation. This cohort of recalled subjects (recall cohort, cohort 2), included 55 participants from the initial discovery cohort. Some of these participants showed PTSD symptom and status changes based on clinical assessment (Fig. 2b). In addition, some participants no longer met the original inclusion/exclusion criteria for the study; these participants had symptoms intermediate between the PTSD and control groups, in some cases meeting criteria for subthreshold PTSD. The 55 recall participants included 15 PTSD, 11 subthreshold PTSD, and 29 control participants. The third cohort, an independent group of 26 PTSD and 26 control participants, became the validation cohort (cohort 3), used for validating the final set of PTSD biomarkers.

Table 1 Summary of cohort demographics and clinical symptoms

Full size table

PTSD cohorts and multi-omic datasets

To identify a minimally invasive PTSD diagnostic panel, blood-based multi-omics and other analytes were assayed for each individual (and during both visits for recalled participants), including DNA methylation, proteomics, metabolomics, miRNAs, small molecules, endocrine markers, and routine clinical lab panels. Additionally, physiological measures were recorded and nonlinear marker combinations were computed. Using a strategy described in the next sections, a robust and diverse 28-member biomarker panel for diagnosing PTSD was identified from this pool of more than one million markers (Fig. 2c).

Three-stage biomarker identification and down-selection from exploratory set of multi-omic data

We used a “wisdom of crowds” approach to identify candidate PTSD biomarkers from the large set of measured blood analytes. Utilizing domain area expertize of multiple researchers, as well as multiple algorithms and methodologies, collective intelligence has the potential to identify successful candidate biomarkers from a large dataset, particularly when knowledge is limited. Collective intelligence and “wisdom of crowds” approaches are often used in financial modeling and predictions [24], have been evaluated in medical decision-making [25], and are the motivation for ensemble classification methods, which have been shown to outperform individual classifiers [26].

From a diverse set of data-driven, hypothesis-driven, hybrid, and other approaches (Table S3), we identified a set of candidate diagnostic panels, totaling 343 unique potential biomarkers (Step 2 from Fig. 1 and Table S4). These approaches included COMBINER [27], polygenic risk [28, 29], as well as traditional Support Vector Machine with Recursive Feature Elimination (SVM-RFE), random forest, and other classification algorithms, and feature selection approaches, including p-value, q-value, and fold-change filtering. Details of these algorithms are listed in the Supplementary Material. To filter and refine the pool of candidate biomarkers, we used data from recalled participants (recall cohort, cohort 2). Many of these returning participants experienced symptom changes over the 3.3 ± 0.9 years (mean ± sd) between the initial and follow-up evaluation. CAPS totals for recalled participants at both time points are shown in Fig. 2b. The panel was refined using the recall cohort along with a two-stage down-selection approach to select the final set of PTSD biomarkers (Steps 4–5 from Fig. 1).

The two-stage down-selection process is based on the following methodology. In the first stage, poor performing candidate biomarkers were removed one-by-one based on the largest average AUC of the remaining biomarker set (Step 4, Fig. 1). The trajectory of AUC scores in the recall cohort is shown in Supplementary Fig. 1A, showing the average AUC at each step of the one-by-one elimination. The biomarker set with the largest average AUC prior to the final performance decline was selected, resulting in 77 remaining biomarkers.

To further reduce the number of features in the panel, we implemented a second stage of down-selection, based on random forest variable importance (Fig. 1, Step 5). Using the recall cohort, the remaining 77 biomarkers were sorted based on random forest variable importance (Supplementary Fig. 1B). We retained biomarkers with importance >30% of the maximum importance score for the final biomarker panel (n = 28). The dynamics and distribution of these 28 biomarkers in the discovery and recall cohorts is shown in Supplementary Figs. 2 and 3.

Validation of a robust, multi-omic PTSD biomarker panel

After the two-stage feature reduction strategy, the final biomarker set consisted of 28 features, including methylation, metabolomics, miRNA, protein, and other data types. A random forest model trained on the combined cohorts 1 and 2 predicted PTSD status in an independent validation set (cohort 3) with an area under the ROC curve (AUC) of 0.80 (95% CI 0.66–0.93, Fig. 3a). Using the point closest to (0,1) on the ROC curve (shown in Fig. 3a), the model was validated with an accuracy of 81%, sensitivity of 85%, and specificity of 77%. The PTSD participants in the validation cohort had CAPS scores ranging from 47–114. We found that predicted PTSD scores from the random forest model for these cases were correlated with total CAPS (r = 0.59, p = 0.001), indicating the current biomarker model predicts not only disease status, but potentially PTSD symptom severity of cases (Fig. 3b). In addition, predicted PTSD scores were moderately correlated with DSM-IV re-experiencing, avoidance, and hyperarousal symptoms (r = 0.44–0.53, Supplementary Fig. 4), suggesting that the identified molecular markers are not specific to a single symptom cluster, but to overall symptoms.

Overall, the set of identified PTSD biomarkers contains many molecular data types (DNA methylation, miRNAs, proteins, metabolites, and others), with signals primarily including under-expressed proteins and miRNAs, and signatures of both DNA hyper- and hypomethylation. Of the 28 markers comprising the final panel, 16 markers had consistent fold-change directions in all three cohorts (Table 2). Five of the final 28 markers were retained during panel refinement even though the fold-change direction was inconsistent between the discovery and recall cohorts, indicating that these features may contain relevant PTSD signal that is not purely measured by group differences in mean. A post hoc analysis of the biomarker panel performance without these inconsistent features resulted in decreased validation performance (AUC = 0.74 and 0.71 when using only markers with consistent fold-change directions across the discovery and recall cohorts (23 markers), and all three cohorts (16 markers), respectively).

Table 2 Overview of biomarker signals in each of the three cohorts

Full size table

Using random forest variable importance, the top 10 biomarkers from the final 28-marker panel included five of the six molecular data types: DNA methylation, physiological, miRNAs, clinical lab measures, and metabolites (Fig. 3c). These data types contribute primarily uncorrelated signals, with only small clusters of moderate to highly correlated biomarkers from three data types: proteins, miRNAs, and DNA methylation (Fig. 3d).

Through the biomarker identification and down-selection process, two intermediate biomarker sets were identified, consisting of 343 and 77 candidate biomarkers. Trained random forest models on these biomarker sets validated with slightly lower AUCs than the final biomarker panel (AUCs of 0.74, 0.75, and 0.80 in the 343, 77, and 28 biomarker panels; Fig. 3e). The consistent validation AUC indicates robust signal in these sets of candidate biomarkers, without loss of signal during down-selection from 343 to 28 features. The final panel of 28 markers consisted of six different data types: routine clinical lab markers, metabolites, DNA methylation marks, miRNAs, proteins, and physiological measurements. The combined panel out-performed all six panels composed of each individual data type (Fig. 3e), demonstrating the power of combining different types of markers in a diverse biomarker panel, capable of capturing the complexities of PTSD.

Two biomarker features included in our final panel are computed, nonlinear metrics: Global Arginine Bioavailability Ratio (GABR, defined as arginine/[ornithine + citrulline]) and lactate/citrate. These computed ratios outperform their combined individual components in predictive performance, indicating biologically-driven nonlinear features may enhance low signals (Fig. 3e). In addition, these ratios begin to alleviate single-sample normalization issues that need to be addressed for clinical use of a biomarker panel.

Evaluation of clinical and demographic factors

The cohorts recruited for this study are diverse in terms of ethnicity, educational background, clinical symptoms, overall health, and comorbid diseases and conditions. The heterogeneity of the participants included in these three cohorts, including race, age, and clinical comorbidities, as well as PTSD severity are shown in Table 1. To evaluate the performance of this biomarker panel in the context of participant demographics and other clinical factors, we computed biomarker performance in stratified subsets of the validation cohort. While biomarker performance was highest in Hispanic participants (AUC = 0.95), we observed no statistically significant differences in AUC across ethnicities (Fig. 3f). Multiple studies have examined the increased prevalence and greater symptom severity of PTSD in Hispanic populations [47, 48], which may correspond to stronger biological signals, leading to the differences in AUC.

In the validation cohort, 35% of PTSD cases also met the criteria for major depressive disorder (MDD). Using the identified biomarker panel and model, these PTSD + /MDD + cases could be distinguished from all controls with an AUC of 0.92, while the PTSD + /MDD – could only be distinguished from controls with an AUC of 0.73 (Fig. 3f). Similarly, predicted PTSD scores were more strongly correlated with PTSD symptom severity in PTSD + /MDD + participants than in PTSD + /MDD– participants, with r = 0.64 and r = 0.37, respectively (Supplementary Fig. 5). This decrease in prediction accuracy and correlation with PTSD symptoms in the absence of comorbid MDD indicates a potential overlap of biological signals for MDD and PTSD that should be explored further.

Discussion

This study presents the identification and validation of a biomarker panel for the diagnosis of combat-related PTSD. The panel consists of 28 features that perform well in identifying PTSD cases from combat-exposed controls in a male, veteran population (81% accuracy). Some of the biomarkers have been linked to PTSD previously, including elevated heart rate [36] and decreased level of coagulation factors [10], and other included markers have been linked to MDD, anxiety, and other comorbid conditions, including platelet volume [43, 44], insulin resistance [41, 49], alterations in the SHANK2 gene [30], and PDE9A expression [31] (Table 2).

In particular, the circulating miRNAs selected in the panel reflect the diverse pathology and comorbidities present in PTSD populations, including connections to metabolic diseases and cardiovascular conditions. The miR-133-3p, a member of myomiRs that are highly abundant in muscle, including cardiac muscle, has been implicated in cardiomyocyte differentiation and proliferation [50]. The circulating miR-133-3p level has been linked to various cardiovascular disorders, including myocardial infarction, heart failure, and cardiac fibrosis [51, 52]. The miR-9-5p is enriched in brain [40] and known as a regulator for neurogenesis. It is also involved in heart development and heart hypertrophy [53]. The miR-192 is highly abundant in the liver and circulating miR-192-5p levels have been associated with various liver conditions as well metabolic diseases such as obesity and diabetes [37, 38]. The circulating miR-192 level has also been used as a biomarker for ischemic heart failure [54].

In addition to molecular markers, our approach selected heart rate as a contributor to the PTSD diagnostic panel. More than two decades ago, heart rate differences were observed between eventual PTSD cases and controls during emergency room visits and at 1-week follow-ups after trauma [36]. While these differences did not persist for longer time points in Shalev’s study, we observed significant mean group differences for heart rates in two of the three cohorts from this study, a number of years following trauma exposure (p < 0.01 for discovery and validation cohorts). Heart rate alone predicts diagnosis of PTSD in the validation cohort with 69% accuracy. Of note, removing heart rate from our biomarker panel did not result in significantly decreased model performance (molecular-only panel without heart rate still achieves 75% accuracy).

Following the heart rate analysis, we evaluated all other biomarkers contained in the panel individually. Three other markers achieved at least 60% accuracy in the validation cohort: gammaglutamyltyrosine, insulin, and cg01208318. However, using any of these markers individually resulted in greater variance in validation accuracy, based on 2000 bootstrapping iterations. Additionally, we note that the most important markers selected during model refinement (based on Random Forest Variable Importance, Supplementary Fig. 1B), were not the top-performing individual markers in the validation cohort. Without an additional validation cohort, validation performance cannot be used to hand-select top-performing individual markers. During additional rounds of panel validation and development, individual markers and smaller subsets of this biomarker panel should be evaluated.

Strengths and limitations

The cohorts recruited for this study were subject to strict inclusion and exclusion criteria, intentionally creating a pool of moderate to severe cases of combat-related PTSD to compare with asymptomatic controls among men deployed to Iraq and/or Afghanistan. To understand the clinical utility of the proposed biomarker panel, further validation is required in other PTSD populations, including active duty soldiers, populations with civilian trauma, female cohorts, and carefully phenotyped populations with and without many conditions commonly comorbid with PTSD. This study design may have allowed for the clearest and strongest signals of combat-related PTSD to emerge, but will need additional validation in cohorts of individuals with chronic PTSD (>10 years), individuals who recover from PTSD, and those with intermediate PTSD symptoms (CAPS from 20–40), where the current model performance may be decreased. Additionally, this study used DSM-IV criteria for diagnosing PTSD to ensure consistency across all cohorts. Hoge et al. [55] determined that 30% of combat veterans who meet DSM-IV diagnostic criteria for PTSD do not meet DSM-5 criteria for PTSD. The impact of using DSM-5 should be evaluated for this specific set of biomarkers in future cohorts.

Many studies have emphasized the high rates of PTSD comorbidity with other conditions, including depression [56], anxiety [57], alcoholism and substance abuse [58], cardiovascular disease [59], diabetes [60], and others. A robust PTSD biomarker panel should be (i) specific to PTSD and not any of these or other comorbidities, and (ii) able to detect PTSD in both the presence and absence of these comorbid conditions. To further identify potential confounders, additional samples including MDD without PTSD, diabetes with and without PTSD, and other conditions should be studied to evaluate the specificity of the panel further.

In an exploratory search of more than one million markers, we assayed a range of molecular data types, including DNA methylation marks, proteins, miRNAs, and metabolites. Owing to quality control and other limitations, several molecular data types were incomplete and therefore excluded from biomarker identification and refinement. These included gene expression, immune cell counts, and cytokine assays. Some of these assays were completed for the discovery cohort, and were included in early approaches for candidate biomarker selection. Any identified biomarker candidates from these assays were removed prior to down-selection and validation due to lack of data in recall and validation cohorts. The presence of these markers in the discovery phase may have influenced the selection of candidate biomarkers for some of the machine learning approaches. However, the exclusion of these datasets was not based on biomarker validation performance and therefore could not have affected the final accuracy and performance of the 28-marker panel.

In summary, we have presented a robust multi-omic panel for predicting combat-related PTSD diagnosis in male veteran populations. These 28 biomarkers include features from DNA methylation, proteins, miRNAs, metabolites, and other molecular and physiological measurements. In an independent validation cohort, we predicted PTSD diagnosis with 81% accuracy, 85% sensitivity, and 77% specificity, indicating a blood-based screening or diagnostic tool is promising for identifying PTSD, particularly in males with warzone-related PTSD.

Disclaimer

The views, opinions and/or findings contained in this report are those of the authors and should not be construed as an official Department of the Army position, policy or decision, unless so designated by other official documentation. The views and conclusions contained in this document are those of the authors and should not be interpreted as representing the official policies, either expressed or implied, of the Army Research Laboratory or the U.S. Government. The U.S. Government is authorized to reproduce and distribute reprints for Government purposes notwithstanding any copyright notation herein.

Data availability

Molecular, clinical, and demographic datasets for all three cohorts are available through the SysBioCube [61], at https://sysbiocube-abcc.ncifcrf.gov.

References

Kulka RA, et al. Trauma and the Vietnam war generation: report of findings from the National Vietnam Veterans Readjustment Study. Philadelphia, PA: Brunner/Mazel; 1990.
Kang HK, Natelson BH, Mahan CM, Lee KY, Murphy FM. Post-traumatic stress disorder and chronic fatigue syndrome-like illness among gulf war veterans: a population-based survey of 30,000 veterans. Am J Epidemiol. 2003;157:141–8.
PubMed Google Scholar
Seal KH, Bertenthal D, Miner CR, Sen S, Marmar C. Bringing the war back home. Arch Intern Med. 2007;167:476–82.
PubMed Google Scholar
Marmar CR, et al. Course of posttraumatic stress disorder 40 years after the Vietnam war. JAMA Psychiatry. 2015;72:875.
PubMed Google Scholar
American Psychiatric Association. Diagnostic and statistical manual of mental disorders. Arlington, VA: American Psychiatric Publishing; 2013.
Mellon SH, Gautam A, Hammamieh R, Jett M, Wolkowitz OM. Metabolism, metabolomics, and inflammation in posttraumatic stress disorder. Biol Psychiatry. 2018;83:866–75.
CAS PubMed Google Scholar
Shalev A, Liberzon I, Marmar C. Post-traumatic stress disorder. N Engl J Med. 2017;376:2459–69.
PubMed Google Scholar
Glatt SJ, et al. Blood-based gene-expression predictors of PTSD risk and resilience among deployed marines: a pilot study. Am J Med Genet Part B Neuropsychiatr Genet. 2013;162:313–26.
CAS Google Scholar
Segman RH, et al. Peripheral blood mononuclear cell gene expression profiles identify emergent post-traumatic stress disorder among trauma survivors. Mol Psychiatry. 2005;10:500–13.
CAS PubMed Google Scholar
Breen MS, et al. PTSD blood transcriptome mega-analysis: shared inflammatory pathways across biological sex and modes of trauma. Neuropsychopharmacology. 2018;43:469–81.
PubMed Google Scholar
Le-Niculescu H, et al. Towards precision medicine for stress disorders: diagnostic biomarkers and targeted drugs. Mol Psychiatry. 2019. https://doi.org/10.1038/s41380-019-0370-z.
Zhang Z, et al. Molecular subtyping of serous ovarian cancer based on multi-omics data. Sci Rep. 2016;6:26001.
CAS PubMed PubMed Central Google Scholar
The Cancer Genome Atlas Research Network. Comprehensive genomic characterization defines human glioblastoma genes and core pathways. Nature. 2008;455:1061–8.
Google Scholar
The Cancer Genome Atlas Research Network. Comprehensive molecular portraits of human breast tumours. Nature. 2012;490:61–70.
Google Scholar
Redón J, Monleón D. Combining -omics in the search for mechanisms in complex trait diseases. J Hypertens. 2015;33:698–9.
PubMed Google Scholar
Benson M. Clinical implications of omics and systems medicine: focus on predictive and individualized treatment. J Intern Med. 2016;279:229–40.
CAS PubMed Google Scholar
Raghavachari N, Gucek M. Pharmacogenomics, pharmacoproteomics, and pharmacometabolomics and personalized medicine: an overview. In: Barh D, Dhawan D, Ganguly NK, editors. Omics for personalized medicine. India: Springer; 2013. p. 3–18.
Thakur GS, et al. Systems biology approach to understanding post-traumatic stress disorder. Mol Biosyst. 2015;11:980–93.
CAS PubMed Google Scholar
Mitra K, Carvunis A-R, Ramesh SK, Ideker T. Integrative approaches for finding modular structure in biological networks. Nat Rev Genet. 2013;14:719.
CAS PubMed PubMed Central Google Scholar
Rantalainen M. Combining metabonomics and other -omics data. Methods Mol Biol. 2015; 147–59. https://doi.org/10.1007/978-1-4939-2377-9_12.
First M, Gibbon M, Spitzer R, Williams J, Benjamin L. Structured clinical interview for DSM-IV axis II personality disorders, (SCID-II). American Psychiatric Press, Inc.; 1997.
Blake DD. A clinical administered PTSD scale for assessing current and lifetime PTSD: the CAPS-I. Behav Ther. 1990;18:187–8.
Google Scholar
Wu X, et al. sRNAnalyzer—a flexible and customizable small RNA sequencing data analysis pipeline. Nucl Acids Res. 2017;45:12140–51.
CAS PubMed Google Scholar
Ray R. Prediction markets and the financial ‘wisdom of crowds’. J Behav Financ. 2006;7:2–4.
Google Scholar
Kurvers RHJM, Krause J, Argenziano G, Zalaudek I, Wolf M. Detection accuracy of collective intelligence assessments for skin cancer diagnosis. JAMA Dermatol. 2015;151:1346.
PubMed Google Scholar
Ow GS, Kuznetsov VA. Big genomics and clinical data analytics strategies for precision cancer prognosis. Sci Rep. 2016;6:1–13.
Google Scholar
Yang R, Daigle BJ, Jr. Petzold LR, Doyle FJ, III. Core module biomarker identification with network exploration for breast cancer metastasis. BMC Bioinformatics. 2012;13:12.
Chatterjee N, Shi J, García-Closas M. Developing and evaluating polygenic risk prediction models for stratified disease prevention. Nat Rev Genet. 2016;17:392–406.
CAS PubMed PubMed Central Google Scholar
Duncan LE, et al. Largest GWAS of PTSD (N = 20 070) yields genetic overlap with schizophrenia and sex differences in heritability. Mol Psychiatry. 2018;23:666–73.
CAS PubMed Google Scholar
Peykov S, et al. Identification and functional characterization of rare SHANK2 variants in schizophrenia. Mol Psychiatry. 2015;20:1489–98.
CAS PubMed PubMed Central Google Scholar
Luykx JJ, et al. Genome-wide association study of monoamine metabolite levels in human cerebrospinal fluid. Mol Psychiatry. 2014;19:228–34.
CAS PubMed Google Scholar
Zhang L, et al. Mitochondria-focused gene expression profile reveals common pathways and CPT1B dysregulation in both rodent stress model and human subjects with PTSD. Transl Psychiatry. 2015;5:e580.
Ali-Sisto T, et al. Global arginine bioavailability ratio is decreased in patients with major depressive disorder. J Affect Disord. 2018;229:145–51.
CAS PubMed Google Scholar
Bersani FS, et al. Global arginine bioavailability, a marker of nitric oxide synthetic capacity, is decreased in PTSD and correlated with symptom severity and markers of inflammation. Brain Behav Immun. 2016;52:153–60.
CAS PubMed Google Scholar
Zierer J, et al. Metabolomics profiling reveals novel markers for leukocyte telomere length. Aging (Albany NY). 2016;8:77–86.
CAS Google Scholar
Shalev AY, et al. A prospective study of heart rate response following trauma and the subsequent development of posttraumatic stress disorder. Arch Gen Psychiat. 1998;55:553.
CAS PubMed Google Scholar
Ghai V, et al. Genome-wide profiling of urinary extracellular vesicle microRNAs associated with diabetic nephropathy in type 1 diabetes. Kidney Int Rep. 2018;3:555–72.
PubMed Google Scholar
Ma X, Lu C, Lv C, Wu C, Wang Q. The expression of miR-192 and its significance in diabetic nephropathy patients with different urine albumin creatinine ratio. J Diabetes Res. 2016:1–6.
de Gonzalo-Calvo D, et al. Circulating inflammatory miRNA signature in response to different doses of aerobic exercise. J Appl Physiol. 2015;119:124–34.
PubMed Google Scholar
Sim S-E, et al. The brain-enriched microRNA miR-9-3p regulates synaptic plasticity and memory. J Neurosci. 2016;36:8641–52.
CAS PubMed PubMed Central Google Scholar
Heppner PS, et al. The association of posttraumatic stress disorder and metabolic syndrome: A study of increased health risk in veterans. BMC Med. 2009;7:1.
Jensen CF, et al. Behavioral and plasma cortisol responses to sodium lactate infusion in posttraumatic stress disorder. Ann N Y Acad Sci. 1997;821:444–8.
CAS PubMed Google Scholar
Kokacya MH, et al. Increased mean platelet volume in patients with panic disorder. Neuropsychiatr Dis Treat. 2015;11:2629.
CAS PubMed PubMed Central Google Scholar
Canan F, et al. Association of mean platelet volume with DSM-IV major depression in a large community-based population: The MELEN study. J Psychiatr Res. 2012;46:298–302.
PubMed Google Scholar
Wang M, et al. Effect of cyclooxygenase‑2 inhibition on the development of post‑traumatic stress disorder in rats. Mol Med Rep. 2018;17:4925–32.
CAS PubMed PubMed Central Google Scholar
Lerer B, Bleich A, Bennett ER, Ebstein RP, Balkin J. Platelet adenylate cyclase and phospholipase C activity in posttraumatic stress disorder. Biol Psychiatry. 1990;27:735–40.
CAS PubMed Google Scholar
Pole N, et al. Effects of gender and ethnicity on duty-related posttraumatic stress symptoms among urban police officers. J Nerv Ment Dis. 2001;189:442–8.
CAS PubMed Google Scholar
Marshall GN, Schell TL, Miles JNV. Ethnic differences in posttraumatic distress: hispanics’ symptoms differ in kind and degree. J Consult Clin Psychol. 2009;77:1169–78.
PubMed PubMed Central Google Scholar
Blessing EM, et al. Biological predictors of insulin resistance associated with posttraumatic stress disorder in young military veterans. Psychoneuroendocrinology. 2017;82:91–7.
CAS PubMed Google Scholar
Piubelli C, et al. MicroRNAs and cardiac cell fate. Cells. 2014;3:802–23.
PubMed PubMed Central Google Scholar
Liu N, et al. microRNA-133a regulates cardiomyocyte proliferation and suppresses smooth muscle gene expression in the heart. Genes Dev. 2008;22:3242–54.
CAS PubMed PubMed Central Google Scholar
Angelini A, Li Z, Mericskay M, Decaux J-F. Regulation of connective tissue growth factor and cardiac fibrosis by an SRF/MicroRNA-133a axis. PLoS ONE. 2015;10:e0139858.
PubMed PubMed Central Google Scholar
Wang K, Long B, Zhou J, Li P. F. miR-9 and NFATc3 regulate myocardin in cardiac hypertrophy. J Biol Chem. 2010;285:11903–12.
CAS PubMed PubMed Central Google Scholar
Emanueli C, et al. Coronary artery-bypass-graft surgery increases the plasma concentration of exosomes carrying a cargo of cardiac microRNAs: an example of exosome trafficking out of the human heart with potential for cardiac biomarker discovery. PLoS ONE. 2016;11:e0154274.
Hoge CW, Riviere LA, Wilk JE, Herrell RK, Weathers FW. The prevalence of post-traumatic stress disorder (PTSD) in US combat soldiers: a head-to-head comparison of DSM-5 versus DSM-IV-TR symptom criteria with the PTSD checklist. Lancet Psychiatry. 2014;1:269–77.
PubMed Google Scholar
Shalev AY, et al. Prospective study of posttraumatic stress disorder and depression following trauma. Am J Psychiatry. 1998;155:630–7.
CAS PubMed Google Scholar
Brown TA, Campbell LA, Lehman CL, Grisham JR, Mancill RB. Current and lifetime comorbidity of the DSM-IV anxiety and mood disorders in a large clinical sample. J Abnorm Psychol. 2001;110:585–99.
CAS PubMed Google Scholar
Brown PJ, Wolfe J. Substance abuse and post-traumatic stress disorder comorbidity. Drug Alcohol Depend. 1994;35:51–9.
CAS PubMed Google Scholar
Cohen BE, et al. Posttraumatic stress disorder and health-related quality of life in patients with coronary heart disease. Arch Gen Psychiatry. 2009;66:1214.
PubMed PubMed Central Google Scholar
Boyko EJ, et al. Risk of diabetes in US military service members in relation to combat deployment and mental health. Diabetes Care. 2010;33:1771–7.
PubMed PubMed Central Google Scholar
Chowbina S, et al. SysBioCube: A data warehouse and integrative data analysis platform facilitating systems biology studies of disorders of military relevance. AMIA Summits Transl Sci Proc. 2013:34–8.

Download references

Acknowledgements

We thank Rohini Bagrodia, Nikos Daskalakis, Charlotte Feld, Afia Genfi, Arsen Grigoryan, Roland Hart, Clare Henn-Haase, Kristin Holmes, Jenna Katz, Erin Koch, Sharon Lee, Amy Lehrner, Brie Loethen, Rebecca Lubin, Silas Mann, Nicholas Milton, Crystal Mora, Vince Passarelli, Emily Purchia, Amy Ransohoff, Alex Ropes, Ashik Siddique, Charu Sood, and Carly Walter for the assistance with data collection, management, and technical support. The High-Performance Computing Center at The University of Memphis also provided generous computing resources for this research. This work was supported by funding from the U.S. Army Research Office, through award numbers W911NF-13-1-0376, W911NF-17-2-0086, W911NF-18-2-0056, by the Army Research Laboratory under grant number W911NF-17-1-0069, and from the U.S. Department of Defense under W81XWH-10-1-0021, W81XWH-09-2-0044, and W81XWH-14-1-0043.

Additional members of The PTSD Systems Biology Consortium:

David Baxter⁹, Linda Bierer⁶, Esther Blessing⁵, Ji Hoon Cho⁹, Michelle Coy¹³, Frank Desarnaud⁶, Silvia Fossati⁵, Allison Hoke¹⁵, Raina Kumar¹¹, Meng Li⁵, Iouri Makotkine⁶, Stacy-Ann Miller¹⁵, Linda Petzold¹⁷, Laura Price⁵, Meng Qian⁵, Kelsey Scherler⁹, Seshamalini Srinivasan¹⁵, Anna Suessbrick⁵, Li Tang⁹, Xiaogang Wu⁹, Gwyneth Wu¹³, Changxin Wu⁶

Author information

These authors contributed equally: Kelsey R. Dean, Rasha Hammamieh, Synthia H. Mellon

Authors and Affiliations

Department of Systems Biology, Harvard University, Cambridge, MA, USA
Kelsey R. Dean
Harvard John A. Paulson School of Engineering and Applied Sciences, Harvard University, Cambridge, MA, USA
Kelsey R. Dean, Burook Misganaw, Pramod R. Somvanshi, Gunjan S. Thakur & Francis J. Doyle III
Integrative Systems Biology, US Army Medical Research and Materiel Command, USACEHR, Fort Detrick, Frederick, MD, USA
Rasha Hammamieh, Aarti Gautam & Marti Jett
Department of Obstetrics, Gynecology, & Reproductive Sciences, University of California, San Francisco, CA, USA
Synthia H. Mellon
Department of Psychiatry, New York Langone Medical School, New York, NY, USA
Duna Abu-Amara, Eugene Laska, Jennifer Newman, Carole Siegel, Esther Blessing, Silvia Fossati, Meng Li, Laura Price, Meng Qian, Anna Suessbrick & Charles Marmar
Department of Psychiatry, James J. Peters VA Medical Center, Bronx, NY, USA
Janine D. Flory, Linda Bierer, Frank Desarnaud, Iouri Makotkine, Changxin Wu & Rachel Yehuda
Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
Janine D. Flory & Rachel Yehuda
Department of Psychiatry, McLean Hospital, Belmont, MA, USA
Guia Guffanti & Kerry J. Ressler
Institute for Systems Biology, Seattle, WA, USA
Kai Wang, Inyoul Lee, Taek-Kyun Kim, Min Young Lee, Nathan D. Price, Shizhen Qin, Yong Zhou, David Baxter, Ji Hoon Cho, Kelsey Scherler, Li Tang, Xiaogang Wu & Leroy Hood
Departments of Biological Sciences and Computer Science, The University of Memphis, Memphis, TN, USA
Bernie J. Daigle Jr. & Liangqun Lu
Advanced Biomedical Computing Center, Frederick National Laboratory for Cancer Research, Frederick, MD, USA
Ruoting Yang & Raina Kumar
Department of Psychiatry and Behavioral Sciences, Emory University, Atlanta, GA, USA
Lynn M. Almli, Kimberly Kerley & Adriana Lori
Department of Psychiatry, University of California, San Francisco, CA, USA
F. Saverio Bersani, Daniel Lindqvist, Victor I. Reus, Michelle Coy, Gwyneth Wu & Owen M. Wolkowitz
Department of Human Neurosciences, Sapienza University of Rome, Rome, Italy
F. Saverio Bersani
USACEHR, The Geneva Foundation, Frederick, MD, USA
Nabarun Chakraborty, Duncan Donohue, Seid Muhie, Allison Hoke, Stacy-Ann Miller & Seshamalini Srinivasan
Lund University, Faculty of Medicine, Department of Clinical Sciences, Lund, Sweden
Daniel Lindqvist
Department of Computer Science, University of California Santa Barbara, Santa Barbara, USA
Linda Petzold

Authors

Kelsey R. Dean
View author publications
You can also search for this author in PubMed Google Scholar
Rasha Hammamieh
View author publications
You can also search for this author in PubMed Google Scholar
Synthia H. Mellon
View author publications
You can also search for this author in PubMed Google Scholar
Duna Abu-Amara
View author publications
You can also search for this author in PubMed Google Scholar
Janine D. Flory
View author publications
You can also search for this author in PubMed Google Scholar
Guia Guffanti
View author publications
You can also search for this author in PubMed Google Scholar
Kai Wang
View author publications
You can also search for this author in PubMed Google Scholar
Bernie J. Daigle Jr.
View author publications
You can also search for this author in PubMed Google Scholar
Aarti Gautam
View author publications
You can also search for this author in PubMed Google Scholar
Inyoul Lee
View author publications
You can also search for this author in PubMed Google Scholar
Ruoting Yang
View author publications
You can also search for this author in PubMed Google Scholar
Lynn M. Almli
View author publications
You can also search for this author in PubMed Google Scholar
F. Saverio Bersani
View author publications
You can also search for this author in PubMed Google Scholar
Nabarun Chakraborty
View author publications
You can also search for this author in PubMed Google Scholar
Duncan Donohue
View author publications
You can also search for this author in PubMed Google Scholar
Kimberly Kerley
View author publications
You can also search for this author in PubMed Google Scholar
Taek-Kyun Kim
View author publications
You can also search for this author in PubMed Google Scholar
Eugene Laska
View author publications
You can also search for this author in PubMed Google Scholar
Min Young Lee
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Lindqvist
View author publications
You can also search for this author in PubMed Google Scholar
Adriana Lori
View author publications
You can also search for this author in PubMed Google Scholar
Liangqun Lu
View author publications
You can also search for this author in PubMed Google Scholar
Burook Misganaw
View author publications
You can also search for this author in PubMed Google Scholar
Seid Muhie
View author publications
You can also search for this author in PubMed Google Scholar
Jennifer Newman
View author publications
You can also search for this author in PubMed Google Scholar
Nathan D. Price
View author publications
You can also search for this author in PubMed Google Scholar
Shizhen Qin
View author publications
You can also search for this author in PubMed Google Scholar
Victor I. Reus
View author publications
You can also search for this author in PubMed Google Scholar
Carole Siegel
View author publications
You can also search for this author in PubMed Google Scholar
Pramod R. Somvanshi
View author publications
You can also search for this author in PubMed Google Scholar
Gunjan S. Thakur
View author publications
You can also search for this author in PubMed Google Scholar
Yong Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Leroy Hood
View author publications
You can also search for this author in PubMed Google Scholar
Kerry J. Ressler
View author publications
You can also search for this author in PubMed Google Scholar
Owen M. Wolkowitz
View author publications
You can also search for this author in PubMed Google Scholar
Rachel Yehuda
View author publications
You can also search for this author in PubMed Google Scholar
Marti Jett
View author publications
You can also search for this author in PubMed Google Scholar
Francis J. Doyle III
View author publications
You can also search for this author in PubMed Google Scholar
Charles Marmar
View author publications
You can also search for this author in PubMed Google Scholar

Consortia

The PTSD Systems Biology Consortium

David Baxter
, Linda Bierer
, Esther Blessing
, Ji Hoon Cho
, Michelle Coy
, Frank Desarnaud
, Silvia Fossati
, Allison Hoke
, Raina Kumar
, Meng Li
, Iouri Makotkine
, Stacy-Ann Miller
, Linda Petzold
, Laura Price
, Meng Qian
, Kelsey Scherler
, Seshamalini Srinivasan
, Anna Suessbrick
, Li Tang
, Xiaogang Wu
, Gwyneth Wu
& Changxin Wu

Corresponding author

Correspondence to Francis J. Doyle III.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Members of The PTSD Systems Biology Consortium are listed below Acknowledgements

Affiliations of the The PTSD Systems Biology Consortium are listed in the Supplementary Materials (Table S1)

Supplementary information

Supplemental Materials and Methods

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Dean, K.R., Hammamieh, R., Mellon, S.H. et al. Multi-omic biomarker identification and validation for diagnosing warzone-related post-traumatic stress disorder. Mol Psychiatry 25, 3337–3349 (2020). https://doi.org/10.1038/s41380-019-0496-z

Download citation

Received: 18 December 2018
Revised: 15 May 2019
Accepted: 24 June 2019
Published: 10 September 2019
Issue Date: December 2020
DOI: https://doi.org/10.1038/s41380-019-0496-z

This article is cited by

Circulating cell-free mitochondrial DNA levels and glucocorticoid sensitivity in a cohort of male veterans with and without combat-related PTSD
- Zachary N. Blalock
- Gwyneth W. Y Wu
- Synthia H. Mellon
Translational Psychiatry (2024)
Prediction of disease-related miRNAs by voting with multiple classifiers
- Changlong Gu
- Xiaoying Li
BMC Bioinformatics (2023)
Screening for PTSD and TBI in Veterans using Routine Clinical Laboratory Blood Tests
- Mu Xu
- Ziqiang Lin
- Charles R. Marmar
Translational Psychiatry (2023)
Integrative Multi-omics Analysis of Childhood Aggressive Behavior
- Fiona A. Hagenbeek
- Jenny van Dongen
- Dorret I. Boomsma
Behavior Genetics (2023)
Deriving psychiatric symptom-based biomarkers from multivariate relationships between psychophysiological and biochemical measures
- Daniel M. Stout
- Alan. N. Simmons
- Dewleen G. Baker
Neuropsychopharmacology (2022)