Distinct Metabolic Endotype Mirroring Acute Respiratory Distress Syndrome (ARDS) Subphenotype and its Heterogeneous Biology

Predisposing aetiologies in Acute Respiratory Distress Syndrome (ARDS), perpetuates to heterogeneous clinical course hampering therapeutic response. Therefore, physiological variables need to be identified by stratifying ARDS subphenotypes and endotype, to target ARDS heterogeneity. The present study is stimulated by the fact that the ARDS heterogeneity arises from diverse pathophysiological changes leading to distinct ARDS endotypes characterized by perturbed biological mechanism which can be exploited in terms of metabolic profile by metabolomics. Biological endotypes using (n = 464 patients and controls), mBALF and serum samples were identified by high – resolution NMR spectroscopy from two clinically diagnosed ARDS subtypes grouped under mild, moderate and severe ARDS as subphenotype1and pulmonary and extra – pulmonary ARDS as subphenotype2. The identified mBALF endotypes (isoleucine, leucine, valine, lysine/arginine, tyrosine, threonine) and serum endotypes (proline, glutamate, phenylalanine, valine) in both subphenotypes by statistical analysis were tested for their reproducibility and robustness. By combining metabolic endotypes with clinical based mortality score (APACHE and SOFA) added to their predictive performance as ARDS mortality predictors. Thus, a comprehensive set of mBALF endotypes representing compartmentalized lung milieu and serological endotypes representing systemic markers of ARDS subtypes were validated. The interlinked biological pathway of these disease specific endotype further elucidated their role as candidate biomarker in governing ARDS heterogeneous biology.

Scientific RepoRts | (2019) 9:2108 | https://doi.org/10.1038/s41598-019-39017-4 distinct physiological differences in these ARDS subtypes [9][10][11] . ARDS subphenotypes has been grouped as mild, moderate and severe phenotype and pulmonary and extra-pulmonary phenotype based on underlying biology and severity 12 . These phenotypes have improved diagnosis but ARDS management, timely therapeutic intervention to portend progression and risk stratification requires distinct biological endotypes 13 . Differentiating the differential pathobiological mechanism characteristic of endotype is essential to reduce the inherent clinical and biological heterogeneity in ARDS 14 . The National Heart Lung and Blood Institute has also emphasized on the use of biological methodology and translational model to pave new directions in addressing ARDS heterogeneity and its resolution 15 . Therefore, need is to map biological endotype, which reflects physiological process to help stratify patient subsets as per ARDS severity/susceptibility. Endotype specific to the disease subtypes can be correlated to biomarker validation by epidemiological approaches. In this respect, small molecular weight metabolites are the endpoint of all the biochemical and physiological process involved in metabolism. Their composition and variation in different biological matrices [16][17][18][19][20] corresponding to drug, disease and toxic insult could reveal endotype of physiological relevance 21 . The miscellaneous etiology of ARDS incidence and acute diffuse onset necessitates dynamic and real time platform for timely and accurate clinical decision making under the framework of physiology besides conventional laboratory and radiological test 22 . Metabolomics provides an integrated view of metabolic fingerprint in a highly unbiased and reproducible manner 23 . Metabolomics provides the answer to the paradigm shift of identifying metabolic endotypes that best stratifies ARDS reflecting the underlying difference in biological mechanism due to pathophysiological stimuli, which ultimately results in different response to therapeutic intervention and therapies 24,25 . Dynamic changes and aberrations in response to the genetic, environmental and pathological stimuli are reflected in the metabolome which needs to be mapped by high throughput and high content analytical platform. Nuclear Magnetic Resonance (NMR) based metabolomics is a matter of choice in terms of it reproducibility, short analytical run and minimal sample preparation 26 . Thus we have employed NMR based metabolomics to identify metabolic endotypes, which can serve as biological marker mirroring the patient heterogeneity due to diverse pathophysiological process underlying ARDS subphenotype.
The aim of our present study was to identify endotype that best stratify ARDS phenotype with predictive power in determining favourable outcome and accurate and sensitive enough when associated with clinical variable. Therefore, in this study we have taken two clinically diagnosed subphenotypes of ARDS based on clinical manifestation and radiological appearances as mild, moderate and severe group categorized as subphenotype1 and pulmonary and extra-pulmonary group categorized as subphenotype2. To test the physiological difference in these two model n = 464 samples of mini Bronchoalveolar lavage fluid (mBALF) and serum biofluid were collected and subjected to metabolic profiling by NMR. Though both these biofluids vary in the mode of collection and biomarker information but together depict comprehensive view of ARDS biology by providing both circulating biomarkers of systemic origin and localized biomarkers of lung milieu. The best identified mBALF and serum endotypes of different subphenotypes were tested for their reproducibility and robustness in test set and training set correlated to each subphenotype. The characterized endotypes were substantiated in predicting outcome by their classification in survivor and non-survivor group. Their clinical relevance were verified by combining with mortality score in area under the receiver operating characteristic curve (AUROC). The underlying complex interplay of biological process was examined in model based on metabolic endotype of ARDS using integrated metabolic pathway. Thus metabolomics proves to be platform in mapping ARDS heterogeneous biology by determining the predictive endotype. Results patient characteristics. The patients baseline characteristics and clinical variables stratified for study population at the time of sampling are provided in Table 1 which includes age matched demographic characteristics, Acute Physiology and Chronic Health Evaluation-II (APACHE II) and Sequential Organ Failure Assessment (SOFA) score. The mild and moderate and severe ARDS patients were grouped and diagnosed as per the new convened Berlin definition. Pulmonary and extra pulmonary subphenotype grouping was governed by clinical symptoms and its ARDS association. The patient included in the study group is shown through the flow chart ( Fig. 1) depicting the included size of patients and some excluded subjects because of missing and confounding clinical details. ARDS serum and mBALF predictability. The NMR spectroscopy was performed on all the collected samples of both mBALF (n = 199) and serum (n = 265) for the identification and assignment of metabolites present in ARDS with respect to control. A representative NMR spectra of serum control and serum ARDS sample is shown along with mBALF control and mBALF ARDS (Fig. 2). The difference in metabolite intensity indicates difference in ARDS associated changes with respect to controls, which can be mapped by chemmometric analysis to further validate the perturbed metabolism. We initially tested the accuracy of serum as diagnostic fluid from n = 265 subjects in discriminating n = 68 control and n = 197 ARDS patients. As a result, we obtained clear separation in OPLS-DA with R2Y = 0.80 and Q2 = 0.76 and PLS-DA as shown in supplementary information (Figs SI 1c,d, 3a-e). Similarly, mBALF as a diagnostic fluid was established for n = 199 subjects in discriminating n = 40 control and n = 159 ARDS patients with R2Y = 0.92 and Q2 = 0.76 value by OPLS-DA and PLS-DA (Figs SI 1a,b and 2a-e). Small value of Q2 can be attributed to small size of control mBALF samples. The resulting separation prompted towards the first objective of our study to stratify ARDS in two subphenotypes based on clinical characteristics and its predictability in determining metabolic endotype using NMR based metabolomics regardless of clinical heterogeneity.
ARDs subphenotypes. Firstly, hypoxemic partial pressure of oxygen to the fraction of inspired oxygen (P/F) ratio was used to build subphenotype1 in both mBALF and serum categorizing patients on the basis of Gastrointestinal (15) Gastrointestinal (30) Tropical infections (4) Tropical infections (10) Others (1) Others (4) Comorbidities, (n) Diabetes mellitus (12) Diabetes mellitus (18) Hypertension (9) Hypertension (12) Diabetes + hypertension (14) Diabetes + hypertension (28) Hypothyroidism (4) Hypothyroidism (9) Chronic kidney disease (3) Chronic kidneydisease (5) Chronic artery disease (2) Chronic artery disease (3) Nil (7) Nil ( severity involving mild, moderate and severe subjects with P/F ratio between 200-300, P/F ratio between 100-200 and P/F ratio less than 100 respectively. Secondly, different respiratory mechanism of lung injury, was used as a basis to build subphenotype2 in both mBALF and serum involving pulmonary aetiologies (pneumonia, lung contusion) called direct ARDS and extra-pulmonary (pancreatitis, trauma, sepsis) contributors known as indirect ARDS.  Heat map plot shows the difference in metabolic intensity in mBALF compared to serum in terms of information pertaining to diffused ARDS lungs from pulmonary and extra -pulmonary causes (Fig. 4c,f). This is due the fact that mBALF represent mostly unambiguous lung related perturbations as taken from proximal alveoli. Whereas serum is more generalized and broad in drawing useful information correlated to spread of ARDS specific inflammation, infection or pathogenesis. So both the fluid should be evaluated for wider choice and comprehensive set of markers. The above-mentioned subphenotype2 highlights the fact that the different origin or etiology affect the final underlying lung injury in ARDS. After affirming its significance from t-test and on the basis of VIP plot with value more than 1 (Fig. 4b,e), significant metabolites were grouped.
Validation of subphenotype1 and subphenotype2. To validate the accuracy of subphenotype1 and subphenotype2, the cohort was randomly divided into training set and test set following 80/20 holdout ratio 27   test set for subphenotype2 with 0.83 and 0.90 accuracy respectively (Fig. SI 5e-h). The above analysis concluded significant metabolites same as obtained in the validation cohort establishing the reproducibility of the endotypes obtained from mBALF and serum. Consequently, the serum specific and mBALF specific metabolic profile obtained and its validation in test set and training set highlighted biological and physiological heterogeneity in ARDS which resulted in distinct subset of metabolites in both subphenotypes corresponding to different biofluids as shown in Table 2. All the metabolites were selected and considered statistically significant after log transformation and Pareto scaling based on p  mBALF and serum endotype as mortality predictors. Significant metabolites common to mBALF subphenotype 1 and subphenotype 2 representing lung specific disease pathophysiology were grouped as mBALF endotype. Similarly, metabolites common to subphenotype1 and subphenotype2 in serum representing ARDS directed serological changes were grouped as serum endotype. Endotypes are representative of distinct functional and pathobiological mechanism but how well they respond to outcome needs to be ascertained to determine their clinical applicability. Therefore, we designed the predictive model based on survivor and non-survivor groups in ARDS for both in mBALF and serum biofluids. The analysis was performed to discover the final set of markers representing ARDS mortality. The mBALF endotype, isoleucine, tyrosine, valine, leucine, lysine/arginine, threonine, glutamate/proline, serine/phenylalanine and lactate were used in the predictive model of survivor and non-survivors (Fig. 5a). Similarly, serum endotype, proline, glutamate, phenylalanine, valine, alanine, glycine, isoleucine and leucine were used to determine their role in differentiating survivor and non-survivor patients (Fig. 5d). mBALF taken from n = 124 patients included n = 51 survivors and n = 73 non survivors whereas n = 143 samples of serum included n = 51 survivors and n = 92 non-survivors ARDS subjects. Further t-test and subsequent PLS-DA analysis sorted out panel of metabolic endotype representative of ARDS heterogenous biology related to mortality (Fig. 5b,e). mBALF endotype based model resulted in R2 = 0.72, Q2 = 0.69 and accuracy 0.98 whereas serum endotype based resulted in R2 = 0.67, Q2 = 0.63 and accuracy 0.95 along with permutation test statistics and classification error rate shown in Fig. SI 8a-h. Pearson correlation based heat map showed amplified metabolite intensity in the non-survivor subjects compared to survivors implying greater metabolic dysfunction attributed by mBALF and serum endotypes as seen in non-survivors (Fig. 5c,f).

Accuracy of ARDS metabolic endotype. Box whisker plot and ROC of individual metabolic endotypes
including proline, glutamate, phenylalanine, valine of serum and isoleucine, tyrosine, valine, leucine, lysine/arginine and threonine of mBALF were plotted to validate their relative concentration with respect to outcome group along with clinical score APACHE and SOFA respectively as shown in Fig. 6(a-n). Box whisker plot showed relative high metabolite concentration in non-survivor subjects highlighting increase in metabolite aberration in patients susceptible to mortality. Accuracy was further substantiated by individual ROC values specifying their role as mortality predictors. The above sensitivity and specificity of individual serum metabolites and mBALF metabolites as resultant serum and mBALF endotype were used to determine their clinical predictability when combined with clinical based APACHE and SOFA score. The accuracy increased to AUROC 1 indicating the clinical relevance of the above determined metabolic endotype (Fig. 7a,c).
ARDs metabolic network. Pathway analysis of serum endotype and mBALF endotype predictive of mortality gave important metabolic pathway symbolic of ARDS correlated changes in metabolism. Figure 7b illustrates most symptomatic pathways of circulatory serum based endotype model characteristic of systemic changes as a consequence ARDS infection, inflammation and pathogenesis. Similarly, Fig. 7d reflects most implicated pathways of localized mBALF based endotype model representative of ARDS diffused lung compartment due to pulmonary edema and infiltration of neutrophils and other inflammatory mediators. Nitrogen metabolism, aminoacyl tRNA synthesis, valine, leucine and isoleucine synthesis, pantothenate and CoA synthesis, phenylalanine, tyrosine and tryptophan biosynthesis common to mBALF and serum endotype indicated anomalous pathobiological mechanism due to ARDS (Fig. 7b,d). Thus all the cumulative metabolic endotypes representative of ARDS pathophysiological changes and outcome were used to build an integrated metabolic pathway showing mBALF    the conclusive metabolites and their contribution in governing ARDS heterogeneous biology (Fig. 7e). As a result, we obtained distinct class of metabolic endotypes identified by metabolomics using mBALF and serum that can clearly distinguish ARDS subtypes with its strong correlation to mortality.

Discussion
ARDS is a complex syndrome where pathophysiology is interwoven with pathology and underlying lung injury is common regardless of the type of multiple insults. In case of ARDS, the critical illness leads to severe derangement of homeostasis and haemodynamic equilibrium ultimately affecting the downstream metabolic process and interlinked pathways. Metabolic dysregulation is the outcome of upstream physiological and pathophysiological changes affecting whole system corresponding to endogenous or exogenous insult. Different degrees of anatomical and physiological derangement add to clinical and biological heterogeneity which requires characterization of disease specific endotype by epidemiological approaches. Both mBALF and serum has been previously established as an archetype of lung milieu and ailing prognosticator of ARDS 19,20,[28][29][30][31] . Therefore, in our study we used both the biological matrices as diagnostic fluid in the identification disease specific metabolic endotypes which can act as biological marker significant for clinical understanding and distinguishing ARDS heterogeneity. The heterogeneity and pathogenesis hampers the tailored and directed therapies. Therefore, to make a clear interpretation and distinction responsible of same phenotypic lung injury corresponding to different aetiology and pathophysiology depends on targeting biological process or metabolic pathway for better-targeted therapies and clinical management. Most of the recruitment manoeuvres are focused on the cure but the goal should be focussed towards finding cause due to marked physiological difference between ARDS subtypes. Therefore, large sample size has been incorporated in the present study with validation in training set and test set so that results can be generalized for widespread population for clinical utility. Following the analysis, the metabolites common to markers of severity in subphenotype1 and sensitive to etiological causes of direct and indirect ARDS in subphenotype2 were grouped to determine their analogy in predicting mortality in survivor and non-survivor group. Their discriminating power in both biofluids were checked to determine the dysregulated pathobiological mechanism occurring as a consequence of ARDS. The final panel of metabolites obtained, representing the mBALF and serum endotypes reflected the perturbed biological mechanism corresponding to subphenotype. Their clinical utility and accuracy was further validated by combing with APACHE and SOFA score in governing the mortality score. Subsequently the pathway analysis in mBALF and serum resulted in determining the affected biological mechanism which was integrated to common biological pathway to aid its correlation to outcome and clinical predictability. The distinct set of metabolic endotypes grouped together as mBALF and serum endotype were found to have important role in ARDS pathogenesis as observed in its overrepresentation in pathway analysis.
The class of serum endotypes included proline, glutamate, phenylalanine and valine. Their putative role has also been observed and recognized in previous corresponding ARDS studies. Proline upregulation in the form of proline-glycine-proline have been found to be interlinked to ARDS 32 . Proline has been established as a substratum of stress and thus owing to subsequent inflammation, it is used as energy source 33 . Though glutamate is found in lesser concentration in plasma, it governs signal mediation in peripheral organs and tissues. Glutamate metabolism has been linked to immunomodulation with its prominent role as glutamate receptors in mediating T-cell mediated immunity. In a recent study by Evans et al. 34 and Rogers et al. 35 , both proline and glutamate have been associated with increased abnormality and correlated to dysfunction metabolism in ARDS. Increased concentration of phenylalanine has been determined both by NMR and GC-MS in distressed lung of asthmatic diagnosed patients 21,36 . Phenylalanine levels were also found associated in ARDS caused by H1N1 pneumonia by virtue of resultant viral and bacterial infections 37 . Protein breakdown is widely associated in sepsis and trauma which are among the major etiologies associated with ARDS in which protein involved in alveolar epithelium and surfactant dysfunction has been widely researched. Surfactant protein C is one of the widely studied marker which is mainly composed of valine, leucine and isoleucine forming the bulk of its hydrophobic residues 38 . Disproportion of amino acid in plasma is widely manifested by critically ill patients with deficit of BCA reported to unbalance the homeostasis of transferrin, nitrogen retention and lymphocyte counts. Overrepresented metabolic variation of these serum endotype is predominately influencing ARDS biochemical process through nitrogen metabolism, aminoacyl tRNA synthesis, arginine and proline metabolism, valine leucine and isoleucine synthesis, pantothenate and CoA synthesis, D-Glutamine and D-glutamate metabolism, phenylalanine, tyrosine and tryptophan biosynthesis.
The class of mBALF endotypes included lysine, arginine, tyrosine, threonine and branched chain amino acids (BCA). These individual metabolic endotypes have been previously reported in studies related to ARDS and its etiologies emphasizing their notable role in ARDS biology. Surge in the percentage of lysine and arginine has been observed in bronchoalveolar lavage fluid corresponding to dysfunction of diamino transport affecting the alveolar epithelial lining fluid and alveolar epithelial cells. This particular mutation has been proposed to govern the increase in the prevalence of ARDS 39 . Arginine and its interplay with nitric oxide (NO) pathway has also been found to have important implication and its corresponding regulation of multiple organ dysfunction syndrome in sepsis associated ARDS 40 . When considering the pathogenesis and exacerbated inflammation in ARDS, NO plays a vital role as oxidant stress whose metabolic by-products proves to be an important marker in early stage of ARDS. Accordingly, tyrosine upregulation in ARDS has been correlated to portending ARDS incidence prior to disease development for its paramount role as NO 2 Tyr in BAL 41 . Also, many growth factors preferably utilize serine threonine and tyrosine kinase receptors for repair and resolution of ARDS by recruiting tyrosine and threonine in the signalling pathway 42 . BCA includes isoleucine, leucine and valine which are vital components of immune enhancing formula with imperative role in muscle rejuvenation, nitrogen balance and protein synthesis 43 . Energy deficit associated with sepsis and tissue injury gravitates increased BCA oxidation by skeletal muscle. Hypermetabolic composition of these mBALF endotype mirrors anomalous metabolism in ARDS reflected prominently in nitrogen metabolism, aminoacyl tRNA synthesis, valine leucine and isoleucine synthesis and degradation, pantothenate and CoA synthesis, tyrosine metabolism, phenylalanine, tyrosine and tryptophan biosynthesis.
This study is one of the first attempts with an all-inclusive metabolic profile of different ARDS subtype in different biofluid. mBALF represents the biomatrix that can best represent the clinical phenotype of the distressed lung due to its close proximity to alveoli and its associated anomaly. Serum is the circulating biofluid phenotyping the immediate perturbations outspreading to from the diffused lung due to infection, inflammation and multiple pathogenesis. Thus both biofluids were used to fingerprint ARDS heterogeneity. Clinical phenotype was checked and validated in large population with its robustness cross validated by statistical analysis. The endotype determined from both subtypes were probed with respect to outcome in survivor and non-survivor subjects and it correlation to clinical based mortality score was well substantiated. Resultant mBALF based endotype model included lysine, arginine, tyrosine, threonine and branched chain amino acids and serum endotype model included proline, glutamate, phenylalanine and valine. Consequently, serum and mBALF endotypes illustrated perturbation mainly in TCA cycle intermediates, protein synthesis and its breakdown products and the corresponding nitrogenous network mirroring the complex biological process involved in ARDS through pathway analysis and its interconnected metabolism. Thus this study is the first of its kind to depict the complete metabolic endotype of ARDS heterogeneous phenotype with its correlation to mortality.
In conclusion, biological insight is imperative to understand heterogeneity for ARDS for early prediction, prevention and progression of ARDS to severity. Physiology driven endotypes are the answer to provide biomarkers which can discriminate ARDS subsets of patients at risk and target therapy accordingly. Characteristic endotypes of ARDS phenotypes will aid in enrolment of patients on the basis of stratified risk, follow up studies and future clinical trials. Thus in this study a model based on metabolic endotype (mBALF and serum endotype) was probed which holds potent prospects in addressing the underlying heterogeneity in ARDS subtypes. This study provides a comprehensive metabolic endotypes (lysine, arginine, tyrosine, threonine and branched chain amino acids proline, glutamate, phenylalanine and valine) associated with ARDS subtype, complementing clinical score and significant predictability in outcome. It can be separately validated in different ethnic groups for its validation and applicability for further bench to bedside prospects.

Materials and Methods
Ethical approval. The study was designed as a part of prospective single centre cohort performed at Intensive Care Unit (ICU) of a tertiary care medical centre, Sanjay Gandhi Post Graduate Institute of Medical Sciences (SGPGIMS), Lucknow, and Centre of Biomedical Research (CBMR), Lucknow, India. The SGPGIMS ethical body reviewed and approved the experimental methodology. A prior written informed consent was obtained from all subjects or their surrogate decision makers before the sample procurement. We confirm that all methods were performed in accordance with the relevant guidelines and regulations approved by the ethical committee of SGPGIMS.
Diagnostic criteria. All the diagnosis was carried out by expertise and trained resident doctors at ICU unit following the standard guidelines aforementioned in the Berlin definition 44 . Lung protective ventilation measures were practiced in the ICU. Exclusion criteria in the current study include patients with age less than 18 years, pregnancy, chronic obstructive pulmonary disease (COPD) patients, bronchial asthma, interstitial lung disease, and other chronic respiratory ailments. subjects and clinical design. Total n = 464 samples including n = 199 mBALF and n = 265 serum samples were collected in the study during the period from 2014-2017 from both control patient and patient diagnosed with ARDS enrolled at ICU admission of SGPGIMS. The control group comprised patients who were on mechanical lung ventilation for routine elective surgeries. Both the biofluid were collected to provide a complete overview of inflammatory markers of lung origin as well as serological markers spread through bloodstream. mBALF is a non-directed lung lavage collected from proximal alveoli which best represents the pulmonary markers of lung compartment. Standardized "catheter in catheter" technique was employed in ICU by trained medical practitioner for collection of mBALF sample from mechanically ventilated patients within 24 hours of diagnosis 45 . The mBALF sampling were performed in accordance with the approved guidelines and regulations of the institute's ethical committee. The procedure was followed under safety guidelines using soft suction, nontoxic, pyrogen free, graduated 16 French gauge/8 French gauge catheter in catheter set. Sterile aseptic precautions were followed for mBALF suctioning using 10 ml sterile distilled water in mucus trap which is then transferred to collection vial and quenched in liquid nitrogen till further processing. A total of 2.0 ml blood was collected for each sample taken from ARDS diagnosed patients and control patients in the storage vial till further NMR analysis were performed. sample preparation. Before NMR analysis, all the mBALF samples were vortexed and then centrifuged at 16000 rpm for 10 min at 4°C to remove cellular debris and bacteria. The supernatant was pipetted out and cryoprotected at −80 °C inside freezer till further NMR experiments were performed. In case of serum, all samples were incubated for 30 min at room temperature and then centrifuged at 6708 * g; 4 °C for 5 min. Golden yellow supernatant is collected and preserved at −80 °C till further NMR experiments. All the processed samples aforementioned before were suitably thawed before NMR data acquisition.
NMR spectroscopy of mBALF and serum. All  Assignment of mBALF and serum. The acquired 1D 1 H spectra was referenced to TSP signal (δ = 0.00 ppm) and manually phased and baseline corrected. The characterization and assignment of peaks resonating from small molecular metabolites in 1  Metabolite profiling. Following the identification of all the possible metabolites present in diseased samples of ARDS, the multidimensional dataset generated from NMR acquisition was subjected to chemmometric analysis to deduce the candidate markers of ARDS pathobiology which otherwise couldn't be inferred from visual inspection of NMR spectrum. Each of the NMR acquired data were subjected to normalization using log 2 transformation and Pareto scaled before statistical analysis by Metaboanlyst 46 . Firstly, the accuracy of mBALF and serum as a diagnostic biofluid in ARDS characterization was established by univariate and multivariate analysis distinguishing control patients and diagnosed ARDS patients. After validation, diseased samples of ARDS was targeted to determine the metabolic profile or panel of putative metabolites responsible for distinguishing ARDS subtypes and predictive of mortality with important implications in manifesting aberrant biological pathway. ARDS subtypes were clinically phenotyped and grouped based on severity as mild, moderate and severe ARDS as subphenotype1 and respiratory mechanics as pulmonary and extrapulmonary ARDS as subphenotype2 in both mBALF and serum biofluid to better interpret ARDS heterogeneity. Unpaired t-test and ANOVA were used to categorize significant metabolites based on p value less than 0.05 in both subphenotypes and subsequently used in substantiating metabolic endotype in outcome subgroups. Multivariate PCA and PLS-DA determined VIP plot were used to establish differential metabolites with value more than 1. R2 and Q2 value was used to determine predictability and cross validation, permutation test statistics were used to validate the model. Pearson correlation based heat map were used to determine the robustness of the model with enumeration of upregulated and downregulated metabolites in subphenotypes and resultant endotype. Individual box whisker plot and the corresponding specificity and sensitivity using AUROC to determine the credentials of endotype in outcome subgroup. Clinical score was combined with model based on endotype in AUROC to validate the accuracy of the model. To determine the endotype governing biological process in ARDS, pathway analysis was carried using Metaboanalyst based on hypergeometric test using relativeness between centrality algorithms. Significant p value and impact value was deduced from pathway enrichment analysis and pathway topology to depict overrepresented pathways interconnected to ARDS metabolism.

Data Availability
The data set used/generated/analysed in the present study will be made available by the corresponding authors on reasonable request.