Abstract
Cohort studies often provide a large array of data on study participants. The techniques of statistical learning can allow an efficient way to analyze large datasets in order to uncover previously unknown, clinically relevant predictors of morbidity or mortality. We applied a combination of elastic net penalized Cox regression and stability selection with the aim of identifying novel predictors of mortality in a cohort of prevalent hemodialysis patients. In our analysis we included 475 patients from the “rISk strAtification in end-stage Renal disease” (ISAR) study, who we split into derivation and confirmation cohorts. A wide array of examinations was available for study participants, resulting in over a hundred potential predictors. In the selection approach many of the well established predictors were retrieved in the derivation cohort. Additionally, the serum levels of IL-12p70 and AST were selected as mortality predictors and confirmed in the withheld subgroup. High IL-12p70 levels were specifically prognostic of infection-related mortality. In summary, we demonstrate an approach how statistical learning can be applied to a cohort study to derive novel hypotheses in a data-driven way. Our results suggest a novel role of IL-12p70 in infection-related mortality, while AST is a promising additional biomarker in patients undergoing hemodialysis.
Similar content being viewed by others
Introduction
End-stage renal disease (ESRD) with hemodialysis treatment continues to be associated with dramatically increased morbidity and mortality. While cardiovascular mortality represents the prevalent cause of death in these patients, death due to other causes such as infection is also more frequent than in the general population1. Several risk factors of cardiovascular and all-cause mortality have been identified and risk scores have been successfully established based on these2,3,4,5,6,7. Beyond the quantification of a personal prognosis, the identification of novel risk factors can lead to hypothesis generation for further research, novel therapeutic interventions and finally lead to improvements in patient outcomes.
Machine learning approaches for survival analysis can allow efficient selection of the most relevant risk factors even when a large number of potential predictors for only a relatively small patient population is available8,9. Among them, Cox regression using “elastic net” regularization allows selection even in settings where there are more predictors than events in the sample10. Here regularization refers to the procedure of penalizing and thereby shrinking the regression coefficients while (in the case of “elastic net”) allowing them to become zero and therefore omitting variables with less predictive power for the outcome. An advantage of this procedure compared to many other machine learning techniques is that it produces a model which is accessible to interpretation by clinician scientists, in a similar way as common multivariable Cox regression. This allows to draw conclusions from the resulting model itself such as e.g. generation of hypotheses for future therapeutic interventions.
A more recent advancement is a combination of regularization with resampling, termed “stability selection”, which consists of recalculating the regularized regression many times with random subgroups of the available patient population in order to select the variables which show most consistent association with the outcome. This reportedly results in a more reliable selection in settings with a large number of potential predictors compared to the number of patients11.
Here we report on the application of elastic net regularization combined with stability selection for the identification of relevant predictors of mortality in the “ISAR” cohort of prevalent hemodialysis patients12.
Results
Characteristics of the study population and collected data
The overall study design and rationale of the ISAR cohort trial have been described in our previous report12. Briefly summarized, prevalent hemodialysis patients were recruited in 17 centers in and around Munich, Germany and followed up for a median time of 37 months (interquartile range of 25–49 months). A total of 475 patients were included in the analysis (Fig. 1). Baseline characteristics for this population were previously reported5. Mortality events were classified into three categories: cardiovascular, infectious/sepsis and other (or unknown), with 64, 43 and 62 respective events occurring during the follow-up period (Fig. 1). Overall over 100 possible predictors were collected for the patients in the study population including clinical parameters, comorbidities and medications, blood value measurements and static retinal vessel analyses (Table S1).
We randomly separated the total cohort into a derivation and confirmation dataset (n = 317 and 158 respectively) with similar characteristics (Fig. 1). The sets were specifically similar with regard to age, gender, dialysis patient adapted Charlson comorbidity index (CCI)13, serum IL-6 and mortality (Table 1).
Application of a combination of elastic net regularization and stability selection for the identification of mortality predictors
Variable selection with consecutive stability selection in the derivation dataset resulted in a substantially reduced predictor subset compared to using only the elastic net regression (Figure S1 and Fig. 2). Most of the predictors passing selection criteria for all-cause mortality or the cause-specific events showed significant associations with mortality in the withheld confirmation cohort (Table 2). Among the comorbidities diabetes mellitus, pulmonary hypertension, atrial fibrillation (AF), history of myocardial infarction, neoplasia, atherosclerosis (other than CHD) and peripheral artery disease (PAD) were selected and confirmed. Most of these have in fact previously been shown to have a significant effect on mortality13. Other previously reported predictors were catheter use for dialysis, blood pressure, hemoglobin levels, serum creatinine, CRP2,14, use of oral anticoagulation15, serum levels of IL-6 and YKL-40, as well as IP-105. We could also observe a detrimental effect of low serum triglycerides in our population as has been reported previously16. Interestingly, we also found low levels of AST to be associated with reduced all-cause mortality, while high levels of IL-12p70 were associated with an increased infection-associated mortality (Table 2). The association of AST with mortality is a relatively novel finding in dialysis patients17. The positive association of IL-12p70 with mortality in these patients is to our knowledge a novel finding. We therefore further analyzed these predictors in our study population.
Serum AST as predictor of mortality in dialysis patients
Low serum AST levels were associated with decreased all-cause mortality (hazard ratio (HR), 0.31; 95% confidence interval (CI), 0.10–0.98, p = 0.046, in the confirmation subcohort, Table 2). In this “low” group, AST levels were in the range of 3–13 U/l and therefore on the lower end or even below the reference range for the normal population (10–50 U/l for males and 10–35 U/l for females). When analyzing mortality binned by AST subgroup in the total study population, this protective effect could be observed for all-cause and cardiovascular mortality, while increased levels of AST (top quintile) were associated with increased mortality for all-cause and “other” mortality causes (Fig. 3a). Modelling mortality using spline fitting on log transformed AST values significantly outperformed a regular Cox model (p = 0.038 for all-cause mortality in a likelihood ratio test on the total population with non-missing AST-values, n = 377, one patient was excluded due to strongly increased AST and ALT as discussed in “Methods”). We additionally observed a numerically improved predictive performance for a spline fitted model as further discussed below (Fig. 5a). In univariate and multivariable Cox regression incorporating spline transformations a similar association as described above for both a protective effect of lower than average AST as well as a detrimental effect of increased AST levels for all-cause mortality as well as cause-specific hazard for cardiovascular mortality and other causes were identified (Fig. 4). Of note, unknown mortality causes were also classified as “other” causes, we therefore expect some degree of overlap with cardiovascular mortality within this group due to e.g. sudden cardiac death.
Interestingly, despite a strong correlation of serum AST with ALT levels (Pearson coefficient: 0.62, p < 0.001), ALT levels were not significantly associated with all-cause or cardiovascular mortality (not shown) and did not show a similar protective effect of lower values when spline transformations were considered (Figure S2).
IL12p70 as a predictor of infection-associated mortality in dialysis patients
High IL-12p70 was one of the most stable predictors associated with mortality due to infection (Fig. 2c), showing a significant association also in the confirmation cohort (infection-associated HR 3.12, 95% CI 1.16–8.41, p = 0.024, Table 2). Criteria for stability selection were missed by a small margin for all-cause mortality (Fig. 2a). With the applied assay for IL-12p70, the majority of patients showed an undetectable level of this interleukin (Figure S3). When analyzing mortality binned by IL-12p70 expression (undetectable, weakly elevated, top quintile) in the total study population, we could observe a selective increase in mortality in the top quintile without according tendency in the weakly elevated group (Fig. 3b).
Added value of AST and IL-12p70 for mortality prediction
Time-dependent receiver-operating characteristics (ROC) analysis for univariate Cox models showed highest areas und the ROC curve (AUC) for AST after logarithmic transformation and spline fitting (Fig. 5a). Using the predefined threshold (top quintile) for IL-12p70 had a similar AUC for all-cause mortality as using transformed full data and when fitting a spline (Fig. 5a). Fitting a spline to transformed IL-12p70 data also did not show a significant improvement in a likelihood ratio test (p = 0.209 for all-cause and p = 0.137 for infection-associated mortality, n = 475).
The AUCs for these single predictors were comparable to those of other established predictors such as serum albumin and the use of a catheter for dialysis, though not as high as some of the top predictors for mortality, age and adapted comorbidity score (Fig. 5a).
Analysis of multivariable models showed an additive effect of the new predictors on our previously established model5 (Fig. 5b, see “Methods” for included predictors). In fact, inclusion of both AST and IL-12p70 (green line in Fig. 5b) was superior to albumin or catheter use and had a similar additive value on AUC as the strong predictors age and adapted CCI within the first year of follow up (orange and blue dashed lines in Fig. 4b left panel).
Discussion
Here, we applied techniques of statistical learning to analyze a cohort study of dialysis patients. Although similar techniques have been previously applied in this population in order to improve mortality prediction18,19,20, our approach differs in that our main goal was not to derive an overall prediction model or to address a specific hypothesis, but rather to find novel promising candidates as predictors in a setting with a large amount of available characteristics for a relatively small study population. Many of the well-known and established prognostic factors for mortality were recovered, while a relatively modest number of 317 patients (with 112 events during follow-up) were used for selection among ≈ 110 predictors. We were then able to confirm significant associations in the withheld smaller confirmation group for most of the factors obtained after stability selection using univariate Cox regression. These findings underline the overall validity of the selection process in order to identify important mortality predictors. A notable observation was thereby the identification of a previously unknown association of IL-12p70 with infection-associated mortality.
IL-12p70 is the active heterodimer form of IL-12, which consists of the IL-12p35 and IL-12p40 subunits. While IL-12p35 is thought to be specific for IL-12, IL-12p40 also associates with IL-23p19 to form active IL-23. IL-12 activity has been implicated in the immunologic response to pathogens and tumor cells, as well as in autoimmune disease21,22. Levels of IL-12p70 are known to be relatively low in the serum of healthy individuals23 and were below the detection limit for our assay in the majority of the study population. Contradictory reports concerning IL-12 activity exist for dialysis patients. One study, published shortly after the discovery of this cytokine, observed an increased level of IL-12 in dialysis patients compared to healthy controls24, a finding that could possibly be explained by increased IL-12 release from peripheral blood mononuclear cells (PBMCs) as an inflammatory response to the dialysis membrane25. This study also reported, contrary to our observation, that increased IL-12 levels were associated with improved survival in dialysis patients24. However, in a more recent report assessing specifically the levels of IL-12p40 and IL-12p70, a significant increase of IL-12p40 was observed, while no increase was reported for IL-12p70 in dialysis patients vs. healthy controls26. The discrepancy between our finding and those of Kimmel et al.24 could therefore be explained by the specificity for IL-12p70 in our assay. Alternatively, since we categorized data and compared individuals with 20% highest levels to others, the effects of IL-12 might be non-linear, in that they might be protective up to a certain level and detrimental or indicative of an additional underlying disease (e.g. autoimmune) above that level. Further studies employing more sensitive IL-12p70 and IL-12p40 measurements would be of interest to solve this question. Having high levels of IL-12p70 was one of the most stable predictors for increased infection-associated mortality in our derivation cohort. Although a causal relationship cannot be established by the applied approach, this close correlation suggests that it may be a causal factor or that it may be closely linked to a process causing mortality in hemodialysis patients. Further studies into the underlying pathophysiology would be highly interesting to elucidate this relationship and clarify if modulation of IL-12 activity may serve as a therapeutic intervention.
Another stable predictor which was obtained in our analysis is AST and we further looked into this association because until now it has received little attention in dialysis patients. AST is often used in clinical practice to assess liver damage, however unlike ALT it is less specific and is also elevated in other conditions, such as myocardial infarction or skeletal muscle damage27,28. Levels of AST have been shown to be predictive of both all-cause as well as specifically cardiovascular mortality (potentially due to an association with the metabolic syndrome) in the general population29,30,31,32. While on average a decreased AST activity in dialysis patients compared to the general population was observed long ago33,34, we could only find one previous study analyzing its association with mortality in dialysis patients17 (in this case with all-cause mortality). Our results confirm these findings and suggest a potential association with cardiovascular disease in dialysis patients. An association with liver disease may explain an increased mortality due to "other" causes in patients with high AST values, similar as was observed for increased ALT values (Figure S2). The finding that ALT levels do not show a similar association with cardiovascular mortality, however, suggests that there may be an alternative relevant pathomechanism or a source of AST elevation outside the liver (such as the heart) which is worthy of further investigation. Although our study is not sufficiently powered to further elucidate the pathophysiology behind AST variation (e.g. only 5% of participants had a known liver disease based on medical records) our results underline the prognostic value of AST and support the previously expressed need for different reference ranges in hemodialysis patients35.
When assessing the additional value of AST and IL-12p70 for the prediction of mortality in dialysis patients using time-dependent ROC analysis, we found an additive value of these predictors. Although in the multivariable model the numeric gain was relatively modest, it has to be compared to other established variables, because the model already incorporated many important predictors. We observed a similar effect of AST and IL-12p70 as e.g. for known independent predictors albumin and catheter use for dialysis. The added effect of these two predictors together was similar in the first year to some of the strongest predictors: age and adapted CCI score.
One of the limitations of the applied penalized Cox regression is that it assumes linear associations with the outcome (as does regular Cox regression). This also means that single high-leverage points, which can represent false measurements or extraordinary conditions, may have a strong effect on selection (or omission) of variables. Here we addressed these issues by “dummy coding” quantitative measurements to nominal variables which represent top or lowest quintiles for specific values. Although such thresholding may influence the selection due to loss of information, we in fact observed that most known ordinal/continuous predictors for mortality in dialysis patients passed our selection criteria (i.e. hsCRP, hemoglobin, creatinine, triglycerides, IL-6, YKL-40, AST). However, one notable exception is serum albumin, which was not retained despite a well-documented correlation in the literature. Decreased albumin levels are both observed as part of malnutrition as well as inflammation (anti-acute phase protein), also summarized as dialysis-associated inflammation-malnutrition complex36. There is also a known negative correlation between IL-6 and albumin37,38. Other retained predictors are also associated with inflammation-malnutrition in ESRD patients, such as hsCRP, creatinine or lipids36. It is therefore likely that, next to the thresholding approach, such predictors have additionally diminished the explanatory value of albumin for mortality prediction, as has been reported previously39. A similar collinearity of hsCRP with IL-640,41 may have reduced the predictive value of the former in the case of infection-associated mortality, yet it was selected as a predictor for all-cause mortality where more events were available and potentially due to its described additional role as a predictor of malnutrition42.
We strongly assume that including more patients in the derivation cohort would further improve factor selection, while in our study this dataset was relatively small (i.e. 317 patients). Having a larger cohort would also allow to include variable interactions in order to identify subgroup-specific effects, something that was not accounted for in our approach. Further limitations of our study are missing values for some of the variables and unknown cause of death for some of the patients. We expect that a relevant portion of missing values, as these were set to the reference level (see “Methods”), will reduce the selection stability of a variable and therefore typically lead to a more conservative selection. An additional conceptual limitation is that only the variables at baseline could be used in the current analysis. Yet, the evolution of biomarkers over time in dialysis patients and the course of chronic kidney disease may provide additional important insights and improve predictive performance.
In summary, we successfully applied techniques of statistical learning as an exploratory approach to high-dimensional data from a cohort of dialysis patients. Despite the relatively modest number of patients, we thereby could retrieve most of the established risk markers and additionally identify promising candidates for novel predictors. The described techniques can be considered complementary, yet not as competitors to widely-applied and state of the art hypothesis-driven approaches. Similar techniques, as they become more accessible, may gain importance in routine exploratory study analysis in order to gain additional insights into the underlying risk-determining factors.
Methods
Subjects/study population and data collection
The clinical trial was registered under ClinicalTrials.gov identifier: NCT01152892. For detailed information please refer to the study protocol12. In brief, we included stable HD patients who were at least 18 years of age, had an HD vintage of at least 90 days, and provided written and informed consent. Malignant disease with a life expectancy shorter than 2 years had been the only relevant exclusion criterion. The present study included 475 patients, for whom sufficient unthawed sera were available for cytokine quantification. The study was approved by the ethics committees of the Klinikum rechts der Isar, Technical University Munich, and by the Bavarian State Board of Physicians. It was performed in accordance with the Declaration of Helsinki.
Age, patient characteristics, dialysis modalities, comorbidities and medication were assessed from medical records. Comorbidities were recorded at inclusion and updated during the 2 years follow up as previously described5,43. An adapted CCI was calculated according to Liu et. al.13 (version without ESRD cause).
Cause of death was attributed to the following domains: cardiovascular, infection associated and other (or unknown) of which the cardiovascular domain had been prespecified in the study protocol12. For further details also see Lorenz et al.5 Medical records and interviews of attending physicians, and relatives were used by the ISAR study physician board to ascertain death and assigned each case an underlying cause of death. When no agreement on cause of death could be reached the cause of death was defined as unknown (n = 28).
Serumbiochemistry and functional studies
Serum was collected prior to a midweek dialysis session at inclusion and after the exclusion of active infection. Patients that had undergone surgery within the past 2 weeks were included at a later timepoint. Serum was centrifuged after 30 min of resting at room temperature, aliquoted and stored at – 80 °C. Routine laboratory analyses were assessed by International Organization for Standardization–certified laboratories. YKL-40 levels were measured in duplicates using a commercial enzyme-linked immunosorbent assay (ELSIA) kit (R&D Systems, Inc., Minneapolis, MN). The remaining inflammatory mediators were assessed using BD Flex-sets (Becton Dickinson, Heidelberg, Germany) on a FACS Canto II (Becton Dickinson) and BD Diva software (Becton Dickinson) according to manufacturer’s instructions. Theoretical limits of detection were indicated by the manufacturer between 0.6 and 1.6 pg/mL.
Ambulatory 24-h blood pressure (BP) and pulse wave velocity (PWV) measurements were carried out as previously described44 using a Mobil-O-Graph 24 h-PWA (I.E.M. GmbH, Stolberg, Germany). Retinal vessel analysis was performed using Static Vessel Analyzer (Imedos Systems UG, Jena, Germany). Fundus images of one eye were gathered and classified into four categories according to quality. Two images with the highest qualitative category were analyzed and means of central retinal arteriolar (CRAE) and central retinal venular equivalent (CRVE) as well as arteriolar-to-venular diameter ratio (AVR) were calculated as previously described45.
Statistical analyses
Statistical analysis was performed in R version 3.6.346.
In order to be able to test the selected predictors on patients whose data was not used for variable selection, we split the cohort into a derivation and confirmation datasets. To this end we performed 1000 random splits in the ratio of 2/3 to 1/3 and calculated for each split the p-value for difference in the matching variables (age, adapted CCI, gender, IL-6 and mortality as indicated in Table 1) using Chi-squared test for nominal and Mann–Whitney test for ordinal or continuous variables. We then selected the split where the two groups were most likely similar (based on where the lowest p-value of the tests was the highest).
For stable variable selection we applied elastic net regularized regression on the derivation subcohort on all predictors listed in Table S1 simultaneously. In order to address possible non-linearity, ordinal and continuous variables (with the exception of age for which a linear relationship can be expected and which was expressed in decades) were dummy coded as “high” and “low” group by assigning for each patient whether he falls into the top or bottom quintile for the specific variable. Ties were assigned randomly. Values not falling into top or bottom quintile and missing values were assigned 0 in both dummy variables. Missing values in nominal variables were also assigned to the reference category (meaning absence of a potential risk factor).
In a first step we performed elastic net10 Cox regression separately for all-cause mortality, cardiovascular and infection-associated mortality with 20-fold cross-validation using 15 equally spaced alpha values in the range of 0 to 1 (here alpha = 0 equals ridge regression and alpha = 1 equals lasso regression). We selected for each alpha the lambda with the lowest cross-validation error (Figure S1a). We then selected for each regression the non-zero alpha value with the lowest cross validation error (vertical dashed lines in Figure S1a). Alpha of zero would not result in any variable selection (as it equals ridge regression) and was therefore not chosen. Next, we used the chosen alpha values in a stability selection approach as described by Meinshausen et Bühlmann11 and implemented in R by Sill et al.47 For this preselection approach as a means of hypothesis generation we used 100 subsamples of the derivation cohort, a “weakness” of 0.8 and a (maximal) per-comparison error rate of 10%. All variables passing the selection criteria are reported in Table 2.
For spline fitting we used penalized splines48,49 with four degrees of freedom. For the analysis in Fig. 4 and Figure S2 one outlier with AST of > 600 U/l and ALT > 900 U/l (suggestive of acute hepatopathy) was removed.
The multivariable Cox model for all-cause mortality (Fig. 4b, Figure S2 and Fig. 5b) contained the following predictors (as described previously5): age, gender, BMI, adapted CCI, history of MI, rheumatic disease, laboratory parameters (phosphate, calcium, albumin), Kt/V, HD (not hemodiafiltration), usage of central venous catheter for dialysis, oral anticoagulation, IL-6 (ln-transformed) and YKL-40 (ln-transformed). The multivariable model for cardiovascular mortality contained: age, gender, BMI, adapted CCI, history of MI, rheumatic disease, usage of central venous catheter for dialysis, ultrafiltration per kilogram of dry weight, IL-6 (ln-transformed) and YKL-40 (ln-transformed). The model for infection-associated mortality contained age, gender, BMI, adapted CCI, history of MI, Albumin, usage of central venous catheter for dialysis, hsCRP (ln-transformed) and IL-6 (ln-transformed). For other mortality causes we used a minimal model which included: age, gender, BMI, adapted CCI and IL-6 (ln-transformed).
Time-dependent ROC analysis for Cox regression was performed by first fitting uni- (Fig. 5a) or multivariable (Fig. 5b) models to a bootstrap sample of the total study population and then calculating incident/dynamic AUCs50 on the out of bootstrap sample. The bootstrap procedure was performed 1000 times and the resulting AUCs were averaged per month. For multivariable analysis missing values were randomly sampled from the available values for the respective variable and a total of 25 thus imputed datasets were generated. AUCs for each of these datasets were calculated as described above (these are presented as dots in Fig. 5b).
References
de Jager, D. J. et al. Cardiovascular and noncardiovascular mortality among patients starting dialysis. JAMA 302, 1782–1789 (2009).
Anker, S. D. et al. Development and validation of cardiovascular risk scores for haemodialysis patients. Int. J. Cardiol. 216, 68–77 (2016).
Thamer, M. et al. Predicting early death among elderly dialysis patients: Development and validation of a risk score to assist shared decision making for dialysis initiation. Am. J. Kidney Dis. 66, 1024–1032 (2015).
Ma, L. & Zhao, S. Risk factors for mortality in patients undergoing hemodialysis: A systematic review and meta-analysis. Int. J. Cardiol. 238, 151–158 (2017).
Lorenz, G. et al. Mortality prediction in stable hemodialysis patients is re fi ned by YKL-40, a 40-kDa glycoprotein. Kidney Int. 93, 221–230 (2018).
Günthner, R. et al. Impaired retinal vessel dilation predicts mortality in end-stage renal disease. Circ. Res. 124, 1796–1807 (2019).
Floege, J. et al. Development and validation of a predictive mortality risk score from a European hemodialysis cohort. Kidney Int. 87, 996–1008 (2015).
Wang, P., Li, Y. & Reddy, C. K. Machine Learning for Survival Analysis: A Survey. arXiv:1708.04649[cs.LG] (2017) [cs.LG].
Deo, R. C. Machine learning in medicine. Circulation 132, 1920–1930 (2015).
Zou, H. & Hastie, T. Regularization and variable selection via the elastic net. J. R. Stat. Soc. Ser. B. Stat. Methodol. 67, 301–320 (2005).
Meinshausen, N. & Bühlmann, P. Stability selection. J. R. Stat. Soc. Ser. B Stat. Methodol. 72, 417–473 (2010).
Schmaderer, C. et al. Rationale and study design of the prospective, longitudinal, observational cohort study “rISk strAtification in end-stage renal disease” (ISAR) study. BMC Nephrol. 17, 1–12 (2016).
Liu, J., Huang, Z., Gilbertson, D. T., Foley, R. N. & Collins, A. J. An improved comorbidity index for outcome analyses among dialysis patients. Kidney Int. 77, 141–151 (2010).
Cohen, L. M., Ruthazer, R., Moss, A. H. & Germain, M. J. Predicting six-month mortality for patients who are on maintenance hemodialysis. Clin. J. Am. Soc. Nephrol. 5, 72–79 (2010).
Keskar, V. & Sood, M. M. Use of Oral Anticoagulation in the Management of Atrial Fibrillation in Patients with ESRD : Con. 2085–2092 (2016) https://doi.org/10.2215/CJN.03200316.
Kilpatrick, R. D. et al. Association between Serum Lipids and Survival in Hemodialysis Patients and Impact of Race. 293–303 (2004) https://doi.org/10.1681/ASN.2006070795.
Ravel, V. et al. Association of aspartate aminotransferase with mortality in hemodialysis patients. Nephrol. Dial. Transplant 31, 814–822 (2016).
Goldstein, B. A., Pencina, M. J., Montez-Rath, M. E. & Winkelmayer, W. C. Predicting mortality over different time horizons: Which data elements are needed?. J. Am. Med. Informatics Assoc. 24, 176–181 (2017).
Friedman, A. N. et al. Fatty acids and other risk factors for sudden cardiac death in patients starting hemodialysis. Am. J. Nephrol. 38, 12–18 (2013).
Goldstein, B. A., Chang, T. I., Mitani, A. A., Assimes, T. L. & Winkelmayer, W. C. Near-term prediction of sudden cardiac death in older hemodialysis patients using electronic health records. Clin. J. Am. Soc. Nephrol. 9, 82–91 (2014).
Tait Wojno, E. D., Hunter, C. A. & Stumhofer, J. S. The immunobiology of the interleukin-12 family: Room for discovery. Immunity 50, 851–870 (2019).
Liu, J. et al. Interleukin-12: An update on its immunological activities, signaling and regulation of gene expression. Curr. Immunol. Rev. 1, 119–137 (2005).
Rea, I. M., McNerlan, S. E. & Alexander, H. D. Total serum IL-12 and IL-12p40, but not IL-12p70, are increased in the serum of older subjects; Relationship to CD3+ and NK subsets. Cytokine 12, 156–159 (2000).
Kimmel, P. L. et al. Immunologic function and survival in hemodialysis patients. Kidney Int. 54, 236–244 (1998).
Memoli, B., Marzano, L., Bisesti, V., Andreucci, M. & Guida, B. Hemodialysis-related lymphomononuclear release of interleukin-12 in patients with end-stage renal disease. J. Am. Soc. Nephrol. 10, 2171–2176 (1999).
Mansouri, L., Paulsson, J. M., Moshfegh, A., Jacobson, S. H. & Lundahl, J. Leukocyte proliferation and immune modulator production in patients with chronic kidney disease. PLoS ONE 8, e73141 (2013).
Kim, W. R., Flamm, S. L., Di Bisceglie, A. M., Bodenheimer, H. C. & Public Policy Committee of the American Association for the Study of Liver Disease. Serum activity of alanine aminotransferase (ALT) as an indicator of health and disease. Hepatology 47, 1363–1370 (2008).
Rej, R. Aminotransferases in disease. Clin. Lab. Med. 9, 667–687 (1989).
Hyeon, C. K. et al. Normal serum aminotransferase concentration and risk of mortality from liver diseases: Prospective cohort study. BMJ 328, 983 (2004).
Lee, T. H., Kim, W. R., Benson, J. T., Therneau, T. M. & Melton, L. J. Serum aminotransferase activity and mortality risk in a United States community. Hepatology 47, 880–887 (2008).
Lee, H. et al. Association between change in serum aminotransferase and mortality. Medicine 95, e3158 (2016).
Oren, R. Serum liver enzymes—Should we count on them?. Liver Int. 34, 171–173 (2014).
Wolf, P. L., Williams, D., Coplon, N. & Coulson, A. S. Low aspartate transaminase activity in serum of patients undergoing chronic hemodialysis. Clin. Chem. 18, 567–568 (1972).
Sette, L. H. B. C. & de Almeida Lopes, E. P. Liver enzymes serum levels in patients with chronic kidney disease on hemodialysis: A comprehensive review. Clinics 69, 271–278 (2014).
Ray, L., Nanda, S. K., Chatterjee, A., Sarangi, R. & Ganguly, S. A comparative study of serum aminotransferases in chronic kidney disease with and without end-stage renal disease: Need for new reference ranges. Int. J. Appl. Basic Med. Res. 5, 31–35 (2015).
Kalantar-Zadeh, K., Ikizler, T. A., Block, G., Avram, M. M. & Kopple, J. D. Malnutrition-inflammation complex syndrome in dialysis patients: Causes and consequences. Am. J. Kidney Dis. 42, 864–881 (2003).
Spoto, B. et al. Association of IL-6 and a functional polymorphism in the IL-6 gene with cardiovascular events in patients with CKD. Clin. J. Am. Soc. Nephrol. 10, 232–240 (2015).
Bologa, R. M. et al. Interleukin-6 predicts hypoalbuminemia, hypocholesterolemia, and mortality in hemodialysis patients. Am. J. Kidney Dis. 32, 107–114 (1998).
Sun, J. et al. Biomarkers of cardiovascular disease and mortality risk in patients with advanced CKD. Clin. J. Am. Soc. Nephrol. 11, 1163–1172 (2016).
Panichi, V. et al. Interleukin-6 is a stronger predictor of total and cardiovascular mortality than C-reactive protein in haemodialysis patients. Nephrol. Dial. Transplant. 19, 1154–1160 (2004).
Wetmore, J. B. et al. Associations of interleukin-6, C-reactive protein and serum amyloid A with mortality in haemodialysis patients. Nephrology 13, 593–600 (2008).
Honda, H. et al. Serum albumin, C-reactive protein, interleukin 6, and fetuin a as predictors of malnutrition, cardiovascular disease, and mortality in patients with ESRD. Am. J. Kidney Dis. 47, 139–148 (2006).
Lorenz, G. et al. Worsening calcification propensity precedes all-cause and cardiovascular mortality in haemodialyzed patients. Sci. Rep. 7, 1–11 (2017).
Matschkal, J. et al. Comparison of 24-hour and office pulse wave velocity for prediction of mortality in hemodialysis patients. Am. J. Nephrol. 49, 317–327 (2019).
Seidelmann, S. B. et al. Retinal vessel calibers in predicting long-term cardiovascular outcomes. Circulation 134, 1328–1338 (2016).
R Core Team. R: A Language and Environment for Statistical Computing. (R Foundation for Statistical Computing, 2020).
Sill, M., Hielscher, T., Becker, N. & Zucknick, M. c060: Extended inference with lasso and elastic-net regularized cox and generalized linear models. J. Stat. Softw. 62, 1–21 (2015).
Eilers, P. H. C. & Marx, B. D. Flexible smoothing with B -splines and penalties. Stat. Sci. 11, 89–121 (1996).
Therneau, T. M. & Grambsch, P. M. Modeling Survival Data: Extending the Cox Model (Springer New York, 2000) https://doi.org/10.1007/978-1-4757-3294-8.
Heagerty, P. J. & Zheng, Y. Survival model predictive accuracy and ROC curves. Biometrics 61, 92–105 (2005).
Acknowledgements
This study was supported by the Klinikum rechts der Isar of the Technical University of Munich and the Else-Kröner-Fresenius Foundation (P27/10//A33/10 to C. Schmaderer). G. Lorenz was supported by a Baxter IIR grant. S. Werfel was supported by a research scholarship from the Klinikum rechts der Isar of the Technical University of Munich.
Funding
Open Access funding enabled and organized by Projekt DEAL.
Author information
Authors and Affiliations
Contributions
S.W. and C.Schmaderer designed the study. G.L. performed and supervised data collection. R.G., J.M., M.C.B., C.Schaller, P.G., S.K., P.M., U.H., J.R. and C.Schmaderer organized and performed data collection. S.W. performed the statistical and machine learning analyses, made figures and drafted the manuscript. B.H. helped to design and critically assess the statistical and machine learning analyses and draft the manuscript. C.Schmaderer, S.S.H., C.N., J.R., P.M. and U.H. critically revised the manuscript. C.Schmaderer and U.H. acquired funding. All authors gave their final approval.
Corresponding author
Ethics declarations
Competing interests
J. Reiser reports grants from National Institute of Health, Thermo BCT and Nephcure, personal fees from Biomarin, TRISAQ, Astellas, Massachusetts General Hospital, Genentech, Up to Date, Merck, Inceptionsci, GLG, Clearview, outside the submitted work; In addition, J. Reiser reports patents licensed to Miltenyi and TRISAQ, outside the submitted work. J. Reiser reports he is co-founder and share holder of TRISAQ, a bio-pharmaceutical company that develops drugs for kidney disease. All of the remaining authors have nothing to disclose.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Werfel, S., Lorenz, G., Haller, B. et al. Application of regularized regression to identify novel predictors of mortality in a cohort of hemodialysis patients. Sci Rep 11, 9287 (2021). https://doi.org/10.1038/s41598-021-88655-0
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-021-88655-0
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.