Introduction

Zinc-alpha 2-glycoprotein, AZGP1 (often also abbreviated as ZAG) is a secreted 43 kDa protein which is expressed in many epithelial tissues and in adipocytes. AZGP1 circulates at high concentrations in human blood.

Despite a multitude of proposed implications in different diseases, the true role of AZGP1, which shows inverse variations with glomerular filtration rate1,2,3, is still poorly defined. AZGP1 has been suggested as a modifier of metabolic functions, insulin sensitivity, fat mass expansion, blood pressure regulation, cancer progression, inflammatory disease, vitiligo, neurological disease and cardiovascular (CV) disease4,5,6,7,8,9. Experimentally AZGP1 exerts antifibrotic effects in kidney and heart10 and indirect clinical evidence indicates that AZGP1 can act as an autocrine/paracrine adipokine to promote lipolysis and improve insulin resistance4,11,12,13. Through these and other effects, AZGP1 might modify the risk for metabolic syndrome, CV disease and mortality14,15,16,17. However, previous studies have provided partially conflicting findings about the potential role of circulating AZGP1 on CV outcomes and mortality in different populations. While serum AZGP1 correlated with CV disease and mortality in dialysis patients, there was an inverse relationship with atherosclerosis and coronary heart disease in non-dialysis patients12,18,19.

CV disease is the leading cause of morbidity and death in individuals over the age of 65. Typically, older adults have a more heterogeneous burden of comorbidities and the pathomechanistic course of CV disease shows more variability compared to younger patients20,21. An important factor contributing to this variability are age- and health-dependent changes of renal function22,23. Improving risk prediction of CV disease has been difficult in the elderly and existing prediction modeling suboptimal24.

The aim of the present study was to build prediction models for death and the composite endpoint of death and cardiovascular events (including stroke and myocardial infarction) in old age. Therefore, we investigated the prognostic potential of several knowledge-based predictors including biomarkers for kidney function and serum AZGP1 levels for these outcomes within the Berlin Initiative Study, a population-based cohort of older adults initiated to study kidney function at older age25,26.

Results

Cohort characteristics

Baseline characteristics of the study population are presented in Table 1 (stratified by quartiles of log-transformed AZGP1 serum levels. The median duration of follow-up was 48.3 months. Of the 930 individuals analyzed, 53% were female, mean age was 82 years, and mean BMI was 28 kg/m2; 63% had a CCI ≥ 8. The mean eGFRBIS2 was 52 ml/min/1.72 m2, and median albumin-to-creatinine-ratio (ACR) was 11 mg/g. Based on health insurance data, 17% of study participants had experienced a prior MI and 16% a stroke before enrolment into the study. Mean log-transformed AZGP1 was 1.92 μg/ml (Table 1). As the 930 individuals with AZGP1 measurement (64.6%) were a subsample of the total sample (n = 1440) of the second BIS follow-up visit, we compared them to the 510 individuals (35.4%) without AZGP1 measurements. Overall, the two groups exhibited similar characteristics, with the AZGP1 study population being on average 2 years younger, having slightly more smokers, less comorbidities, and a tendency to lower ACR values (Supplementary Table S1).

Table 1 Baseline characteristics of total study sample and according to log-transformed AZGP1 levels in older adults.

Baseline variables according to AZGP1 levels

For most of the baseline variables no distinct pattern such as an increase or decrease with regard to the AZGP1 concentration could be observed (Table 1). No difference in mean AZGP1 levels was seen between women (1.91 ± 0.23) and men (1.93 ± 0.23, p = 0.13). All kidney function measures, however, demonstrated worsening kidney function (an increase in serum creatinine or cystatin C levels or a decline in eGFR) with increasing levels of AZGP1. While we found no differences in CCl between the lower AZGP1 groups, we observed that the highest quartile group was associated with higher CCI categories. As further analysis revealed that individuals in the highest CCl category were also more likely to exhibit signs of kidney impairment (decreased GFR) and kidney damage (albuminuria), we repeated the analysis excluding individuals categorized as CKD with “very high risk” regarding kidney disease prognosis according to the KDIGO guidelines (see Figure “Prognosis of CKD by GFR and albuminuria category” in 27). In the analysis excluding individuals with GFR category G3a and worse with persistent albuminuria category A3 (eGFR < 60 ml/min/1.73 m2 and ACR > 300 mg/g), GFR category G3b and worse with persistent albuminuria category A2 (eGFR < 45 ml/min/1.73 m2 and ACR ≥ 30 mg/g), and GFR category G4 and G5 (eGFR < 30 ml/min/1.73 m2) the association of AZGP1 with the highest CCI category vanished (Table 2). Also, AZGP1 correlated with none of the continuous baseline variables (Table 3).

Table 2 Indicators of CKD and AZGP1 by CCI categories in the subsample and when excluding individuals without high risk of CKD according to KDIGO guidelines*.
Table 3 Correlation coefficients (CC) of log-transformed AZGP1 levels with demographic parameters and biomarkers.

Prediction of death

In total, 234 deaths occurred in the subsample, and 218 deaths in the 882 cases with complete candidate variables (Supplementary Fig. S1). There was no difference in survival when comparing the AZGP1 quartiles in Kaplan–Meier analysis (p = 0.110, Supplementary Fig. S2). Out of the 20 candidate variables considered for the prediction of death, from 500 bootstrap-based Cox models with backward selection, AZGP1 was among 8 variables included in > 50% of all models (67.4%, Supplementary Table S2). In multivariable Cox regression analysis of the subsample, the final model incorporated the predictive factors age (HR; 95% CI: 1.60; 1.41–1.81), BMI (0.92; 0.88–0.95), smoking (1.58; 1.20–2.08), anemia (1.44; 1.05–1.97), cystatin C (1.27 per 0.3 mg/l; 1.16–1.39), CRP (1.45 per log(mg/l)-unit; 1.06–1.97), CCI (1.11; 1.06–1.15), and AZGP which demonstrated predictive value (HR per log-unit: 0.44; 0.24–0.8, p = 0.008, Fig. 1A). The c-index of this Cox model was 0.756 (0.755–0.758). The calibration test indicated good model performance (p = 0.345). The model overestimated the observed survival by 2.0–6.5% (Fig. 2A). When using LASSO for variable selection, the same eight variables with similar HR estimates as stated above and eight additional variables were selected (Supplementary Table S2). The corresponding c-index was 0.766 (0.746–0.786).

Figure 1
figure 1

Predictors for (A) death and (B) composite endpoint (endpoint of myocardial infarction, stroke or death) from the multivariable Cox model with stepwise backward selection. BMI body mass index, log natural logarithm, CRP c-reactive protein, HR hazard ratio, CI confidence interval. The figure was produced with IBM SPSS Statistics version 25.0 (https://www.ibm.com/de-de/analytics/spss-statistics-software) and R version 4.0.0 (https://www.R-project.org).

Figure 2
figure 2

Fit of the multivariable Cox models with 95% confidence interval compared to the estimates from Kaplan–Meier analysis for (A) death and (B) the composite endpoint. The p-value from a calibration test for the Cox model is shown. KM Kaplan–Meier, CI confidence interval, time duration of observation since biomarker measurement. The figure was produced with IBM SPSS Statistics version 25.0 (https://www.ibm.com/de-de/analytics/spss-statistics-software) and R version 4.0.0 (https://www.R-project.org).

Prediction of the composite endpoint of myocardial infarction, stroke or death

In total, 70 incident strokes and 38 incident MI occurred. When assessing the quartile groups for time to the composite endpoint of MI, stroke or death (n = 661 without prior events), there were again no differences between the AZGP1 groups in Kaplan–Meier analysis (p = 0.697, Supplementary Fig. S2). Of the 638 cases with complete candidate variables 178 reached the composite endpoint, by counting the first of the events of interest comprising 32 incident MI, 66 incident strokes and 80 deaths. Out of the 20 candidate variables considered for the prediction of the composite endpoint, from 500 bootstrap-based Cox models with backward selection, AZGP1 was among 8 variables included in > 50% of all models (61.0%, Supplementary Table S3). Again, when analyzing the predictive value in the multivariable Cox regression model incorporating the predictive factors (HR; 95%CI) age (1.45; 1.26–1.67), BMI (0.95; 0.91–0.99), waist-hip-ratio (1.31; 1.06–1.62), cystatin C (1.15 per 0.3 mg/l; 1.04–1.28), CRP (1.51 per log(mg/l)-unit; 1.07–2.13) and CCI (1.10; 1.05–1.15), higher AZGP1 levels predicted a lower risk for the composite endpoint (HR per log-unit = 0.43, 0.23–0.82, p = 0.010; Fig. 1B). The c-index of this Cox model was 0.681 (0.683–0.679). The calibration test indicated good model performance (p = 0.498). The model overestimated the observed frequency of the composite endpoint by 1.6–4.1% (Fig. 2B). When using LASSO for variable selection, only age, CCI and cystatin C were selected with HRs which were about half the size of the above described model (Supplementary Table S2). AZGP1 was not included in this model. The corresponding c-index was 0.749 (0.731–0.767).

Discussion

The purpose of this analysis was to develop a knowledge-based prediction model for mortality and CV events explicitly in a subsample of a large, well-characterized cohort of older adults and to evaluate the predictive potential of the biomarker AZGP1. So far, no valid prediction model for both endpoints in this age group had existed. A proposed prediction model such as the SCORE-model for older people (SCORE-OP) was proven suboptimal when externally validated in the BIS-cohort24.

Our analysis identified that, apart from AZGP1, chronological age, BMI, CCI, cystatin C and elevated CRP proved to be significant predictors of both, mortality and the composite CV endpoint. With regard to AZGP1, we found that higher AZGP1 serum levels were predictive for lower mortality and a lower risk for the composite CV endpoint. In the validation step evaluating 500 bootstrap samples AZGP1 was selected into 67% of the models for death, and into 61% of the models for the composite CV endpoint. The finally selected models built on the original subsample both overestimated the observed event-free survival, albeit to a negligible extent, as indicated by the non-significant results of the calibration test. When we performed variable selection with the LASSO method, for the endpoint mortality all 8 variables from the bootstrap-backward-selection (BBS) approach were confirmed with similar coefficients, despite the LASSO model comprising eight additional variables, indicating a small impact of these variables. Given the comparable discriminative ability of both models according to the c-index and aiming at sparse models, we consider the model derived with BBS as preferable. For the composite endpoint the LASSO selection resulted in a small model only comprising age, CCI and cystatin C, to our surprise with a better discrimination index than the BBS model. The three variables selected were also part of the BBS model with 6 variables; regression coefficients for age, CCI and cystatin C in the LASSO model were about half the size of those in the BBS model, presumably since no variables with negative coefficients were part of the LASSO selection, such as BMI and AZGP1 in the BBS model. From a medical perspective the small LASSO model seems rather questionable; to our knowledge the most commonly used risk score in Europe adapted for older people (SCORE-OP) does not perform satisfactorily in older adults24. This means there is no straight answer which variables could be missing here to predict cardiovascular risk in the general elder population. Furthermore, it is known that LASSO will select only one feature from a group of correlated features, and the selection is arbitrary in nature28. Thus, the surprisingly limited variable selection for the composite cardiovascular endpoint could also be a result of this property.

Although our understanding of the complex molecular changes of aging is increasing, there is still a lack of biomarkers which are able to distinguish between healthy and unhealthy aging. Emerging biomarkers which have been introduced in association with increased CV risk and mortality include markers of inflammation, heart failure or kidney dysfunction, e.g. CRP, Brain Natriuretic Peptide (BNP) and cystatin C29,30,31,32,33. In our cohort, we had no information on BNP, but on CRP as well as cystatin C which were both predictive for mortality and the composite endpoint.

Similar to cystatin C, AZGP1 is filtered and degraded by the kidney. Thus, circulating concentrations of AZGP1 also depend on renal function, leading to increased plasma levels in patients with kidney dysfunction2,3. In the present study all kidney function parameters (creatinine, cystatin C, eGFR) demonstrated worsening kidney function with increasing levels of AZGP1, though without significant correlation patterns. Higher AZGP1 serum levels predicted reduced outcome risks in contrast to cystatin C levels where higher levels predicted increased outcome risks. In the BIS cohort, a population-based sample of older adults investigating the decline in kidney function, we have previously demonstrated an elevated load of major CV risk factors in individuals with reduced kidney function25,34.

While there are no published data on the predictive nature of AZGP1, there are some studies investigating the association of AZGP1 and traditional CV risk factors. Kurita et al., for example, presented a cohort in which higher serum AZGP1 levels were positively associated with hypertension35. In contrast, Zhu et al. demonstrated reduced AZGP1 levels in hypertensive patients9. A recent cross-sectional study by Huang et al. in middle aged patients (mean age 58 years) indicated an inverse correlation between serum AZGP1 levels and coronary artery disease (CAD)18. The authors demonstrated an independent association between low AZGP1 levels and atherosclerosis, and provided experimental evidence indicating that AZGP1 exerts anti-inflammatory effects via the β3-adrenoceptor and JNK/AP-1 signaling in macrophages and possibly in other cells surrounding atherosclerotic plaques. Another study in middle aged patients (mean age 55 years) found a similar association between lower AZGP1 levels and premature CAD9. Since the pathophysiologic relationship between AZGP1 and CV risk factors especially in older adults is still unclear, we focused in our analysis on the predictive power of AZGP1 and found AZGP1 to be predictive for mortality and the composite endpoint including myocardial infarction and stroke.

As we investigated a population of older adults, another relevant aspect is the role of AZGP1 in frail individuals. In a previous observational study Lee et al. investigated frailty and AZGP1 levels in participants from a hospital-based comprehensive geriatric assessment program (mean age 77 years)36. In contrast to our data they found that AZGP1 levels were higher in men than in women. By multiple linear regression analysis they also observed a positive correlation between AZGP1 and frailty in women, assessed by unintentional weight loss, exhaustion, low physical activity level, slow walking speed and low grip strength. A further comparison of our data with the study by Lee et al. is difficult as we do not have information on frailty parameters. We based our assessment on the CCI which represents the concept of comorbidity and overlaps only partly with the construct of frailty37. When investigating individuals of the highest AZGP1 quartile group we found an association with a higher CCI. To minimize the potential influence of impaired kidney function we repeated the analysis without individuals with CKD and found no association between AZGP1 and CCI, suggesting that the increased AZGP-1 levels were due to the presence of CKD and not the higher CCI.

Overall, data on AZGP1 in older adults are scarce. There is only one additional report in which plasma AZGP1 levels were measured in a surgical geriatric patient cohort; Vasunilashorn et al. performed a proteomic study in a subsample of the Successful Aging after Elective Surgery (SAGES) cohort (mean age 77 years)38. Interestingly, they found that lower levels of AZGP1 were associated with a higher risk to develop post-surgical delirium in geriatric patients.

Our study has some limitations. First, we cannot draw conclusions about any causative role of AZGP1 levels. It remains unclear whether AZGP1 is actively involved in CV protective mechanisms or whether it should be regarded as a purely predictive biomarker. While there are interesting mechanistic concepts on potential protective functions, we are lacking conclusive evidence about their relevance which is why we chose a predictive and not a causal model. Secondly, AZGP1 measurements were made on stored samples of one time point measurements. This may not reflect AZGP1 dynamics over time. Furthermore, the analyzed subsample was slightly younger and seemingly healthier compared to the overall BIS cohort. Finally, assessment of the discriminatory properties for both prediction models in external data sets is required, as is the evaluation of the predictive potential of AZGP1 in established risk models or proposed risk scores especially developed for older ages. Still, we think that despite the limited model validation and the deviating results of the LASSO variable selection for the composite endpoint, our findings propose that AZGP1 might have potential as a new biomarker for cardiovascular aging. The strengths of our report are based on the size of the database given the advanced mean age of the participants and the comprehensive nature of follow-up data, which provides a well phenotyped sample of community-dwelling older adults. Besides, we applied rigorous prediction modeling methodology. Furthermore, our results contribute a piece to the puzzle of the potential role of AZGP1 in cardiovascular aging.

In conclusion, this is the first study to analyze the predictive potential of AZGP1 serum levels for mortality and CV events in a community-based subsample of older adults in Berlin. Our data revealed that in old age higher serum levels of AZGP1 were predicting a reduced mortality and CV risk within a model of other knowledge-based predictors. Given the challenges of traditional CV risk prediction in older people we suggest that AZGP1 merits further exploration as a novel CV biomarker for healthy aging.

Methods

Study design and population

The Berlin Initiative Study (BIS) is a longitudinal population-based cohort of 2069 older adults living in Berlin, who were recruited between November 2009 and June 2011. Inclusion criteria were AOK membership [AOK-Nordost—Die Gesundheitskasse; Berlin’s largest statutory health insurance fund] and age of 70 years and above. Follow-up information was collected bi-annually. A detailed description of the study design can be found elsewhere26. Additionally, insurance claims data including “International Classification of Diseases”-10 (ICD-10) codes39 were available for all study participants. The Charlson comorbidity index (CCI) was calculated based on ICD-10 codes, and the age-adjusted CCI version was used40. The study was approved by the local ethics committee, Charité, Berlin, Germany (EA2/009/08), and every participant gave written informed consent. All research was performed in accordance with relevant guidelines and regulations and the Declaration of Helsinki. The data base freeze for this analysis was December 04, 2019.

Data for the current study include a subsample of 930 out of 1440 BIS study participants who attended the 4-year follow up visit of the BIS between January 2014 and September 2015. Due to a change of the laboratory analysis platform provider during the 4-year follow up visit, reserve blood samples had been stored for the first 65% of participants until June 2015. Those 930 samples were available for the additional laboratory analyses of AZGP1 conducted in 2018 (Supplementary Fig. S1).

Laboratory methods

AZGP1 (μg/ml) was measured in frozen serum samples (− 80 C) in a subsample of 930 individuals of the 4-year follow-up visit using a previously described2 commercial enzyme-linked immunosorbent assay (Biovendor, Modrice, Czech Republic), according to the manufacturer’s instructions. Investigators were blinded to patients’ data and all measurements were performed in duplicate in 2018. The assay sensitivity was 0.673 ng/ml. The intra-assay coefficient of variation was less than 5%. Serum creatinine was analysed using the isotope dilution mass spectrometry (IDMS) traceable enzymatic method from Roche (Crea plus; Roche Diagnostics, Mannheim, Germany) on a Roche Modular-analyzer P-Modul. Cystatin C was measured using the particle enhanced turbidimetric (PETIA) Tina-quant generation 2 assay on the Roche/Hitachi Cobas S system (Cobas c 501).

The estimated glomerular filtration rate (eGFR) was calculated with the combined creatinine and cystatin C-based BIS2 equation (eGFRBIS2) developed for individuals above the age of 7041,42.

Statistical analysis

The biomarker AZGP1 was log(10)-transformed for further analyses. To recognize potential trends in relation to demographic and clinical characteristics with rising AGZP1 serum levels, the cohort was split into AGZP1 quartile groups. Correlations of AZGP1 with continuous variables were evaluated with Pearson and Spearman correlation coefficients, for categorical variables biomarker levels were compared with the t-test. We investigated the predictive value of serum AZGP1 levels for the outcome mortality and the composite endpoint comprising the outcomes stroke, myocardial infarction (MI) or death, whatever occurred first. Information on mortality, acute and prior events of MI (ICD10: I21, only prior I22, I23, I25.2) and stroke (ICD10: I61, I63, I64, only prior I69.1, I69.2, I69.3, I69.4) were derived from the claims data. Individuals with a previous event of the composite endpoint (stroke or MI) by insurance claims data before the AZGP1 measurement (n = 269) were excluded from the respective analyses (ICD-10 codes from insurance claims data reach back to 2006).

For time-to-event analysis we used Kaplan–Meier analysis with the Breslow test by quartile groups of the biomarkers, and Cox proportional hazard models with stepwise backward selection. Based on medical pre-selection, the following variables were considered for the predictive models: age, gender, body-mass-index (BMI), waist-to-hip-ratio, smoking, anemia, systolic and diastolic blood pressure, serum creatinine, serum cystatin C, hemoglobin, high density lipoprotein (HDL), low density lipoprotein (LDL) and total cholesterol, C-reactive protein (CRP, log-transformed), eGFRBIS2, albumin-to-creatinine ratio (ACR, log-transformed), age-corrected Charlson Comorbidity Index (CCI), antihypertensive treatment, and AZGP1 serum levels (log-transformed). Further comorbidities such as diabetes mellitus, myocardial infarction, stroke, cancer and coronary heart disease were included as parts of the CCI. To illustrate the fit of these models, calibration plots are shown comparing survival estimates with 95% confidence interval from the Cox models to the Kaplan–Meier survival rates; the modified D'Agostino-Nam test on calibration of the Cox model was performed43,44. To assess model discrimination, a generalized version of the c-index for survival analysis allowing for censoring was calculated45. For internal model validation, 500 bootstrap samples were drawn from the subsample and Cox regression with backward selection was applied. The mean of the differences in c-indices of the corresponding models between the bootstrap and the original subsample was calculated as indicator for the optimism in the c-index of the prediction model derived from the original subsample46. The c-indices corrected for optimism are reported. Furthermore, to check the robustness of our model results with other statistical selection techniques, we additionally performed the LASSO method for Cox regression47 (package glmnet in R48) in the original data. For comparison, we show the variable selection from LASSO when choosing λ where the cross-validation error curve hits its minimum; c-indices from the incorporated tenfold cross-validation function are reported.

Since information on death and endpoints derived from claims data are only available with a respective time lag, analyses of death were censored at 30.06.2019, and analyses for the composite endpoint were censored at 31.12.2018. We conducted complete-case analyses. IBM SPSS Statistics version 25.049 (https://www.ibm.com/de-de/analytics/spss-statistics-software) and R version 4.0.048 (https://www.R-project.org) were used for analysis. We followed the TRIPOD reporting guidelines50.