Background

Prostate cancer is the second most commonly diagnosed cancer in men worldwide after lung cancer and is a leading cause of cancer death.1 However, its well-established risk factors age, ethnicity family history and other genetic factors are not modifiable.2,3 There is evidence that higher circulating insulin-like growth factor-I (IGF-I) concentrations are related to higher overall prostate cancer risk,4,5 and obesity has been associated with a higher risk of aggressive disease.6 Moreover, men with low free testosterone concentrations may have a lower risk of overall prostate cancer.7 However, the aetiology of prostate cancer is not well understood, and there is a need to identify novel risk factors for the disease.

There is some evidence that some non-communicable diseases are associated with prostate cancer risk. Men with diabetes or kidney disease have a lower risk of being diagnosed with prostate cancer,8,9 but it is unknown whether concentrations of common biochemical factors related to other non-communicable diseases, such as cardiovascular or liver disease, are also associated with prostate cancer. Due to the high cost of measuring multiple biomarkers in samples from an entire cohort, most previous studies of potential biomarkers for prostate cancer risk have been nested case-control studies of a small number of selected putative cancer biomarkers.

The UK Biobank prospective cohort study is an important resource for the study of cancer aetiology. UK Biobank has measured concentrations of multiple biomarkers known to be related to several non-communicable diseases (i.e. cardiovascular, bone and joint, diabetes, renal and liver diseases) in blood and urine samples from every participant in the cohort. This provides a unique opportunity to investigate a wide range of biomarkers, including biomarkers that have not previously been studied in relation to prostate cancer risk. We report here the first results from this cohort on the association of concentrations of blood and urine biomarkers known to be related to other non-communicable diseases with prostate cancer incidence and mortality in a large British cohort.

Methods

Study design and population

UK Biobank is a prospective study of >500,000 people (aged 40–69 years, including 229,000 men) designed to be a resource for research into the causes of disease in middle and old age. The study protocol and information about data access are available online (http://www.ukbiobank.ac.uk/wp-content/uploads/2011/11/UK-Biobank-Protocol.pdf) and elsewhere.10 All the covariates were measured at the same time as the biological samples were collected.

Briefly, persons who lived within reasonable traveling distance (25 km) of 1 of the 22 assessment canters across England, Wales and Scotland were identified from National Health Service (NHS) registers and invited to participate in the study between 2006–2010, resulting in a participation rate of 5.5%.11 At recruitment, participants provided detailed information on a range of sociodemographic, physical, lifestyle and health-related factors via self-completed touch-screen questionnaires and a computer assisted personal interview.11 Anthropometric measurements (standing height, weight, waist and hip circumferences) were taken by trained research clinic staff at the assessment centre, while body mass index [BMI] and percentage body fat were assessed through bioimpedance measures.12 A total of 211,754 men were included in these analyses; a study flowchart showing all the exclusions can be found in Supplemental Fig. 1.

Blood and urine collection and laboratory analysis

Blood and a mid-flow urine sample (both non-fasting) were collected at recruitment from 99.7% and 96% of participants, respectively. Repeat blood and urine samples were collected in ~9000 men between August 2012 and June 2013 at the UK Biobank Co-ordinating Centre in Stockport; participants who lived within a 35 km radius were invited to attend, with an overall response rate of 21%.13,14 Blood and urine samples were shipped to the central processing laboratory at 4 °C prior to serum preparation, aliquoting and cryopreservation in a central working archive.15,16

For the current analysis, we have included 28 biochemistry markers (cardiovascular, bone, joint, diabetes, renal and liver-related biomarkers, please see Table 1 for the full biomarker list) that were measured in 229,000 men. UK Biobank did not release data on biomarkers outside of the limit of detection for each individual biomarker; therefore, extreme outliers are already not included in the dataset. Details about assay methods and quality control procedures are available online (https://biobank.ndph.ox.ac.uk/showcase/showcase/docs/serum_biochemistry.pdf and http://biobank.ctsu.ox.ac.uk/crystal/crystal/docs/urine_assay.pdf).

Table 1 Baseline characteristics and biochemical data of all men and men who developed prostate cancer in UK Biobank.

Ascertainment of prostate cancer incidence and mortality

For prostate cancer incidence, the endpoint was first diagnosis of prostate cancer (International Classification of Diseases Tenth revision codes: C6117) or death from prostate cancer (prostate cancer mentioned on the death certificate), whichever was first. Men were followed-up for cancer incidence until the censoring dates (31 March 2016 in England and Wales and 31 October 2015 in Scotland).

For analyses of prostate cancer mortality, the endpoint was prostate cancer as the underlying cause of death recorded on the death certificate and men were followed-up until 31 January 2018 for England and Wales and 30 November 2016 for Scotland.

Cancer incidence and mortality data were provided by the NHS Digital for England and Wales and by the NHS Central Register and Information and Statistics Division for Scotland. Person-years were calculated from the date of recruitment to the date of cancer registration (first malignant neoplasm, except non-melanoma skin cancer (ICD-10 C44)), death date lost to follow-up or the censoring date, whichever occurred first.

Statistical analysis

Logarithmically transformed concentrations of the biochemistry markers were used for all analyses to approximate a normal distribution. Cox proportional hazards models were used to calculate hazard ratios (HRs) and 95% confidence intervals (CIs) for prostate cancer incidence and death, using age as the underlying time variable. Men were categorised into fifths (for incident analyses) and thirds (for death analyses, due to the more limited number of events) of biomarker concentrations based on the distribution in the cohort. We also modelled HRs per SD higher concentrations. The minimally adjusted models were stratified by geographical region of recruitment (ten UK regions) and age (<45, 45–49, 50–54, 55–59, 60–64 and ≥65 years) at recruitment. The fully adjusted model was further adjusted for Townsend deprivation score (fifths, unknown [0.1%]), ethnic group (white, mixed background, Asian, black, other and unknown [0.7%]), height (<170, 170–174.9, 175–179.9 and ≥180 cm and unknown [0.6%]), lives with a wife or partner (no, yes), body mass index (BMI) (<25, 25–<29.9, 30–34.9 and ≥35 kg/m2 and unknown [0.6%]), cigarette smoking (never, former, current 1–<15 cigarettes per day, current ≥15 cigarettes per day, current but number of cigarettes per day unknown and smoking status unknown [0.7%]), physical activity (low [0–9.9 METs/week], moderate [10–49.9 METs/week] and high [≥50 METs/week], unknown [3.7%]), alcohol consumption (non-drinkers, <1–9.9, 10–19.9 and ≥20 g ethanol/day, unknown [0.6%]) and diabetes (no, yes, and unknown [0.6%]). For the vitamin D analyses, we further adjusted the fully adjusted model by month of recruitment to allow for the influence of month of blood draw on circulating concentrations. Categories of the adjustment covariates were defined a priori based on previous analyses by our group using UK Biobank data.8 To check for violation of the proportional hazards assumption we examined time-varying covariates and Schoenfeld residuals. This check did not indicate any such violation.

Measurement error and within-person variability when using single biomarker measurements lead to under-estimation (regression dilution bias) of potential associations between biomarker concentrations and prostate cancer risk.18 Repeated biomarker measures were available from a subsample of ~9000 men who had provided a second blood sample ~4–5 years after recruitment, and these were used to correct risk analyses for regression dilution bias using the McMahon–Peto method.19 In MacMahon’s method, individuals are grouped into fifths according to their first biomarker measurement, and the mean of the biomarker is calculated for each group at each repeat. The MacMahon’s regression dilution ration (RDR) is the ratio of the range of means at the repeat to the range of means at the first measurement.

Sensitivity analyses to reduce reverse causality were performed by repeating the analyses after excluding the first 4 years of follow-up; due to the relatively small number of deaths and limited statistical power, this sensitivity analysis was not performed for prostate cancer death. We also restricted analyses to those biomarkers measured using aliquot 1, which was the vast majority (~88% of the samples), in order to assess whether the inadvertent dilution of some aliquots had an impact on the associations.20 Finally, we restricted analyses to men aged ≥50 years to exclude those with very prostate cancer risk.

All analyses were performed using Stata version 14.1 (Stata Corporation, College Station, TX, USA), all tests of significance were two-sided, and to account for multiple testing the false discovery rate was controlled to 0.05 using the Benjamini–Hochberg method.21

Results

Participants’ characteristics

During the follow-up period (mean 6.9 years, SD: 1.4 for incidence; and 8.7 years, SD: 1.2 for mortality), 5968 men were diagnosed with prostate cancer and there were 331 prostate cancer deaths. Table 1 shows the characteristics of the study participants at baseline, including means and SDs for baseline biomarker measurements. Among all participants, the mean BMI was 27.8 kg/m2, 13% reported that they were current cigarette smokers, 43.5% reported drinking at least 20 g of alcohol per day and 28.7% of men reported being physically inactive. The mean age at diagnosis was 66 years (SD, 5.4 years). Regression dilution ratios, which measure within-person variability, are reported in Supplementary Table 1, and ranged from 0.21 to 1.03, with an average of 0.62.

The associations of concentrations of blood and urine biomarkers with prostate cancer incidence, corrected for regression dilution bias, are reported in Figs. 13, and associations without correction for regression dilution bias are shown in Supplementary Tables 2, 3, 4 and 5, while both uncorrected and corrected HRs for 1-SD higher circulating concentrations of each biomarker are shown in Supplementary Table 1. The associations were generally slightly greater in magnitude after adjustment for regression dilution bias (see Supplementary Table 1); the HRs and (95% CIs) below reported in the text below are the corrected results.

Fig. 1: Hazard ratio of incident prostate cancer by fifths of usual serum cardiovascular-, bone-, joint- and diabetes-related biomarker concentrations in up to 205,529 men in UK Biobank.
figure 1

The hazard ratios above each plot are for 1-SD higher concentration of the biomarker after correction for regression dilution bias. HRs are from Cox regression analyses stratified by region and age at recruitment and adjusted for age (underlying time variable), Townsend deprivation score, ethnicity, lives with a wife or partner, BMI, smoking, physical activity, alcohol consumption and diabetes. For vitamin D analyses, the model was further adjusted for month of recruitment. Full details for each covariate are provided in the statistical section. The boxes represent the hazard ratios; the vertical lines represent the 95% CIs. The x-axis shows the mean concentrations of the repeat biomarker measurement within each fifth. The numbers above the vertical lines are point estimates for hazard ratios, and the numbers below are the number of prostate cancer diagnoses within each fifth. Multivariable-adjusted P values marked in boldface were statistically significant after allowing for multiple testing. ApoA1 apolipoprotein A1, ApoB apolipoprotein B, BMI body mass index, CI confidence intervals, HbA1c haemoglobin A1c, HDL high-density lipoprotein, HR hazard ratio, N number of prostate cancer cases.

Fig. 2: Hazard ratio of incident prostate cancer by fifths of usual serum renal-related biomarker concentrations in up to 211,754 men in UK Biobank.
figure 2

The hazard ratios above each plot are for 1-SD higher concentration of the biomarker after correction for regression dilution bias. HRs are from Cox regression analyses stratified by region and age at recruitment and adjusted for age (underlying time variable), Townsend deprivation score, ethnicity, lives with a wife or partner, BMI, smoking, physical activity, alcohol consumption and diabetes. Full details for each covariate are provided in the statistical section. The boxes represent the hazard ratios; the vertical lines represent the 95% CIs. The x-axis shows the mean concentrations of the repeat biomarker measurement within each fifth. The numbers above the vertical lines are point estimates for hazard ratios, and the numbers below are the number of prostate cancer diagnoses within each fifth. Values marked in boldface were statistically significant after allowing for multiple testing. Multivariable-adjusted P values marked in boldface were statistically significant after allowing for multiple testing. BMI body mass index, CI confidence intervals, HR hazard ratio, N number of prostate cancer cases.

Fig. 3: Hazard ratio of incident prostate cancer by fifths of usual serum liver-related biomarker concentrations in up to 204,621 men in UK Biobank.
figure 3

The hazard ratios above each plot are for 1-SD higher concentration of the biomarker after correction for regression dilution bias. HRs are from Cox regression analyses stratified by region and age at recruitment and adjusted for age (underlying time variable), Townsend deprivation score, ethnicity, lives with a wife or partner, BMI, smoking, physical activity, alcohol consumption and diabetes. Full details for each covariate are provided in the statistical section. The boxes represent the hazard ratios; the vertical lines represent the 95% CIs. The x-axis shows the mean concentrations of the repeat biomarker measurement within each fifth. The numbers above the vertical lines are point estimates for hazard ratios, and the numbers below are the number of prostate cancer diagnoses within each fifth. Values marked in boldface were statistically significant after allowing for multiple testing. Multivariable-adjusted P values marked in boldface were statistically significant after allowing for multiple testing. ALT alanine aminotransferase, AST aspartate aminotransferase, BMI body mass index, CI confidence intervals, HR hazard ratio, N number of prostate cancer cases.

Cardiovascular-related biomarkers

Figure 1 (and Supplementary table 2) shows that cardiovascular-related biomarkers [total cholesterol, low-density lipoprotein–cholesterol (LDL-c), high-density lipoprotein–cholesterol (HDL-c), triglycerides (TG), apolipoprotein A-I (ApoA-I), apolipoprotein B (ApoB), C-reactive protein (CRP) and lipoprotein a] were not related to prostate cancer diagnosis or mortality.

Bone- and joint-related biomarkers

Higher circulating vitamin D concentrations were associated with an elevated risk of prostate cancer (HR per 1 SD increase 1.11, 95% CI 1.05–1.17, P trend = 0.0004) (Fig. 1 and Supplementary table 3). There was little evidence that concentrations of the other bone- and joint-related biomarkers were associated with prostate cancer incidence or death (Fig. 1 and Table 2).

Diabetes-related biomarkers

Men with higher circulating glycated haemoglobin (HbA1c) concentration had a higher risk of dying from prostate cancer (HR per 1 SD increase 1.19, 1.02–1.39, P trend = 0.024; Table 2), but this association did not survive correction for multiple testing. Higher glucose concentration was associated with a lower prostate cancer risk (0.90, 0.84–0.97, P trend = 0.0082; Fig. 1), but was not associated with prostate cancer mortality.

Table 2 Hazard ratios (95% CI) for prostate cancer death by thirds (uncorrected for RDB) and one standard deviation increase (corrected for RDB) in each biomarker in UK Biobank.

Renal-related biomarkers

Men with higher circulating total protein concentrations had a lower prostate cancer risk (HR per 1 SD increase 0.88, 0.84–0.93, P trend = 0.0001; Fig. 2). Circulating urea and phosphate concentrations were associated with an increased prostate cancer risk (1.07, 1.02–1.12, P trend = 0.0036; and 1.13, 1.05–1.22, P trend = 0.0006, respectively). Renal-related biomarkers were not associated with prostate cancer mortality.

Liver-related biomarkers

Higher circulating aspartate aminotransferase (AST) concentration was associated with a lower prostate cancer risk (HR per 1 SD increase 0.90, 0.85–0.94, P trend ≤ 0.0001; Fig. 3). We also found that men with higher circulating total bilirubin concentration had a lower risk of prostate cancer death (0.83, 0.71–0.98, P trend = 0.028; Table 2), although this association did not survive correction for multiple testing. The other liver-related biomarkers were not related to prostate cancer diagnosis or death.

Sensitivity analyses

After excluding the first 4 years of follow-up the 95% confidence intervals were wider and only cystatin-C and phosphate were associated with prostate cancer risk after controlling for multiple testing (HR per 1 SD increase 0.93, 0.89–0.97; and 1.07, 1.05–1.29, respectively; Table 3).

Table 3 Sensitivity analyses.

The associations reported above did not vary when analyses were restricted to blood samples from aliquot 1 (Supplementary Table 6) or to men aged ≥50 years (Supplementary Table 7).

Discussion

This large British prospective study with biomarkers measured in the entire cohort found several associations with possible novel biomarkers for prostate cancer incidence and mortality. The risk of being diagnosed with prostate cancer was higher in those with higher circulating concentrations of vitamin D, urea and phosphate, while risk was lower in those with higher circulating concentrations of glucose, total protein and AST. In analyses excluding the first 4 years of follow-up, cystatin-C and phosphate were associated with prostate cancer risk. After correction for multiple testing, there was little evidence of associations with prostate cancer death.

Cardiovascular-related biomarkers

Our results are consistent with findings from a meta-analysis of 14 prospective studies that do not support an association between total cholesterol, LDL-c, HDL-c and TG concentrations and prostate cancer risk.22 This is in-line with a previous Mendelian randomisation study that did not find an association of genetic variants for these biomarkers with overall prostate cancer; however, that study did find some evidence that higher LDL-c and TG levels increase aggressive prostate cancer risk, and that a variant in HMGCR gene (that mimics the LDL lowering effect of statin drugs) reduces risk, although there was some evidence of pleiotropy.23

There are limited published prospective data on ApoA-I and ApoB in relation to prostate cancer risk. Our findings of no association of ApoA-I concentrations with risk contrast to findings from the one previous prospective analysis that found an inverse association with overall prostate cancer,24 while our null findings for ApoB are consistent with previous studies.24,25

In agreement with previous prospective studies,26,27,28 circulating CRP concentrations were not related to prostate cancer incidence or mortality in the current study.

Bone- and joint-related biomarkers

Higher circulating vitamin D concentrations were associated with a higher risk of prostate cancer diagnosis in the current study, which is consistent with a previous individual participant meta-analysis of prospective studies;29 however, this association was not significant after excluding the first 4 years of follow-up. This finding may reflect detection bias; health-conscious men may have a higher dietary intake of vitamin D and/or vitamin D supplementation and higher sun exposure, and these men may be more likely to have a prostate specific antigen (PSA) test or to seek medical attention with early symptoms. Therefore, these men may have a higher risk of being diagnosed with prostate cancer.

We did not find an association between alkaline phosphatase or calcium concentrations and prostate cancer incidence and mortality. To the best of our knowledge, there is no previous prospective evidence on the association between alkaline phosphatase and prostate cancer risk, and results from previous prospective studies on circulating calcium concentration are not conclusive.30,31,32

Diabetes-related biomarkers

In our study, higher glucose concentrations were related to a lower prostate cancer risk, although there was a suggestion of a positive association between circulating HbA1c concentration, a marker of long-term poor glycaemic control, and risk of dying from prostate cancer. However, the association with glucose was not significant in after 4 years of follow-up and the association with HbA1c did not survive correction for multiple testing. While previous prospective evidence has shown an inverse association between diabetes and prostate cancer risk,33 not many studies have assessed the association of diabetes with prostate cancer death. However, there is some prospective evidence that hyperglycaemia may be associated with an increased risk of fatal prostate cancer.34 Men with obesity are more likely to develop type II diabetes, and our findings for diabetes-related biomarkers in relation to prostate cancer risk are concordant with the current evidence on adiposity and prostate cancer, which shows an inverse association of adiposity with prostate cancer incidence but a positive association with prostate cancer death.6

Renal-related biomarkers

Although there are few prospective data on kidney disease and prostate cancer risk, a previous individual patient data meta-analysis found that men with kidney disease may have a lower prostate cancer risk.9 Men with kidney disease are at higher risk of having a very low circulating testosterone concentration,35 which in turn has been associated with a lower prostate cancer risk.7 Findings from the present study showed that higher circulating urea and phosphate concentrations were associated with an increased prostate cancer risk, while cystatin-C and total protein circulating concentrations (both markers of kidney function) were associated with a lower risk. However, only the associations of cystatin-C and phosphate with risk were significant in the last four years of follow-up. These results may appear contradictory as all of these are biomarkers of renal disease; however, these biomarkers reflect different kidney functions. For example, elevated cystatin-C is a marker of impaired glomerular filtration rate, while phosphate concentrations are regulated by intestinal absorption, kidney excretion and bone metabolism.36 To the best of our knowledge, no previous prospective study has assessed the association of circulating phosphate with prostate cancer risk; however, a previous prospective study found a positive association between phosphorous intake and risk of lethal prostate cancer risk37 and there seems to be a positive association between the consumption of foods with phosphate additives and higher circulating concentrations.38 More studies are needed to clarify the association between circulating phosphate concentrations and prostate cancer risk.

Liver-related biomarkers

This study found that higher circulating concentrations of AST were associated with a lower prostate cancer risk, although there was no association after excluding the first 4 years of follow-up. This is, to our knowledge, the first prospective study of the association between these liver-related biomarkers and prostate cancer risk and more studies with longer follow-up are needed to further investigate this possible association. Only three previous prospective studies have examined the association between non-alcoholic fatty liver disease and prostate cancer risk, and the results were inconsistent.39,40,41

Study strengths and limitations

To the best of our knowledge, this is the largest prospective study looking at a wide range of biomarker measurements and prostate cancer incidence and mortality data. The UK Biobank has measured a panel of biomarkers in blood samples from the entire cohort, including biomarkers that have not been previously studied in prostate cancer analyses. Moreover, repeated measurements of biomarkers were available for a subsample of repeat blood specimens collected 4 years after recruitment, which has made it possible for us to adjust estimates of associations between biomarkers and risk for regression dilution bias.18 In addition, this is a relatively young cohort, which permits examination of factors related to earlier onset prostate cancer. Many covariates, such as adiposity measurements or blood pressure, were assessed by trained research clinic staff instead of being self-reported.

This study also has several limitations. Although UK Biobank includes participants from multiple regions across the UK, including deprived areas, it is not a representative sample of the whole UK population11 and this cohort may have suffered from selection bias,42 although the direction of risk factor associations in the UK Biobank seem to be generalisable.43 As in every observational study, there may be some residual confounding by unmeasured risk factors. Some of the results may be influenced by detection bias, as comorbidities may affect both biomarker concentrations and prostate specific antigen test attendance and/or results. Moreover, the blood samples of the current study are non-fasting and we do not have 24-h urine samples, and therefore, some biological measurements may have been affected by time since last meal. While we performed sensitivity analyses excluding the first 4 years of follow-up some of the associations may be due to reversed causality, as the follow-up is relatively short. Moreover, due to the small number of prostate cancer deaths we had limited power to find associations. While we have hypotheses for some of the biomarkers, we do not have a hypothesis for others as they have not previously been assessed in relation to prostate cancer, and some of the analyses using these biomarkers were exploratory. Finally, as in other epidemiological studies, there may be some misclassification of the underlying cause of death; furthermore, information on tumour characteristics, such as tumour stage and Gleason grade, is currently unavailable in the UK Biobank.

In conclusion, we have reported a range of possible novel biomarkers for prostate cancer incidence and mortality in this large UK prospective study. Since some of these analyses were exploratory, these results should be interpreted carefully. Future analyses will include research by tumour characteristics, which will help to determine whether these associations may be due to differences in the likelihood of being diagnosed and/or differences in the risk of developing clinically important prostate cancer.