Introduction

The probability of being in employment is strongly affected by the number of chronic diseases. According to OECD statistics, for example, 1 in 4 people aged 50–59 with no chronic diseases were not in employment, while this was the case for half of those with 2 or more chronic conditions1. The number of age-related chronic diseases in working populations is likely to increase in the future due to population ageing. This presents work life with new challenges, including a need for better preventing work disability in working populations which include a growing number of people with a chronic disease.

To enable timely interventions, numerous studies have investigated predictors of work disability in the general and working populations as well as in groups with specific diseases, or those that have undergone specific treatment procedures2,3,4,5,6,7,8,9,10,11. Fewer studies have examined prediction in high-risk individuals across multiple common chronic conditions that increase the likelihood of work disability. For example, there are no well-validated and easily administered prediction tools available to determine the risk among employees who have depression, musculoskeletal disorders, respiratory disease, hypertension or cardiometabolic multimorbidity. This is an important limitation which may hinder optimal targeting of interventions to those who would benefit most.

The Finnish Institute of Occupational Health (FIOH) has previously formulated a risk prediction model for work disability for use in the general working population2. This model had a C-index of 0.8 indicating high discriminative ability. In the present study, we examined whether the variables used also have the capacity to accurately determine work disability risk among employees with chronic conditions. We first evaluated the predictive performance of the 8-item FIOH risk score in nine common disease groups; then developed a modified version where we re-estimated the coefficients; and, lastly, built a model with a new set of predictors selected from a large pool of additional variables and ascertained whether these new models improved risk prediction. To evaluate the relevance of the risk models in clinical decision making, we dichotomized the score to distinguish test positives from test negatives, and examined detection rate and false positive rate for this measure12.

Methods

Study population

Participants were from the Finnish Public Sector study, a prospective cohort study of public sector employees from 10 municipalities and 21 hospitals in the same geographical areas in Finland. The eligible population represented 31.4% of all municipal employees in Finland at the time of the study, including all employees with job contract in the 10 municipalities and 21 hospitals at the time of the surveys13. Participants responded to questionnaire surveys conducted in 2000–2002, 2004, 2008, or 2012. In the present study, we used data from the full study population of respondents including employees with and without chronic conditions. Using self-reports of physician-diagnosed diseases, complemented with records from the cancer registry and drug reimbursement register (for respiratory disease, hypertension, coronary heart disease, musculoskeletal disorders, or diabetes), we categorized participants into subgroups with different chronic conditions including those living with musculoskeletal disorder, depression, migraine, respiratory disease, hypertension, cancer, coronary heart disease, diabetes and comorbid depression and cardiometabolic disease (co-occurrence of mental and physical illnesses with major public health importance) at baseline. We considered the baseline for each participant with disease to be at the survey when the particular condition was first reported. Ethical approval was obtained from the ethics committee of the Helsinki-Uusimaa Hospital District Ethics Committee (HUS/1210/2016). All participants provided a written informed consent. This study was conducted according to the guidelines of the Helsinki declaration. Details of the study design and participants have been previously described14.

Potential risk predictors

We used a pool of 105 variables which included those in the 8-item risk prediction FIOH-model of work disability for the general working population: age category (< 35; 35–39; 40–44; 45–49; 50–54; 55 + years), BMI (< 18.5; 18.5- < 25; 25–30; 30 + kg/m2), socioeconomic status (SES), smoking (yes; no), number of chronic diseases (0, 1, 2, 3+), self-rated health (range 1–5), difficulty falling asleep (range from 1 (never) to 6 (almost every night)) and number of sickness absences in previous year before baseline (0, 1, 2, 3+)2. Other available variables included sex, alcohol consumption, physical inactivity, psychological distress and working conditions (job control, job demands, job strain, effort, reward and effort-reward imbalance, relational justice, procedural justice, participatory safety, support for innovation, vision, task orientation, social capital at workplace, shift work and working night shifts). We measured SES based on occupational titles categorised according to the occupational classification of Statistics Finland, which is based on the World Health Organization/International Labor Organization International Standard Classification of Occupations (ISCO-88)15 into 7 occupational groups: ‘Managers’ (ISCO class 1), ‘professionals’ (ISCO class 2, for example, physicians and teachers), ‘technicians and associate professionals’ (ISCO class 3, for example, registered nurses), ‘clerical support workers’ (ISCO class 4, for example, secretaries), ‘service workers’ (ISCO class 5, for example, cooks), ‘manual workers’ (ISCO classes 6–8, for example, maintenance workers) and ‘elementary occupations’ (ISCO class 9, for example, cleaners).

The predictors are presented in Table 1 and a detailed list of individual items is provided in Appendix (p. 2–4).

Table 1 Potential predictors.

Ascertainment of work disability

All study members were insured in some pension scheme. Records were obtained from the national register at the Finnish Centre of Pensions16, an organisation which has a statutory obligation to curate records of all pensions in Finland. Disability pension records including start date and diagnosis according to the World Health Organization International Classification of Diseases, version 10. The outcome was full-time disability pensions (temporary or permanent) which is defined as the capacity for work being impaired by at least 60% due to a disease, injury, or other disability. These records have been widely used in research context17,18,19.

Statistical methods

We imputed missing data on predictors (3.7% of all observations) as follows: we first complemented missing responses on chronic diseases using data from the cancer registry and the drug reimbursement register, then set responses on chronic diseases that were still missing as ‘no’ answers, and height as median value of all non-missing responses per individual. Other predictors were imputed using single imputation with predictive mean matching20.

The follow-up ended in case of death, retirement (age-related or early retirement on other than health grounds), work disability, or a maximum follow-up time of 10 years, whichever came first. The unadjusted bivariate associations between the individual predictor variables and work disability are shown with Manhattan plots. We performed three steps to select the best model for each disease group. In the first, we examined whether the FIOH model was valid within the disease groups. According to the FIOH model, the work disability risk is estimated as follows2:

$${\text{P(x) = }}\Phi {\text{[(ln(10) - linear prediction)/scale)]}}$$

where Φ is the standard cumulative normal distribution. We applied the model and the coefficients that were formulated by Airaksinen et al.2. Coefficients for linear predictor in the FIOH-model were fitted with lognormal distribution and are provided in the report by Airaksinen et al.2 and in the Appendix of this paper (page 5).

The second step was to examine whether the model for each disease groups could be improved by re-estimating the coefficients of the FIOH-model, using the same set of the explanatory variables as was used in the FIOH model. We call this the re-estimated FIOH model. For fitting the models, we used a similar method (including assumption of lognormal distribution) as Airaksinen et al.2.

The third step was to examine whether the prediction could be improved by selecting a completely new set of predictors for each disease group. Following the conventional approach in developing prediction algorithms for survival data, we fitted the models with Weibull distribution21,22. For each disease group we first ran a redundancy analysis to exclude variables that could be readily predicted using all other variables. We then specified a parametric survival model that included all the candidate variables as predictors (‘full model’). To obtain a more parsimonious algorithm, we derived the predicted work disability risks from the full model for each individual. We then used backward stepwise ordinary least squares regression to select eight predictors, by predicting risks derived from the full model as the outcome. The number of items in the new models was chosen to be the same as in the FIOH-model because a short questionnaire is easy to administer and quick to respond, maximising the response rate. If the selected eight predictors included any summary variables (for example job strain), we repeated the previous steps with the summary variable(s) broken down to individual items. The models achieved for each disease group as a result of the third step are referred as the new model.

The performance of each prediction model was evaluated using Harrell’s C-index, which is the concordance between predicted and observed survival20. This index gives the probability that a randomly selected individual who experienced the outcome during the follow-up, had a higher risk score than a randomly selected individual who did not experience the outcome. The C-index has a range from 0.5 (no predictive ability) to 1 (maximum predictive ability). C-index under 0.7 represents poor, 0.7–0.8 good, and > 0.8 strong discrimination ability. To examine whether the original findings from the general working population were replicable in our dataset, we computed C-statistics for the entire study population with and without chronic conditions and among those with no history of sickness absence one year prior to baseline (the low-risk population11). Calibration of the model—that is, how accurately the predicted absolute risks correspond to the observed absolute risks—was assessed using calibration plots. We additionally plotted observed and predicted events by deciles of 10-year risk for each model, excluding by age and baseline those with less than 10 years of potential follow-up time. We compared the performance of the FIOH-model with both new models using the C-statistics with 95% confidence intervals. If the confidence intervals were overlapping, the FIOH-model was chosen.

To evaluate the final model for an individual, we dichotomized the score into ‘test positive’ versus ‘test negative’ using alternative thresholds of 5; 10; 20; 30; 40; 50 and 60% for a positive test result. For positive test cases, we calculated false positive rate (the proportion of test positive cases who did not experience work disability), detection rate (the proportion of work disability cases who were test positive) and the ratio of true to false positives. The formulas were as follows:

$$\begin{aligned} & {\text{False}}\;{\text{positive}}\;{\text{rate}} = {\text{b}}/\left( {{\text{b}} + {\text{d}}} \right) \\ & {\text{Detection}}\;{\text{rate}} = {\text{a}}/\left( {{\text{a}} + {\text{c}}} \right) \\ \end{aligned}$$

Ratio of true to false positives = 1 : (b/a), where a, b, c and d represent different combinations of risk scores and work disability as defined below:

Risk score

Work disability during the follow-up

Yes

No

Test positive

a

b

Test negative

c

d

All analyses were performed using R 4.2.2 (packages: mice23, rms24, leaps25 and Hmisc26) and SAS 9.4.

Ethics approval

Ethical approval was obtained from the ethics committee of the Helsinki-Uusimaa Hospital District Ethics Committee (HUS/1210/2016).

Consent to participate

Informed consent was obtained from all individual participants included in the study.

Results

Baseline characteristics

Figure 1 shows sample selection. The target population was municipal employees during the survey years in Finland, on average of 426,500 men and women during the survey years. The eligible population in the 10 towns and 21 hospitals which participated in the FPS study represented 31.4% of all municipal employees. Of these, 78.7% responded to at least one of the four questionnaire surveys. Linked records from national health and work disability registers were available for 85.0% of the respondents, a total of 89 543 adults. We excluded from analyses people who were on disability pension or retired, at age 65 years or older or with extreme values in BMI (< 15 or > 50) at baseline.

Figure 1
figure 1

Flow diagram for total sample and disease groups.

Table 2 shows characteristics of the resulting analytical sample of 88,521 participants. Of them, 70,805 were women, the mean age was 43.1 years, and the largest occupational groups were ‘professionals’ (for example, teachers) and ‘associate professionals’ (for example, registered nurses). 73,996 had no history of short-term work disability (no sickness absences one year before baseline) and this group was denoted as the low-risk population. The number of participants with specific, prevalent condition at baseline varied between 1162 (comorbid depression and cardiometabolic disease) and 33,601 (musculoskeletal disorders).

Table 2 Baseline characteristics of all participants and subgroups of individuals with no history of sickness absence and those with a chronic condition at baseline.

Work disability during follow-up

During a mean follow-up of 8.6 years, 6836 (7.7%) participants were granted a disability pension. The incidence of work disability was 14.1% in those with musculoskeletal disorders, 9.9% for migraine, 15.1% for hypertension, 12.2% for respiratory disease (chronic bronchitis or asthma), 16.7% for depression, 16.4% for diabetes, 14.2% for cancer, 22.6% for coronary heart disease, and 27.7% for comorbid depression and cardiometabolic disease. The most common causes of work disability were diseases of the musculoskeletal system and connective tissue (44.2%), followed by mental, behavioural and neurodevelopmental disorders (23.7%), and these proportions varied by disease group (Fig. 2, for details, see Appendix p. 6).

Figure 2
figure 2

Causes of work disability at follow-up in all participants and those with prevalent diseases.

Associations of potential risk predictors and work disability

The unadjusted bivariate associations between 105 predictor variables and work disability are shown in Fig. 3. All variables of the 8-item FIOH-risk score were associated with work disability (p < 0.0001), the strongest associations seen for age and self-rated health. Many other health-related variables were also strongly associated with work disability whereas the associations of the items related to risk behaviours, work characteristics, team climate and management were weaker. The pattern of findings was similar in all disease groups (Appendix, p. 7–15).

Figure 3
figure 3

Bivariate unadjusted associations between predictors at baseline and work disability at follow-up in all participants with and without chronic disease at baseline. Bars represent -LOG10(p-value) and are cut at the maximum value of 300.

Selection of the model for each disease group

Development of the re-estimated and new models is reported in Appendix (p. 16–17.) The predictors in the new models included 4 to 7 of the 8 predictors in the FIOH-model, depending on the participant group. Each of these models included age, socioeconomic status, self-rated health and the number of sickness absences in previous year. Individual chronic diseases were also part of the models for every disease group. Work-related items were included in the models for musculoskeletal disorders, hypertension, respiratory disease, diabetes, cancer, and comorbid depression and cardiometabolic disease.

Table 3 shows that C-statistics for the FIOH-model among all employees, that is, those with and without chronic conditions was (0.84, 95% CI 0.84 to 0.85), a similar C-statistics in the magnitude as in the original study2. This result was little changed in analysis of complete case without imputations (N = 78 479, C-statistics 0.84, 95% CI 0.84 to 0.85). Among those with no sickness absence at baseline the C-statistic was (0.82, 95% CI 0.81 to 0.83). The table also provides a comparison of the performance between the FIOH-model and the two alternatives. The FIOH-model performed well in all disease groups. The C-statistics was ≥ 0.80 in those with musculoskeletal disorders (0.80, 95% CI 0.80 to 0.81), migraine (0.83, 95% CI 0.82 to 0.84) and respiratory disease (0.82, 95% CI 0.81 to 0.83). For all other subgroups, including hypertension, depression, diabetes, cancer, coronary heart disease or comorbid depression and cardiometabolic disease, C-statistics was ≥ 0.72 but less than 0.80. The C-statistics for the re-estimated FIOH-models and the new models were virtually the same as for the FIOH-model, suggesting no improvement in predictive performance. Calibration plots for the existing model indicated a high correspondence between the predicted and the observed risk in all disease groups (Fig. 4).

Table 3 C-index for the FIOH, re-estimated and new prediction models in all participants and subgroups of individuals with no history of sickness absence and those with a chronic condition at baseline.
Figure 4
figure 4

Observed and predicted incidence of work disability by deciles of the work disability risk score.

Predictive performance of the FIOH model

Table 4 shows the detection and false positive rates for dichotomized FIOH-risk scores using various risk thresholds to dichotomise the score to indicate high versus low risk (for results of sensitivity, specificity and positive and negative predictive value, see Appendix, p. 18). With a high threshold for risk (≥ 50% predicted probability indicating high risk), the false positive rate ranged between 2.6% and 8.6% in all disease groups. Exceptions were participants with coronary heart disease (19.0%) and those with comorbid depression and cardiometabolic disease (20.1%) for whom false positive rate was markedly higher. The detection rate varied between 18.9% (participants with cancer) and 52.8% (participants with comorbid depression and cardiometabolic disease) and the ratio of true-to-false positives ranged from 1:1.0 to 1:1.4. With a low threshold (5% predicted probability indicating high risk), detection rate raised to 92.0% or 99.2% (less than 1 in 10 disability cases were missed), but with very high false positive rate (54.2% to 94.4% depending on the disease group). For both the 50% and 5% thresholds, the detection and false positive rates were slightly lower in the total working population, but the ratio of true-to-false positives was approximately the same as in the population with chronic diseases.

Table 4 Detection rate, false positive rate and the ratio true-to-false positives for the FIOH prediction model in all participants and subgroups of individuals with no history of sickness absence and those with a chronic condition at baseline.

Discussion

Our study shows that a short, self-administered survey instrument has predictive utility for work disability in people with chronic conditions and comorbidity. This survey-based 8-item risk calculator includes age, self-rated health, number of sickness absences, socioeconomic position, the number of comorbidities, sleep problems, body mass index, and smoking habit. The algorithm can be used in many settings, including members of the public who have web access, and the assessment does not require laboratory testing or other clinical measurements. The calculator might be used to identify working-age people with common chronic diseases with an elevated risk of future work disability and so might facilitate early intervention.

Approximately one third of people at age 40 have a chronic condition and the proportion rises to 75% by age 6527. Despite this high prevalence and the urgent need for new measures to prevent their work disability, we are not aware of other large-scale studies on risk stratification algorithms for work disability in employees with chronic conditions. In our study, C-index exceeded 0.72 in all disease groups and was 0.80 or greater for those with musculoskeletal disorders, migraine and respiratory disease. These results indicate good discrimination and are comparable to those reported for established risk prediction tools currently used in clinical practice. For example, the C-index is 0.72 for the Pooled Cohort Equations to predict the 10-year risk of cardiovascular disease events using 9 risk factors (age, sex, race, diabetes status, smoking status, antihypertensive medication use, total cholesterol, HDL cholesterol levels, and systolic blood pressure)28; between 0.74 and 0.77 for the FINDRISC model to predict the risk of type 2 diabetes using 8 characteristics (age, BMI, waist circumference, physical activity, diet, history of antihypertensive medication use, history of high blood glucose and family history)29,30; and 0.70 to 0.91 for QRISK3 to predict future cardiovascular disease based on a wide range of risk factors obtained from electronic health records (age, sex, ethnicity, socioeconomic deprivation, angina or heart attack in a 1st degree relative at age < 60, chronic kidney disease, migraine, corticosteroids, systemic lupus erythematosus, use of atypical antipsychotics, severe mental illness, steroid treatment, erectile dysfunction, total/ HDL cholesterol ratio, BMI, systolic blood pressure variability)31.

In clinical decision making, dichotomised predictive scores are used to distinguish patients who should receive intervention or referrals for further assessments, although few studies have reported relevant performance metrics in this regard12. While the performance of our dichotomized work disability risk score fell short of the best established clinical screening tests, such as mammography for breast cancer (detection rate 75% with a false positive rate of 8%)32,33 and faecal immunochemical test (FIT) for colon cancer (79%/6%)34, it was similar to those reported for widely used cardiovascular disease risk scores, such as QRISK2 (detection and false positive rates 40% and 13% for men and 26% and 6% for women)12 and thus appears to provide a useful tool to aid decisions of targeting preventive interventions. More specifically, our risk calculator had a relatively high false positive rate for test positive thresholds that allowed high detection rates. Conversely, the use of a threshold that provided low (approximately 5%) false positive rate, resulted in detection of only 20–25% of disability cases. This means that the score is useful in informing the targeting of interventions with no significant harm from overtreatment and in informing referrals to more detailed assessments for potential tailored preventive measures. By contrast, the score should not be used for expensive or new interventions with uncertain safety profiles as many people who will not benefit from the intervention will be targeted. The same applies to predictors of cardiovascular disease that are currently used in health care.

The present study benefits from the use of data which are from a country (Finland) where ascertainment of work disability pension was possible with linkage to comprehensive records from the national pension register with virtually full coverage for those in employment10,11. Large sample size, contributing to higher precision and lower risk of type 2 error, and the relatively high response rate are additional strengths. The age and sex distribution (mean age 43.1, 80% women) in the analytic sample corresponded to those in the eligible population of 133,966 municipal employees (mean age 45.7, 80% women)13. We defined disease groups using data from self-reported physician-diagnosed conditions. Validation studies supported the accuracy of self-reports as a measure of prevalent chronic diseases35,36,37,38,39,40,41,42,43,44.

This study has also several limitations. Although work disability is defined by impairment, receipt of a disability pension may additionally be dependent on non-medical factors, such as disability pension regulations, the work environment, the nature of the job, and the extent to which a workplace is prepared to accommodate the disability. Our study largely comprised women and all study participants were drawn from public sector workplaces, dominated by professionals, such as health care workers and teachers, deviating from the general population in which the incidence of work disability is greater11. However, among the Finnish workforce during the years 1997–2006, the yearly number of men and women with incident work disability was 111 and 100 per 10,000 persons45. This is higher but still comparable with the 89 per 1000 person-years in our total population. The generalizability of the present findings should therefore be investigated in different settings, study populations and other countries with different disability pension policies. Predictive performance may vary by country because in addition to the impairment, receipt of a disability pension is dependent on non-medical factors, such as disability pension regulations, the work environment, the nature of the job, and the extent to which the workplace is prepared to accommodate the disability46. Additionally, we followed the same parametric methodological approach as in the original FIOH study2, although a non-parametric approach (for example discrete-time models) would have avoided parametric assumptions.

Conclusion

Detection of individuals at high risk is a precondition for effective targeted interventions to prevent long-term work disability and a basis for developing cost-effective strategies to avoid early labour-market exit. Predictive performance of the simple and cost-free FIOH-work disability risk score was comparable to those observed for established widely-used risk scores for other outcomes, such as cardiovascular diseases47. These findings suggest that it is possible to predict work disability in a working population with chronic disease using a scalable internet-based tool with a reasonable accuracy and thus aid decisions of targeted interventions and referrals to detailed assessments of tailored interventions.