Introduction

Osteoporosis is a serious disease, but treatment can be instituted when early detection is made possible. Hip fractures, in particular, are associated with significant mortality and morbidity in the elderly,1 but in one study, less than one fifth (18%) of high risk people had received medical treatment for osteoporosis before the occurrence of hip fracture.2 The current failure to assess and treat patients at high risk of osteoporosis may be partly due to insufficient resources or time, but a failure of health professionals to identify risk factors and refer the patient for definitive diagnosis using dual-energy X-ray absorptiometry (DXA) is an important contributory factor.

In an attempt to improve this situation, several clinical risk 'tests' have been developed as a means of identifying subjects who would benefit from further investigation. The contribution of clinical risk factors (such as OSIRIS) to the primary prevention of osteoporotic fractures is at present under consideration by the World Health Organisation and the National Institute for Health and Clinical Excellence (NICE). OSIRIS is a weighted combination of those clinical risk factors that are known to independently predict whether a patient has osteoporosis. These indices, however, have not gained universal acceptance as a routine diagnostic test because of their poor specificity in detecting patients at increased risk of osteoporosis.3,4 Harrison and Adams5 found that clinical risk indices misclassified unacceptably large numbers of osteoporotic women, with consequent decreased cost effectiveness. Recently, we reported the use of mandibular cortical width measurements on dental panoramic radiographs (DPRs) as an alternative method of identifying patients with osteoporosis.6,7 The underlying rationale for this is the enormous number of DPRs taken in dental practice. We propose a strategy of the dentist referring individuals with a thin mandibular cortex and other clinical risk factors for further DXA investigation. Detection of a thinned cortex on DPRs using specially developed computer software8 has been found to be a good predictor of systemic osteoporosis, and because the method is automatic it is also convenient.

This study has two aims. The first was to determine the diagnostic efficacy of combining the OSIRIS clinical index with the cortical width measurement on radiographs, using multivariate statistical analysis. The second aim was to evaluate the cost effectiveness of using this combined test as a basis for further referral for central dual energy X-ray absorptiometry (DXA), using the 90% sensitivity and 90% specificity values as thresholds.

Methods

This work forms part of the OSTEODENT study, a collaborative project funded by the European Commission Fifth Framework Programme Quality of Life and Management of Living Resources'. The methodology of subject recruitment and examination has been fully described previously,7 and is summarised here. With Ethics Committee approval, female subjects (aged 45-70 years) were recruited consecutively into the study following their informed consent. Subjects were recruited from each centre using publicity material and by word-of-mouth, but this patient group may not be representative of a primary dental care population. The study included all female volunteers and patients in this age group, with subjects excluded only if they suspected that they might be pregnant. All subjects were interviewed and provided information about their age, weight, medication and fracture history. Patients were recruited from centres in Leuven (Belgium), Athens (Greece), Manchester (UK), and Malmo (Sweden). Six hundred and seventy-one subjects were recruited into the study. The bone mineral density of the total hip could not be measured in two subjects and a further eight subjects were found to be aged less than 45 years, so their data were not included in any further analysis. A further nine radiographs were either lost, digitally corrupted or of poor diagnostic quality. The remaining 652 subjects formed the study population and underwent a clinical risk assessment of osteoporosis using the OSIRIS questionnaire (OSteoporosis Index of RISk) and computer cortical width measurement. One hundred and forty subjects had osteoporosis involving at least one measurement site.

Dental radiographs

Each subject underwent a dental panoramic radiographic examination while biting on a spherical, steel ball bearing (3.175 mm diameter), used to calculate the image magnification. The Leuven (Belgium) and Malmo (Sweden) centres used a Cranex III (Soredex, IL, USA) dental panoramic radiography machine whereas Athens (Greece) and Manchester (UK) used a Planmeca (Planmeca USA, Roselle, IL, USA). In Leuven, a photostimulable phosphor plate system for image capture and digital read out was used, but other centres used a conventional film/cassette. Typical imaging parameters for panoramic radiography were 70 kV at 8 mA for 15 s. All of the radiographs were digitised using a Kodak LS85 digitiser (Eastman Kodak, Rochester, NY) at a resolution of 25.64 pixels/mm.

The mandibular cortex was automatically detected on the digitised panoramic radiographs using software based on Active Shape Model search9, which is a sophisticated computer imaging technique. Its width was measured by the method described by Allen et al.8

DXA Examination

Central DXA of the proximal femur and lumbar spine was performed at each of the four centres. The World Health Organisation (WHO) criteria were used to diagnose osteoporosis, ie using DXA to identify those with a bone mineral density T-score value 2.5 SD or more below the mean value of the young sex matched reference population at any of the lumbar spine, femoral neck or total hip measurement sites. This was used as a 'gold-standard' measure of osteoporosis.

Osteoporosis Index of Risk (OSIRIS)

The Osteoporosis Index of Risk (OSIRIS) is based on four variables:10 age, body weight, current hormone replacement therapy (HRT) use and history of previous low impact fracture. The index is calculated by adding together:

  • Age multiplied by -2 (rounded down to the nearest integer)

  • Weight in kg multiplied by 2 (rounded down to the nearest integer)

  • +2 if a current user of HRT

  • −2 if a history of low trauma fracture.

An OSIRIS score of lower than -3 indicates a high risk of low BMD, between +1 and -3 an intermediate risk and greater than +1.0 a low risk.11

Statistical analysis

Student's t-test was used to analyse the significance of the differences between OSIRIS and cortical width values in the osteoporotic and normal individuals.

Discriminant analysis, using the variables cortical width and OSIRIS score, was used to derive the probability of osteoporosis in an individual subject. The model was evaluated using a leave-one-out cross validation strategy to avoid biasing the estimates of discrimination ability. Models were calculated from the entire data set except one, which was used as the test datum. This process was repeated using each of the patients in turn as a test datum. The calculated probability of osteoporosis from each of the experiments was used to generate an ROC curve. The resulting area under the ROC curve was compared with that of OSIRIS and the cortical width measurements. The 90% sensitivity and 90% specificity thresholds were used to calculate the numbers of subjects correctly and incorrectly classified as osteoporotic.

Cost effectiveness

Two strategies were compared to determine those who should receive further investigation using dual energy X-ray absorptiometry. Criteria values for the combined cortical width and clinical analysis used either (a) 90% sensitivity or (b) 90% specificity. In calculating the cost per patient correctly diagnosed with osteoporosis, the costs used for central DXA were £50 per patient and for the OSIRIS index were £5 per patient. Both of these costs estimates have been used recently in other publications by our research group.5

Results

The mean difference in mandibular cortical width between osteoporotic and healthy patients (0.718 mm) was highly significant (t = 12.83, p <0.0001) (Table 1). The difference between osteoporotic and normal patients' OSIRIS indices (3.62) was also highly significant (t = 13.54, p <0.0001).

Table 1 Cortical width and OSIRIS data from 652 subjects, 140 of whom were osteoporotic at one of the measurement sites

Discriminant analysis was used to obtain a linear combination of weighted average of cortical width and OSIRIS variables that resulted in the best separation between those with and without osteoporosis in our sample. The resulting discriminant score was used to distinguish between the two groups. Wilk's lamba (the ratio of the within-group sum of squares to the total sum of squares) was 0.683 (χ2 = 247.6, df = 2, p <0.001). Therefore the two groups (those with and without osteoporosis) differed in their mean discriminant score, and 79% of cases were correctly assigned to the groups. Both cortical width and OSIRIS variables contributed equally to the prediction of group membership because they had similar standardised regression coefficients (OSIRIS = 0.682, cortical width = 0.634).

A model derived from a sample will usually fit it better than another sample obtained from the same population. In further leave-one-out cross validation analysis, each case was classified using all the other data to derive the function except that one. A similar but less biased estimate of the correct classification rate of 78.8% of cross-validated grouped cases was obtained.

Using this model, the correlations between the probability of osteoporosis and the BMD at the lumbar spine, femoral neck and total hip were -0.60, -0.64 and -0.61, respectively. These correlations were highly significant, p <0.01.

The area under the ROC curve (Fig. 1) recorded separately for OSIRIS was 0.84 (95% CI = 0.81 to 0.87) and for the cortical width was 0.82 (95% CI 0.79 to 0.85). The difference between both ROC curve areas was 0.021, which was not significantly different (95% CI = -0.022 to 0.064), p = 0.335. The area under the ROC curve for the predicted probability produced by the linear discriminant analysis in the cross validation experiment (cortical width and OSIRIS) was 0.90 (95% CI = 0.87 to 0.92). There was a significant improvement in the diagnostic ability of the combined OSIRIS and cortical width test over both tests applied separately (p <0.001).

Figure 1
figure 1

ROC curve of OSIRIS, cortical width measurements on radiograph, and the effect of combining both variables

Using the combined OSIRIS and cortical width data, an 'Osteodent Index' was calculated giving the risk of osteoporosis. An operating point on the ROC curve with a specificity value of 90% (95% CI = 87.1 to 92.5) and corresponding value of sensitivity of 69% (95% CI = 60.2 to 76.1) was selected. By using a high specificity value, at the expense of sensitivity, the minimum number of patients would be sent for unnecessary further investigations. Using this criterion value gave a test with a positive likelihood ratio of 6.9 and negative likelihood ratio of 0.35. The diagnostic odds ratio, the ratio of positive likelihood ratio divided by negative likelihood ratio, was 19.7.

The OSIRIS values were plotted against cortical width for the sample of 652 patients. Using the 90% specificity threshold for the combined variable, the sample was divided into those predicted as being at either high or low risk of osteoporosis. Figure 2 shows the scatterplot of OSIRIS index and cortical width, with assignment to either high or low risk of osteoporosis. Our previous work12 has shown that the optimal decision boundary of whether to further refer patients lies at a 3 mm cortical width. Figure 2 shows that in a patient with a 3 mm cortical width, only when the OSIRIS value is greater than 1.83 is referral not indicated.

Figure 2
figure 2

The variables OSIRIS and cortical width are plotted. The 90% specificity value has been used as a threshold and those who are or are not indicated for referral for central DXA are indicated (ie predicted probability of osteoporosis of either less than or greater than 34.5%)

There is some overlap of osteoporotic and non-osteoporotic OSIRIS and cortical width values (Fig. 3). Those patients with osteoporosis tend to be grouped towards the lower values of both parameters.

Figure 3: Cortical width measurements plotted against OSIRIS, with each point represented as either osteoporotic or normal according to DXA.
figure 3

While osteoporotic subjects tend to group towards low values of both parameters, there is extensive overlap between the two groups

Table 2 shows the false positive and false negative assignments arising from a referral decision at 90% sensitivity for the combined cortical width and OSIRIS data. The corresponding specificity was 68% and the diagnostic odds ratio was 18.73. While using a high sensitivity ensures that only 10% of osteoporotic patients would fail to be referred for further investigation, 56.6% of those referred would have a normal BMD. The cost of this strategy was £141 per osteoporotic patient diagnosed.

Table 2 The numbers (and % of the total sample) referred for DXA using a threshold of 90% sensitivity. This guaranteed that the majority of patients with osteoporosis would be referred for further DXA examination, but 164 (56.6%) of the referred patients would have a normal BMD. In total, 178 (or 27.3%) of the 652 patients were misclassified

An alternative decision strategy is to adopt a high specificity operating point (Table 3). Using a threshold of 90% specificity, results in a referral of only 50 (10%) of those with a normal BMD, but with the disadvantage that 44 out of the 140 osteoporotic patients (31.4%) would be missed. The cost of this strategy was £110 per osteoporotic patient diagnosed.

Table 3 The numbers (and % of the total sample) referred for DXA using a threshold of 90% specificity. Only 50 (or 10%) of patients with normal BMD would be referred for further DXA examination. In total, 94 (or 14.4%) of the 652 subjects were misclassified

Discussion

In this study, we have described a case-finding strategy, where a combination of a clinical index (OSIRIS) and automatically measured width of mandibular cortex, is a technique with good diagnostic accuracy in predicting low bone mineral density at the hip or spine. The diagnostic odds ratio, the ratio of positive likelihood ratio divided by negative likelihood ratio, measures the performance of a test and a value above 20 indicates a diagnostic test with strong evidence for efficacy.13 Our test, providing a diagnostic odds ratio of 19.7, falls into this category. Furthermore, the fact that the combined test performed better than either the clinical or the radiological test alone demonstrates that they are not providing the same information, but rather complementary information. It is therefore worthy of further clinical trial. Other case-finding strategies that have combined the information from clinical risk factors and selective use of BMD have also proven to be more successful in identifying high-risk patient groups.14 Clinical risk factors have been shown to independently predict hip fracture risk,15 but the balance between a simply administered assessment is often at odds with the requirement for a comprehensive assessment of all possible risk factors. Therefore, we plan to further assess the ability of our diagnostic strategy to predict hip fracture in our target population of young post-menopausal women.

In the United States and other industrialised nations, patients are increasingly aware of the benefits of disease prevention, as well as the long-term cost saving with healthcare.16 What then could be the clinical role for this form of testing for low bone density? Our detection strategy would be used to select those who would undergo further DXA, a case finding approach that follows the UK Royal College of Physicians Guidelines.17 Millions of dental radiographs are taken by dentists annually, with dental radiographs accounting for nearly 25% of all medical radiographic exposures.18 Using dentists to select postmenopausal women at high risk of osteoporosis has the advantage that patients are seen regularly from this age group, and that there is an increasing use of radiographs by dentists for diagnosis.19

The diagnostic efficacy of the radiographic test alone makes it clearly unsuitable as a screening test, because it performs no better than the simple clinical risk assessment; it also has greater cost and an associated X-ray exposure. We have previously suggested that dentists should refer patients for DXA opportunistically using DPRs that they have taken for the usual dental purposes. Would the combined test be justifiably used as a screening test, at least in the age group of women examined in this study? The decision partly depends on the cost effectiveness of this strategy.

With wide scale dental radiographic and clinical identification of osteoporotic patients, increased healthcare costs in the short-term would be inevitable because of increased demand for DXA services. Europe is already under-resourced for central DXA machines.20 Our study cannot predict the numbers of femoral neck fractures prevented if our methodology was introduced, but age and a history of previous fracture (which contribute to the OSIRIS index) and a low femoral neck BMD are clinical risk factors which play a role in femoral fracture risk.21,22

In the UK, the National Osteoporosis Society has recommended that postmenopausal women given peripheral X-ray absorptiometry be classified into three risk categories.23 In the first group at high risk of osteoporosis, treatment is recommended, particularly if accompanied by other risk factors. In the second group, the patients are referred for central DXA for further confirmation, and in the third group no additional action is recommended. In this context, we developed an analogous strategy using the combined cortical width and OSIRIS variables to categorise patients into three groups of differing osteoporosis risk. Two thresholds were chosen based on the 90% sensitivity (the low threshold) and the 90% specificity values (the high threshold), as described by Harrison and Adams.5 The 90% sensitivity value was the predicted probability value of osteoporosis represented by the 10th percentile and the 90% specificity value was the predicted value of non-osteoporosis represented by the 90th percentile. However, the value of this whole approach is limited by the medical side effects and the high cost of providing long-term drug therapy for those in the highest risk group when they do not have osteoporosis.

We therefore compared using either threshold values of 90% sensitivity or 90% specificity for the combined cortical width and OSIRIS variable as two strategies of referral for DXA for those patients considered 'at risk' of osteoporosis. Using a test with 90% specificity (£110 per diagnosed osteoporotic patient) provided a more cost effective option than using 90% sensitivity (£141 per osteoporotic patient). This is due to the large number of patients with normal bone mineral density that were referred unnecessarily for DXA, and therefore the comparatively low yield of osteoporotic patients. There was a cost of £233 per diagnosed osteoporotic patient if all patients in the study received central DXA. This analysis ignores the costs associated with the undiagnosed osteoporotic patients as these were difficult to define in this study. The authors hope that some of these individuals would be identified through further opportunistic testing using either our methodology on a future occasion or other techniques, such as dual energy X-ray absorptiometry or quantitative ultrasound. We also hope to use our test methodology and any subsequent patient treatment to examine the incremental cost-effectiveness ratio per quality adjusted life year. In addition, if the net benefit of our methodology is to be assessed, the distribution of risk assessment cost over the population must be calculated.

Case-finding strategies are prone to operator variability and error. For example, evidence from chest radiography taken in an emergency hospital department has shown that only 25% of patients with radiologically evident vertebral fractures received a diagnosis of osteoporosis or any treatment.24 Our own research using observer measurements of cortical width has demonstrated that the weakness lies in observer variability. One can postulate that dentists in a primary care setting would be inaccurate in making some measurements by hand.7,25 Our methodology involves a computer-measured mandibular cortical width to make the initial diagnosis of a high risk of osteoporosis, and following consultation with the patient, the dentist can either provide follow-up clinical questions, such as an OSIRIS clinical risk index, or refer to a specialist.

The cost effectiveness of our proposed methodology is dependent on the prevalence of osteoporosis in our study population (21.5%) being comparable to that of the UK population. In a study based in Hull, UK, the prevalence of osteoporosis of the hip and spine in general practice was similar (24%), but their study population consisted of women in their seventh decade.26 The mean age of our study sample was 55 years (sd = 6.1), and the age range (45-70 years) was chosen to test an asymptomatic population which was more representative of young postmenopausal women attending a general dental practice. Our osteoporotic screening method could be made more cost effective if restricted to an elderly population; fewer misdiagnoses are then likely because the incidence of vertebral and hip fractures increases exponentially with age.27

Poor radiographic technique is common in general dental practice and may limit the usefulness of our technique. For example in a study by Rushton et al.,28 the image of the lower border of the mandible was at least partially absent in 9% of panoramic radiographs. Using digital panoramic radiographs will approximately halve the reject rate of films as about half of faults are due to chemical processing of film.28 Patient positioning faults could be reduced by using better positioning aids, further training of dentists with an emphasis on quality assurance, and using only suitably qualified personnel to take radiographs.28

In conclusion, our methodology used computer software to detect and analyse the mandibular cortical width and when combined with clinical risk indices data detected patients at early, high risk of osteoporosis.