Automated bone mineral density prediction and fracture risk assessment using plain radiographs via deep learning

Hsieh, Chen-I; Zheng, Kang; Lin, Chihung; Mei, Ling; Lu, Le; Li, Weijian; Chen, Fang-Ping; Wang, Yirui; Zhou, Xiaoyun; Wang, Fakai; Xie, Guotong; Xiao, Jing; Miao, Shun; Kuo, Chang-Fu

doi:10.1038/s41467-021-25779-x

Download PDF

Article
Open access
Published: 16 September 2021

Automated bone mineral density prediction and fracture risk assessment using plain radiographs via deep learning

Nature Communications volume 12, Article number: 5472 (2021) Cite this article

12k Accesses
51 Citations
4 Altmetric
Metrics details

Subjects

Abstract

Dual-energy X-ray absorptiometry (DXA) is underutilized to measure bone mineral density (BMD) and evaluate fracture risk. We present an automated tool to identify fractures, predict BMD, and evaluate fracture risk using plain radiographs. The tool performance is evaluated on 5164 and 18175 patients with pelvis/lumbar spine radiographs and Hologic DXA. The model is well calibrated with minimal bias in the hip (slope = 0.982, calibration-in-the-large = −0.003) and the lumbar spine BMD (slope = 0.978, calibration-in-the-large = 0.003). The area under the precision-recall curve and accuracy are 0.89 and 91.7% for hip osteoporosis, 0.89 and 86.2% for spine osteoporosis, 0.83 and 95.0% for high 10-year major fracture risk, and 0.96 and 90.0% for high hip fracture risk. The tool classifies 5206 (84.8%) patients with 95% positive or negative predictive value for osteoporosis, compared to 3008 DXA conducted at the same study period. This automated tool may help identify high-risk patients for osteoporosis.

Segment anything in medical images

Article Open access 22 January 2024

AI in health and medicine

Article 20 January 2022

A foundation model for generalizable disease detection from retinal images

Article Open access 13 September 2023

Introduction

Osteoporosis is a common bone disease¹ that increases the global health burden². All major types of osteoporosis-related fragility fractures are associated with chronic pain, disability, functional dependence³, high morbidity⁴, and a two-fold to three-fold increase in mortality⁵, despite the availability of effective anti-osteoporotic drugs⁶. Dual-energy X-ray absorptiometry (DXA) is the preferred modality for the measurement of bone mineral density (BMD) in the human hip or lumbar spine, which is an essential component of the fracture risk assessment tool (FRAX) used to estimate the 10-year risk of hip or major osteoporotic fracture⁷. According to International Osteoporosis Foundation, DXA is falling short of the minimum service requirement for DXA of 11 units per million population⁸ in most parts of Eastern Europe and Central Asia⁹, the Middle East and Africa¹⁰, Asia Pacific¹¹ and Latin America¹², and ten member states in European Union¹³. The US is well-resourced but both DXA and FRAX seem underutilized¹⁴. Among Medicare beneficiaries ≥65 years of age, only 30% of women and 4% of men were tested for BMD with DXA¹⁵. Among people with fragility fractures, only 10.2% of female¹⁶ and 6% of male patients¹⁷ have undergone BMD testing before the index event. Furthermore, DXA utilization seems to be decreasing in post-menopausal women in the US¹⁸.

Opportunistic screening for osteoporosis using imaging modalities other than DXA is a potential strategy to stratify the unscreened population into distinct risk groups of osteoporosis and fragility fractures. This approach used radiographs ‘already been taken’ for other clinical indications to screen osteoporosis at no additional cost, time, or radiation exposure to patients. For example, several studies used computed tomography (CT)-based metrics to estimate BMD^19,20,21, classify osteoporosis²², simulate DXA T-scores²³, and predict fracture outcomes²⁴. However, the performance, radiation dose, and population coverage of CT-based screening strategies are barriers to their use in clinical settings. Unlike DXA and CT, plain radiography has greater availability, broader indications, lower radiation dose, and overall costs. In addition, the spatial resolution of radiographs is excellent, allowing the visualization of fine bone texture, which is correlated with bone density²⁵ and can distinguish patients with osteoporotic fractures from controls^25,26,27. Therefore, an automated tool based on hip or spine radiographs for identifying hip fracture and vertebral compression fracture (VCF), predicting BMD, and evaluating fracture risk can help identify patients with greater fracture risk among individuals undergoing radiography of the hip or spine for other reasons.

Deep learning algorithms have achieved performance superior to traditional methods in visual recognition tasks²⁸, which is the foundation of clinical applications such as fracture detection²⁹, retinopathy grading³⁰, and lung nodule identification³¹. Therefore, this retrospective cohort study was performed to test the hypothesis that an automated deep neural network-based tool could effectively predict BMD and risk of fragility fractures using plain radiographs of the pelvis and lumbar spine. Here, we proposed and validated a fully automated deep learning-based tool to 1. extract the hip and spine region of interests (ROIs), 2. identify hip fracture, VCF, or morphological abnormalities, 3. check the radiograph quality to ensure that implants and foreign bodies were absent from the ROIs, 4. predict BMD and estimate the probability of a fracture within the next 10 years based on the FRAX (Fig. 1). We compared the predicted BMD with the BMD measured by central DXA. We also compared the risks of 10-year hip and major osteoporotic fractures (https://www.sheffield.ac.uk/FRAX/) using only clinical parameters (FRAX-NB, age, sex, weight, and height), clinical parameters and DXA-measured BMD (FRAX-MB) or predicted BMD (FRAX-PB). We also conducted a real-world test on consecutive patients to prove the clinical applicability of our tool, and its impact on osteoporosis screening strategy.

Results

Data source

From 2006 to 2020, 30,958 and 86,977 patients aged 40–90 years with paired DXA-pelvis or paired DXA-lateral radiographs of the lumbar spine (18.6% and 18.2% of patients with hip or lumbar spine radiographs) were screened to identify hip and spine cohorts for analysis. Of these, 18,097 and 58,149 patients in the respective cohorts were excluded due to a DXA-radiograph interval >180 days, lack of detailed reports, inadequate image quality, positions, or analyzable ROIs. The final cohorts included 10,797 patients with Hologic DXA-hip radiograph pairs and 25,482 patients with Hologic DXA- spine radiograph pairs (Supplementary Fig. 1). No patient was included in more than one group. As Table 1 shows, the final study population included 5164 patients (3997 women [77.4%], mean age, 72.2 [standard deviation, SD, 11.2] years) in the hip testing set and 18,175 patients (14,469 women [79.6%], mean age, 67.1 [SD, 10.6] years) in the spine testing set. The median time between DXA and plain radiographs was 29 and 16 days, respectively. The DXA identified 1110 patients (21.5%) in the hip and 7860 patients (43.3%) in the spine cohort as osteoporotic.

Table 1 Patient characteristics of the study population.

Full size table

Performance for BMD prediction

Table 2 summarizes the model performance to predict BMD. Pearson’s correlation coefficients between DXA-measured and model-predicted BMD were 0.92 for the hip and 0.90 for the lumbar spine. The linear regression model showed excellent predictive performance of predicted BMD with regard to measured BMD (hip: R² = 0.84, root mean square error [RMSE] = 0.062; spine: R² = 0.81, RMSE = 0.081). The model was well calibrated with minimal bias in the hip (slope = 0.982, calibration-in-the-large = −0.003) and the lumbar spine BMD (slope = 0.978, calibration-in-the-large = 0.003) (Fig. 2). The model performance remained robust across various age and sex strata.

Table 2 Summary of performance metrics of predictive models for Hologic BMD.

Full size table

**Fig. 2: The calibration plots for predicted-measured BMD.**

Performance for osteoporosis and fracture risk prediction

Table 3 illustrates the discriminatory performance of the model to classify hip or spine osteoporosis, and identify patients with greater 10-year risks of major osteoporotic fractures (≥20%) and hip fractures (≥3%). The algorithm provided a high degree of discrimination for osteoporosis (area under the precision-recall curve [AUPRC], 0.89 for both the hip and spine models). The overall accuracies were 91.7% for hip osteoporosis and 86.2% for lumbar spine osteoporosis. The median FRAX 10-year major fracture (8.84% vs. 8.76%, p = 0.24) and hip fracture risks (2.48% vs. 2.46%, p = 0.06) did not significantly differ, when scores were based on the predicted BMD (FRAX-PB) or measured BMD (FRAX-MB) plus clinical parameters (age, sex, height, and weight). The area under the precision-recall curve (AUPRC) values for major osteoporotic and fractures were 0.83 and 0.96 for (FRAX-PB), compared to 0.40 and 0.83 for the FRAX tool without BMD input (FRAX-NB) (Supplementary Figs. 2 and 3). Supplementary Table 1 shows robust model discriminatory performances across age and sex strata.

Table 3 Discriminatory performance (%) of the predicted BMD to classify hip/lumbar vertebral osteoporosis and high-risk groups for major osteoporotic or hip fractures.

Full size table

Next, we identified predicted BMD thresholds that correspond to a positive predictive value (PPV) of 95% to classify and a negative predicted value (NPV) of 95% to exclude osteoporosis (Table 4). Overall, 88.2% of the predicted values in the hip cohort and 70.4% in the spine cohort have an excellent PPV or NPV for osteoporosis. Among the hip cohort, FRAX-PB provides excellent discriminatory performance to classify high fracture risk patients. The proportion of the study population who would be referred for DXA was 11.8% in the hip cohort and 29.6% in the spine cohort.

Table 4 Osteoporosis classification results at 95% PPV and 95% NPV thresholds on Hologic testing data.

Full size table

External validation

We identified 2060 patients with paired GE DXA-pelvis radiographs and 3346 patients with paired GE DXA-lumbar spine radiographs (Supplementary Table 2). The GE BMD values were converted to Hologic values using the manufacturer-provided equations (Supplementary table 3). Supplementary Table 4 summarizes the model performance by comparing model-predicted BMD and GE DXA-measured BMD and Supplementary Table 5 summarized the discriminatory performance. The Pearson’s correlation coefficients between GE DXA-measured and model-predicted BMD were 0.90 for the hip and 0.89 for the hip and lumbar spine (Supplementary Fig. 4). The model remains robust with good linear correlation, calibration and minimal bias across different age and sex strata. The discriminatory performance is also excellent, with an AUPRC of 0.87 for the hip and 0.89 for the spine model. We further test our tool using 34 pairs of GE DXA-hip radiographs and 179 pairs of DXA-lumbar spine radiographs from the Wuhan Hospital of Traditional Chinese Medicine. The Pearson correlation coefficient was 0.93 for the hip model and 0.86 for the spine model.

Real-world experiment

Next, we implemented the tools in the central inference platform connected to the picture archiving and communication system (PACS) in the Chang Gung Memorial Hospital (CGMH, Linkou branch) to study the real impact of our tool to screen osteoporosis. The hospital PACS relayed all newly acquired images to the inference platform daily. In total, 2388 consecutive pelvis (1858 patients, 43.2% women) and 9741 lumbar spine radiographs (5336 patients, 40.8% women) in those aged 40–90 years were conducted between January and May 2021. The tool excluded 816 pelvis radiographs and 1715 spine radiographs due to poor image quality, inappropriate positions, implants, and fractures that may impede BMD estimation. The percentages of images passing through the entire pipeline and successfully reporting a predicted BMD were 79.0% for pelvis radiographs and 82.3% for spine radiographs. Among these, 5206 (84.8%) patients with hip or spine radiographs were classified or excluded as osteoporotic with high PPV or NPV for osteoporosis using thresholds reported in Table 4. Finally, only 933 (15.2%) patients were advised to take the DXA examination (Supplementary Fig. 5). At the same period, 3008 DXA examinations were conducted in CGMH, Linkou branch.

Discussion

Osteoporosis is a silent disease before fragility fractures, leading to multiple morbidities and increased mortality in affected patients⁴. Therefore, population-based screening is imperative for identifying at-risk patients and implementing preventive measures. DXA is the preferred screening modality to screen osteoporosis but is of limited availability, especially for the developing counties^{8,9,10,11,12,13} and underutilized in the well-resourced area such as the US¹⁵. In addition to improving DXA availability and utilization, opportunistic osteoporosis screening using other imaging modalities is a potential strategy to expand screening populations. In CGMH, approximately 80% of patients aged 40–90 years with pelvis or spine radiographs had not been screened by DXA previously. Our automated, reliable tool can evaluate fracture risk using these radiographs “already” conducted for other indications to identify at-risk patients, who are not screened by DXA without additional cost, time, and radiation.

The performance of the tool is robust with DXA as a reference and compared favorably with other opportunistic osteoporosis screening tools based on CT attenuation of the spine (area under the receiver-operator curve [AUROC], 0.83)²² and machine-learning-based T-score simulation (accuracy, 82%)²³. In comparison, our tool correlated well with gold standard Hologic or GE DXA-measured BMD in both internal and external testing sets with excellent discriminatory performance to classify hip and spine osteoporosis (AUROC, 0.97 and 0.92, respectively) and stratify patient fracture risks. Clinical testing of our automated tool in consecutive patients with pelvis or spine radiographs found that approximately 80% of them could be automatically screened for osteoporosis. Among them, our tool classified osteoporosis with excellent PPV or NPV for osteoporosis for 5206 patients who were mostly not examined by DXA; during the same period, 3008 DXA examinations were conducted. The real-world evidence demonstrated that our automated tool could expand opportunistic screening to a broader population at risk.

BMD is not the only determinant of fracture risk³². A history of osteoporotic fracture is one of the clinical risk factors for FRAX but often are unnoticed because many patients with occult hip fractures and VCFs are asymptomatic^22,33. We exploited the excellent spatial resolution of radiographs to identify hip implants and unsuspected fragility fractures before estimation of BMD. The tool incorporates our previously published PelviXNet³⁴ to detect hip fracture and newly developed algorithms to detect hip implants. Furthermore, we developed a VCF assessment algorithm based on a Deep Adaptive Graph network (DAG)³⁵, which determines anatomical landmarks for standard six-point vertebral morphometry that facilitates VCF detection using the widely accepted semiquantitative Genant visual method^36,37. Therefore, our tool could evaluate fragility fracture risk based on a single radiograph. However, other patient-related clinical risk factors (e.g., comorbidity, medication, and lifestyle) require input from electronic medical records.

Opportunistic osteoporosis screening using other imaging modalities has been reported previously, but none had been clinically examined as comprehensive as our study. The best-studied strategy is the use of abdominal CT to predict spine BMD^19,20,23, classify osteoporosis based on CT attenuation²², simulated BMD^{19, 20}, T-score²³, or detect osteoporotic fractures³⁸; or use imaging biomarkers to predict the risk of fractures²⁴. Julien Smets et al. reviewed machine learning solutions for osteoporosis³⁹. Among five studies using CT scans to predict BMD, the best correlation coefficient between estimated and CT-simulated spine BMD was 0.94²¹. An earlier study compared the CT Hounsfield units over a manually annotated ROI involving vertebral body trabecular bone with its paired DXA T-score; this approach for detection of osteoporosis yielded an AUROC of 0.83²². Deep learning-based models provided a better correlation between predicted and reference values, but were only validated in small datasets^19,20,23. A larger study testing the performance of simulated T-scores on a larger dataset of 1843 CT-DXA pairs achieved an accuracy of 82% to detect osteoporosis²³. This algorithm was integrated with VCF identification and CT trabecular density as biomarkers, and its performance for the prediction of 5-year fracture risks was compared favorably with the performance of FRAX-NB²⁴. Osteoporosis and fragility fracture risk have also been assessed on dental^40,41, hip^42,43, and spine radiographs^41,44, and magnetic resonance imaging⁴⁵. However, only three were validated against standard DXA-based hip or spine BMD. The best AUROC was 0.92 for hip⁴² and 0.73 for spine osteoporosis classification using small testing sets (131 and 345 patients, respectively)⁴⁴. These studies demonstrated the feasibility of using non-DXA modalities to screen osteoporosis, although the applicability and usability of such tools in real clinical settings are questionable.

In contrast, the present study provided a fully automated tool enabling opportunistic screening for osteoporosis and evaluating fragility fracture risk using plain radiographs of the hip and spine. Our tool utilizes ubiquitous, low-cost radiographs that involve substantially lower radiation exposure than CT-based tools to assess both the hip rather than the spine alone (e.g., using CT-based tools). Furthermore, we envision that other musculoskeletal radiographs may also be used to predict bone density and fracture risks, regardless of the original purpose of the images. This strategy used radiographs already taken for other indications, therefore requiring no additional patient time or radiation exposure with minimal costs but may substantially improve the risk profiling for fragility fractures.

This study had several limitations. First, CGMH is a medical center in which the patients tend to have more severe diseases. A large proportion of patients have fractures or implants. Our study population may not have represented the healthier population. However, because the tool was developed based on the more complex population, the ROI localization, quality check, and BMD prediction processes can presumably be readily adapted to populations with fewer complications. In addition, the performance of our tool remains robust when testing on external data. Second, the calculation of FRAX in this study did not consider past medical history, medication use, family medical history, alcohol consumption, and smoking status because this information requires input from the hospital information system. However, the performance assessment should not vary much because these parameters are identical for FRAX based on the DXA-measured or model-predicted BMD. The clinical implementation of the tool can report full FRAX results when digital data are available. Third, the tool was created using the reference BMD values reported by Hologic DXA scanners alone. However, the model’s performance remains robust in a test set of paired GE DXA and plain radiographs and external sources. The GE BMD measurements were converted to corresponding Hologic BMD values using the algorithm provided by the Hologic manufacturer. It seems the conversion has a small negative effect on model performance. A specific model for GE DXA is needed to maximize performance. Fourth, the performance of the prediction tool is influenced by the radiograph image quality. In addition to existing fractures, accurate BMD prediction may be impeded by foreign bodies, implants, bowel gas, and bone pathologies (e.g., avascular necrosis or severe osteoarthritis). The actual rate of radiographs that could be evaluated for BMD and fracture risk is around 80% in our real-world test. Depending on a patient’s specific indications, radiographs are often examined repeatedly. Therefore, the per-patient success rate will potentially increase as more radiographs become available over time.

This study demonstrated that a robust opportunistic screening tool for osteoporosis and fracture risk assessment, based on conventional radiographs obtained for various indications, provided VCF detection, BMD, and fracture risk estimation in a fully automated process. This tool leveraged state-of-the-art deep learning algorithms to provide an efficient strategy for population-based opportunistic screening with minimal additional cost. Integrating this automated tool into the hospital information system may expand osteoporosis screening to a broader population at risk.

Methods

Setting

This study was approved by the Institutional Review Board at the CGMH and was conducted in accordance with the tenets of the Declaration of Helsinki. This study was approved by the Institutional Review Board at the CGMH (approval number: 202000254B0, 202100346B0, 202101180B0). The requirement for informed consent was waived because the data used in this paper were fully de-identified to protect patient confidentiality. This study was performed using data from CGMH, the largest private hospital system in Taiwan, which includes seven acute hospitals with 10,050 beds that received 8.2 million outpatient visits and 2.4 million inpatient care visits. The study was conducted in collaboration between the CGMH and PAII Inc., a research subsidiary of Ping-An Technology that focuses on state-of-the-art computer vision algorithm development. PAII Inc. used clinical images and clinical data from CGMH to create automated BMD and fracture risk estimation tools. The provided data were fully encrypted to prevent patient confidentiality leaks. Except for the training and validation image data, PAII Inc. remained blinded to other clinical and testing datasets.

The study population consisted of 184,339 patients with at least one central DXA from January 2006 to December 2020 and were aged 40–90 years on the DXA index date. The study population was also required to have adequate radiographs of the pelvis or lumbar spine within 180 days from the index date. For patients with multiple DXA and plain film radiographs, the earliest pair was used. We performed a quality check for plain films to ensure that these images were suitable for BMD prediction; after the exclusion of inadequate plain films, model building and testing were performed based on a cohort of 10,797 patients with at least one Hologic DXA-pelvis radiograph pair and 25,482 patients with at least one lateral radiograph of the lumbar spine–DXA pair (Supplementary Fig. 1). The patients were randomly allocated into the training and testing set by simple random sampling in which each patient has an equal probability of selection, and sampling is without replacement. Patients with GE DXA-plain film pairs were used as the separate testing sets (hip testing set, n = 2060; spine testing set, n = 3346). We also include 34 pairs of GE DXA-hip radiographs and 179 pairs of DXA-lumbar spine radiographs from the Wuhan Hospital of Traditional Chinese Medicine to do external validation.

We also tested the algorithms in a clinical setting to ascertain the number and proportions of patients with hip or spine radiographs who may benefit from the tool. The algorithms were packaged in docker containers and implemented on the PACS-linked inference platform of CGMH, based on the Nvidia Triton architecture. We tested the model using consecutive radiographs conducted between January 2021 and May 2021.

BMD measurement

Proximal femoral and lumbar spine DXA scans were performed using a Hologic QDR-4500A fan-beam densitometer (Bedford, MA, USA) during 2005–2010 and a Hologic Discovery model A densitometer during the period 2011–2021. The GE DXA scanner was the Lunar iDXA system (Madison, WI). The scans were analyzed following recommendations issued by the Taiwan Radiological Society⁴⁶, amended from the International Society for Clinical Densitometry, ISCD (Supplementary methods)⁴⁷. Hip T-scores were calculated using the revised NHANES III white female reference values^48,49. Because there is no international reference standard for the lumbar spine BMD, lumbar T-scores were calculated using the manufacturer’s reference values. For each patient, the lowest T-score of the hip or lumbar vertebrae was used to categorize osteoporosis or calculate FRAX risk.

Acquisition and preprocessing of radiographs

The radiographs were collected from the PACS and anonymized before the study procedure. Most radiographs were produced using the Canon CDXI 710C (82.5% for the hip and 86% for the spine). The peak kilovoltage (kVp) range is mainly 70–80 kV for the hip and 90–95 kV for the lumbar spine. No performance difference was observed between different machines or kVp (Supplementary Table 6). The images were converted to grayscale and resized to a resolution of 0.15 mm × 0.15 mm pixel spacing, then stored as 12-bit images. A deep adaptive graph (DAG) landmark detection method was developed to formulate the anatomical landmarks of the pelvis and spine as graphs and to robustly and accurately detect these landmarks³⁵. We detected 16 anatomical landmarks on hip radiographs, including 12 landmarks on the pelvic boundary and four landmarks on the femoral head and trochanter. We detected six anatomical landmarks for each of the lumbar vertebrae on spine radiographs from L1 to L4. ROIs were extracted from the radiographs and used as input for the BMD prediction model based on the detected anatomical landmarks. For hip radiographs, ROIs of the left and right hips were extracted. For the lumbar spine, ROIs were extracted for each vertebra from L1 to L4. The ROIs were used as input for the BMD prediction model. A schematic representation of the pipeline, models and examples of the detected anatomical landmarks and ROIs used to predict BMD is shown in Fig. 1.

Anatomical landmark detection via deep adaptive graph (DAG)

The anatomical landmarks were detected using DAG, a method introduced in our previous publication³⁵. The details of DAG were described in Supplementary methods. In our experiment, the hip and spine DAG models were trained using 3306 pelvic radiographs and 1076 spine radiographs with expert annotations, respectively. The radiographs used to train the DAG models are excluded from the test sets used to evaluate the BMD estimation models. The DAG models are evaluated on 876 pelvic and 290 spine radiographs and report 4.29 + −3.29 mm and 1.22 + −3.23 mm localization errors.

Automated radiograph quality assessment procedure

Some medical conditions may affect the hip and vertebra anatomy, making plain films unsuitable for BMD estimation. The most common conditions include implantation and fracture. Therefore, we conducted an automated quality assessment to exclude hips and vertebrae with implants or fractures unsuitable for BMD prediction.

Quality assessment of hip radiographs

We detect hip fracture and implant (joint prosthesis, screws, plates, or cement) in the quality assessment process and exclude them from the downstream BMD estimation. An existing model, PelviXNet³⁴, is used to detect the hip fracture. PelviXNet consists of a DensetNet-121 backbone neural network and a Feature Pyramid Network and was trained on 5204 pelvic radiographs that had been annotated by experienced physicians using an efficient and flexible point-based annotation scheme. In addition to detecting hip fracture, we trained another network with an identical structure to PelviXNet using 2973 pelvic radiographs to detect implants. The maximum responses of the fracture and implant detection networks in the hip ROI are calculated as the classification scores for hip fracture and implant, respectively. The fracture detection model, PelviXNet, was evaluated on 1888 pelvic radiographs covering various medical conditions (e.g., implants and periprosthetic fracture) and reports 92.4% sensitivity and 90.8% specificity. The implant detection model was evaluated on 715 randomly selected pelvic radiographs and reports 99.9% sensitivity and 99.7% specificity.

Quality assessment of spine radiographs

The adult official positions of the ISCD advise excluding vertebrae that are abnormal and non-assessable or have a more than a 1.0 T-score difference between the vertebra in question and adjacent vertebrae⁵⁰. Therefore, the automated quality assessment procedure for spine radiographs is performed in three steps: implant and VCF detection, six-point morphology analysis and assessment for T-score of nearby vertebrae. The implant/VCF detection model had the same architecture as PelviXNet and was trained on 1485 expert-annotated lateral spine radiographs to produce probability maps for implant and VCF. The L1 to L4 vertebrae were classified as normal, VCF, and implant by the annotator. A supervision mask was then generated by filling the vertebra polygons produced by DAG using the annotated label.

Using the predicted implant and VCF probability maps, the maximum responses in the vertebrae polygons were regarded as the classification scores. Vertebrae with a positive implant or VCF detection results were excluded, and the remaining vertebrae were analyzed by six-point morphology. Specifically, six landmarks were detected for each vertebra, including two anterior points, two posterior points, and two middle points of the top and bottom vertebral plates. Four distances were calculated from these six points: anterior height h_a, posterior height h_p, middle height h_m, and vertebra width w. The three heights were calculated as the pairwise distances between the two anterior, posterior, and middle points. The vertebra width was calculated as the mean distance between the anterior and posterior points. Three criteria were used to identify vertebrae with abnormal deformity, following the widely accepted Genant visual semiquantitative method³⁷, with modifications to facilitate automated measurement and fracture detection:

$$\frac{\min ({{{h}}}_{{{{{{\rm{a}}}}}}},{{{h}}}_{{{{{{\rm{p}}}}}}})}{\max ({{{h}}}_{{{{{{\rm{a}}}}}}},{{{h}}}_{{{{{{\rm{p}}}}}}})} \; < \; 0.8,$$

(1)

$$\frac{{{{h}}}_{{{{{{\rm{m}}}}}}}}{\max ({{{h}}}_{{{{{{\rm{a}}}}}}},\,{{{h}}}_{{{{{{\rm{p}}}}}}})} \; < \; 0.6,$$

(2)

$$\frac{{{\max }}({{{h}}}_{{{{{{\rm{a}}}}}}},{{{h}}}_{{{{{{\rm{p}}}}}}})}{{{{{{\rm{w}}}}}}} \; < \; 0.55.$$

(3)

The first criterion aimed to detect wedge and crush fractures, where the anterior and posterior heights were reduced. The second criterion aimed to detect a biconcave fracture, where the middle height was reduced. The last criterion aimed to detect severe VCF cases where the overall height of the vertebra was significantly reduced. If a vertebra met any of the three criteria, it was considered abnormal and excluded from downstream processing. These criteria only detected moderate to severe compression fractures to avoid ambiguity in determining mild or borderline deformities. The vertebrae with more than one standard deviation difference from their neighbors were excluded from the analysis. We comply with the ISCD positions that only those with two or more assessable vertebrae were included for analysis⁵⁰.

To evaluate the performance of the spine radiograph QA module, we randomly selected 200 spine radiographs from the test set and manually labeled implant and VCF. The implant and VCF detection module report 91.5% and 93.2% sensitivity and 99.5% and 91.5% specificity. Some mild VCFs are not detected by the VCF detection module alone.

Algorithm development and training procedure for BMD prediction

We developed a deep learning algorithm to estimate the hip/spine BMD from each corresponding ROI. The neural network employs a backbone network to encode the input ROI as a feature vector and two consecutive fully connected layers with ReLU activation functions to produce the estimated BMD. We evaluated multiple backbone networks (i.e., VGG-11, VGG-16, ResNet-18, ResNet-34) in earlier experiments and empirically found that VGG-16 and ResNet-34 produce the best BMD prediction results for spine and hip BMD prediction, respectively. The model using only image-based features already performs strongly, and the addition of age and gender does not improve the model’s performance. Therefore, we choose to use the VGG16 backbone without age and sex in later model development (Supplementary Tables 7 and 8). L1–L4 vertebrae have slightly different geometries and distinct BMD statistics; therefore, the vertebra index information was required by the model to predict the BMD accurately. In the spine model, the vertebra index (from L1 to L4), encoded by a one-hot vector of length 4, appended to the feature vector in the neural network before the last fully connected layer. During training, ROIs were augmented by random affine transformation and subsequently resized to 512 × 512 pixels. The L1 distance between the predicted BMD and the ground truth BMD obtained from DXA was regarded as the training loss. A fourfold cross-validation procedure was conducted, and ensemble learning was adopted to combine the predictions of the four trained models during inference. Pseudocodes for BMD estimation are provided in the Supplementary Tables 9 and 10.

Evaluation of BMD prediction performance

Evaluation of all performance measures was performed only on the test datasets. The Bland–Altman plot visualized the agreement between predicted and measured BMD scores, and Pearson’s correlation coefficient was calculated. The tool’s calibration was evaluated by comparing the mean risk calculated based on predicted BMD and the mean risk based on DXA-measured BMD. The following measures were calculated to evaluate the overall calibration: calibration slope and calibration-in-the-large. Osteoporosis results were considered positive when T-score ≤ −2.5. Ten-year probabilities of major fracture and hip fracture with total hip BMD were calculated for each patient using the FRAX tool with risk estimators specific to the Taiwanese population (https://www.sheffield.ac.uk/FRAX/; FRAX Desktop Multi-Patient Entry, version 4.0). The FRAX parameters used in this study include age, sex, weight, height, and BMD. FRAX risks with and without BMD were calculated separately. For each patient, the lowest BMD was used to calculate the T-score and FRAX risk. Ten-year risk scores of ≥3% for hip fracture and ≥20% for major osteoporotic fracture were considered high-risk, based on the intervention threshold established in the Taiwan Osteoporosis Practice Guidelines⁵¹ and the recommendations of the National Osteoporosis Foundation⁵². The overall discriminative abilities to discern osteoporosis and high-risk patients were evaluated using the AUROC and AUPRC. Other measures were also calculated, including sensitivity, specificity, positive predictive value, and negative predictive value. Two-sided p values were reported throughout the manuscript. Analyses were conducted using Stata software, version 16.1 (StataCorp, College Station, TX, USA).

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

The sample testing imaging data generated in this study have been deposited in the public repository (https://doi.org/10.5281/zenodo.5216219). The full original imaging data are available under restricted access for the policy of the Chang Gung Memorial Hospital and data privacy laws. Researchers who are interested in our work can request access to the de-identified raw images for academic purposes. The request should be made to the corresponding author and the access will be granted within a month. Use of data is limited to research purposes and redistribution of data is not allowed.

Code availability

The code used to train and evaluate the model performance is not openly available due to the use of proprietary software packages and infrastructures including a general deep learning model training and evaluation platform that is used not only for the described study but also across other projects at PAII Lab. Pseudocodes are provided in the Supplementary Tables 9 and 10. We provided a Gigantum project (https://gigantum.com/xraybmd/nc-bmd-cpu) to test our model. The instructions for the Gigantum project are provided at the end of the supplementary materials. The inference services are available from the corresponding author upon request. Researchers who are interested in our work can request access to the Gigantum project or inference services for academic purposes. The request should be made to the corresponding author and the access will be granted within a month.

References

Johnell, O. & Kanis, J. A. An estimate of the worldwide prevalence and disability associated with osteoporotic fractures. Osteoporos. Int. 17, 1726–1733 (2006).
Article CAS PubMed Google Scholar
Sanchez-Riera, L. et al. The global burden attributable to low bone mineral density. Ann. Rheum. Dis. 73, 1635–1645 (2014).
Article CAS PubMed Google Scholar
Cree, M., Carriere, K. C., Soskolne, C. L. & Suarez-Almazor, M. Functional dependence after hip fracture. Am. J. Phys. Med. Rehabil. 80, 736–743 (2001).
Article CAS PubMed Google Scholar
Nazrun, A. S., Tzar, M. N., Mokhtar, S. A. & Mohamed, I. N. A systematic review of the outcomes of osteoporotic fracture patients after hospital discharge: morbidity, subsequent fractures, and mortality. Ther. Clin. Risk Manag. 10, 937–948 (2014).
PubMed PubMed Central Google Scholar
Bliuc, D. et al. Mortality risk associated with low-trauma osteoporotic fracture and subsequent fracture in men and women. JAMA 301, 513–521 (2009).
Article CAS PubMed Google Scholar
Saito, T. et al. Effectiveness of anti-osteoporotic drugs to prevent secondary fragility fractures: systematic review and meta-analysis. Osteoporos. Int. 28, 3289–3300 (2017).
Article CAS PubMed Google Scholar
Kanis, J. A. et al. Development and use of FRAX in osteoporosis. Osteoporos. Int. 21, S407–S413 (2010).
Article PubMed Google Scholar
Kanis, J. A. & Johnell, O. Requirements for DXA for the management of osteoporosis in Europe. Osteoporos. Int. 16, 229–238 (2005).
Article CAS PubMed Google Scholar
International Osteoporosis Foundation. The Eastern European and Central Asian Regional Audit—epidemiology, costs and burden of osteoporosis in 2010. https://www.osteoporosis.foundation/sites/iofbonehealth/files/2019-06/2010_Eastern_European_Central_Asian_Audit_English.pdf (2013).
International Osteoporosis Foundation. The Middle East and Africa Regional Audit—epidemiology, costs and burden of osteoporosis in 2011. https://www.osteoporosis.foundation/sites/iofbonehealth/files/2019-06/2011_Middle_East_Africa_Audit_English.pdf (2013).
International Osteoporosis Foundation. The Asia-Pacific Regional Audit—epidemiology, costs and burden of osteoporosis in 2013. https://www.osteoporosis.foundation/sites/iofbonehealth/files/2019-06/2013_Asia_Pacific_Audit_English.pdf (2013).
International Osteoporosis Foundation. The Latin America Regional Audit—epidemiology, costs and burden of osteoporosis in 2012. https://www.osteoporosis.foundation/sites/iofbonehealth/files/2019-06/2012_Latin_America_Audit_English.pdf (2013).
Kanis, J. A. et al. SCOPE 2021: a new scorecard for osteoporosis in Europe. Arch. Osteoporos. 16, 82 (2021).
Article PubMed PubMed Central Google Scholar
Compston, J. E., McClung, M. R. & Leslie, W. D. Osteoporosis. Lancet 393, 364–376 (2019).
Article CAS PubMed Google Scholar
Curtis, J. R. et al. Longitudinal trends in use of bone mass measurement among older americans, 1999–2005. J. Bone Miner. Res. 23, 1061–1067 (2008).
Article PubMed PubMed Central Google Scholar
Michael Lewiecki, A. J. S. et al. Geographic variation in prevalence of osteoporosis diagnosis and utilization of anti-osteoporosis therapies in United States female medicare fee-for-service beneficiaries with fragility fractures. In: The American Society for Bone and Mineral Research Annual Meeting (The American Society for Bone and Mineral Research, 2020).
Williams, S. D. S., Weiss, R., Wang, Y., Arora, T. & Curtis, J. Characterization of older male patients with a fragility fracture [abstract]. Arthritis Rheumatol. 72, 1082 (2020).
Article CAS Google Scholar
Overman, R. A. et al. DXA utilization between 2006 and 2012 in commercially insured younger postmenopausal women. J. Clin. Densitom. 18, 145–149 (2015).
Article PubMed PubMed Central Google Scholar
Yasaka, K., Akai, H., Kunimatsu, A., Kiryu, S. & Abe, O. Prediction of bone mineral density from computed tomography: application of deep learning with a convolutional neural network. Eur. Radiol. 30, 3549–3557 (2020).
Article CAS PubMed Google Scholar
Fang, Y. et al. Opportunistic osteoporosis screening in multi-detector CT images using deep convolutional neural networks. Eur. Radiol. 31, 1831–1842 (2020).
Article PubMed Google Scholar
Gonzalez, G., Washko, G. R. & Estepar, R. S. J. Deep learning for biomarker regression: application to osteoporosis and emphysema on chest CT scans. Proc. SPIE Int. Soc. Opt. Eng. 10574, 105741H (2018).
PubMed PubMed Central Google Scholar
Pickhardt, P. J. et al. Opportunistic screening for osteoporosis using abdominal computed tomography scans obtained for other indications. Ann. Intern. Med. 158, 588–595 (2013).
Article PubMed PubMed Central Google Scholar
Krishnaraj, A. et al. Simulating dual-energy X-ray absorptiometry in CT using deep-learning segmentation cascade. J. Am. Coll. Radiol. 16, 1473–1479 (2019).
Article PubMed Google Scholar
Dagan, N. et al. Automated opportunistic osteoporotic fracture risk assessment using computed tomography scans to aid in FRAX underutilization. Nat. Med. 26, 77–82 (2020).
Article CAS PubMed Google Scholar
Benhamou, C. L. et al. Fractal analysis of radiographic trabecular bone texture and bone mineral density: two complementary parameters related to osteoporotic fractures. J. Bone Miner. Res. 16, 697–704 (2001).
Article CAS PubMed Google Scholar
Pothuaud, L. et al. Fractal analysis of trabecular bone texture on radiographs: discriminant value in postmenopausal osteoporosis. Osteoporos. Int. 8, 618–625 (1998).
Article CAS PubMed Google Scholar
Touvier, J. et al. Fracture discrimination by combined bone mineral density (BMD) and microarchitectural texture analysis. Calcif. Tissue Int. 96, 274–283 (2015).
Article CAS PubMed Google Scholar
LeCun, Y., Bengio, Y. & Hinton, G. Deep learning. Nature 521, 436–444 (2015).
Article ADS CAS PubMed Google Scholar
Lindsey, R. et al. Deep neural network improves fracture detection by clinicians. Proc. Natl Acad. Sci. USA 115, 11591–11596 (2018).
Article MathSciNet CAS PubMed PubMed Central Google Scholar
Gulshan, V. et al. Development and validation of a deep learning algorithm for detection of diabetic retinopathy in retinal fundus photographs. JAMA 316, 2402–2410 (2016).
Article PubMed Google Scholar
Ardila, D. et al. End-to-end lung cancer screening with three-dimensional deep learning on low-dose chest computed tomography. Nat. Med. 25, 954–961 (2019).
Article CAS PubMed Google Scholar
Siris, E. S. et al. Bone mineral density thresholds for pharmacological intervention to prevent fractures. Arch. Intern. Med. 164, 1108–1112 (2004).
Article PubMed Google Scholar
Siris, E. S. et al. The effect of age and bone mineral density on the absolute, excess, and relative risk of fracture in postmenopausal women aged 50–99: results from the National Osteoporosis Risk Assessment (NORA). Osteoporos. Int. 17, 565–574 (2006).
Article CAS PubMed Google Scholar
Cheng, C. T. et al. A scalable physician-level deep learning algorithm detects universal trauma on pelvic radiographs. Nat. Commun. 12, 1066 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Weijian, L. et al. Structured landmark detection via topology-adapting deep graph learning. In European Conference on Computer Vision 9, 266–283, https://arXiv.org/2004.08190 (2020).
Guglielmi, G. et al. Vertebral morphometry: current methods and recent advances. Eur. Radiol. 18, 1484–1496 (2008).
Article CAS PubMed Google Scholar
Genant, H. K., Wu, C. Y., van Kuijk, C. & Nevitt, M. C. Vertebral fracture assessment using a semiquantitative technique. J. Bone Miner. Res. 8, 1137–1148 (1993).
Article CAS PubMed Google Scholar
Tomita, N., Cheung, Y. Y. & Hassanpour, S. Deep neural networks for automatic detection of osteoporotic vertebral fractures on CT scans. Comput. Biol. Med. 98, 8–15 (2018).
Article PubMed Google Scholar
Smets, J., Shevroja, E., Hugle, T., Leslie, W. D. & Hans, D. Machine learning solutions for osteoporosis—a review. J. Bone Miner. Res. 36, 833–851 (2021).
Article PubMed Google Scholar
Kavitha, M. S., Asano, A., Taguchi, A., Kurita, T. & Sanada, M. Diagnosis of osteoporosis from dental panoramic radiographs using the support vector machine method in a computer-aided system. BMC Med. Imaging 12, 1 (2012).
Article CAS PubMed PubMed Central Google Scholar
Lee, K. S., Jung, S. K., Ryu, J. J., Shin, S. W. & Choi, J. Evaluation of transfer learning with deep convolutional neural networks for screening osteoporosis in dental panoramic radiographs. J. Clin. Med. 9, 392 (2020).
Yamamoto, N. et al. Deep learning for osteoporosis classification using hip radiographs and patient clinical covariates. Biomolecules 10, 1534 (2020).
Sapthagirivasan, V. & Anburajan, M. Diagnosis of osteoporosis by extraction of trabecular features from hip radiographs using support vector machine: an investigation panorama with DXA. Comput. Biol. Med. 43, 1910–1919 (2013).
Article CAS PubMed Google Scholar
Zhang, B. et al. Deep learning of lumbar spine X-ray for osteopenia and osteoporosis screening: a multicenter retrospective cohort study. Bone 140, 115561 (2020).
Article CAS PubMed Google Scholar
Ferizi, U. et al. Artificial intelligence applied to osteoporosis: a performance comparison of machine learning algorithms in predicting fragility fractures from MRI data. J. Magn. Reson. Imaging 49, 1029–1038 (2019).
Article PubMed Google Scholar
Taiwan Radiological Society. Best practices for dual-energy X-ray absorptiometry. https://www.rsroc.org.tw/news/news_detail.asp?news_id=1426&NType=1 (2017).
Lewiecki, E. M. et al. Best practices for dual-energy X-ray absorptiometry measurement and reporting: International Society for clinical densitometry guidance. J. Clin. Densitom. 19, 127–140 (2016).
Article PubMed Google Scholar
Kanis, J. A. et al. A reference standard for the description of osteoporosis. Bone 42, 467–475 (2008).
Article CAS PubMed Google Scholar
Binkley, N. et al. Recalculation of the NHANES database SD improves T-score agreement and reduces osteoporosis prevalence. J. Bone Miner. Res. 20, 195–201 (2005).
Article PubMed Google Scholar
The International Society for Clinical Densitometry. The adult official positions of the ISCD as updated in 2019. https://iscd.org/learn/official-positions/adult-positions (2019).
Health Promotion Administration, Ministry of Health and Welfare, Taiwan. Taiwan Osteoporosis Practice Guidelines. https://www.hpa.gov.tw/Pages/Detail.aspx?nodeid=1053&pid=5994 (2018).
Dawson-Hughes, B. et al. Implications of absolute fracture risk assessment for osteoporosis practice guidelines in the USA. Osteoporos. Int. 19, 449–458 (2008).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

The authors thank the funding from Chang Gung Memorial Hospital (CLRPG3H0012, CLRPG3H0013, CORPG3J0191) and the Ministry of Science and Technology (MOST 109-2321-B-182A-007-) for supporting the research. The authors would like to thank Ms. Meng-Jiun Chiou and Yu-Ying Chen for data preparation and statistical analysis.

Author information

Authors and Affiliations

Division of Rheumatology, Allergy and Immunology, Chang Gung Memorial Hospital, Taoyuan, Taiwan
Chen-I Hsieh & Chang-Fu Kuo
PAII Inc., Bethesda, MD, USA
Kang Zheng, Le Lu, Weijian Li, Yirui Wang, Xiaoyun Zhou, Fakai Wang, Shun Miao & Chang-Fu Kuo
Center for Artificial Intelligence in Medicine, Chang Gung Memorial Hospital, Taoyuan, Taiwan
Chihung Lin
Wuhan Hospital of Traditional Chinese Medicine, Wuhan, China
Ling Mei
Department of Medicine, College of Medicine, Chang Gung University, Kwei-Shan, Taoyuan, Taiwan
Fang-Ping Chen & Chang-Fu Kuo
Department of Obstetrics and Gynecology, Osteoporosis Prevention and Treatment Center, Keelung Chang Gung Memorial Hospital, Keelung, Taiwan
Fang-Ping Chen
Ping An Insurance (Group) Company of China, Ltd., Shenzhen, Guangdong, China
Guotong Xie & Jing Xiao

Authors

Chen-I Hsieh
View author publications
You can also search for this author in PubMed Google Scholar
Kang Zheng
View author publications
You can also search for this author in PubMed Google Scholar
Chihung Lin
View author publications
You can also search for this author in PubMed Google Scholar
Ling Mei
View author publications
You can also search for this author in PubMed Google Scholar
Le Lu
View author publications
You can also search for this author in PubMed Google Scholar
Weijian Li
View author publications
You can also search for this author in PubMed Google Scholar
Fang-Ping Chen
View author publications
You can also search for this author in PubMed Google Scholar
Yirui Wang
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoyun Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Fakai Wang
View author publications
You can also search for this author in PubMed Google Scholar
Guotong Xie
View author publications
You can also search for this author in PubMed Google Scholar
Jing Xiao
View author publications
You can also search for this author in PubMed Google Scholar
Shun Miao
View author publications
You can also search for this author in PubMed Google Scholar
Chang-Fu Kuo
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

C.I.H., K.Z., C.H.L. contributed equally to the manuscript. C.F.K., C.I.H., S.M. and L.L. present the conception of the work; C.F.K., S.M. and L.L. designed the study; C.F.K., L.L., G.T.X. and J.X. obtained funding to support the work and built up the research team for joint development; C.F.K., C.I.H., C.H.L., F.P.C. and L.M. acquired the data; K.Z., W.J.L., Y.R.W., X.Y.W., F.K.W. labeled the images for further analysis; S.M., K.Z., W.J.L., Y.R.W., X.Y.W. and F.K.W developed the deep learning model; C.F.K., J.X., S.M. and L.L. conducted the statistical analysis; S.M. created the new software for labeling image; C.H.L. prepared the real-world inference system and conducted the experiments; C.F.K., C.I.H., C.H.L. and S.M. drafted the manuscript; C.F.K., L.L., C.H.L., G.T.X. and J.X.H. substantively revised it. All the authors made substantial contributions to this work and have critically reviewed the manuscript before submission.

Corresponding authors

Correspondence to Shun Miao or Chang-Fu Kuo.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Nature Communications thanks Didier Hans and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Reporting Summary

Peer Review File

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Hsieh, CI., Zheng, K., Lin, C. et al. Automated bone mineral density prediction and fracture risk assessment using plain radiographs via deep learning. Nat Commun 12, 5472 (2021). https://doi.org/10.1038/s41467-021-25779-x

Download citation

Received: 28 March 2021
Accepted: 01 September 2021
Published: 16 September 2021
DOI: https://doi.org/10.1038/s41467-021-25779-x

This article is cited by

Artificial Intelligence-enabled Chest X-ray Classifies Osteoporosis and Identifies Mortality Risk
- Dung-Jang Tsai
- Chin Lin
- Wen-Hui Fang
Journal of Medical Systems (2024)
External validation of a deep learning model for predicting bone mineral density on chest radiographs
- Takamune Asamoto
- Yasuhiko Takegami
- Shiro Imagama
Archives of Osteoporosis (2024)
Establish and validate the reliability of predictive models in bone mineral density by deep learning as examination tool for women
- Wei- Chieh Hung
- Yih-Lon Lin
- Chih-Hsing Wu
Osteoporosis International (2024)
CT-based radiomics can identify physiological modifications of bone structure related to subjects’ age and sex
- Riccardo Levi
- Federico Garoli
- Letterio S. Politi
La radiologia medica (2023)
Emerging trends and research foci of deep learning in spine: bibliometric and visualization study
- Kai Chen
- Xiao Zhai
- Ming Li
Neurosurgical Review (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Data source

Performance for BMD prediction

Performance for osteoporosis and fracture risk prediction

External validation

Real-world experiment

Discussion

Methods

Setting

BMD measurement

Acquisition and preprocessing of radiographs

Anatomical landmark detection via deep adaptive graph (DAG)

Automated radiograph quality assessment procedure

Quality assessment of hip radiographs

Quality assessment of spine radiographs

Algorithm development and training procedure for BMD prediction

Evaluation of BMD prediction performance

Reporting summary

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links