Prediction of osteoporosis from simple hip radiography using deep learning algorithm

Jang, Ryoungwoo; Choi, Jae Ho; Kim, Namkug; Chang, Jae Suk; Yoon, Pil Whan; Kim, Chul-Ho

doi:10.1038/s41598-021-99549-6

Download PDF

Article
Open access
Published: 07 October 2021

Prediction of osteoporosis from simple hip radiography using deep learning algorithm

Ryoungwoo Jang¹^na1,
Jae Ho Choi²^na1,
Namkug Kim³,
Jae Suk Chang⁴,
Pil Whan Yoon⁵ &
…
Chul-Ho Kim⁶

Scientific Reports volume 11, Article number: 19997 (2021) Cite this article

8793 Accesses
25 Citations
25 Altmetric
Metrics details

Subjects

Abstract

Despite being the gold standard for diagnosis of osteoporosis, dual-energy X-ray absorptiometry (DXA) could not be widely used as a screening tool for osteoporosis. This study aimed to predict osteoporosis via simple hip radiography using deep learning algorithm. A total of 1001 datasets of proximal femur DXA with matched same-side cropped simple hip bone radiographic images of female patients aged ≥ 55 years were collected. Of these, 504 patients had osteoporosis (T-score ≤ − 2.5), and 497 patients did not have osteoporosis. The 1001 images were randomly divided into three sets: 800 images for the training, 100 images for the validation, and 101 images for the test. Based on VGG16 equipped with nonlocal neural network, we developed a deep neural network (DNN) model. We calculated the confusion matrix and evaluated the accuracy, sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV). We drew the receiver operating characteristic (ROC) curve. A gradient-based class activation map (Grad-CAM) overlapping the original image was also used to visualize the model performance. Additionally, we performed external validation using 117 datasets. Our final DNN model showed an overall accuracy of 81.2%, sensitivity of 91.1%, and specificity of 68.9%. The PPV was 78.5%, and the NPV was 86.1%. The area under the ROC curve value was 0.867, indicating a reasonable performance for screening osteoporosis by simple hip radiography. The external validation set confirmed a model performance with an overall accuracy of 71.8% and an AUC value of 0.700. All Grad-CAM results from both internal and external validation sets appropriately matched the proximal femur cortex and trabecular patterns of the radiographs. The DNN model could be considered as one of the useful screening tools for easy prediction of osteoporosis in the real-world clinical setting.

Segment anything in medical images

Article Open access 22 January 2024

Jun Ma, Yuting He, … Bo Wang

Towards a general-purpose foundation model for computational pathology

Article 19 March 2024

Richard J. Chen, Tong Ding, … Faisal Mahmood

nnU-Net: a self-configuring method for deep learning-based biomedical image segmentation

Article 07 December 2020

Fabian Isensee, Paul F. Jaeger, … Klaus H. Maier-Hein

Introduction

Osteoporosis is a common condition, especially in postmenopausal women; however, it often remains undetected until after fracture occurs. Early detection of osteoporosis is greatly important in preventing osteoporotic fractures. In the United States, the incidence of osteoporosis-related fractures is more than four times higher compared to that of stroke, heart attack, and breast cancer¹, and based on the meeting report of the World Health Organization (WHO), osteoporotic fractures account for more hospital bed-days than those diseases in several high-income countries². Hip fractures, one of the major osteoporotic fractures, are associated with limitations in ambulation, chronic pain and disability, loss of independence, and decreased quality of life, and 21%–30% of patients who have hip fracture die within 1 year³.

To date, the gold standard for osteoporosis diagnosis is the estimation of bone mineral density (BMD) in the hip and lumbar spine using dual-energy X-ray absorptiometry (DXA)⁴. According to the WHO guidelines, BMD ≤ 2.5 standard deviations below the young adult mean (T-score ≤ − 2.5) indicates osteoporosis, while a T-score at any site between − 1.0 and − 2.5 indicates low bone mass or osteopenia. Moreover, the US Preventive Services Task Force has recommended screening for osteoporosis with BMD testing to prevent osteoporotic fractures in women aged ≥ 65 years³.

However, even though DXA is the gold standard of osteoporosis diagnosis, it could not be widely used as a screening tool for osteoporosis because of its high cost and limited availability in developing countries^5,6. To overcome these limitations, until now, great efforts have been made to develop a screening tool for osteoporosis. Quantitative ultrasonography is one of them, which has been developed as an alternative to DXA for screening osteoporosis. It is portable and more economical than DXA; however, it is insufficient to replace DXA as a screening tool for osteoporosis⁶. Furthermore, there are also various clinical risk assessment tools that have been developed to predict osteoporosis, including Fracture Risk Assessment Tool (FRAX), QFracture algorithm, Garvan Fracture Risk Calculator, and the Osteoporosis Self-assessment Tool^6,7. These various risk assessment tools are easily accessible and useful; particularly, the FRAX calculator is a major achievement in terms of understanding and measuring fracture risk. However, a few limitations also exist, such as the lack of consideration of racial and ethnic group difference, especially those regarding body mass index and mortality rate⁸. Therefore, an advanced screening tool for osteoporosis is still needed in clinical practice.

Recently, artificial intelligence (AI) has been used for various medical imaging interpretation fields. Moreover, several studies attempted to apply AI technology for the development of a screening tool for osteoporosis. Based on simple radiographic data, there were a few trials to predict osteoporosis using machine learning or deep learning algorithm^4,9. However, to the best of our knowledge, there were some limitations in the development of a real-world screening tool, such as inappropriate object detection results and extremely small sample size with methodological flaws.

This study aimed to predict osteoporosis from simple hip radiography by developing a deep neural network (DNN) model, which could be considered as a screening tool for osteoporosis by applying the latest techniques in the medical AI field, minimizing the previous limitations.

Results

The mean age of all 1001 female patients was 72.5 ± 12.5 years (range 55.2–100.5). Moreover, 440 right hips and 561 left hips were included. The mean BMD of the total hip was 0.715 ± 0.162 g/cm² (range 0.20–1.34), and the mean T-score of the total hip was − 2.2 ± 1.3 (range − 6.5–3.1).

Validation of the deep learning model performance

The confusion matrix of the trained neural network applying the 101 test sets is shown in Table 1. Our final DNN model showed an overall accuracy of 81.2%, sensitivity (recall) of 91.1%, and specificity of 68.9%. The positive predictive value (PPV), which indicates “precision,” was 78.5%, and the negative predictive value (NPV) was 86.1%.

Table 1 Confusion matrix of the final model.

Full size table

Furthermore, to evaluate the performance of our classification model, we drew the receiver operating characteristic (ROC) curve shown in Fig. 1. The area under curve (AUC) value was 0.867, which indicates excellent performance¹⁰.

Visualization of model performance: gradient-based class activation map (Grad-CAM) result

All Grad-CAM results from the 101 test sets were confirmed as appropriate by two orthopedic surgeons by perfect agreement (κ = 1.000). Moreover, to predict osteoporosis, we illustrated the Grad-CAM result for true positive, false positive, false negative, and true negative (Fig. 2). Not only the true positive and true negative results but also all false positive and false negative Grad-CAM results appropriately matched not only the cortex line but also the trabecular patterns of proximal femur radiographs, which indicates the validity of our classification model (see Supplementary File S1 online).

External validation

The confusion matrix of the external validation cohorts from 117 datasets is shown in Table 2. The performance of the model showed an overall accuracy of 71.8%, sensitivity (recall) of 83.7%, and specificity of 38.7%. The PPV, which indicates “precision,” was 79.1%, and the NPV was 46.2%. The AUC value was 0.700, which indicates acceptable performance¹⁰ (Fig. 1). We also confirmed Grad-CAM results from all 117 datasets to visually verify the model performance (see Supplementary File S2 online).

Table 2 Confusion matrix of the external validation.

Full size table

Discussion

DXA, which is regarded as the gold standard of osteoporosis diagnosis, uses the spectral imaging with measurement of the differences of energy levels from two X-ray beams¹¹. On the contrary, from the single hip radiographs, decreased BMD can be appreciated by decreased cortical thickness and loss of bony trabeculae in the early stages in simple radiographs (Fig. 3). Indeed, several previous studies in both orthopedic and other various clinical fields have reported that the cortical thickness or trabecular pattern could predict the BMD^9,12. Therefore, herein, we hypothesized that if we used a deep learning methodology, we could predict the presence of osteoporosis using simple hip radiography. Even though the radiation dose of simple hip radiography is higher compared to DXA, in case DXA is not available, especially in developing countries, or patients have already undergone simple hip radiography for other symptoms, the use of simple hip radiography for screening osteoporosis could be a good alternative to DXA.

In this study, we developed a DNN model that could predict osteoporosis only from a single hip radiograph, with > 80% accuracy and > 90% sensitivity with near 70% specificity based on VGG16 equipped with nonlocal neural network (NLNN) model. We believe that the current proposed model with high accuracy and sensitivity could be considered a useful screening tool for the easy diagnosis of osteoporosis in the real-world clinical setting.

Previously, there were several attempts to predict osteoporosis using simple radiography based on machine learning. In 2012, Harrar et al.¹³ compared the multilayer perceptron neural network to other artificial neural networks to predict osteoporosis defined by DXA in their study using calcaneus radiography. Moreover, in the following year, Kavitha et al.¹⁴ reported the combination method of histogram-based automatic clustering and support vector machine technique for the prediction of osteoporosis diagnosed by DXA using mandibular cortical bone in the fields of dentistry. However, in the clinical setting, the BMD from DXA is usually based on the examination of the lumbar spine and hip region, so there was a fundamental query in performing the validation between DXA on the lumbar spine/hip regions and other bones in the human body. Recently, some authors reported that their studies tried to predict osteoporosis from lumbar spine or hip radiographs. In 2020, Zhang et al.¹⁵ used a dataset of 1616 lumbar spine radiographs from 808 postmenopausal women and introduced a deep convolutional neural network (CNN) model to classify the osteoporosis, osteopenia, and normal groups. Moreover, Yamamoto et al.⁴ used the deep learning model to classify osteoporosis using 1131 simple hip radiographs, which is similar to our study. However, both studies have some limitations: The former even performed three-class osteoporosis classification with great performance of the training dataset; in the test dataset, they only showed sensitivity of 57.9–89.3% on screening osteoporosis and sensitivity of 50.0–85.3% on screening osteopenia, which showed lack of consistency. Moreover, they have several potential biases from the characteristics of spine radiography, which is more sensitive to degenerative changes compared to the hip region or several overlying structures near the spine axis. There was also a critical query in the latter study by Yamamoto et al.; even though they reported high prediction performance, such as accuracy, precision, recall, and specificity numerically, in visualization of their classification model from Grad-CAM, their heatmap result was not distributed appropriately to the proximal femoral cortex or trabecular structure, around the lesser trochanter without the proximal femur or pelvic bone as they mentioned. They indicated definitely different regions of interest of DXA, even though they diagnosed and labeled osteoporosis/nonosteoporosis in the DXA result. However, compared to these previous studies, we achieved a favorable performance from our DNNs, which could predict osteoporosis from simple hip radiographs, and the visual explanation of our result from Grad-CAM was also appropriately matched with the proximal femur structure. To the best of our knowledge, this is the first study that showed acceptable osteoporosis diagnostic efficacy using a deep learning model suggesting appropriate Grad-CAM results.

In this study, considering the neural network algorithm, CNN is an efficient algorithm for image processing. However, there are some drawbacks when using CNN architecture. The most significant part related to this study is that CNN only focuses on local features, not global features. As CNN uses a filter map for its linear transformation, CNN only sees locally. In this study, the X-ray dose is not modulated during radiography, and there might be a difference between images of soft tissue intensity divided by bone intensity. Therefore, we contemplated on how to overcome the different windowing levels automatically, and NLNN provided part of the solution. As NLNN works by generating one correlation matrix between every pixel, NLNN provides global information to the CNN network. This methodology is known as attention mechanism and used in various algorithms, such as transformer¹⁶. Without NLNN, we were unable to train the model at all. However, with NLNN, our model showed reasonable performance with high sensitivity.

This study has several limitations. First, when diagnosing osteoporosis in the clinical setting, we usually consider the BMD from DXA on both the lumbar spine and hip region, but we only considered the hip region in this study because of the complexity of the calculation criteria and various conditions of each patient of spine BMD (e.g., for interpretation of spine DXA, there were numerous exceptions that we do not routinely calculate L1 to L4 BMD). The data preprocessing from both the lumbar spine and hip region could make establishing the neural network model more difficult. However, considering the screening tool, the current model could be helpful even though it did not contain data of the lumbar spine. Second, we did not classify the dataset into three classes—“osteoporosis,” “normal,” and “osteopenia”—but only classified them into two classes, “osteoporosis” or “nonosteoporosis.” Classification can be thought of as an algorithm that makes the decision boundary between classes in data manifold. Therefore, the more subtle the decision boundary is, the more difficult the training process becomes. Especially, osteopenia can be considered the gray zone for the classification task, leading to more difficult differentiation of osteoporosis and osteopenia. Moreover, the limitation of the relatively small number of included dataset was another main reason for the difficulty in dividing the data into three classes, including the “osteopenia” group and continuous outcome predicting model, which correlates actual BMD units. However, the natural progression from normal to osteopenia to osteoporosis is a continuous process, and in the clinical setting, we adopted the T-score for diagnosis of osteoporosis, not a BMD unit, and the two-class identification model could be more intuitive in some ways in a real-world clinical setting than three-class identification, as the screening tool of the current neural network model. Third, we only included postmenopausal women in the study, even though, clinically, they have a high risk of osteoporotic fracture. Indeed, recently, there were concerns about both underdiagnosed and undertreated osteoporosis in the older male population because men are typically not part of routinely recommended screening with DXA, so, in the future study, the larger neural network model should contain the data of the male population. Forth, the fundamental question exists for the definition of osteoporosis, on how well DXA reflects the real osteoporosis. Even though the WHO guideline defined osteoporosis as a T-score ≤ − 2.5 in postmenopausal women, DXA is also one of the examination modalities for evaluating BMD, not an absolute index for defining osteoporosis. Therefore, our current neural network model worked well and showed favorable performance to predict osteoporosis; however, these predictions sometimes might be not an actual osteoporosis prediction, and there could be a query that this only means consistency with the result of the DXA. In the future, further studies are needed, showing more accurate performance of both hip and spine radiography, like central DXA, with larger study datasets. Furthermore, comparisons with other examination modalities for osteoporosis, not only BMD-DXA, are required. Lastly, potential concerns regarding model performance might exist due to the discordance of AUC values between the internal and external validation sets, even though both AUC values indicated acceptable model performance¹⁰. However, we believe this may be due to the domain generalization issue. Moreover, to the best of our knowledge, no previous study has completely resolved the domain generalization issue. A few studies have attempted to solve it from a mathematical perspective, but they also finally handled this issue on a domain-by-domain basis, that is, as a multicenter study¹⁷. In the current study, we demonstrated the possibility of predicting osteoporosis using simple hip radiographs. Therefore, we assert that, using our proposed method, anyone can implement and train an osteoporosis prediction model using their own dataset.

In conclusion, even though there were some limitations in the study, the current deep learning network model could be a useful screening tool for the easy diagnosis of osteoporosis in the real-world clinical setting with high accuracy and sensitivity.

Methods

Study design and patient selection

This study aimed to establish a deep learning algorithm that classifies osteoporosis or nonosteoporosis defined by the T-score of DXA from simple hip radiographs. This study was approved by the Ethics Committee of Institutional Review Board of Asan Medical Center, Seoul, Republic of Korea (IRB No. 2019-1489). The Asan Medical Center ethics committee waived the need for the informed consent due to the retrospective nature of the study, and the analysis used anonymous clinical data. Data cannot be shared publicly because it contains potentially identifying information of each patient. Data are available from the Asan Medical Center Institutional Data Access/Ethics Committee (contact Asan Medical Center Institutional Review Board, Convergence Innovation Bldg. 88, Olympic-ro 43-gil, Songpa-gu, Seoul, Republic of Korea. Website link, http://eirb.amc.seoul.kr/; E-mail, irb@amc.seoul.kr; Phone, + 82-2-3010-7165). Data collection was performed in accordance with the relevant guidelines and regulations of the committee.

We retrospectively reviewed data with both simple hip anterior–posterior (AP) radiographs and DXA examination results of consecutive patients who visited our hip and pelvic trauma and disease clinic based on our outpatient clinic and inpatient pool between January 2010 and February 2019. We only included the data of patients who (1) were aged ≥ 55 years, (2) were female, and (3) underwent both hip AP radiography and DXA within a month. We excluded the data of patients who (1) had hip osteoarthritis, which potentially leads to BMD overestimation; (2) had other diseases with a specific condition that could result in a bias in DXA results, such as osteonecrosis of femoral head, tumorous condition, and calcific tendinitis of the hip; (3) underwent DXA of the operated hip; (4) underwent hip radiography, which contained foreign body materials on the same side of the DXA; and (5) had either radiography or DXA records from other hospitals. All included data, such as simple hip radiographs and records of DXA, were provided by a single center (Asan Medical Center, Seoul, Republic of Korea). Digital X-ray machines (Model: GM85, Samsung Electronics, Seoul, Korea; Model: GR40CW, Samsung Electronics, Seoul, Korea; Model: Discovery XR656, GE Healthcare, WI, USA; Model: Thunder Platform, GE Healthcare, WI, USA; Model: CXDI, Canon Inc., Tokyo, Japan) at 70 to 78 kV and 207 to 500 tube current were used by an experienced X-ray imaging technologist to acquire standard hip radiographs in the study population. BMD measurement by DXA was performed using Lunar Prodigy Advance system (GE Healthcare, WI, USA). We did not exclude the data of patients who had vascular calcification or Foley catheter on X-ray and did not consider hip dysplasia, even if there was no evidence of osteoarthritis progression. Moreover, we did not consider the position of the hip joint on radiography as an exclusion criterion affecting the quality assessment of hip simple radiographs, but this was regarded as normal variation of real-world data. Finally, the data of 1012 patients were collected.

Dataset preparation

A total of 1012 simple hip radiographs were divided into two groups: the osteoporosis group, defined by a T-score ≤ − 2.5, and the nonosteoporosis group, defined by a T-score > − 2.5 in DXA. The definition of osteoporosis followed the WHO diagnostic criteria¹⁸.

Of 1012 subjects, 513 subjects were diagnosed with osteoporosis, and the other 499 subjects were classified as nonosteoporotic. Of 1012 subjects, one had two radiographs (one subject). Except for this one radiograph, one licensed medical doctor (JR) manually cropped the simple hip bone X-ray images for 1011 subjects.

All images were cropped on the same side where the matched DXA examination was performed. The images were cropped on radiography, which fully contained the proximal femur region similar to the region of interest of DXA, defined as the acetabular roof as the superior border of the cropped region, as the lower margin of the lesser trochanter as the inferior border, lateral margin of the teardrop as the medial border, and lateral of the vastus ridge as the lateral border of the cropped image (Fig. 4).

We were unable to crop 10 images for inappropriate quality of the radiograph. Finally, we have 1001 images that were used for training, validation, and test sets. We randomly selected 80% for training set from entire datasets, and the remaining 10% for validation set and 10% for test set. We used 800 images for the training set (393 osteoporosis; 407 nonosteoporosis), 100 images for the validation set (55 osteoporosis; 45 nonosteoporosis), and 101 images for the test set (56 osteoporosis; 45 nonosteoporosis).

Training details

For training, the VGG16 network was chosen¹⁹ as a deep learning model. It is a CNN model that shows high performance. To increase the performance of the network, we not only used the VGG16 network itself but also implemented NLNN²⁰ inside the VGG16 network.

When feeding images into the network, due to the nonstandardized image acquiring process of X-ray, we normalized X-ray images with Z-transform with 2-sigma, which translates mean to 0, and then divided with 2-sigma. Furthermore, we randomly mentioned 2-sigma again with 1-sigma to make the model meet various windowing levels. In the mathematical term, we can express our Z-transform as

$${z}_{ij}=\frac{{x}_{ij}-\mu }{2\sigma \pm \epsilon \cdot \sigma }$$

where $\epsilon $ is drawn from a random uniform distribution. After Z-transformation, we indicated pixel values that are bigger than 1 as 1 and smaller than − 1 as − 1.

Moreover, we used the data augmentation technique. For data augmentation, five strategies were used: original imaging, blurring, sharpening, shearing, and rotation with a small angle. As there were five strategies, we set the number of iterations per epoch to be 4,000 iterations, which are 800 images multiplied by five strategies.

Additionally, we used Keras framework of Python language, with TensorFlow backend. Loss was set to be vanilla binary cross entropy, and we trained the model for 300 epochs. We selected the optimizer to be Adam²¹ with a learning rate of ${10}^{-6}$. After every epoch, validation loss and validation accuracy were calculated, and the model with the best validation accuracy was selected for the final model.

Model evaluation

After training, we calculated the confusion matrix to validate model performance, and from this, we evaluated the accuracy, sensitivity, specificity, PPV, and NPV. Moreover, we also drew the ROC curve and calculated the AUC. Furthermore, we used a Grad-CAM²² overlapping the original image to visualize the model performance. The Grad-CAM is a tool of visual explanations of DNNs, activating the mapping that the deep learning network is seeing. Thus, Grad-CAM is a tool for explainable AI. Through Grad-CAM, one can visualize where a deep learning algorithm sees using a gradient of deep learning algorithm at the image level. All Grad-CAM results of the 101 test sets were independently reviewed by two board-certified orthopedic surgeons who are the faculty of orthopedic hip and pelvis (KCH and YPW). The adequacy of Grad-CAM were evaluated by the distribution of the heatmap, through not only the cortex line but also the trabecular patterns of the proximal femur, which is checked as binary data (yes/no) for adequacy. The agreement between reviewers was correlated with kappa values a priori: κ = 1, corresponding to perfect agreement; 1.0 > κ ≥ 0.8, almost perfect agreement; 0.8 > κ ≥ 0.6, substantial agreement; 0.6 > κ ≥ 0.4, moderate agreement; 0.4 > κ ≥ 0.2, fair agreement; and κ < 0.2, slight agreement.

External validation

The neural network model was externally validated with the data that were prospectively collected from another university hospital between October 2020 and July 2021. Digital X-ray machines (Model: Innovision DXII, DK Medical, Seoul, Korea; Model: Discovery XR656, GE Healthcare, WI, USA) at 70 to 78 kV and 200 to 500 tube current were used in hip radiography, and BMD measurement by DXA was performed using Lunar Prodigy Advance system (GE Healthcare, WI, USA). A total 117 datasets (86 osteoporosis; 31 nonosteoporosis) were used for external validation. The overall accuracy, sensitivity, specificity, PPV, and NPV were calculated. The ROC curve and AUC value were calculated. We also used Grad-CAM to visually verify the model performance.

Ethical approval and informed consent

This study was approved by the Institutional Review Board of Asan Medical Center, Seoul, Republic of Korea (IRB No. 2019-1489), informed consent was waived due to the retrospective nature of the study, and the analysis used anonymous clinical data.

Consent for publication

All authors agreed to publish this manuscript.

Data availability

The data presented in this study are available on request from the corresponding author. The data are not publicly available due to conditions of the ethics committee of our university.

References

Gharib, H. et al. American Association of Clinical Endocrinologists, Associazione Medici Endocrinologi, and European Thyroid Association Medical guidelines for clinical practice for the diagnosis and management of thyroid nodules. Endocr. Pract. 16, 1–43. https://doi.org/10.4158/10024.Gl (2010).
Article PubMed Google Scholar
Group, W. H. O. WHO Scientific Group on the Assessment of Osteoporosis at Primary Health Care Level 10–11 (World Health Organization, 2004).
Google Scholar
Force, U. S. P. S. T et al. Screening for osteoporosis to prevent fractures: US preventive services task force recommendation statement. JAMA 319, 2521–2531. https://doi.org/10.1001/jama.2018.7498 (2018).
Article Google Scholar
Yamamoto, N. et al. Deep learning for osteoporosis classification using hip radiographs and patient clinical covariates. Biomolecules https://doi.org/10.3390/biom10111534 (2020).
Article PubMed PubMed Central Google Scholar
Chin, K. Y. & Ima-Nirwana, S. Calcaneal quantitative ultrasound as a determinant of bone health status: What properties of bone does it reflect?. Int. J. Med. Sci. 10, 1778–1783. https://doi.org/10.7150/ijms.6765 (2013).
Article PubMed PubMed Central Google Scholar
Subramaniam, S., Ima-Nirwana, S. & Chin, K. Y. Performance of osteoporosis self-assessment tool (OST) in predicting osteoporosis: A review. Int. J. Environ. Res. Public Health https://doi.org/10.3390/ijerph15071445 (2018).
Article PubMed PubMed Central Google Scholar
Rubin, K. H., Friis-Holmberg, T., Hermann, A. P., Abrahamsen, B. & Brixen, K. Risk assessment tools to identify women with increased risk of osteoporotic fracture: Complexity or simplicity? A systematic review. J. Bone Miner. Res. 28, 1701–1717. https://doi.org/10.1002/jbmr.1956 (2013).
Article PubMed Google Scholar
Silverman, S. L. & Calderon, A. D. The utility and limitations of FRAX: A US perspective. Curr. Osteoporos. Rep. 8, 192–197. https://doi.org/10.1007/s11914-010-0032-1 (2010).
Article PubMed PubMed Central Google Scholar
Sapthagirivasan, V. & Anburajan, M. Diagnosis of osteoporosis by extraction of trabecular features from hip radiographs using support vector machine: An investigation panorama with DXA. Comput. Biol. Med. 43, 1910–1919. https://doi.org/10.1016/j.compbiomed.2013.09.002 (2013).
Article CAS PubMed Google Scholar
Yang, S. & Berdine, G. The receiver operating characteristic (ROC) curve. Southwest Respir. Crit. Care Chron. https://doi.org/10.12746/swrccc.v5i19.391 (2017).
Article Google Scholar
Pisani, P. et al. Screening and early diagnosis of osteoporosis through X-ray and ultrasound based techniques. World J. Radiol. 5, 398–410. https://doi.org/10.4329/wjr.v5.i11.398 (2013).
Article PubMed PubMed Central Google Scholar
Tafraouti, A., Hassouni, M. E., Toumi, H., Lespessailles, E. & Jennane, R. In 2014 Tenth International Conference on Signal-Image Technology and Internet-Based Systems 73–77 (2014).
Harrar, K., Hamami, L., Akkoul, S., Lespessailles, E. & Jennane, R. In 2012 3rd International Conference on Image Processing Theory, Tools and Applications (IPTA) 217–221 (2012).
Kavitha, M. S., Asano, A., Taguchi, A. & Heo, M.-S. The combination of a histogram-based clustering algorithm and support vector machine for the diagnosis of osteoporosis. Imaging Sci. Dent. https://doi.org/10.5624/isd.2013.43.3.153 (2013).
Article PubMed PubMed Central Google Scholar
Zhang, B. et al. Deep learning of lumbar spine X-ray for osteopenia and osteoporosis screening: A multicenter retrospective cohort study. Bone https://doi.org/10.1016/j.bone.2020.115561 (2020).
Article PubMed PubMed Central Google Scholar
Vaswani, A. et al. In Advances in neural information processing systems 5998–6008.
Nguyen, A. et al. Domain invariant representation learning with domain density transformations. Prepreint at https://arxiv.org/abs/2102.05082 (2021).
Cosman, F. et al. Clinician’s guide to prevention and treatment of osteoporosis. Osteoporos. Int. 25, 2359–2381. https://doi.org/10.1007/s00198-014-2794-2 (2014).
Article CAS PubMed PubMed Central Google Scholar
Simonyan, K. & Zisserman, A. Very deep convolutional networks for large-scale image recognition. arXiv preprint (2014).
Wang, X., Girshick, R., Gupta, A. & He, K. In Proceedings of the IEEE conference on computer vision and pattern recognition 7794–7803.
Kingma, D. P. & Ba, J. Adam: A method for stochastic optimization. arXiv preprint (2014).
Selvaraju, R. R. et al. In Proceedings of the IEEE international conference on computer vision 618–626.

Download references

Author information

These authors contributed equally: Ryoungwoo Jang and Jae Ho Choi.

Authors and Affiliations

Department of Biomedical Engineering, University of Ulsan College of Medicine, Seoul, Republic of Korea
Ryoungwoo Jang
University of Ulsan College of Medicine, Seoul, Republic of Korea
Jae Ho Choi
Department of Convergence Medicine, University of Ulsan College of Medicine, Asan Medical Center, Seoul, Republic of Korea
Namkug Kim
Department of Orthopaedic Surgery, Good Gangan Hospital, Busan, Republic of Korea
Jae Suk Chang
Department of Orthopaedic Surgery, Asan Medical Center, University of Ulsan College of Medicine, Seoul, Republic of Korea
Pil Whan Yoon
Department of Orthopaedic Surgery, Chung-Ang University Hospital, Chung-Ang University College of Medicine, Seoul, Republic of Korea
Chul-Ho Kim

Authors

Ryoungwoo Jang
View author publications
You can also search for this author in PubMed Google Scholar
Jae Ho Choi
View author publications
You can also search for this author in PubMed Google Scholar
Namkug Kim
View author publications
You can also search for this author in PubMed Google Scholar
Jae Suk Chang
View author publications
You can also search for this author in PubMed Google Scholar
Pil Whan Yoon
View author publications
You can also search for this author in PubMed Google Scholar
Chul-Ho Kim
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

R.J. designed the study, performed the experiments, analyzed the data, and prepared the manuscript; J.H.C. designed the study, performed the experiments, analyzed the data, and prepared the manuscript; N.K. validated the data; J.S.C. collected the data; P.W.Y. collected the data; C.-H.K. designed the study, collected and interpreted the data, and prepared and revised the manuscript.

Corresponding author

Correspondence to Chul-Ho Kim.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Figure 1.

Supplementary Figure 2.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Jang, R., Choi, J.H., Kim, N. et al. Prediction of osteoporosis from simple hip radiography using deep learning algorithm. Sci Rep 11, 19997 (2021). https://doi.org/10.1038/s41598-021-99549-6

Download citation

Received: 19 June 2021
Accepted: 22 September 2021
Published: 07 October 2021
DOI: https://doi.org/10.1038/s41598-021-99549-6

This article is cited by

Predicting the risk of inappropriate depth of endotracheal intubation in pediatric patients using machine learning approaches
- Jae-Geum Shim
- Eun Kyung Lee
- Jin Hee Ahn
Scientific Reports (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Segment anything in medical images

Towards a general-purpose foundation model for computational pathology

nnU-Net: a self-configuring method for deep learning-based biomedical image segmentation

Introduction

Results

Validation of the deep learning model performance

Visualization of model performance: gradient-based class activation map (Grad-CAM) result

External validation

Discussion

Methods

Study design and patient selection

Dataset preparation

Training details

Model evaluation

External validation

Ethical approval and informed consent

Consent for publication

Data availability

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Figure 1.

Supplementary Figure 2.

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Predicting the risk of inappropriate depth of endotracheal intubation in pediatric patients using machine learning approaches

Comments

Search

Quick links