Machine Learning for Diagnosis of Hematologic Diseases in Magnetic Resonance Imaging of Lumbar Spines

Hwang, Eo-Jin; Jung, Joon-Yong; Lee, Seul Ki; Lee, Sung-Eun; Jee, Won-Hee

doi:10.1038/s41598-019-42579-y

Download PDF

Article
Open access
Published: 15 April 2019

Machine Learning for Diagnosis of Hematologic Diseases in Magnetic Resonance Imaging of Lumbar Spines

Eo-Jin Hwang¹,
Joon-Yong Jung¹,
Seul Ki Lee¹,
Sung-Eun Lee² &
…
Won-Hee Jee¹

Scientific Reports volume 9, Article number: 6046 (2019) Cite this article

1840 Accesses
12 Citations
1 Altmetric
Metrics details

Subjects

Abstract

We aimed to assess feasibility of a support vector machine (SVM) texture classifier to discriminate pathologic infiltration patterns from the normal bone marrows in MRI. This retrospective study included 467 cases, which were split into a training (n = 360) and a test set (n = 107). A sagittal T1-weighted lumbar spinal MR image was normalized by an intervertebral disk, and bone marrows were segmented. The various kernel functions and SVM input dimensions were experimented to construct the most optimal classifier model. The accuracy and sensitivity increased as the number of training set sizes increased from 180 to 360. The test set was analyzed by SVM and two independent readers, and the accuracy and sensitivity of the SVM classifier, reader 1 and reader 2 were 82.2% and 85.5%, 79.4% and 82.3%, and 82.2% and 83.9%, respectively. The area under receiver operating characteristic curve (AUC) of the SVM classifier, reader 1 and reader 2 were 0.895, 0.879 and 0.880, respectively. The SVM texture classifier produced comparable performance to radiologists in isolating the hematologic diseases, which could support inexperienced physicians with spinal MRI to screen patients with marrow diseases, who need further diagnostic work-ups to make final decisions.

Segment anything in medical images

Article Open access 22 January 2024

Microenvironmental reorganization in brain tumors following radiotherapy and recurrence revealed by hyperplexed immunofluorescence imaging

Article Open access 15 April 2024

Best practices for single-cell analysis across modalities

Article 31 March 2023

Introduction

Magnetic resonance imaging (MRI) of the spine is frequently performed in patients with low back pain because it directly visualizes vertebral column, spinal cords and nerves, and supporting soft tissue structures¹. Bone marrow is an important part to be interpreted in reading the spinal MRI. However, diffuse bone marrow infiltration may appear as normal due to its generalized and repetitive pattern involving the entire marrow spaces². In addition, age-dependent variabilities and marrow reconversion in response to physiological oxygen demands complicate the bone marrow interpretation^3,4. Therefore, it is often challenging for physicians to make decisions with bone marrow signals on MRI and to determine whether to proceed with further clinical and laboratory work ups.

A number of machine-learning based and automated imaging diagnoses and classification of diseases have been studied in the field of medical imaging^5,6,7,8. From a methodological point of view, features extracted from texture were used as inputs to the machine learning algorithm to build a decision-making model^6,7,8. Texture analysis provides quantitative means to describe tissue properties and pathological stages to reveal information that is often invisible to the human eyes⁹. Based on the diffuse and redundant nature of the bone marrow infiltration on MRI, diffuse bone marrow diseases are propitious candidates for texture analysis. A previous study demonstrated the feasibility of texture analysis in determining the treatment response to the multiple myeloma¹⁰. Another recent study attempted to differentiate normal from abnormal marrows in metastases patients using a machine learning algorithm with textural inputs¹¹. However, no attempt has been made to distinguish between normal and diffuse marrow infiltrative diseases using textural differences. We hypothesized that the machine-learning based algorithm with bone marrow textures as input would be able to discriminate the diffuse bone marrow infiltration from the normal bone marrows. Previously, a number of studies in the field of artificial intelligence had utilized support vector machine (SVM) as a classifier to differentiate non-medical images with various textures^12,13,14. Therefore, the purpose of our study was to construct a machine learning based algorithm using a SVM texture classifier and to isolate infiltration patterns suspicious of hematologic diseases on lumbar spine MRI (L-spine MRI). We built a SVM texture classifier model that was most suitable for marrow differentiation and compared its performance to experienced radiologists on the separate data set. In addition, we estimated a sample size required to reach target classification accuracy.

Results

Effects of SVM kernel types and feature dimensions on predictive performance

Figure 1 illustrates the effect of SVM kernels and feature dimensions on differentiating diseased marrows from the normal marrows. Overall, the classification accuracy and sensitivity were more affected by the choice of kernel types than they were on feature dimensions. The 3^rd order polynomial kernel produced the classification accuracy and sensitivity at the range of 60 to 80 percent depending on the feature dimension sizes. However, the training sensitivity only ranged from 40 to 50 percent (Fig. 1a). The tangent hyperbolic kernel produced varying accuracies and sensitivities between 60 to 80 percent, and the values did not increase linearly with the feature dimension sizes (Fig. 1b). The radial basis function kernel produced the accuracies and sensitivities at the range of 70 to 90 percent (Fig. 1c). The effect of feature dimension was minimal, and the results were consistent across all feature dimensions we tested. However, the feature dimension of 56,862 pixels produced the highest training sensitivity, and the difference between the training and test set results was minimal. We chose the radial basis function kernel and the feature dimension of 56,862 pixels to build the final model.

Predictive performance of SVM on differentiating diseased marrows from the normal marrows

Table 1 illustrates the overall predictive performance of our SVM classifier model with respect to increasing training set sizes. Overall, the predictive performance of the marrow differentiation gradually increased with respect to the number of training set sizes. When the training set size was 360, the classification accuracy, sensitivity and specificity of the training sets and were 82.8%, 81.7%, 83.9%, respectively, and AUC was 0.895 (P < 0.001).

Table 1 The two-class SVM results with varying sizes of the training sets.

Full size table

Comparison of performances and interobserver agreements between the SVM classifier and human readers using a separate data set

Table 2 illustrates the classification accuracy of SVM classifier and two independent readers for the same test set. There was no significant difference in accuracy, sensitivity and specificity between SVM and each reader. Figure 2 illustrates the ROC curves and AUCs of the SVM classifier, and two independent readers, respectively. There was no significant difference in AUCs between SVM and each reader.

Table 2 Classification accuracy, sensitivity and specificity of SVM, Reader 1 and Reader 2.

Full size table

Interobserver agreements between SVM and readers were moderate: κ = 0.425 with reader 1 and κ = 0.599 with reader 2. This was similar to interobserver agreements between the two readers (κ = 0.560). Benign marrow signal changes involving at least one level of vertebra were vertebral hemangioma (n = 19), Modic type change (n = 37), and fracture (n = 30). A multivariate analysis revealed that the factor associated with SVM classification results was fracture (P = 0.018), but not hemangioma (P = 0.283) or Modic type change (P = 0.872). Nine false positive and false negative cases occurred by the SVM classifier in the test phase. Among them, the five false positive cases exhibited diffusely and heterogeneously decreased bone marrow signal intensities, which may be regarded as red marrow hyperplasia. Moreover, 6 false negative cases were multiple myeloma with normal marrow patterns.

Predicting sample size required for classification

Figure 3 illustrates the change of classification accuracy and sensitivity with respect to the number of training samples and fitted curves to the inverse power law function. With the three coefficients calculated by curve-fit, it was estimated that the training samples more than 553 would be required to reach classification accuracy of 85% (Fig. 3a) and sensitivity of 91.4% (Fig. 3b). The SSE and RMSE values for predicting classification accuracy were close to zero, which indicated good fits to the learning curves. The three coefficients calculated by non-linear regression to the learning curve model and goodness-of-fit evaluation results are summarized in Supplementary Table S1.

Discussion

We constructed a SVM texture classifier model to discriminate diffuse infiltrative patterns suspicious of hematologic diseases, which require further clinical and laboratory work-up to make a final decision. Its predictive performance was comparable to experienced radiologists, which successfully demonstrated the feasibility of SVM texture classifiers to differentiate bone marrow with hematologic diseases from those without diseases. The use of intervertebral disks or skeletal muscle signals in T1-weighted images has been internal standards in clinical practice since the initial study reported that they have helped determine pathologic marrow infiltrations with high accuracy¹⁵. However, the challenge for diagnosing marrow infiltrative diseases has been attributed to the presence of an overlap between normal hypercellular marrow and diffuse marrow infiltrations¹⁶. Moreover, a considerable proportion of multiple myeloma and the most of MGUS have been deluded as normal marrow patterns^17,18. The majority of the falsely interpreted cases by the SVM classifier also falls under these challenges probably because the marrow signal was normalized by intervertebral disks. On the other hand, benign focal marrow lesions, such as vertebral hemangioma and Modic type changes, were not associated with the false interpretations. The association between fractures and SVM classifier was probably due to the fact that fracture is closely related to multiple myeloma in the diseased group^16,17. The learning curve model predicted that both accuracy and sensitivity would increase with more number of training samples. Therefore, the future study would be needed to examine whether the accuracy would improve further with a larger sample size and outperform human observers.

We experimented with various user-defined parameters to find the most optimal SVM classifier model. The effects of different kernel types and input dimensions were evaluated and the combination of a kernel and input dimension with the best performance was determined. The SVM classifier searches for an optimal separating hyperplane that maximizes the margin of the nearest data points. These subsets of the data points are called support vectors (SVs), which fall closest to the separating hyperplane. The operation of SVM for texture classification is two-fold; nonlinear mapping of a texture space into a high-dimensional feature space and construction of a separating hyperplane in the feature space. For nonlinear mapping, a kernel function is used to map a textural input space into a high dimensional feature space¹³. In this study, three different types of kernels were experimented to find which one was most suitable for marrow differentiation, including the 3^rd order polynomial kernel, tangent hyperbolic kernel and radial basis function kernel. Among the three kernels, the radial basis function kernel produced the highest accuracy and sensitivity. The radial basis function kernel projects the data into infinite dimensions to find a linear separation¹⁹. Unlike the two other kernels, it builds a non-parametric model, which means the complexity of the model can grow infinitely with the size of the data. If one has the unlimited data and very weak prior knowledge about the data, the non-parametric model is always better than the parametric model, which makes the radial basis function kernel a popular and a good default kernel for SVM²⁰.

For constructing a separate hyperplane in the features space, SVM is capable of using nonlinearly mapped input textures as features for classification. The textural input image is decomposed into a set of feature images using a bank of filters before classification is performed¹². A multiple number of channels corresponding to different filters are necessary to capture specific characteristics of the input textures, which makes filter selection a major issue to discriminate appropriate textural properties. In high-dimensional feature space, SVM searches for SVs, which plays a role of filters that capture critical measures from the input image, which are identified by the operation performed by a kernel function¹². Because SVM implicitly involves a process equivalent to feature selection, no additional feature selection was necessary. Therefore, we provided the gray-level pixels from the bone marrow images to SVM without additional user-defined feature selection methods. In general, a texture study follows the sequential steps of post-processing, feature extraction, feature selection, and classification²¹. A classification model confronts a risk of overfitting if too many features are included²², which makes feature selection an essential step in building a classification model^6,23. In this study, we avoided the issues with feature selection by preserving the textural information from the data as itself. Moreover, avoiding features selection substantially reduced time and effort to build the final model.

Since the raw pixels from the images were directly used as inputs to SVM, no spatial smoothing or further filtering process was involved in our study. Non-medical image classification using SVM had required additional modelling or filtering, such as a multiresolution simultaneous autoregressive model²⁴ or wavelet transform^25,26. In fact, none of these methods improved the overall accuracy, which could be attributed to irregular patterns of the bone marrow textures. In addition, our method was designed to accept heterogeneity of the T1-weighted images from multiple MR machines and acquisition parameters. Each scanner might involve its own and distinctive internal filtering operation before the final image is generated, but we were unable to control these vender-specific filters, nor did the internally smoothed images seem to affect much on our results. Although the images from a single vendor and a single scan protocol may have produced better results with less number of sample sizes, we tried to minimize the discrepancy by adjusting the signal intensity levels with respect to the disk signal of the same subject.

The limitations were the following. First, there was a bias on our population in that a major proportion of our study population was multiple myeloma among other hematologic diseases, and the cases with hypercellular marrow secondary to anemia, which is relatively frequent in clinical practice, were not included. However, these hematologic diseases share similar imaging features in T1-weighted images because the marrow signal intensity is determined by the relative composition of cellular and fat components. Our main objective was to diagnose bone marrow infiltration, not to distinguish each category separately. Therefore, this biased population might not have influenced on the overall results. Second, we sampled the disease positive and negative data from the separate cohorts in a case-control manner. This sampling pattern carries a risk of biases that the samples might not adequately reflect the spectrum in real clinical practice²⁷. Third, we did not consider demographic parameters. Age and gender are well-established factors accounting for heterogeneous signal intensities in the normal bone marrows^3,28. In particular, exclusion of females aged less than 40 and males less than 30 due to the concerns for hypercellular marrows may narrow the applicability. In the future study, a model regarding demographic factors should also be considered. Fourth, external validation was not performed. We split the data for training and validation of the SVM classifier model. However, external validation using geographically different data set is preferred to ensure generalizability²⁷. A method to control balance between false positives and false negatives should also be incorporated to increase and stabilize the sensitivity values²⁹. Finally, both accuracy and sensitivity could be improved by including multi-modal images as inputs to SVM classifier such as short T1 inversion recovery (STIR) images, dynamic contrast-enhanced images, chemical shift images and diffusion-weighted images, which are frequently used to help determine marrow disease status in conjunction with the T1-weighted images^30,31,32.

In summary, we introduced a machine learning method to differentiate diffuse marrow infiltrative diseases from the normal bone marrows based on the L-spine MRI. The SVM texture classifier model demonstrated comparable performance to experienced radiologists in isolating the marrows with hematologic diseases from the normal ones. In this respect, the SVM texture classifier has the potential to support physicians to determine whether the bone marrow signals suspicious of hematologic diseases would require further diagnostic work-ups.

Methods

Subjects

This retrospective study was approved by the institutional review board, and informed consent was waived. Figure 4 illustrates the flowchart of the subject inclusion criteria. The diseased and control cases were collected from the separate cohorts with different selection criteria, which were entirely based on the clinicopathological features without considering MR findings. For the diseased group, patients who visited the hematology department of our hospital and received L-spine MRI between March 2010 and June 2017 were searched in our PACS system (n = 1032). Among them, included were 273 patient cases from 256 patients (17 patients received MRI twice, one at the initial diagnosis and the other at relapse after complete remission) who met the following criteria: (1) confirmative diagnosis of having active hematologic disease based on the clinicopathological criteria of each disease category, (2) prior to initiation of the therapy in the first diagnosed patient, or re-initiation of the therapy in the relapsed patients. After reviewing 273 images, 31 were additionally excluded because bone marrow in their images were not appropriate for texture input due to combined diseases: severely collap sed, multilevel (>3 vertebral body segments) compression fracture (n = 19), extensive osteonecrosis (n = 7), spondylitis involving multiple levels (>3 vertebral body segments) (n = 5). The diseased group consisted of multiple myeloma (n = 159), leukemia (n = 32), lymphoma (n = 28), monoclonal gammopathy of unknown significance (MGUS) (n = 12), myelodysplastic syndrome (MDS) (n = 7), myelofibrosis (n = 3) and hypereosinophilia (n = 1).

For the control group, 350 cases were randomly selected from our PACS system among those who received L-spine MRIs between March 2010 and June 2017. Marrow cellularity usually depends on age and gender³³. It has been known that young men and middle-aged women have less than 50% of fat components on average³⁴. Therefore, the majority of people in this age group exhibited low bone marrow signal intensities on the T1-weighted MRI, which frequently leads to inconclusive interpretations. Because the aim of this study was to separate abnormal bone marrow signal intensities solely based on the texture information from MR imaging, females aged less than 40 and males less than 30 were excluded. Furthermore, the cases with history of malignancy or anemia, chronic diseases such as liver cirrhosis or chronic renal failure, and patients with transfusion history were excluded. The medical records and laboratory results longitudinally followed up for more than one year to confirm the absence of bone marrow pathology. Finally, 242 diseased cases (mean age: 60.3 ± 11.59, male: female = 131:111), and 225 control cases (mean age: 65.3 ± 12.41, male: female = 90:135) were included in this study. Among them, 180 cases were randomly selected for a training set from the diseased and control groups, respectively. The remaining cases were denoted as a test set. Consequently, 360 patients were assigned as a training group (mean age: 62.7 ± 11.76, male: female = 201:159, control: disease = 180:180), and 107 patients as a test group (mean age: 62.8 ± 13.77, male: female = 45:62, control: disease = 45:62).

MRI acquisition

MR images were acquired using multiple MRI vendors (Table 3). The T1-weighted images used in our study were heterogeneous in terms of manufacturers, model names, magnetic fields and scanning parameters.

Table 3 Summary of the sequence parameters for multiple MR vendors.

Full size table

Image post-processing

All algorithms for post-processing were written and executed using a MATLAB software package (MATLAB and Statistics Toolbox 2017a, The Mathwork, Inc, Natick, MA, USA). To compensate for signal heterogeneities, the acquired T1-weighted images were normalized by subtracting the whole pixels from the annulus fibrosus of nondegenerated intervertebral disk of the same subject. The intervertebral disk was separated into 5 regions with equal distance from anterior to posterior, and the first and last regions were regarded as annulus fibrosus. The disk-normalized marrows were segmented using a 3-dimensional GrowCut algorithm, which is a semi-automatic algorithm to segment the area of interest from multiple slices of an image³⁵. The sagittal T1-weighted images from multiple vendors and the processed images of the normal controls and patients with hematologic diseases can be found as Supplementary Fig. S1.

Data preparation for SVM model construction

The parameters used to construct a final SVM classifier model were experimented using LIBSVM version 3.21 (Library for Support Vector Machines, https://www.csie.ntu.edu.tw/~cjlin/libsvm/)³⁶. Figure 5 summarizes the study process from data preparation to classification using SVM. A rectangular window was created to cover the marrow signals of the vertebral body from a single subject. Since each individual had different shapes and sizes, a minimal window size was selected to encompass the smallest marrow among the subject population. From a single slice, 6 windows were created from S1 to L1 vertebral bodies, and the same numbers of windows were selected from the three and nine consecutive slices, respectively. Lastly, the 2-dimensional window array was reshaped into a 1-dimensional vector, each of which was concatenated to a single textural feature per subject.

Once the most optimal kernel function and feature dimension were determined, the final SVM classifier model was experimented on the varying numbers of training sizes from 180 to 360. For each size of the training set, training was repeated 30 times, each with different combinations of the training sets randomly selected from the training population. The pixel values within the feature matrices were scaled from 0 to 1. The grid-search method was employed to find two kernel parameters, the cost function C and gamma, γ, which identifies the pair of C and γ with the best internal cross-validation accuracy. The 5-fold cross validation was performed to the training set to estimate overall performance of the model. A final SVM classifier model was generated to the entire training set using the optimized parameters.

Comparison of predictive performances of SVM to human readers

The constructed SVM model was applied to the test set. Accuracy, sensitivity and specificity from the test set were regarded as the final outcomes. Two readers (S.K.L and J.Y.J with 3 and 10 years of experience in musculoskeletal radiology, respectively) blinded to clinical and laboratory results independently reviewed the test set. They determined the presence of hematologic diseases with five-level confidence scores: 0 = definitely absent, 1 = probably absent, 2 = equivocal, 3 = probably present, 4 = definitely present.

Sample size estimation using a learning curve

A learning curve model was employed to estimate target classification accuracy at a given number of the training set size. The curve model is represented as an inverse power law function, where the classification accuracy is expressed as a function of a training set size given unknown coefficients of a, b and c. The learning curve is modeled by the following equation¹:

$${\rm{y}}={\rm{f}}\,({\rm{x}};{\bf{a}},{\bf{b}},{\bf{c}})=(1-{\bf{a}})-{\bf{b}}{{\rm{x}}}^{{\bf{c}}}$$

(1)

where x is the training set size and y is the classification accuracy; a, b and c represent the minimum achievable error, learning rate and decay rate, respectively.

Using the observed classification accuracy at 13 different training sizes (0, 10, 20, 40, 60, 80, 100, 120, 140, 180, 240, 300, and 360), the unknown coefficients were estimated using a non-linear regression. In the MATLAB software, a function ‘fit’ was implemented using a Levenberg-Marquardt algorithm. The target sensitivity at the given training size was also estimated using the same equation.

Statistical Analysis

The classification accuracy, sensitivity, specificity and diagnostic accuracy for training phases were estimated using an area under the receiver operating characteristic (ROC) curve (AUC)³⁷. Sensitivity is the proportion of test positives among those who are truly diseased. Specificity is the proportion of test negatives among those who are not diseased. AUC is the measure of classification performance at various threshold settings, which tells the capability of the model in distinguishing different classes in the range of 0.5 and 1.

For the calculation of sensitivity, specificity, and interobserver agreements in human readers, 0–2 was regarded as negative, while 3–4 regarded as positive. Interobserver agreements (к) were calculated between SVM and readers. The κ values can be interpreted as poor (κ = 0), slight (κ = 0.0–0.2), fair (κ = 0.21–0.40), moderate (κ = 0.41–0.60), substantial (κ = 0.61–0.80), and almost perfect (κ = 0.81–1.00)³⁸.

Sensitivity, specificity and accuracy between the SVM classifier and human readers were compared by McNemar statistics. AUCs were compared between the SVM classifier and two readers³⁹. A multivariate logistic regression analysis was performed to estimate the influence of benign marrow signal changes including vertebral hemangioma, Modic type change, and fracture on classification results. P < 0.05 was considered significant for aforementioned statistics.

Finally, goodness-of-fit to the learning curve was evaluated using sum of squares due to error (SSE) and root mean squared error (RMSE). SSE measures the total deviation between the observed (y_i) and predicted accuracies (y). The weight (w_i) is the weighting applied to each data point and is usually w_i = 1 (Eq. 2). RMSE is the square root values of SSE divided by the residual degrees of freedom, which is defined as the number of data points (n) minus the number of fitted coefficients (m) (Eq. 3). SSE and RMSE values close to zero indicate a better fit. The curve-fitting and evaluation were separately performed for classification accuracy and sensitivity.

$${\rm{SSE}}=\,\sum _{i=1}^{n}wi{(yi-y)}^{2}$$

(2)

$${\rm{RMSE}}=\sqrt{\frac{SSE}{(n-m)}}$$

(3)

References

Chou, R. et al. Diagnosis and Treatment of Low Back Pain: A Joint Clinical Practice Guideline from the American College of Physicians and the American Pain Society. Ann Intern Med 147, 478, https://doi.org/10.7326/0003-4819-147-7-200710020-00006 (2007).
Article PubMed Google Scholar
Shah, L. M. & Hanrahan, C. J. MRI of Spinal Bone Marrow: Part 1, Techniques and Normal Age-Related Appearances. American Journal of Roentgenology 197, 1298–1308, https://doi.org/10.2214/ajr.11.7005 (2011).
Article PubMed Google Scholar
Ricci, C. et al. Normal age-related patterns of cellular and fatty bone marrow distribution in the axial skeleton: MR imaging study. Radiology 177, 83–88, https://doi.org/10.1148/radiology.177.1.2399343 (1990).
Article CAS PubMed Google Scholar
Navarro, S. M. et al. Musculoskeletal Imaging Findings of Hematologic Malignancies. RadioGraphics 37, 881–900, https://doi.org/10.1148/rg.2017160133 (2017).
Article PubMed Google Scholar
Park, Y. S. et al. Texture-Based Quantification of Pulmonary Emphysema on High-Resolution Computed Tomography: Comparison With Density-Based Quantification and Correlation With Pulmonary Function Test. Investigative Radiology 43, 395–402, https://doi.org/10.1097/rli.0b013e31816901c7 (2008).
Article PubMed Google Scholar
Juntu, J., Sijbers, J., De Backer, S., Rajan, J. & Van Dyck, D. Machine learning study of several classifiers trained with texture analysis features to differentiate benign from malignant soft-tissue tumors in T1-MRI images. Journal of Magnetic Resonance Imaging 31, 680–689, https://doi.org/10.1002/jmri.22095 (2010).
Article PubMed Google Scholar
Larroza, A. et al. Support vector machine classification of brain metastasis and radiation necrosis based on texture analysis in MRI. Journal of Magnetic Resonance Imaging 42, 1362–1368, https://doi.org/10.1002/jmri.24913 (2015).
Article PubMed Google Scholar
Mannil, M., von Spiczak, J., Manka, R. & Alkadhi, H. Texture Analysis and Machine Learning for Detecting Myocardial Infarction in Noncontrast Low-Dose Computed Tomography. Investigative Radiology 53, 338–343, https://doi.org/10.1097/rli.0000000000000448 (2018).
Article PubMed Google Scholar
Zhang, J., Yu, C., Jiang, G., Liu, W. & Tong, L. 3D texture analysis on MRI images of Alzheimer’s disease. Brain Imaging and Behavior 6, 61–69, https://doi.org/10.1007/s11682-011-9142-3 (2011).
Article Google Scholar
Zhou, C. et al. Quantitative Analysis of MR Imaging to Assess Treatment Response for Patients with Multiple Myeloma by Using Dynamic Intensity Entropy Transformation: A Preliminary Study. Radiology 278, 449–457, https://doi.org/10.1148/radiol.2015142804 (2016).
Article PubMed Google Scholar
Larhmam, M. A., Mahmoudi, S., Drisis, S. & Benjelloun, M. In Bioinformatics and Biomedical Engineering 198–211 (Springer International Publishing, 2018).
Li, S., Kwok, J. T., Zhu, H. & Wang, Y. Texture classification using the support vector machines. Pattern Recognition 36, 2883–2893, https://doi.org/10.1016/s0031-3203(03)00219-x (2003).
Article MATH Google Scholar
Kwang In, K., Keechul, J., Se Hyun, P. & Hang Joon, K. Support vector machines for texture classification. IEEE Transactions on Pattern Analysis and Machine Intelligence 24, 1542–1550, https://doi.org/10.1109/tpami.2002.1046177 (2002).
Article Google Scholar
Liu, Y., Peng, Y. & Zhou, X. In Lecture Notes in Computer Science 1042–1046 (Springer Berlin Heidelberg, 2006).
Carroll, K. W., Feller, J. F. & Tirman, P. F. J. Useful internal standards for distinguishing infiltrative marrow pathology from hematopoietic marrow at MRI. Journal of Magnetic Resonance Imaging 7, 394–398, https://doi.org/10.1002/jmri.1880070224 (1997).
Article CAS PubMed Google Scholar
Shigematsu, Y. et al. Distinguishing Imaging Features between Spinal Hyperplastic Hematopoietic Bone Marrow and Bone Metastasis. American Journal of Neuroradiology 35, 2013–2020, https://doi.org/10.3174/ajnr.a4012 (2014).
Article CAS PubMed Google Scholar
Koutoulidis, V. et al. Quantitative Diffusion-weighted Imaging of the Bone Marrow: An Adjunct Tool for the Diagnosis of a Diffuse MR Imaging Pattern in Patients with Multiple Myeloma. Radiology 282, 484–493, https://doi.org/10.1148/radiol.2016160363 (2017).
Article PubMed Google Scholar
Bauerle, T. et al. Multiple myeloma and monoclonal gammopathy of undetermined significance: importance of whole-body versus spinal MR imaging. Radiology 252, 477–485, https://doi.org/10.1148/radiol.2522081756 (2009).
Article PubMed Google Scholar
Chen, D. G., He, Q. & X. Z. Wang. On linear separability of data sets in feature space. Neurocomputing 70(13–15) 2441–2448 (2007).
Raissi, M. Parametric Gaussian Process Regression for Big Data, arXiv:1704.03144 [stat.ML] (2017).
Kassner, A. & Thornhill, R. E. Texture Analysis: A Review of Neurologic MR Imaging Applications. American Journal of Neuroradiology 31, 809–816, https://doi.org/10.3174/ajnr.a2061 (2010).
Article CAS PubMed Google Scholar
Saeys, Y., Inza, I. & Larranaga, P. A review of feature selection techniques in bioinformatics. Bioinformatics 23, 2507–2517, https://doi.org/10.1093/bioinformatics/btm344 (2007).
Article CAS PubMed Google Scholar
Zacharaki, E. I. et al. Classification of brain tumor type and grade using MRI texture and shape in a machine learning scheme. Magnetic Resonance in Medicine 62, 1609–1618, https://doi.org/10.1002/mrm.22147 (2009).
Article PubMed PubMed Central Google Scholar
Durgamahanthi, V., Rangaswami, R., Gomathy, C. & Victor, A. C. J. Texture Analysis Using Wavelet-Based Multiresolution Autoregressive Model: Application to Brain Cancer Histopathology. Journal of Medical Imaging and Health Informatics 7, 1188–1195, https://doi.org/10.1166/jmihi.2017.2255 (2017).
Article Google Scholar
Arivazhagan, S. & Ganesan, L. Texture classification using wavelet transform. Pattern Recognition Letters 24, 1513–1521, https://doi.org/10.1016/s0167-8655(02)00390-2 (2003).
Article MATH Google Scholar
Wang, Z. Z. & Yong, J. H. Texture analysis and classification with linear regression model based on wavelet transform. IEEE Trans Image Process 17, 1421–1430, https://doi.org/10.1109/TIP.2008.926150 (2008).
Article ADS MathSciNet PubMed Google Scholar
Park, S. H. & Han, K. Methodologic Guide for Evaluating Clinical Performance and Effect of Artificial Intelligence Technology for Medical Diagnosis and Prediction. Radiology 286, 800–809, https://doi.org/10.1148/radiol.2017171920 (2018).
Article PubMed Google Scholar
Schellinger, D. et al. Normal Lumbar Vertebrae: Anatomic, Age, and Sex Variance in Subjects at Proton MR Spectroscopy—Initial Experience. Radiology 215, 910–916, https://doi.org/10.1148/radiology.215.3.r00jn42910 (2000).
Article CAS PubMed Google Scholar
Garde, A., Voss, A., Caminal, P., Benito, S. & Giraldo, B. F. SVM-based feature selection to optimize sensitivity–specificity balance applied to weaning. Computers in Biology and Medicine 43, 533–540, https://doi.org/10.1016/j.compbiomed.2013.01.014 (2013).
Article PubMed Google Scholar
Rahmouni, A. et al. Bone Marrow with Diffuse Tumor Infiltration in Patients with Lymphoproliferative Diseases: Dynamic Gadolinium-enhanced MR Imaging. Radiology 229, 710–717, https://doi.org/10.1148/radiol.2293020748 (2003).
Article PubMed Google Scholar
Zajick, D. C., Morrison, W. B., Schweitzer, M. E., Parellada, J. A. & Carrino, J. A. Benign and Malignant Processes: Normal Values and Differentiation with Chemical Shift MR Imaging in Vertebral Marrow. Radiology 237, 590–596, https://doi.org/10.1148/radiol.2372040990 (2005).
Article PubMed Google Scholar
Padhani, A. R., van Ree, K., Collins, D. J., D’Sa, S. & Makris, A. Assessing the Relation Between Bone Marrow Signal Intensity and Apparent Diffusion Coefficient in Diffusion-Weighted MRI. American Journal of Roentgenology 200, 163–170, https://doi.org/10.2214/ajr.11.8185 (2013).
Article PubMed Google Scholar
Ishijima, H., Ishizaka, H., Horikoshi, H. & Sakurai, M. Water fraction of lumbar vertebral bone marrow estimated from chemical shift misregistration on MR imaging: normal variations with age and sex. AJR Am J Roentgenol 167, 355–358, https://doi.org/10.2214/ajr.167.2.8686603 (1996).
Article CAS PubMed Google Scholar
Liney, G. P., Bernard, C. P., Manton, D. J., Turnbull, L. W. & Langton, C. M. Age, gender, and skeletal variation in bone marrow composition: a preliminary study at 3.0 Tesla. J Magn Reson Imaging 26, 787–793, https://doi.org/10.1002/jmri.21072 (2007).
Article PubMed Google Scholar
Vladimir Vezhnevets, V. K. In Graphicon Vol. 1 150–156 (2005).
Chang, C.-C. & Lin, C.-J. LIBSVM. ACM Transactions on Intelligent Systems and Technology 2, 1–27, https://doi.org/10.1145/1961189.1961199 (2011).
Article ADS Google Scholar
Parmar, C., Grossmann, P., Bussink, J., Lambin, P. & Aerts, H. J. W. L. Machine Learning methods for Quantitative Radiomic Biomarkers. Scientific Reports 5, https://doi.org/10.1038/srep13087 (2015).
Landis, J. R. & Koch, G. G. The measurement of observer agreement for categorical data. Biometrics 33, 159–174 (1977).
Article CAS Google Scholar
DeLong, E. R., DeLong, D. M. & Clarke-Pearson, D. L. Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics 44, 837–845 (1988).
Article CAS Google Scholar

Download references

Author information

Authors and Affiliations

Department of Radiology, Seoul St. Mary’s Hospital, College of Medicine, The Catholic University of Korea, Seoul, Republic of Korea
Eo-Jin Hwang, Joon-Yong Jung, Seul Ki Lee & Won-Hee Jee
Department of Hematology, Seoul St. Mary’s Hospital, College of Medicine, The Catholic University of Korea, Seoul, Republic of Korea
Sung-Eun Lee

Authors

Eo-Jin Hwang
View author publications
You can also search for this author in PubMed Google Scholar
Joon-Yong Jung
View author publications
You can also search for this author in PubMed Google Scholar
Seul Ki Lee
View author publications
You can also search for this author in PubMed Google Scholar
Sung-Eun Lee
View author publications
You can also search for this author in PubMed Google Scholar
Won-Hee Jee
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

E.J. Hwang and J.Y. Jung designed the study, E.J. Hwang, J.Y. Jung, S.K. Lee, S.E. Lee and W.H. Jee participated in data acquisition, E.J. Hwang, J.Y. Jung and S.K. Lee analyzed the data, E.J. Hwang, J.Y. Jung and S.E. Lee wrote and edited the manuscript, and W.H. Jee provided advice on the analysis.

Corresponding author

Correspondence to Joon-Yong Jung.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary table and figure

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Hwang, EJ., Jung, JY., Lee, S.K. et al. Machine Learning for Diagnosis of Hematologic Diseases in Magnetic Resonance Imaging of Lumbar Spines. Sci Rep 9, 6046 (2019). https://doi.org/10.1038/s41598-019-42579-y

Download citation

Received: 19 December 2018
Accepted: 03 April 2019
Published: 15 April 2019
DOI: https://doi.org/10.1038/s41598-019-42579-y

This article is cited by

Curation of myeloma observational study MALIMAR using XNAT: solving the challenges posed by real-world data
- Simon J. Doran
- Theo Barfoot
- Andrea Rockall
Insights into Imaging (2024)
Combined radiomics-clinical model to predict malignancy of vertebral compression fractures on CT
- Choong Guen Chee
- Min A Yoon
- Hye Won Chung
European Radiology (2021)
Artificial Intelligence based Models for Screening of Hematologic Malignancies using Cell Population Data
- Shabbir Syed-Abdul
- Rianda-Putra Firdani
- Erik Dovgan
Scientific Reports (2020)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Segment anything in medical images

Microenvironmental reorganization in brain tumors following radiotherapy and recurrence revealed by hyperplexed immunofluorescence imaging

Best practices for single-cell analysis across modalities

Introduction

Results

Effects of SVM kernel types and feature dimensions on predictive performance

Predictive performance of SVM on differentiating diseased marrows from the normal marrows

Comparison of performances and interobserver agreements between the SVM classifier and human readers using a separate data set

Predicting sample size required for classification

Discussion

Methods

Subjects

MRI acquisition

Image post-processing

Data preparation for SVM model construction

Comparison of predictive performances of SVM to human readers

Sample size estimation using a learning curve

Statistical Analysis

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing Interests

Additional information

Supplementary information

Supplementary table and figure

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Curation of myeloma observational study MALIMAR using XNAT: solving the challenges posed by real-world data

Combined radiomics-clinical model to predict malignancy of vertebral compression fractures on CT

Artificial Intelligence based Models for Screening of Hematologic Malignancies using Cell Population Data

Comments

Search

Quick links