Lumbar spine segmentation in MR images: a dataset and a public benchmark

van der Graaf, Jasper W.; van Hooff, Miranda L.; Buckens, Constantinus F. M.; Rutten, Matthieu; van Susante, Job L. C.; Kroeze, Robert Jan; de Kleuver, Marinus; van Ginneken, Bram; Lessmann, Nikolas

doi:10.1038/s41597-024-03090-w

Download PDF

Data Descriptor
Open access
Published: 02 March 2024

Lumbar spine segmentation in MR images: a dataset and a public benchmark

Jasper W. van der Graaf ORCID: orcid.org/0000-0001-8684-5175^1,2,
Miranda L. van Hooff^2,3,
Constantinus F. M. Buckens⁴,
Matthieu Rutten^1,5,
Job L. C. van Susante⁶,
Robert Jan Kroeze⁷,
Marinus de Kleuver²,
Bram van Ginneken¹ &
…
Nikolas Lessmann ORCID: orcid.org/0000-0001-7935-9611¹

Scientific Data volume 11, Article number: 264 (2024) Cite this article

2166 Accesses
1 Altmetric
Metrics details

Subjects

Abstract

This paper presents a large publicly available multi-center lumbar spine magnetic resonance imaging (MRI) dataset with reference segmentations of vertebrae, intervertebral discs (IVDs), and spinal canal. The dataset includes 447 sagittal T1 and T2 MRI series from 218 patients with a history of low back pain and was collected from four different hospitals. An iterative data annotation approach was used by training a segmentation algorithm on a small part of the dataset, enabling semi-automatic segmentation of the remaining images. The algorithm provided an initial segmentation, which was subsequently reviewed, manually corrected, and added to the training data. We provide reference performance values for this baseline algorithm and nnU-Net, which performed comparably. Performance values were computed on a sequestered set of 39 studies with 97 series, which were additionally used to set up a continuous segmentation challenge that allows for a fair comparison of different segmentation algorithms. This study may encourage wider collaboration in the field of spine segmentation and improve the diagnostic value of lumbar spine MRI.

Multi-scanner and multi-modal lumbar vertebral body and intervertebral disc segmentation database

Article Open access 23 March 2022

A computed tomography vertebral segmentation dataset with anatomical variations and multi-vendor scanner data

Article Open access 28 October 2021

Fully automatic 3D segmentation of the thoracolumbar spinal cord and the vertebral canal from T2-weighted MRI using K-means clustering algorithm

Article 04 March 2020

Background & Summary

Low back pain (LBP) causes the largest burden of disease worldwide, with most years lived with disability of any disease¹. As a consequence, lumbar spine magnetic resonance imaging (MRI) for LBP is one of the most used imaging procedures within musculoskeletal imaging². In the United States, 93% of the lumbar MRI referrals were appropriate according to the American College of Radiology guidelines, even though only 13% of the scans contributed in the clinical decision making³. Automatic image analysis might be the key to improve the diagnostic value of MRI by enabling more objective and quantitative image interpretation. A first step toward automatic assessment of lumbar spine MRI is segmentation of relevant anatomical structures, such as the vertebrae, intervertebral discs (IVDs) and the spinal canal.

With recent advances in machine learning and artificial intelligence (AI), state-of-the-art spine segmentation algorithms are generally learning-based algorithms that require well-curated training data. The development of vertebra segmentation algorithms for CT images has considerably benefitted from multiple large publicly available datasets with CT images and reference segmentations^4,5. Currently no comparable large high quality datasets are available for lumbar spine MRI. Existing available datasets are either small, only segment the vertebral body^6,7, or are only annotated in the midsagittal slice (2D)^8,9. Moreover, most datasets are limited to only one of the many anatomical structures that are most relevant for assessing multifactorial disorders such as LBP, i.e., only the vertebrae^{10,11,12,13,14} or the IVDs^15,16,17.

To advance the development of segmentation algorithms, and ultimately automatic image analysis, for lumbar spine MRI, this study has three primary goals:

1.
To present a large multi-center lumbar spine MR dataset with reference segmentations of vertebrae, IVDs and spinal canal combined with per-level radiological gradings.
2.
To introduce a continuous lumbar spine MRI segmentation challenge that allows algorithm developers to submit their models for evaluation.
3.
To provide reference performance metrics for two algorithms that segment all three spinal structures automatically: a baseline AI algorithm, which was used in the data collection process, and the nnU-Net, a popular algorithm for 3D segmentation tasks for which training and inference code is publicly available.

Methods

Data collection

In total, 257 lumbar spine studies from patients with a history of LBP were retrospectively collected, with each study consisting of up to three MRI series. Of these 39 patient studies, containing 97 MRI series, were sequestered for public benchmarking. The public data release described in this paper consists of 218 patient studies with 447 series. The study was approved by the institutional review board at Radboud University Medical Center (IRB 2016–2275). Informed consent was exempted, given the retrospective scientific use of deidentified MRI scans. Studies were collected from four different hospitals in the Netherlands, including one university medical center (UMC), two regional hospitals and one orthopedic hospital (data acquired between January 2019 and March 2022). All involved hospitals signed either a data transfer agreement or a public data sharing form in which public sharing of the data under a CC-BY 4.0 license was disclosed.

Data originating from the UMC were all available lumbar spine MRI studies of patients presenting with (chronic) low back pain between January 2019 and November 2020 that included a T2 SPACE sequence. This sequence produces images with almost isotropic spatial resolution (voxel size: 0.90 × 0.47 × 0.47 mm). All studies also contained both a standard sagittal T1 and T2 sequence (voxel size: 3.30 × 0.59 × 0.59 mm). MRI studies were only excluded if the image quality was considered too low for fully manual segmentation (n = 4). Data originating from the other three hospitals were sets of consecutive lumbar spine MRI studies of patients presenting with (chronic) low back pain with at least a sagittal T1 or a sagittal T2 sequence. The voxel size of these images ranged from 3.15 × 0.24 × 0.24 mm to 9.63 × 1.06 × 1.23 mm. At each center, we included a fixed number of consecutive MRI studies that met these inclusion criteria. There were no other exclusion criteria except for a number of studies across all four contributing centers being excluded from publication to serve as hidden hold-out test set for an algorithm-development challenge (see the Segmentation data section for further details). Additional dataset characteristics are given in Table 1.

Table 1 Overview dataset.

Full size table

Segmentation data

In all included MRI series, all visible vertebrae (excluding the sacrum), intervertebral discs, and the spinal canal were manually segmented. The segmentation was performed by a medical trainee who was trained and supervised by both a medical imaging expert (JG) and an experienced musculoskeletal radiologist (SB). Three-dimensional MRI annotation is a complex and laborious task, especially for the vertebral arch of the lumbar vertebrae. Therefore, we worked with an iterative data annotation approach in which our automatic baseline segmentation method (baseline 1: iterative instance segmentation) was trained with a small part of the dataset, enabling semi-automatic segmentation of the remaining images. During semi-automatic segmentation, the automatic method was used to obtain an initial segmentation, which was subsequently reviewed and manually corrected. This process was repeated several times by retraining the automatic segmentation model until the entire dataset was annotated.

Initially, twenty randomly selected high resolution T2 (SPACE) series of the UMC data were manually annotated using 3D Slicer version 5.0.3¹⁸. All structures were segmented in their entirety, which for the vertebrae also includes the vertebral arch. This was done since the vertebral arch is essential in the diagnosis of disorders such as foraminal stenosis, facet joint arthrosis, and spondylolysis. The initial manual annotations were performed only on high resolution series because the near-isotropic resolution enables detailed viewing in sagittal, axial and coronal directions. Annotations of the corresponding standard sagittal T1 and T2 images were obtained by resampling the T2 SPACE segmentations to the resolution of the T1 and T2 images. The resampled segmentations were reviewed for misalignment due to patient movement between the acquisitions and corrected if needed. Image windowing was adjusted dynamically in 3D slicer by the user to enhance the visibility of relevant structures. All twenty fully manually annotated MRI studies were segmented by a medical trainee and were reviewed by JG.

All other segmentations were created by first generating initial segmentations using the automatic segmentation method trained with all data for which annotations were available at that point. The annotated portion of the dataset increased iteratively by repeating this process four times while adding newly annotated data to the data used for training the automatic method that generated initial segmentations. The first time the algorithm was trained only on the twenty fully manually annotated studies to predict the segmentations of the remaining UMC data. The following three iterations were done with a retrained version of the network to generate segmentation for the data from each different hospital. The review on corrections of the predicted segmentations was manually done slice by slice in 3D Slicer by JG. The diagram in Fig. 1 shows the iterative segmentation pipeline. The main benefit of using this approach was that larger and easier structures do not require manual delineation, enabling to focus on smaller details and imperfections. The partial volume effect in the non-isotropic images was handled by viewing the annotations in all three directions and by using the smoothing functionalities in 3D Slicer when appropriate.

All structures were given a separate segmentation label. The reference segmentations provided in this dataset are labelled from the bottom up with the most caudal lumbar vertebra labelled as 1. However, this label should not be interpreted as an anatomical label as this vertebra is not necessarily L5, the fifth lumbar vertebra, due to possible anatomical variations and irregular number of lumbar vertebrae. A lumbar-sacral transitional vertebra occurs in 35.5% of the population and the presence of a sixth lumbar vertebra in 6.6% of the population¹⁹. A larger field of view, axial MRI studies covering the complete lumbar spine or additional imaging are required to accurately determine the anatomical labels²⁰. These were not available for the majority of studies in this dataset. The use of traditional clinical nomenclature was therefore considered infeasible due to the risk of mislabeling vertebrae.

For final evaluation of the used algorithm, all data (n = 257) was divided into a training set (n = 179), a validation set (n = 39), and a test set (n = 39). The test set was removed from the publicly available dataset to allow for public benchmarking (https://spider.grand-challenge.org/) and to avoid overfitting for fair comparison. This test set consists of 39 lumbar MRI studies of unique patients, which includes 15 out of the 20 fully manually annotated studies that were used in the iterative data annotation scheme. The remainder of the test set originates from the same four hospitals and was randomly selected in a similar distribution as the presented dataset. The 5 remaining fully manually annotated studies were placed in the validation set. The training and validation sets consist of 179 studies (82%) and 39 studies (18%) respectively. Series belonging to the same patient were always placed in the same set. The challenge is freely available to all users of the dataset, allowing other researchers to replicate the baseline results presented in this paper.

Radiological gradings

In order to allow users of this dataset to identify the more healthy and the more diseased cases in this dataset, the dataset includes radiological gradings at all IVD levels. Graded was either the presence or the severity of the following degenerative changes that can be observed on MRI images and that have a known or suspected relation to low back pain²¹:

1.
Modic changes (type I, II or III)
2.
Upper and lower endplate changes/Schmorl nodes (binary)
3.
Spondylolisthesis (binary)
4.
Disc herniation (binary)
5.
Disc narrowing (binary)
6.
Disc bulging (binary)
7.
Pfirrman grade (grade 1 to 5)

All features were manually scored per IVD level by an expert musculoskeletal radiologist (MR, 26 years of experience).

Baseline 1: Iterative instance segmentation

By presenting this baseline algorithm, we establish a reference point for evaluating performance and provide users with an understanding of the algorithm employed in generating the dataset. This section summarizes the iterative instance segmentation (IIS) method. An automatic AI-based segmentation algorithm for vertebra segmentation¹⁴ was extended to segment also the IVDs and the spinal canal. This algorithm uses a 3D patch-based iterative scheme to segment one pair of vertebra and the corresponding inferior IVD at a time, together with the segment of spinal canal covered by the image patch. A schematic image of the network architecture is shown in Fig. 2.

Instance memory

Because the MR volume is segmented by consecutively analyzing 3D patches, one vertebral level at a time, a method is needed to keep track of its progress. An instance memory volume is used to save the structures that have been segmented, and is used as an extra input channel to remind the network of the structures that can be ignored because they are already segmented. In contrast to the original vertebra-focused method, we introduced separate memory state volumes for the vertebrae, IVDs, and the spinal canal. The spinal canal memory state is only used to save the segmentation progress, not as an extra input for the network, as the spinal canal is an elongated structure that cannot be covered by a single patch. Therefore, the network is trained to always segment any visible portion of the spinal canal, which is then stitched together for all patches that are fed through the network. In total, the network has three input channels, two memory states, and the corresponding image patch.

Network architecture

The segmentation approach is based on a single 3D U-net-like fully-convolutional neural network. Unlike the vertebra segmentation algorithm as described in the original paper¹⁴, a patch size of 64 × 192 × 192 voxels with a resolution of 2 × 0.6 × 0.6 mm was used, as the created dataset contains sagittal MR images exclusively. These generally have a higher slice thickness compared to the data used by Lessmann et al.¹⁴. A higher in-plane resolution of the predicted segmentation is achieved while still ensuring the patch is large enough that a vertebra completely fits within one patch. The network has three output channels, one for each anatomical structure.

Iterative segmentation approach

The patch-based scheme is structured in such a way that only relevant parts of the MR volume are being processed. The patch systematically moves through the image until it finds a fragment of the first vertebra, in this case always the lowest vertebra. Subsequently, the patch moves to the center of mass of that fragment after which a new segmentation is made. This process continues until the vertebra’s volume stabilizes, which means that the detected vertebra is completely visible within the patch. Binary masks of that vertebra, its underlying IVD, and the spinal canal are then added to their respective memory states. The same patch is segmented again with the updated memory states as input, which causes a fragment of the next vertebra to be segmented. This iterative process, illustrated in Fig. 3, continues until no more vertebra fragments are detected or when the top of the MR volume is reached.

Completeness and label prediction

The most cranial vertebra is often only partially visible within the field of view of the MR image. The segmentation method includes an additional compression path after the compression path of the U-net, which has a single binary value as output, predicting the completeness of a vertebra. The original vertebra segmentation method also contained a similar compression path for predicting the anatomical label. However, this output was not used in our experiments since no accurate anatomical labels regarding lumbosacral transitional vertebrae were present in our dataset.

Training of the algorithm

Preprocessing of the images consisted of resampling to a standard resolution of 2 × 0.6 × 0.6 mm and orientation in axial slices. Standard data augmentation steps were implemented, such as random elastic deformation, the addition of random Gaussian noise, random Gaussian smoothing, and random cropping along the longitudinal axis. The loss function used during training consisted of three parts: (1) The segmentation error was defined by the weighted sum of false positives and false negatives combined with the binary cross-entropy loss. (2) The labeling error was defined by the absolute difference between the predicted label and the ground truth. (3) The completeness classification error was defined as the binary cross-entropy between the true label and the predicted probability. For final evaluation, the algorithm was trained using the training dataset while using the validation set to monitor the training process.

Baseline 2: nnU-Net

In addition to adapting a segmentation method that was specifically developed for vertebra segmentation, reference results for nnU-Net are provided. nnU-Net is a self-configuring, deep learning-based framework for medical image segmentation²². It has been widely accepted in the medical image analysis community as a state-of-the-art approach to 3D image segmentation tasks after winning the Medical Segmentation Decathlon²³ and performing well in several other segmentation challenges. A 3D full resolution nnU-Net was trained on the training and validation datasets with 5-fold cross validation, which is its recommended training strategy²². Data pre-processing, network architecture and other training details were automatically determined by the nnU-Net framework. The network was trained on both the T1- and T2-weighted MRI series after which the overall performance was compared to the IIS baseline algorithm.

Evaluation

The segmentation performance was evaluated using two metrics: (1) The Dice coefficient (measured in 3D) to measure the volume overlap, and (2) the average absolute surface distance (ASD) as an indication of the segmentation accuracy along the surface of all structures. Both metrics were calculated separately for all individual structures and were averaged per anatomical structure (vertebrae, IVDs, or spinal canal). Additionally, the average Dice coefficient and average ASD per MRI sequence (T1 vs. T2) were calculated for each anatomical structure. To ensure the Dice score and ASD are not influenced by labeling differences, the individual structures of the reference segmentation are matched to the structured in the predicted segmentation based on the largest found overlap. The completeness classification performance was determined by the percentage of accurate predictions, as well as the average number of false positives and false negatives. Evaluation was performed on a sequestered test set which is a subset of the presented dataset.

Data Records

The complete dataset can be found at https://doi.org/10.5281/zenodo.10159290 and is available under the CC-BY 4.0 license²⁴. Which MRI studies are assigned to the training and validation sets can be found in the overview file. This file also provides the biological sex for all patients and the age for the patients for which this was available. It also includes a number of scanner and acquisition parameters for each individual MRI study. The dataset also comes with radiological gradings found in a separate file. All radiological gradings are provided per IVD level.

To generate this dataset, a total of 218 lumbar MRI studies of patients with low back pain were included. Each study consisted of up to three sagittal MRI series which were either T1-weighted or T2-weighted (regular resolution, or high resolution generated using a SPACE sequence) with a total of 447 series. Of all included patients, 63% was female. A total of 3125 vertebrae, 3147 IVDs, and 447 spinal canal segmentations were included over all series combined. An overview of the complete dataset divided by the different hospitals is shown in Table 1. An overview of the training and validation sets and all included structures is shown in Table 2. The radiological gradings per IVD level and per patient are summarized in Table 3.

Table 2 Overview of the distribution of data between the training and validation set.

Full size table

Table 3 Overview of the radiological gradings per intervertebral disc (IVD) level and per patient for the training and validation set.

Full size table

All MR images and their corresponding segmentation masks used in this study are stored in MHA format in separate directories. Both files have the same name, which is a combination of the MRI study identifier and the specific sequence type (T1, T2, or T2 SPACE). It is important to note that all MRI series from the same MRI study have the same identifier.

Technical Validation

The performance of the IIS baseline algorithm, which was used to generate initial segmentation masks of unseen images from the dataset, was assessed on a hidden test set. These results are presented to assess the data-annotation strategy, as well as establish a reference performance for users of the dataset. The results for the different structures and the different sequences are shown in Table 4. The overall mean (SD) Dice score was 0.93 (±0.05), 0.85 (±0.10) and 0.92 (±0.04) for the vertebrae, IVDs and spinal canal respectively. The overall mean (SD) ASD was 0.49 mm (±0.95 mm), 0.53 mm (±0.46 mm) and 0.39 (mm ± 0.45) mm for the vertebrae, IVDs and spinal canal respectively. The spinal canal was identified in all scans. One of the 656 vertebrae and nine (three in T1 images and six in T2 images) of the 688 IVDs were not found. The completeness prediction was correct in 650 of the 656 vertebrae (99.1%). A nnU-Net was trained on the same training data to enable comparison between the IIS baseline algorithm and nnU-Net baseline. The results of both networks are displayed in Table 5. Figure 4 shows a collection of segmentations obtained by both networks.

Table 4 Overview of all results of the IIS baseline algorithm.

Full size table

Table 5 Comparison between the iterative segmentation algorithm and nnU-Net.

Full size table

The IIS model, which was used in the iterative annotation of the dataset, demonstrates strong performance on our dataset, which is comparable to other MR segmentation methods in the literature^7,25,26. The performance of the IIS model was validated and benchmarked by comparing it against the performance of a second baseline algorithm. nnU-Net was chosen due to its widely acknowledged status as the current gold standard in medical image segmentation. The results of the two algorithms are nearly identical, indicating that the IIS baseline model was an accurate tool in the iterative data annotation workflow and is a reasonable benchmark for comparison.

The used iterative data annotation approach showed to be an effective strategy. One strength of this approach is its ability to improve the quality of the dataset over time by incorporating corrections from segmentation predictions into the training data. This helps to reduce errors and increase accuracy in subsequent iterations. Additionally, this approach was faster and more efficient compared to fully manual annotations. However, there are several limitations that should be addressed. Firstly, the iterative process of training the network on a small dataset, generating segmentation predictions on unseen images, and manually correcting the predictions before adding them to the dataset, can introduce bias in the final dataset. This strategy was chosen to shorten the time needed for manual annotation and thus enable the creation of a larger dataset. A key advantage of this approach was that larger and simpler areas did not need manual delineation. More intricate structures such as the vertebral lamina, the annulus of the IVDs, and epidural fat still regularly demanded manual corrections. On average, a fully manual segmentation took approximately five hours while correcting a predicted segmentation still took approximately one hour. Secondly, the use of only high-resolution T2 series for the initial manual annotation may not be representative of the entire population, as it is limited to patients from one hospital who underwent this specific imaging modality.

In the era of machine learning and AI algorithms, lumbar spine segmentation can serve as the basis for automated, accurate lumbar spine MR analysis, assisting clinical radiologists and imaging-minded spinal surgeons in their daily practice. It will be able to generate robust, quantitative MR results that can serve as inputs into larger models of lumbar spine disease in clinical practice and research settings. The availability of public datasets and benchmarks plays a crucial role in advancing the field. While datasets exist for CT vertebra segmentation, such as VerSe which is the largest available vertebra segmentation dataset²⁷, currently no public datasets for MRI spine segmentation are available. Our dataset is of similar size to VerSe²⁷ and provides full segmentation of all relevant spinal structures on MR images. This allows for wider participation and collaboration in the field of spine segmentation, as it can be used to train and evaluate algorithms, as well as to compare to other datasets. The presented algorithms are the baseline results to which other algorithms can be compared.

Usage Notes

In order to allow for a fair comparison between different algorithms, including both baseline algorithms, a public segmentation challenge is hosted on the Grand Challenge platform and can be found at https://spider.grand-challenge.org/. Algorithms are evaluated on a sequestered test set which contains 97 MRI series of 39 unique patients.

Code availability

The original training code of the IIS baseline algorithm and the trained weights and biases are publicly available at: https://github.com/DIAGNijmegen/SPIDER-Baseline-IIS. The nnU-Net baseline algorithm from Isensee et al. can be found here: https://github.com/MIC-DKFZ/nnUNet. Both trained algorithms can also be used on the grand challenge platform:

1. https://grand-challenge.org/algorithms/spider-baseline-iis/

2. https://grand-challenge.org/algorithms/spider-baseline-nnu-net

The source code used to compute the evaluation metrics on the test set are published at: https://github.com/DIAGNijmegen/SPIDER-Evaluation.

References

Hoy, D. et al. The global burden of low back pain: estimates from the Global Burden of Disease 2010 study. Ann. Rheum. Dis. 73, 968–974 (2014).
Article PubMed Google Scholar
Kjelle, E. et al. Characterizing and quantifying low-value diagnostic imaging internationally: a scoping review. BMC Med. Imaging 22, 73 (2022).
Article PubMed PubMed Central Google Scholar
Wnuk, N. M., Alkasab, T. K. & Rosenthal, D. I. Magnetic resonance imaging of the lumbar spine: determining clinical impact and potential harm from overuse. Spine J. 18, 1653–1658 (2018).
Article PubMed Google Scholar
Löffler, M. T. et al. A Vertebral Segmentation Dataset with Fracture Grading. Radiol. Artif. Intell. 2, e190138 (2020).
Article PubMed PubMed Central Google Scholar
Deng, Y. et al. CTSpine1K: A Large-Scale Dataset for Spinal Vertebrae Segmentation in Computed Tomography. Preprint at http://arxiv.org/abs/2105.14711 (2021).
Zukić, D. et al. Robust Detection and Segmentation for Diagnosis of Vertebral Diseases Using Routine MR Images: Robust Detection and Segmentation for Diagnosis of Vertebral Diseases Using Routine MR Images. Comput. Graph. Forum 33, 190–204 (2014).
Article Google Scholar
Chu, C. et al. Fully Automatic Localization and Segmentation of 3D Vertebral Bodies from CT/MR Images via a Learning-Based Method. PLOS ONE 10, e0143327 (2015).
Article PubMed PubMed Central Google Scholar
Huang, J. et al. Spine Explorer: a deep learning based fully automated program for efficient and reliable quantifications of the vertebrae and discs on sagittal lumbar spine MR images. Spine J. 20, 590–599 (2020).
Article ADS PubMed Google Scholar
Suri, A. et al. A deep learning system for automated, multi-modality 2D segmentation of vertebral bodies and intervertebral discs. Bone 149, 115972 (2021).
Article CAS PubMed PubMed Central Google Scholar
Peng, W. et al. A convenient and stable vertebrae instance segmentation method for transforaminal endoscopic surgery planning. Int. J. Comput. Assist. Radiol. Surg. 16, 1263–1276 (2021).
Article PubMed Google Scholar
Sekuboyina, A., Kukačka, J., Kirschke, J. S., Menze, B. H. & Valentinitsch, A. Attention-Driven Deep Learning for Pathological Spine Segmentation. in Computational Methods and Clinical Applications in Musculoskeletal Imaging (eds. Glocker, B., Yao, J., Vrtovec, T., Frangi, A. & Zheng, G.) vol. 10734, 108–119 (Springer International Publishing, 2018).
Masuzawa, N., Kitamura, Y., Nakamura, K., Iizuka, S. & Simo-Serra, E. Automatic segmentation, localization, and identification of vertebrae in 3D CT images using cascaded convolutional neural networks. Med. Image Comput. Comput. Assist. Interv. 2020 23rd Int. Conf. Lima Peru Oct. 4–8 2020 Proc. Part VI 23 681–690 (2020).
Sekuboyina, A., Valentinitsch, A., Kirschke, J. S. & Menze, B. H. A Localisation-Segmentation Approach for Multi-label Annotation of Lumbar Vertebrae using Deep Nets. Preprint at http://arxiv.org/abs/1703.04347 (2017).
Lessmann, N., van Ginneken, B., de Jong, P. A. & Išgum, I. Iterative fully convolutional neural networks for automatic vertebra segmentation and identification. Med. Image Anal. 53, 142–155 (2019).
Article PubMed Google Scholar
Chen, H. et al. 3D Fully Convolutional Networks for Intervertebral Disc Localization and Segmentation. in Medical Imaging and Augmented Reality (eds. Zheng, G., Liao, H., Jannin, P., Cattin, P. & Lee, S.-L.) vol. 9805, 375–382 (Springer International Publishing, 2016).
Ji, X., Zheng, G., Belavy, D. & Ni, D. Automated Intervertebral Disc Segmentation Using Deep Convolutional Neural Networks. in Computational Methods and Clinical Applications for Spine Imaging (eds. Yao, J. et al.) vol. 10182, 38–48 (Springer International Publishing, 2016).
Korez, R., Ibragimov, B., Likar, B., Pernuš, F. & Vrtovec, T. Intervertebral disc segmentation in MR images with 3D convolutional networks. SPIE Med. Imaging 2017 Image Process. 10133, 43–52 (2017).
Google Scholar
Fedorov, A. et al. 3D Slicer as an image computing platform for the Quantitative Imaging Network. Magn. Reson. Imaging 30, 1323–1341 (2012).
Article PubMed PubMed Central Google Scholar
Apazidis, A., Ricart, P. A., Diefenbach, C. M. & Spivak, J. M. The prevalence of transitional vertebrae in the lumbar spine. Spine J. 11, 858–862 (2011).
Article PubMed Google Scholar
Hughes, R. J. & Saifuddin, A. Numbering of lumbosacral transitional vertebrae on MRI: role of the iliolumbar ligaments. Am. J. Roentgenol.-NEW Ser.- 187, 226 (2006).
Google Scholar
Van Der Graaf, J. W., Kroeze, R. J., Buckens, C. F. M., Lessmann, N. & Van Hooff, M. L. MRI image features with an evident relation to low back pain: a narrative review. Eur. Spine J. 32, 1830–1841 (2023).
Article PubMed Google Scholar
Isensee, F., Jaeger, P. F., Kohl, S. A. A., Petersen, J. & Maier-Hein, K. H. nnU-Net: a self-configuring method for deep learning-based biomedical image segmentation. Nat. Methods 18.2, 203–211 (2020).
Google Scholar
Antonelli, M. et al. The Medical Segmentation Decathlon. Nat. Commun. 13, 4128 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
van der Graaf, J. W. et al. SPIDER - Lumbar spine segmentation in MR images: a dataset and a public benchmark. Zenodo https://doi.org/10.5281/zenodo.10159290 (2023).
Pang, S. et al. SpineParseNet: Spine Parsing for Volumetric MR Image by a Two-Stage Segmentation Framework With Semantic Image Representation. IEEE Trans. Med. Imaging 40, 262–273 (2021).
Article PubMed Google Scholar
Zheng, G. et al. Evaluation and comparison of 3D intervertebral disc localization and segmentation methods for 3D T2 MR data: A grand challenge. Med. Image Anal. 35, 327–344 (2017).
Article PubMed Google Scholar
Sekuboyina, A. et al. VerSe: A Vertebrae labelling and segmentation benchmark for multi-detector CT images. Med. Image Anal. 73, 102166 (2021).
Article PubMed Google Scholar

Download references

Acknowledgements

This study was funded by Radboud AI for Health (ICAI).

Author information

Authors and Affiliations

Diagnostic Image Analysis Group, Radboud University Medical Center, Nijmegen, The Netherlands
Jasper W. van der Graaf, Matthieu Rutten, Bram van Ginneken & Nikolas Lessmann
Department of Orthopedic surgery, Radboud University Medical Center, Nijmegen, The Netherlands
Jasper W. van der Graaf, Miranda L. van Hooff & Marinus de Kleuver
Department Research, Sint Maartenskliniek, Nijmegen, The Netherlands
Miranda L. van Hooff
Department of Medical Imaging, Radboud University Medical Center, Nijmegen, The Netherlands
Constantinus F. M. Buckens
Department of Radiology, Jeroen Bosch Hospital, ‘s-Hertogenbosch, The Netherlands
Matthieu Rutten
Department of Orthopedic Surgery, Rijnstate Hospital, Arnhem, the Netherlands
Job L. C. van Susante
Department of Orthopedic Surgery, Sint Maartenskliniek, Nijmegen, The Netherlands
Robert Jan Kroeze

Authors

Jasper W. van der Graaf
View author publications
You can also search for this author in PubMed Google Scholar
Miranda L. van Hooff
View author publications
You can also search for this author in PubMed Google Scholar
Constantinus F. M. Buckens
View author publications
You can also search for this author in PubMed Google Scholar
Matthieu Rutten
View author publications
You can also search for this author in PubMed Google Scholar
Job L. C. van Susante
View author publications
You can also search for this author in PubMed Google Scholar
Robert Jan Kroeze
View author publications
You can also search for this author in PubMed Google Scholar
Marinus de Kleuver
View author publications
You can also search for this author in PubMed Google Scholar
Bram van Ginneken
View author publications
You can also search for this author in PubMed Google Scholar
Nikolas Lessmann
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Jasper W. van der Graaf created the segmentation dataset, developed and trained the presented segmentation algorithms, and wrote the manuscript. Miranda L. van Hooff, Marinus de Kleuver, and Bram van Ginneken provided oversight for the project and revised the manuscript. Constantinus F.M. Buckens assisted with the MRI segmentation. Matthieu Rutten, Job van Susante, and Robert Jan Kroeze provided MRI data from their respective hospitals and revised the manuscript. Nikolas Lessmann helped with the development and training of the presented segmentation algorithms, provided overall project management, and revised the manuscript.

Corresponding author

Correspondence to Jasper W. van der Graaf.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

van der Graaf, J.W., van Hooff, M.L., Buckens, C.F.M. et al. Lumbar spine segmentation in MR images: a dataset and a public benchmark. Sci Data 11, 264 (2024). https://doi.org/10.1038/s41597-024-03090-w

Download citation

Received: 20 June 2023
Accepted: 27 February 2024
Published: 02 March 2024
DOI: https://doi.org/10.1038/s41597-024-03090-w