AI-assisted quantification of hypothalamic atrophy in amyotrophic lateral sclerosis by convolutional neural network-based automatic segmentation

Vernikouskaya, Ina; Müller, Hans-Peter; Roselli, Francesco; Ludolph, Albert C.; Kassubek, Jan; Rasche, Volker

doi:10.1038/s41598-023-48649-6

Download PDF

Article
Open access
Published: 06 December 2023

AI-assisted quantification of hypothalamic atrophy in amyotrophic lateral sclerosis by convolutional neural network-based automatic segmentation

Ina Vernikouskaya¹^na1,
Hans-Peter Müller²^na1,
Francesco Roselli^2,3,
Albert C. Ludolph^2,3,
Jan Kassubek^2,3^na2 &
…
Volker Rasche^1,4^na2

Scientific Reports volume 13, Article number: 21505 (2023) Cite this article

806 Accesses
1 Citations
11 Altmetric
Metrics details

Subjects

Abstract

The hypothalamus is a small structure of the brain with an essential role in metabolic homeostasis, sleep regulation, and body temperature control. Some neurodegenerative diseases such as amyotrophic lateral sclerosis (ALS) and dementia syndromes are reported to be related to hypothalamic volume alterations. Despite its crucial role in human body regulation, neuroimaging studies of this structure are rather scarce due to work-intensive operator-dependent manual delineations from MRI and lack of automated segmentation tools. In this study we present a fully automatic approach based on deep convolutional neural networks (CNN) for hypothalamic segmentation and volume quantification. We applied CNN of U-Net architecture with EfficientNetB0 backbone to allow for accurate automatic hypothalamic segmentation in seconds on a GPU. We further applied our approach for the quantification of the normalized hypothalamic volumes to a large neuroimaging dataset of 432 ALS patients and 112 healthy controls (without the ground truth labels). Using the automated volumetric analysis, we could reproduce hypothalamic atrophy findings associated with ALS by detecting significant volume differences between ALS patients and controls at the group level. In conclusion, a fast and unbiased AI-assisted hypothalamic quantification method is introduced in this study (whose acceptance rate based on the outlier removal strategy was estimated to be above 95%) and made publicly available for researchers interested in the conduction of hypothalamus studies at a large scale.

Virtual reality-empowered deep-learning analysis of brain cells

Article Open access 22 April 2024

Native-state proteomics of Parvalbumin interneurons identifies unique molecular signatures and vulnerabilities to early Alzheimer’s pathology

Article Open access 01 April 2024

nnU-Net: a self-configuring method for deep learning-based biomedical image segmentation

Article 07 December 2020

Introduction

The hypothalamus has a crucial role in the regulation of the human body, involved in metabolic¹, neuroendocrine², immune³, and cardiovascular activity⁴. Detailed hypothalamic imaging has become of major interest to better characterize disease-associated clinical abnormalities including metabolism⁵. Several neurodegenerative diseases such as frontotemporal dementia (FTD)⁶ and Huntington disease (HD)⁷ are assumed to be related to hypothalamic volume atrophy. Specifically in amyotrophic lateral sclerosis (ALS), previous studies from different groups indicated that the total volume of the hypothalamus is substantially reduced in patients with ALS compared with controls^8,9,10. ALS is traditionally conceptualized as a neurodegenerative disease affecting primarily the motor neurons whose degeneration is responsible for the severe motor phenotype of ALS. However, within the additional, non-motor symptoms of ALS with substantial impact on patient well-being and overall survival, proofs are available of a substantial hypermetabolic phenotype which predates, accompanies, and influences the clinical onset of ALS¹¹. The cause of the hypermetabolic state in ALS has been subject to several mechanistic investigations, including the demonstration of an altered hypothalamic physiology^8,9,10. The quantification of hypothalamic atrophy is thus an in vivo measure for the neuroimaging phenotype of neurodegenerative diseases like ALS and might be used for correlation analyses e.g., with the individual metabolic characteristics.

Due to the low size of the hypothalamus, the volumetric analysis in high resolution magnetic resonance imaging (MRI) is challenging. Due to limited image contrast in the vicinity of the hypothalamus, morphological landmarks require experience by the rater to be exactly determined in manual segmentation from MRI. As a consequence, results of different studies regarding hypothalamic volumetry show high variability during manual segmentation^12,13. Moreover, manual segmentation is a very time-consuming and tedious procedure. Therefore, there is a need in neuroimaging for a reliable and unbiased technique to perform reproducible hypothalamic segmentation and volumetric analysis of large datasets, with a minimum of human intervention.

The success of deep learning methods in image classification has extended their use to solve more complex tasks including semantic segmentation¹⁴, which is the task of labeling pixels with a corresponding class of what is being represented. While there have been previous attempts at segmentation tasks, it was not until Ronneberger et al.¹⁵ with U-Net that a significant improvement in biomedical image segmentation performance was achieved¹⁶. The network is based on fully convolutional network (FCN) which consists of a contracting path (encoder) constituted by the general convolutional process to capture context and a symmetric expanding path (decoder) constituted by transposed convolutional layers that enables precise localization. Trained end-to-end from very few images, it outperforms the previously best methods and represents the state-of-the-art class of methods in terms of segmentation accuracy¹⁵. Because of its simplicity and effectiveness, U-Net has been widely adopted within the medical imaging community, improving the originally fully-convolutional network approach¹⁷. Since its inception in 2015, U-Net has seen many advancements in its architecture, e.g., U-Net architecture can be built with many different styles of backbone which is the architectural element which defines how the layers are arranged in the encoder network and determines how the decoder network should be built. Different classification networks as the backbone of the semantic segmentation network may show different performance¹⁸. There are several state-of-the-art pre-trained networks widely explored in the literature. Some famous examples in computer science applications are VGG16, ResNet50, Inceptionv3, and EfficientNetB0. VGG, ResNet, and Inception families are fundamental deep learning backbones already used for years for different tasks achieved excellent backbone-building performance¹⁸. EfficientNet networks are a recent family of architectures that have been shown to significantly outperform other networks in classification tasks while having fewer parameters¹⁹ and have been explored for medical image segmentation as an encoder²⁰.

However, very few deep-learning based methods are available in the literature for hypothalamus segmentation on T1-weighted MR images. First, Rodrigues et al.²¹ implemented a fully automatic method based on max-tree to detect a bounding box around the hypothalamus in axial, sagittal, and coronal MR images and convolutional neural networks (CNNs) of 2D U-Net architecture to segment the hypothalamus within the detected region in each of three views with subsequent creation of a consensus from all three models’ outputs in order to help eliminate false positives. Their consensus model achieved a Dice coefficient of 0.77. Later, Billot et al.²² used a 3-D U-Net-based architecture with aggressive data augmentation to segment the hypothalamus and its subunits from one dataset with a Dice coefficient of 0.83 for the whole hypothalamus. Finally, Rodrigues et al.²³ provided the first public benchmark composed of a diverse annotated dataset and achieved a Dice coefficient of 0.83 with the Teacher-Student-based model composed of modified EfficientNetB4 architectures for segmentation and correction.

The rationale of the current study is to present a fully automated approach to segment the hypothalamus on T1-weighted MRI scans and, based on this segmentation, to perform volumetric analyses of the hypothalamus in a large sample of healthy controls and in ALS patients to identify volume differences at the group level, i.e. atrophy associated with ALS. The method relies on application of CNN to segment both hypothalamus and intracranial volume (ICV) for hypothalamic volume normalization. The final validation of the AI-based segmentation was performed by comparison with volumetric results of an established manual delineation procedure, which has been used in previous studies and already obtained a high level of reproducibility in some hundred sporadic ALS cases and controls⁹.

Results

Performance of neural network hypothalamic segmentation

Figure 1 shows the comparison of segmentations predicted by four different models with ground truth segmentations overlaid on the MR images in five exemplary coronal slices of a single control test dataset. Visual inspection of the automated segmentations shows that the overall anatomy of the hypothalamus is well learned by all networks. Disagreements between the ground truth and predicted segmentations were observed at the edge slices (anterior, posterior) where false pixels were predicted by all networks to some extent.

Table 1 summarizes the comparison of performance of hypothalamic segmentation by four investigated network architectures in the test dataset. All investigated network architectures achieved similar performance in terms of intersection over union (IoU) metric which measures the amount of overlapping between prediction and ground truth. The highest value was achieved for EfficientNetB0 (0.88). High Recall values for all models indicated that a high fraction of pixels that should be predicted as hypothalamus was also predicted as hypothalamus. The highest Precision, i.e., most of the pixels predicted as hypothalamus were true predictions, was achieved with EfficientNetB0 (0.87). Dice similarity coefficient was also highest for EfficientNetB0 (0.87) balancing Precision and Recall better than other models. The fastest prediction per image was achieved with EfficientNetB0, permitting segmentation of the whole hypothalamus (50 slices) in 1.43 s on a GPU.

Table 1 Comparison of performance of hypothalamic segmentation by four investigated network architectures in the test dataset.

Full size table

No significant difference between the ground truth volume and the volume segmented by EfficientNetB0 was observed, whereas all other networks significantly overestimated the segmented volumes.

Statistical results were confirmed by Bland–Altman plots (Fig. 2). The minimal mean difference between prediction and ground truth among the investigated networks was achieved with EfficientNetB0 (Fig. 2a) which only slightly underestimated the hypothalamus volumes (mean difference of − 0.01 cm³). The data points are equally distributed within a rather narrow range of the limits of agreement ([− 0.14, 0.11] cm³). Inceptionv3 consistently overestimated the hypothalamic volume (mean difference of 0.14 cm³ with 95% limits of agreement [0.0, 0.28] cm³) (Fig. 2b). ResNet50 also tends to overestimate the volumes (confidence interval of the mean difference above zero line) (Fig. 2c). Although the confidence interval on the mean difference for VGG16 contains the zero line (Fig. 2d), the range of limits of agreement is the largest ([− 0.19, 0.26] cm³), and an outlier falling outside of the confidence interval of − 1.96 SD limit of agreement is observed in these data, which may significantly impact the results by erroneously shifting the mean difference towards zero line.

Moreover, EfficientNetB0 has a much lower number of parameters, meaning faster training and lighter model.

Volumetric analysis in the test group

After being re-trained on augmented data including images with limited contrast, EfficientNetB0 model achieved a Dice coefficient of 0.84 ± 0.03 in ALS and 0.86 ± 0.02 in controls. 95% HD was 0.82 ± 0.39 mm in ALS and 0.70 ± 0.38 mm in controls.

Non-significant differences in calculated hypothalamic volumes between prediction and ground truth were observed both in ALS patients (0.80 ± 0.10 cm³ vs. 0.77 ± 0.09 cm³ in ground truth, p = 0.15) and controls (0.86 ± 0.07 cm³ vs. 0.86 ± 0.08 cm³ in ground truth, p = 0.76). Fig. 3a demonstrates qualitative results of hypothalamus segmentation in control and ALS patient as compared to ground truth.

To allow adjustments of the hypothalamic volumes between subjects with different head sizes, hypothalamic volumes in the test group were further normalized to intracranial volumes (ICV) (i.e., the sum of gray matter, white matter, and cerebrospinal fluid), which were automatically segmented using the same neural network approach. During generation of the training data for the ICV segmentation, the two-threshold technique did not work perfectly due to intensity inhomogeneities in the whole head image so that random errors at the borders (such as partial overlap with other brain structures) could appear (Fig. 3b as an example). Due to the intrinsic properties of the CNN approach, these non-associated random errors were not “learned” by the algorithm and the predicted volumes do not show these effects anymore.

Normalized to the ICV (Fig. 4a), significant differences in hypothalamic volumes (− 10%, p = 0.0011) could be obtained between the ALS (775 ± 62 mm³) and control group (863 ± 66 mm³). Respective average volumes of manually segmented hypothalamus normalized to ICV comprised 870 ± 96 mm³ (controls) and 750 ± 66 mm³ (ALS) (ALS: − 14% vs. controls). Thus, differences between ground truth and automatically segmented hypothalamic volumes comprised 3% on average in the ALS group and did not exceed 1% in controls.

Hypothalamic analysis in the group comparison

EfficinetNetB0 model trained without data augmentation could not perform any segmentation in 57 ALS cases and 2 control cases due to limited contrast of MRI scans, whereas the model trained with augmented data (with modified contrast distribution) in the training dataset could deliver segmentation results in 100% of cases.

According to box-whiskers plot for all data included in our dataset, three outliers above the upper limit of agreement in the ALS group were removed because the segmented ICV volume was highly underestimated (781 ± 191 cm³) in comparison to average ICV volume (1481 ± 165 cm³) in this group (p = 0.035). For 18 outliers below the lower limit of agreement in ALS group, the hypothalamus was significantly under-segmented (0.48 ± 0.16 cm³) as compared to average hypothalamus segmentation in this group (0.78 ± 0.11 cm³, p < 0.0005). In the control group, lower hypothalamus volumes (0.58 ± 0.09 cm³ vs. average 0.84 ± 0.09 cm³, p = 0.033) in combination with large ICV volumes (1714 ± 75 cm³) were detected as outliers after normalization in three cases; in two cases the ICV volume was underestimated (1258 ± 33 cm³ vs. average 1532 ± 139 cm³, p = 0.036), leading to outliers above the upper limit after normalization. With outliers, average hypothalamic volume in ALS group was calculated to be 812 ± 127 mm³ and in controls 847 ± 99 mm³ (ALS: − 4% vs. controls, p = 0.0009).

After outlier removal (Fig. 4b), a significant reduction in hypothalamic volume could still be achieved in the ALS group (823 ± 84 mm³) as compared to controls (852 ± 77 mm³) (ALS: − 3% vs. controls, p = 0.002).

In conclusion, based on the outlier removal strategy, the acceptance rate of the proposed automatic approach for hypothalamus segmentation with consecutive brain normalization was estimated to be above 95%.

Discussion

Hypothalamic atrophy, together with alterations in hypothalamic peptides controlling energy metabolism, is known to be associated with metabolic derangements in ALS²⁴. Given that this hypothalamic involvement can, like metabolic alterations in general, be regarded as a potential treatment target in ALS⁸, hypothalamus volume quantification might be developed as an imaging-based biological marker by unbiased, time-efficient approaches. However, segmenting the hypothalamus is very challenging due to its low size and low contrast in its vicinity where it is surrounded by grey matter structures. Although a growing number of neuroimaging studies in the literature aims to assess volume alterations of the hypothalamus^6,7,25,26,27, only few have focused on implementing an automated method to reduce human variability and enhance studies’ robustness²³.

This study has to be considered to include several strengths. In the first line, we have presented an artificial intelligence-based approach to automatically segment the hypothalamus (and the intracranial volume) in T1-weighted MR images of the brain by use of convolutional neural networks of U-Net architecture. U-Net has been developed primarily for image segmentation tasks and obtained a high utility within the medical imaging community. There are several variants which adopt an encoder-decoder architecture of U-Net, aiming to improve performance compared to the original fully-convolutional network approach. In this study, a comparison is performed using four significant U-Net variants on the same dataset to observe an effect on segmentation performance as well as trade-offs with respect to computational time and complexity.

Most data-driven methods are very susceptible to data variability: this challenge is especially apparent when applying deep learning to brain MRI, where intensities and contrasts vary due to acquisition protocol, scanner-, and center-specific factors²⁸. Our data originated from the same center and had been acquired at the same scanner, but were heterogenous in terms of protocol, software release, and operator, since acquired over several years. Thus, our algorithm was complemented by data augmentation to implicitly regularize our trained network by making it robust against contrast variations and increasing generalization at inference. Our segmentation approach permitted extremely fast hypothalamus segmentation at inference (in less than 1.5 s on GPU) as compared to even semi-automated approaches requiring 20–40 min processing time per hypothalamus²⁹. With the Dice coefficients of 0.86 ± 0.02 in controls and 0.84 ± 0.03 in ALS patients, the state-of-the-art automatic segmentation methods presented in the literature (0.77 from²¹ and 0.83 from^22,23, respectively) could be improved. The predicted differences between ALS patients and controls were about 10% in the test group, whereas differences between manual and AI-based segmentation did not exceed 3% in this group.

Finally, we performed a volumetric analysis of the hypothalamus normalized to ICV in ALS patients vs. controls in a large neuroimaging dataset as a typical scenario for which our approach was intentionally developed for. Basically, our results were in line with previous independent MRI studies from different groups, reporting hypothalamic atrophy in ALS^9,10,30, recently confirmed in a neuropathological study³¹. This significant hypothalamic volume reduction in ALS in comparison to controls at the group level could be confirmed in the current dataset. The predicted differences between ALS and controls in the large dataset comprised 3%. This value is highly significant, however, lower than previously reported^9,31. This accuracy is sufficient for studies at the group level; however, in order to obtain robust clinical results at the individual level e.g., in clinical diagnostic procedures of a given disorder, further improvements should be necessary.

This study also has to be regarded in the context of its limitations. It might be regarded as a limitation that the technique did not work in all cases. Inaccuracy in ICV volume prediction of 5% leads to a maximum inaccuracy in hypothalamus volume calculation of about 10%. This value approaches the expected difference in the hypothalamic volume between ALS and healthy controls⁹. Therefore, outliers identified in predicted ICV volumes which fell out of the 5% range of average ICV volumes were considered to be outliers and were removed from the analysis, simultaneously defining the acceptance rate of AI-based approach. So, automatic segmentation of hypothalamus and/or ICV was rejected in 5 controls (out of 112) and 21 ALS (out of 432) cases, resulting in a rejection rate of the method below 5%. We identified as main reason for failure in hypothalamus segmentation the distortions of the MR images during pre-processing or due to breathing-related motion artifacts, especially in patients with high disease burden. This challenge can be tackled in the future by augmenting the training dataset with spatially deformed images. ICV segmentation failed in MR volumes with reduced contrast since such data were not part of the training. Generally, the challenge of contrast variation in MR images can, alternatively to the augmentation approach applied here, be addressed with z-score normalization of the data without the need for data augmentation. Better contrast can be obtained at a 3.0 T MRI scanner, potentially resulting in an improvement of the accuracy of segmentation. However, for the current study, some hundred T1-weighted MRI scans of MND patients as part of the standard clinical MRI protocol were available at 1.5 T. Future studies with the use of 3 T (or higher) should be performed. A final limitation of the current study is the use of data from a single imaging center which can lead to performance losses when predicting images from different datasets than those used in the training.

In conclusion, we present an AI-based technique for automated hypothalamus segmentation and volumetric analysis to be performed in an unbiased, reproducible manner and at a large scale. We applied this technique to study hypothalamic atrophy associated with ALS at the group level. Future work will focus on extending this automated analysis to applications to other neurological diseases, such as dementia syndromes (like FTD or Alzheimer’s disease), Huntington disease or Parkinson’s disease. To encourage other researchers to reproduce our results on their own datasets, we provide the source code as well as the trained models on GitHub: https://github.com/vernikouskaya/hypothalamus_segmentation.

Material and methods

Ethical approval

The ethics application includes the recording and the analysis of MRI data, irrespective of the analysis technique; no additional MRI scans have been performed for the current study. Previous studies on the analysis of MRI data have already been performed (^9,32) and have been approved by the Ethics Committee of the University of Ulm (references #19/12 and #20/12) in accordance with the ethical standards laid down in the 1964 Declaration of Helsinki and its later amendments. Written informed consent was obtained from all individual participants included in the study.

MRI dataset

Six-hundred-and-sixty-four T1-weighted whole head MRI datasets were acquired on a 1.5 T MRI scanner (Symphony, Siemens Medical, Erlangen, Germany) (Table 2A). Morphological data were obtained with a MPRAGE sequence (144 sagittal slices, no gap, 1.0 × 1.2 × 1.0 mm³ voxels, 256 × 192 × 256 matrix, TE = 4.2 ms, TR = 1600 ms), which is part of a standard clinical MRI examination protocol for patients with motor neuron diseases (MND). One-hundred-and-fifty-four healthy subjects without any neurological/psychiatric disease or other medical condition composed the control group. Five-hundred-and-ten patients with sporadic ALS were recruited in the outpatient and inpatient settings of the Department of Neurology, University of Ulm, Germany and composed the ALS group. One-hundred-and-twenty datasets (78 ALS patients and 42 controls) out of these groups were available with corresponding manual delineations of hypothalamus. After pre-processing based on a visual quality check, 12 ALS hypothalamic volumes were removed from the dataset due to limited contrast of the MR images. Thus, a subset of 108 hypothalamic volumes (50 slices each) were used for the training of different network architectures and the comparison of their performance. These data were randomly split on the subject level into training (47 ALS, 24 controls), validation (4 ALS, 3 controls), and previously unseen test (15 ALS, 15 controls) datasets, respectively, at a ratio of 66%/6%/28%, resulting in 3550 images for training, 350 images for validation, and 1500 images for test (Table 2B). Controls and ALS groups in the test sample were gender- and age-matched.

Table 2 Overview of available datasets and their splits during hypothalamic segmentation: (A) Total amount, (B) Network implementation, (C) Neuroimaging dataset.

Full size table

Data-preprocessing and manual segmentation protocol of hypothalamus

T1-weighted MRI data were used for manual delineation of the hypothalamus in the coronal plane in three-step pre-processing pipeline using the Tensor Imaging and Fiber Tracking (TIFT) software package expanded by a volumetric extension package³³. The ground truth was obtained as a subsample from results of a previous analysis of some hundred sporadic ALS cases and 112 healthy controls⁹. First, the rigid body normalization of T1-weighted MRI data was performed along the anterior commissure (AC)—posterior commissure (PC) axis such that the coronal cutting plane was perpendicular with respect to the AC-PC axis to correct for individual tilt of the head and to minimize potential partial volume effects. Then, spatial upsampling was performed to improve the accuracy in visually identifying landmarks and hypothalamic borders. The hypothalamic section of each dataset was pre-selected in 50 slices of 0.5 mm thickness. Finally, manual delineation of the left- and right-hemispheric hypothalamus was performed using the highly reproducible technique adapted from Gabery et al.¹², as previously described^9,34,35. In short: boundaries in coronal sections were defined anterior when the optic chiasm was first seen to be attached to the ventral part of the septal area and posterior where the fornix appears to be merged with the mammillary nucleus. The hypothalamus was medially bounded by the third ventricle, the inferior border was defined by the junction of the optical chiasm for the anterior part, and by the border of the cerebrospinal fluid for the more posterior slices. The hypothalamus was laterally bounded by the diagonal band of Broca in the preoptic area, the internal capsule and the cerebral peduncle for the more posterior slices together with non-hypothalamic grey matter structures such as the fields of Forel on the most posterior slices. The optical tract was excluded from all slices. An appropriate visualization of the hypothalamic localization is provided in Fig. 5. With this manual delineation procedure, a high level of reproducibility was obtained with an intra-rater variability with a coefficient of variation < 4% and an intraclass correlation coefficient > 0.9 for inter-rater variability.

Choice of network architecture

In the first stage of our work we compared the performance of four classification networks (VGG16, ResNet50, Inceptionv3, and EfficientNetB0) as backbones in the U-Net network for segmentation of the hypothalamus in the same experimental environment and with the same data.

All mentioned backbones weights were pre-trained on The ImageNet dataset³⁶ to shorten the learning procedure, to speed up convergence, and to achieve high performance as compared to a non-pre-trained model. The 50 images representing the hypothalamic region had a size of 512 × 512 pixels with an in-plane resolution of 0.125 × 0.125 mm². All models were trained on a GeForce GTX 1060 6 GB GPU for 25 epochs with early stopping with a batch size of 4 samples per pass. The loss function was the sum of the categorical Cross Entropy and Jaccard loss. Adaptive Moment Estimation (Adam) with the Keras default settings was used as the optimizer. The IoU was used as a metric to evaluate the model during training. Training was stopped when the validation loss was observed to have ceased improving for 10 consecutive epochs and the model with the lowest validation loss was chosen for prediction.

The segmentation performance of each CNN model was evaluated in terms of IoU. Further, true-positives ($TP$), i.e., the intersection between segmentation and ground truth; true-negatives ($TN$), i.e., part of the image beyond the union between segmentation and ground truth; false-positives ($FP$), i.e., segmented parts not overlapping the ground truth; false-negatives ($FN$), i.e., missed parts of the ground truth—were calculated for each volume to estimate the average Precision, Recall, and Dice coefficient. Average prediction time per image was assessed.

Data augmentation

Because the contrast variation in the acquired MRI scans was a significant segmentation challenge, the training dataset was extended by the 12 ALS datasets excluded in the previous stage (resulting in 120 datasets) and the dataset was augmented by shifting the contrast of each image with some sampled values, such that the variability in the training set was similar to what is seen in real world clinical data, while preserving anatomical information, in order to make the network robust against contrast variations. That way, 27,350 images were obtained for training and 2250 images for validation (Table 2B). In the second stage of this work we re-trained the best model from the previous experiment on these data and repeated the evaluation in the same test dataset consisting of 15 ALS and 15 control datasets.

Total intracranial volume—segmentation and normalization

We utilized the neural network with the best performed backbone used in previous experiments to automatically segment the ICV from original MRI volumes. To generate ground truth data of the ICV for training the network, manual delineation by visual intensity-based 3-dimensional marking of the ICV was performed by the TIFT software. In total, 10 ICV volumes (5 volumes from each test group from the previous experiment) were available for training (9 volumes, 4608 images) and validation (1 volume, 512 images) of the network.

Since no ICV ground truth data were available for the investigated hypothalamic test group, the evaluation was performed visually by 3-D reconstruction of the automatically segmented ICVs.

Finally, volumetric analysis was performed by calculating the normalized hypothalamus volume as:

$${{V}_{hypoth{al}_{norm}}=V}_{hypothal}/{V}_{ICV}*{V}_{{ICV}_{mean}\left(control\right)},$$

(1)

where $V$ denotes the automatically segmented volume of individual hypothalamus or ICV, respectively, and ${V}_{{ICV}_{mean}\left(control\right)}$ is the average ICV volume of controls.

Application to hypothalamic volumetry in ALS

Then, we accessed the ability of the previously trained neural networks to reliably segment the hypothalamus (and consequently also the ICV) in a neuroimaging group level study (Table 2C), which represents a major application for the proposed method. Table 3 provides gender and age characteristics of ALS patients and controls in the neuroimaging dataset, as well as ALS-FRS-R score and disease duration for ALS patients.

Table 3 Overview of subject characteristics of the whole neuroimaging dataset. ALS-FRS-R—revised ALS functional rating scale³⁷.

Full size table

Since manual delineations were not available for the whole dataset, a quality check of the automated segmentation was performed based on the outliers detected in the predicted hypothalamus volume and ICV. The outliers were identified by applying interquartile range (IQR): any point outside the range [Q1 − 1.5*IQR; Q3 + 1.5*IQR] was considered to be an outlier.

Statistical analysis

We used two evaluation criteria of the segmentation performance: first, Dice similarity coefficient measuring the overlap between predicted segmentation and ground truth and second, 95% Hausdorff distance which is similar to maximum HD, but based on the calculation of the 95th percentile of the distances between boundary points in the ground truth and prediction in order to eliminate the impact of a very small subset of the outliers.

The agreement in volume quantification between the ground truth and automatic segmentation provided by each model was analysed based on Bland–Altman plots. Comparison between the ground truth of the hypothalamic volumes and the hypothalamic volumes predicted by the networks was performed by applying a paired t-test or Wilcoxon signed-rank test as appropriate according to Shapiro–Wilk test for normality. The differences between ICV normalized hypothalamus volumes in the control and the ALS group were assessed applying unpaired t-test. A p-value < 0.05 was assumed statistically significant. The mean value and the standard deviation of the differences are reported.

Data availability

The original contributions presented in the study are included in the article, further inquiries can be directed to the corresponding author on reasonable request.

References

Coll, A. P. & Yeo, G. S. The hypothalamus and metabolism: Integrating signals to control energy and glucose homeostasis. Curr. Opin. Pharmacol. 13, 970–976 (2013).
Article CAS PubMed Google Scholar
Clarke, I. J. Hypothalamus as an endocrine organ. In Comprehensive Physiology 217–253 (Wiley, 2014). https://doi.org/10.1002/cphy.c140019.
Guijarro, A., Laviano, A. & Meguid, M. M. Hypothalamic integration of immune function and metabolism. Prog. Brain Res. 153, 367–405 (2006).
Article CAS PubMed PubMed Central Google Scholar
Rahmouni, K. Cardiovascular regulation by the arcuate nucleus of the hypothalamus. Hypertension 67, 1064–1071 (2016).
Article CAS PubMed Google Scholar
Spindler, M. & Thiel, C. M. Quantitative magnetic resonance imaging for segmentation and white matter extraction of the hypothalamus. J. Neurosci. Res. 100, 564–577 (2022).
Article CAS PubMed Google Scholar
Piguet, O. et al. Eating and hypothalamus changes in behavioral-variant frontotemporal dementia. Ann. Neurol. 69, 312–319 (2011).
Article PubMed PubMed Central Google Scholar
Bartlett, D. M. et al. Investigating the relationships between hypothalamic volume and measures of circadian rhythm and habitual sleep in premanifest Huntington’s disease. Neurobiol. Sleep Circadian Rhythms 6, 1–8 (2019).
Article PubMed Google Scholar
Ahmed, R. M., Steyn, F. & Dupuis, L. Hypothalamus and weight loss in amyotrophic lateral sclerosis. Handb. Clin. Neurol. 180, 327–338 (2021).
Article PubMed Google Scholar
Gorges, M. et al. Hypothalamic atrophy is related to body mass index and age at onset in amyotrophic lateral sclerosis. J. Neurol. Neurosurg. Psychiatry 88, 1033–1041 (2017).
Article PubMed Google Scholar
Liu, S. et al. Hypothalamic subregion abnormalities are related to body mass index in patients with sporadic amyotrophic lateral sclerosis. J. Neurol. 269, 2980–2988 (2022).
Article PubMed Google Scholar
Dupuis, L., Pradat, P.-F., Ludolph, A. C. & Loeffler, J.-P. Energy metabolism in amyotrophic lateral sclerosis. Lancet Neurol. 10, 75–82 (2011).
Article CAS PubMed Google Scholar
Gabery, S. et al. Volumetric analysis of the hypothalamus in huntington disease using 3T MRI: The IMAGE-HD study. PLoS ONE 10, e0117593 (2015).
Article PubMed PubMed Central Google Scholar
Tognin, S. et al. Enlarged hypothalamic volumes in schizophrenia. Psychiatry Res. 204, 75–81 (2012).
Article PubMed Google Scholar
Bendiabdallah, M. H. & Settouti, N. A comparison of U-net backbone architectures for the automatic white blood cells segmentation. WAS Sci. Nat. WASSN ISSN 2766-7715 4. https://worldascience.com/journals/index.php/wassn/article/view/24 (2021).
Ronneberger, O., Fischer, P. & Brox, T. U-Net: Convolutional Networks for Biomedical Image Segmentation. ArXiv150504597 Cs (2015).
Siddique, N., Paheding, S., Elkin, C. P. & Devabhaktuni, V. U-Net and its variants for medical image segmentation: A review of theory and applications. IEEE Access 9, 82031–82057 (2021).
Article Google Scholar
Kugelman, J. et al. A comparison of deep learning U-Net architectures for posterior segment OCT retinal layer segmentation. Sci. Rep. 12, 14888 (2022).
Article ADS PubMed PubMed Central Google Scholar
Zhang, R., Du, L., Xiao, Q. & Liu, J. Comparison of backbones for semantic segmentation network. J. Phys. Conf. Ser. 1544, 012196 (2020).
Article Google Scholar
Elharrouss, O., Akbari, Y., Almaadeed, N. & Al-Maadeed, S. Backbones-Review: Feature Extraction Networks for Deep Learning and Deep Reinforcement Learning Approaches. https://doi.org/10.48550/arXiv.2206.08016 (2022).
Mathews, M. R., Anzar, S. M., Kalesh Krishnan, R. & Panthakkan, A. EfficientNet for retinal blood vessel segmentation. In 2020 3rd International Conference on Signal Processing and Information Security (ICSPIS) 1–4. https://doi.org/10.1109/ICSPIS51252.2020.9340135 (2020).
Rodrigues, L. et al. Hypothalamus fully automatic segmentation from MR images using a U-Net based architecture. In Proceedings of the 15th SIPAIM, 11330. International Society for Optics and Photonics. https://doi.org/10.1117/12.2542585 (2020).
Billot, B. et al. Automated segmentation of the hypothalamus and associated subunits in brain MRI. NeuroImage 223, 117287 (2020).
Article PubMed Google Scholar
Rodrigues, L. et al. A benchmark for hypothalamus segmentation on T1-weighted MR images. NeuroImage 264, 119741 (2022).
Article PubMed Google Scholar
Guillot, S. J., Bolborea, M. & Dupuis, L. Dysregulation of energy homeostasis in amyotrophic lateral sclerosis. Curr. Opin. Neurol. 34, 773–780 (2021).
Article PubMed Google Scholar
Breen, D. P. et al. Hypothalamic volume loss is associated with reduced melatonin output in Parkinson’s disease. Mov. Disord. 31, 1062–1066 (2016).
Article CAS PubMed PubMed Central Google Scholar
Shapiro, N. L. et al. In vivo hypothalamic regional volumetry across the frontotemporal dementia spectrum. NeuroImage Clin. 35, 103084 (2022).
Article PubMed PubMed Central Google Scholar
Ye, S. et al. MRI volumetric analysis of the thalamus and hypothalamus in amyotrophic lateral sclerosis. Front. Aging Neurosci. 13, 610332 (2021).
Article ADS PubMed Google Scholar
Meyer, M. I. et al. A contrast augmentation approach to improve multi-scanner generalization in MRI. Front. Neurosci. 15, 708196 (2021).
Article PubMed PubMed Central Google Scholar
Wolff, J. et al. A semi-automated algorithm for hypothalamus volumetry in 3 Tesla magnetic resonance images. Psychiatry Res. Neuroimaging 277, 45–51 (2018).
Article PubMed Google Scholar
Chang, J. et al. Lower hypothalamic volume with lower body mass index is associated with shorter survival in patients with amyotrophic lateral sclerosis. Eur. J. Neurol. 30, 57–68 (2023).
Article PubMed Google Scholar
Gabery, S. et al. Loss of the metabolism and sleep regulating neuronal populations expressing orexin and oxytocin in the hypothalamus in amyotrophic lateral sclerosis. Neuropathol. Appl. Neurobiol. 47, 979–989 (2021).
Article CAS PubMed Google Scholar
Münch, M., Müller, H.-P., Behler, A., Ludolph, A. C. & Kassubek, J. Segmental alterations of the corpus callosum in motor neuron disease: A DTI and texture analysis in 575 patients. NeuroImage Clin. 35, 103061 (2022).
Article PubMed PubMed Central Google Scholar
Müller, H.-P., Unrath, A., Ludolph, A. C. & Kassubek, J. Preservation of diffusion tensor properties during spatial normalization by use of tensor imaging and fibre tracking on a normal brain database. Phys. Med. Biol. 52, N99-109 (2007).
Article ADS PubMed Google Scholar
Gorges, M. et al. Morphological MRI investigations of the hypothalamus in 232 individuals with Parkinson’s disease. Mov. Disord. 34, 1566–1570 (2019).
Article PubMed Google Scholar
Kassubek, R. et al. Morphological alterations of the hypothalamus in idiopathic intracranial hypertension. Ther. Adv. Chronic Dis. 13, 20406223221141350 (2022).
Article CAS PubMed PubMed Central Google Scholar
Russakovsky, O. et al. ImageNet Large Scale Visual Recognition Challenge. https://doi.org/10.48550/arXiv.1409.0575 (2015).
Cedarbaum, J. M. et al. The ALSFRS-R: A revised ALS functional rating scale that incorporates assessments of respiratory function. BDNF ALS Study Group (Phase III). J. Neurol. Sci. 169, 13–21 (1999).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

The authors would like to thank the Ulm University Center for Translational Imaging MoMAN for its support.

Funding

Open Access funding enabled and organized by Projekt DEAL. This study was supported by the German Research Foundation (Deutsche Forschungsgemeinschaft, DFG Grant Number LU 336/15-1) and the German Network for Motor Neuron Diseases (BMBF 01GM1103A).

Author information

These authors contributed equally: Ina Vernikouskaya and Hans-Peter Müller.
These authors jointly supervised this work: Jan Kassubek and Volker Rasche.

Authors and Affiliations

Department of Internal Medicine II, Ulm University Medical Center, Albert-Einstein-Allee 23, 89081, Ulm, Germany
Ina Vernikouskaya & Volker Rasche
Department of Neurology, University of Ulm, Ulm, Germany
Hans-Peter Müller, Francesco Roselli, Albert C. Ludolph & Jan Kassubek
German Center for Neurodegenerative Diseases (DZNE), Ulm, Germany
Francesco Roselli, Albert C. Ludolph & Jan Kassubek
Core Facility Small Animal MRI, University of Ulm, Ulm, Germany
Volker Rasche

Authors

Ina Vernikouskaya
View author publications
You can also search for this author in PubMed Google Scholar
Hans-Peter Müller
View author publications
You can also search for this author in PubMed Google Scholar
Francesco Roselli
View author publications
You can also search for this author in PubMed Google Scholar
Albert C. Ludolph
View author publications
You can also search for this author in PubMed Google Scholar
Jan Kassubek
View author publications
You can also search for this author in PubMed Google Scholar
Volker Rasche
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

I.V.: Methodology, Data curation, Formal analysis, Investigation, Software, Visualization, Validation, Writing—original draft, Writing—review & editing. H.P.M.: Data curation, Methodology, Software, Writing—review & editing. F.R.: Conceptualization, Resources, Writing—review & editing. A.C.L.: Resources, Funding acquisition, Writing—review & editing. J.K.: Conceptualization, Resources, Supervision, Project administration, Writing—review & editing. V.R.: Conceptualization, Supervision, Project administration, Writing—review & editing.

Corresponding author

Correspondence to Ina Vernikouskaya.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Vernikouskaya, I., Müller, HP., Roselli, F. et al. AI-assisted quantification of hypothalamic atrophy in amyotrophic lateral sclerosis by convolutional neural network-based automatic segmentation. Sci Rep 13, 21505 (2023). https://doi.org/10.1038/s41598-023-48649-6

Download citation

Received: 03 May 2023
Accepted: 29 November 2023
Published: 06 December 2023
DOI: https://doi.org/10.1038/s41598-023-48649-6

This article is cited by

AI-assisted automatic MRI-based tongue volume evaluation in motor neuron disease (MND)
- Ina Vernikouskaya
- Hans-Peter Müller
- Volker Rasche
International Journal of Computer Assisted Radiology and Surgery (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Virtual reality-empowered deep-learning analysis of brain cells

Native-state proteomics of Parvalbumin interneurons identifies unique molecular signatures and vulnerabilities to early Alzheimer’s pathology

nnU-Net: a self-configuring method for deep learning-based biomedical image segmentation

Introduction

Results

Performance of neural network hypothalamic segmentation

Volumetric analysis in the test group

Hypothalamic analysis in the group comparison

Discussion

Material and methods

Ethical approval

MRI dataset

Data-preprocessing and manual segmentation protocol of hypothalamus

Choice of network architecture

Data augmentation

Total intracranial volume—segmentation and normalization

Application to hypothalamic volumetry in ALS

Statistical analysis

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

AI-assisted automatic MRI-based tongue volume evaluation in motor neuron disease (MND)

Comments

Search

Quick links