Convolutional neuronal networks combined with X-ray phase-contrast imaging for a fast and observer-independent discrimination of cartilage and liver diseases stages

Stroebel, Johannes; Horng, Annie; Armbruster, Marco; Mittone, Alberto; Reiser, Maximilian; Bravin, Alberto; Coan, Paola

doi:10.1038/s41598-020-76937-y

Download PDF

Article
Open access
Published: 17 November 2020

Convolutional neuronal networks combined with X-ray phase-contrast imaging for a fast and observer-independent discrimination of cartilage and liver diseases stages

Johannes Stroebel¹,
Annie Horng^2,5,
Marco Armbruster²,
Alberto Mittone^3,4,
Maximilian Reiser²,
Alberto Bravin³ &
…
Paola Coan^1,2,3

Scientific Reports volume 10, Article number: 20007 (2020) Cite this article

1166 Accesses
5 Citations
2 Altmetric
Metrics details

Subjects

Abstract

We applied transfer learning using Convolutional Neuronal Networks to high resolution X-ray phase contrast computed tomography datasets and tested the potential of the systems to accurately classify Computed Tomography images of different stages of two diseases, i.e. osteoarthritis and liver fibrosis. The purpose is to identify a time-effective and observer-independent methodology to identify pathological conditions. Propagation-based X-ray phase contrast imaging WAS used with polychromatic X-rays to obtain a 3D visualization of 4 human cartilage plugs and 6 rat liver samples with a voxel size of 0.7 × 0.7 × 0.7 µm³ and 2.2 × 2.2 × 2.2 µm³, respectively. Images with a size of 224 × 224 pixels are used to train three pre-trained convolutional neuronal networks for data classification, which are the VGG16, the Inception V3, and the Xception networks. We evaluated the performance of the three systems in terms of classification accuracy and studied the effect of the variation of the number of inputs, training images and of iterations. The VGG16 network provides the highest classification accuracy when the training and the validation-test of the network are performed using data from the same samples for both the cartilage (99.8%) and the liver (95.5%) datasets. The Inception V3 and Xception networks achieve an accuracy of 84.7% (43.1%) and of 72.6% (53.7%), respectively, for the cartilage (liver) images. By using data from different samples for the training and validation-test processes, the Xception network provided the highest test accuracy for the cartilage dataset (75.7%), while for the liver dataset the VGG16 network gave the best results (75.4%). By using convolutional neuronal networks we show that it is possible to classify large datasets of biomedical images in less than 25 min on a 8 CPU processor machine providing a precise, robust, fast and observer-independent method for the discrimination/classification of different stages of osteoarthritis and liver diseases.

Generative models improve fairness of medical classifiers under distribution shifts

Article Open access 10 April 2024

Microenvironmental reorganization in brain tumors following radiotherapy and recurrence revealed by hyperplexed immunofluorescence imaging

Article Open access 15 April 2024

Segment anything in medical images

Article Open access 22 January 2024

Introduction

The analysis and classification of radiological images are highly time-consuming and require trained observers. In the X-ray domain, biological tissues and their pathology-induced modifications can have very similar attenuation properties, thus their discrimination can be very difficult or, in some cases, impossible. Early stages of a disease, characterized by tiny signs and structures, have to be visualized when a new treatment is being studied; this requires the availability of highly sensitive imaging methods and high spatial resolution images. The standard reference in clinical pathology is the histological examination that provides 2D analysis of thin planar slices of small portions of tissue. The method requires labor-intensive protocols (including tissue preparation, cutting, staining/labeling and analysis) necessitating up to tens of hours in case of 3D histology reconstruction¹. Full organ volumetric representation is still challenging: (1) the minimal sampling in the third dimension is limited by the slice thickness; and (2) the frequent tearing of the tissue occurring during the sectioning phase causes artifacts and thus makes the slice-to-slice alignment prone to error. In addition, histological techniques come short of a complete characterization of the tissue because the to-be-derived information heavily depends on the quality of the staining and labelling of the tissue. Last, the diagnosis relies on skilled operators in all the different steps of the histological exam. X-ray phase contrast imaging (PCI) has proven to provide enhanced sensitivity and accuracy for pathology detection in a not destructive way². The technique is based on the detection of both the amplitude (i.e. attenuation) and the phase changes induced by an object in the X-ray beam. The image contrast produced by PCI can be up to orders of magnitude higher with respect to that given by standard absorption-based radiography in the energy of interest for biomedical imaging and, in particular, for soft tissues^3,4,5. As a result, the visibility of low-absorbing structures and of features with similar attenuation properties is largely enhanced. Combined with computed tomography (CT) methodologies, it can provide a highly contrasted 3D representation of the imaged volumes, which can then be virtually sliced along any plane of choice (and at different sampling steps) performing what it is called today in the literature X-ray “virtual histology”^6,7. Previous works have shown that PCI enables the depiction of different stages of articular cartilage degradation (i.e. osteoarthritis—OA)^{8,9,10,11,12,13,14} and of liver fibrosis in a rat animal model^15,16. In those studies, images were evaluated and classified by experienced radiologists. Osteoarthritis is a degenerative disease, where the cartilage wears up over time¹⁷ leading to mobility restraints and pain¹⁸. Liver fibrosis is the excessive accumulation of extracellular matrix proteins including collagen that occurs in most types of chronic liver diseases¹⁹. Conventional X-ray imaging techniques are sensitive only to advanced stages of these pathologies when therapeutic strategies are less effective.

Convolutional Neuronal Networks (CNN) are new artificial neuronal systems, which offer a highly accurate and observer-independent classification of images²⁰.

As reported in the literature, CNNs have been successfully used in different fields, in object detection²¹ and face recognition²², and in medical imaging²³. CNNs were successfully applied, for the first time, to PCI cartilage images by Abidin et al.²⁴. This pilot study has inspired us to test multiple advanced CNNs to different computed tomography datasets: one containing cartilage images acquired at much higher resolution than reported in Abidin’s paper using an alternative PCI technique, and a second one including PCI images of liver fibrosis and fat liver.

Our objective is to apply and compare the performance of different CNN systems in terms of their capability in discriminating different stages of cartilage and liver diseases using, as input, datasets images acquired by highly sensitive PCI methods. We aim at identifying the optimal approach and settings in order to establish a procedure for OA and liver data classification that is time-effective, observer-independent and more accurate than what already reported in the literature.

Methods

Artificial neural networks

Artificial Neuronal Networks (ANNs) are computing systems inspired by the structure and functioning of biological neuronal networks, which “learn” how to perform tasks from given input examples, without being specifically programmed for the task. ANNs are a sub-category of the more general machine learning algorithms²⁵. In this work, the convolutional neuronal network (CNN) was used, which is a special kind of ANN having four different types of layers: an input, a convolutional, a pooling, and a fully connected layer.

The input layer takes the image that is given to the network to be analyzed. The convolutional layer has a kernel with trainable weights and with a size that can be varied: usual sizes are 3 × 3, 5 × 5 or 7 × 7 pixels. The input image is convolved with the kernel, which acts, thus, as a filter. The pooling layer performs downsampling of the input, with parameters that depend on the kernel size and stride length (step of displacement after convolution).

In this study, the so-called max-pooling layer is used: it takes the maximum value within the kernel as an input for the next layer. At the end of the network system, classification and activation functions are performed in the fully connected layer, where all artificial neurons are connected²⁶.

In the CNN language, one epoch is one iteration of the network. An epoch consists of two parts: the forward processing of images to classify them and the backpropagation to change/train the weights and converge towards an improved classification. In the forward processing, the image data go from the input layer to the classification layer; in this latter case, a function calculates the error between the predicted classification and the apriori classification information (error function) by considering the effect of every weight. To minimize the error, an optimizer²⁷ is used, which adjusts the weight according to the learning rate set by the user (in the range [0,1]). Depending on the number of epochs and the learning rate, the CNN converges and classifies the data; therefore, the learning rate is set to get the fastest convergence (as defined at the end of this section) and the best classification.

In this study, we used the so-called transfer learning method that works with CNN weights pre-trained on large image datasets²⁸. In our case, weights were trained on the ImageNet dataset²⁹. A custom-designed network was implemented: this was achieved by removing the classification layer of the pre-trained network and by adding a fully connected layer and another classification layer with two or four outputs, depending on the dataset. The pre-trained CNN acts as a feature extractor for the self-designed part of the network. In the backpropagation, the weights are adjusted in the self-designed network, whereas the weights in the pre-trained network are fixed.

We tested three pre-trained CNNs for our study: the VGG16³⁰, the Inception V3³¹, and the Xception Network³². We selected these specific networks, because of their high performance in the image classification competition, i.e. the Large Scale Visual Recognition Challenge (LSVRC) organized by the ImageNet project³³. The achieved accuracy in the challenge was 90.1% for VGG16, 94.1% for the Inception V3 network and 94.5% for the Xception³².

The VGG16 CNN is a network with 16 convolutional layers with a kernel size of 3 × 3 pixels, two fully connected layer and a classification layer with 1000 classification outputs. The size of the input images is 224 × 224 pixels (for RGB images)³⁰. The VGG16 is a heavy computation network, with long training times and a large number of weights (for a total size of 533 MB).

The Inception network was introduced by Szegedy et al.³⁴. The idea is to construct the network “wider” instead of “deeper”. To make a network “deeper”, layers are added in such a way that layers are behind each other. For the Inception network, layers are instead added and arranged in a parallel configuration; the network becomes thus “wider” and the layers also work in parallel. In this way, the size of the pre-trained weights is reduced to 96 MB³⁵.

The Xception Networks architecture builds on a depth-wise separable convolutional layer. The weight size of this network is 91 MB³².

The analysis was performed on a Fujitsu workstation with 8 Intel Xeon CPU processors with 4 kernels and 2.6 GHz. The graphics card on which the calculations were carried out is a NVIDIA Quadro P1000 with 4 GB Memory. The entire code is written in Python based on the Keras library³⁶, a deep learning library in Python interfacing tensorflow-GPU³⁷ as a backend.

Transfer learning process

We used the transfer learning of the CNNs, which is achieved through a three-step process.

(1)
The network is divided in two sections: the first section is trained first on a large dataset with annotated images, which is not related to the later task.
(2)
All the parameters in the first section are fixed and a second training is performed using a dataset related to the classification task to train the second section of the network. In this way, the algorithm learns how to classify the images.
(3)
The algorithm is applied to the dataset of interest in a fully automated manner.

Fine tuning of the networks

For the fine tuning of the networks, we tried to improve them in two ways: (1) by studying the influence of parameters such as optimizer, learning rate or number of epochs etc.; (2) by adding an additional network element to the pre-trained network, such as fully connected layer, drop-out layer, etc.

Concerning the first method, we kept the optimizer algorithm RMSProb³⁸ for all cases constant as well as the number of epochs. The number of epochs was chosen to assure convergence of the CNN. In this study, we defined a convergence criteria based on a threshold (± 0.5% between two consecutive epochs of the validation data). The learning rate has been always adjusted to push the network to its best performance and to avoid overfitting.

Using the second method, the best results were obtained by adding one fully connected layer.

Phase contrast imaging and dataset description

In X-ray PCI the image contrast derives from the perturbations of the X-ray wave-front induced by the presence of an object along its propagation path. This contrast mechanism has been proven leading to a superior image contrast^3,4,5 with respect to standard X-ray attenuation, especially in case of soft tissues. In this work, we applied X-ray PCI to investigate cartilage and liver biological specimens.

Cartilage samples and dataset

For the cartilage evaluation, human cadaveric patellae were used. According to the regulations for experiments involving cadaveric samples, this study was waived by the ethics committee of the Ludwig-Maximilians-University, Munich, Germany. However, the required informed consent was obtained from the legally authorized/next of kin of the deceased prior to the extraction of the patella. Samples were extracted in compliance with the relevant guidelines and regulations by the forensic medicine department of the Ludwig-Maximilians-University, including testing for infectious diseases. Four cartilage samples (plugs), cylinders of 7 mm in diameter, were harvested from human patella (67-year-old woman) within 24 h of death. The plugs were divided into two groups based on OARSI assessment system³⁹ by two experienced pathologists: the control group with healthy cartilage samples and the OA degraded cartilage group. The samples were imaged at the Biomedical beamline (ID17) of the European Synchrotron (ESRF, France) by using X-ray propagation-based PCI micro-CT⁴⁰ with a polychromatic and filtered X-ray beam with peak energy around 40 keV⁴¹. The detection system consisted of a PCO edge 5.5 sCMOS camera⁴² coupled with a 10 × optics and a 19 µm thick GGG scintillator screen leading to a final pixel size of 0.7 × 0.7 µm². From the reconstructed CT volumes, sagittal CT images of the transitional and mid zone of the cartilage are extracted layer (1024 × 1024 pixels), downscaled to reduce them to 224 × 224 pixels in order to fit the CNN requirements and finally normalized to values in the [0–1] range. The analysis was performed on images presenting a voxel size of 0.7 × 0.7 × 0.7 µm³. Example are shown in Fig. 1A,B): (1A) is a sagittal PCI CT image of a healthy cartilage, whereas is (1B) the sagittal slice of an osteoarthritic cartilage sample with a small crack of the tissue visible on the right side. From every group, 3800 images were extracted and split into three categories: 60% were used as training data, 20% as validation data during training and 20% as testing data after training. Half of the images corresponded to healthy samples, the other half to pathological ones. In addition, for this step we have split the samples as follows: two samples (one from each group) were used for training the network; the images of the remaining two samples were split equally into validation and testing data sets^43,44.

Liver samples and dataset

Six Male Lewis rats (Charles River Wiga, Sulzfeld, Germany; 170–190 g of weight) underwent syngeneic orthotopic liver transplantation with a surgical technique described in detail elsewhere^45,46. Organs were stored in cold University of Wisconsin solution (4 °C; DuPont de Nemours, Bad Homburg, Germany). All experiments were carried out in accordance with the German legislation on animal protection and with the “Principles of Laboratory Animal Care” (NIH publication no. 86-23, revised 1985)¹⁵. All experimental protocols for the rat studies were approved by the local government (Regierung von Oberbayern, Munich, Germany) and were reported to the responsible authorities regularly. The liver samples were divided into three groups: (1) healthy; (2) fibrotic four weeks (4 weeks perfusion); (3) fatty livers. The samples were all paraffin-embedded and were imaged with X-ray PCI micro-CT with a polychromatic X-ray beam with a mean energy of 24 keV at the ID19 beamline at the ESRF. The detection system was a PCO Edge with a 6.5 µm pixel size connected with a 2.9 × optics, thus determining a final effective pixel size of 2.2 × 2.2 µm². The analysis was performed on CT slices presenting a voxel size of 2.2 × 2.2 × 2.2 µm³. Both CT datasets were pre-processed and reconstructed with the PyHST2 software⁴⁷.

The 512 × 512 pixels’ liver images were extracted from 3D reconstructed volumes, the intensity normalized to the [0–1] range and reduced as well to 224 × 224 pixels by binning via linear interpolation. This dataset originates from 6 different liver samples: two healthy (Fig. 1C), two fibrotic 4 weeks (Fig. 1D) and two fatty livers (Fig. 1E). A total of 3600 images were obtained and, from each liver sample, 600 images were extracted. By applying again, a 60/20/20 split ratio, 2160 images were used for training the network, 720 for validation during the training and 720 image were used for testing the trained network. In addition, the total number of input images for training and testing was increased by rotating the original images by 90, 180 and 270 degrees and adding them to the respective groups of images. This method is referred as “data augmentation” and it increased the total number of available images by a factor of four in this case⁴⁸. In this third step, we have split the samples as follows: one of each group was used for training and the images of the remaining three samples were used for validation and testing data sets^43,44.

The training data were used for training and updating the weights of the network. The validation data were used to evaluate after each iteration the accuracy of the network. The testing data set was used to evaluate the accuracy of a trained model with a new dataset. The accuracy of the performances of the networks was calculated as the ratio of the sum of the true positive cases plus true negative cases over the total number of input images:

$$Accuracy=\frac{\sum True\,Positive+ \sum True\,Negative}{\sum Total\,number\,of\,images}$$

Results

All validations were done with respect to the histological data, taken as reference. For the cartilage data, the VGG16 network provided a testing accuracy of 99.8% (validation accuracy 99.8% and training accuracy 99.9%) (Fig. 3 top) after 25 epochs and a learning rate of 7 × 10^–7. In this case, none of 760 images were falsely classified as healthy instead of degenerated (false-positive) and 3 out 760 images were classified healthy instead of degenerated (false-negative), as shown in the so-called confusion matrix in Fig. 2A. The time to train and validate this network was 34 min and 42 s.

The inception V3 network classified the data with a test accuracy of 84.7% (validation accuracy 86.8% and training accuracy 96.6%) (Fig. 3 mid) after 25 epochs with a learning rate of 1 × 10^–6; in numbers: 224 images were predicted as false negative and no false positive cases were found (Fig. 2B); 536 healthy images were classified correctly and 760 images with signs of degeneration were classified correctly over the total number of 1520 images (Fig. 2B). The calculation time for this network (training, validation and testing) was 19 min and 41 s. The Xception network has 308 images predicted false positive and no false negative; 376 healthy images and 760 degenerated are predicted correctly (Fig. 2C). Figure 3 shows the accuracy (training and validation) plot as a function of the number of epochs for the VGG16, the Inception and the Xception networks, respectively; it shows how they converge toward a stable solution for a same number of iterations.

With the Xception network, a validation accuracy of 70.2% and a testing accuracy of 72.6% was achieved after 25 epochs. The training, validation and testing with this Network took 37 min and 25 s. The accuracy of the training data is at 88.7% (Fig. 3 bottom). By using instead 42 epochs, the plateau (< ± 0.5% different between epochs) was reached during the training accuracy (97.2%), the validation accuracy increased to 79.7% and the testing accuracy was 81.3%. The time required for this calculation was 1 h 14 min and 33 s. Training the network with more epochs the testing accuracy of the Xception network is increased, but it does not reach the testing accuracy of the inception or VGG16 network.

In the next step, the cartilage data were split by sample type: images of one healthy and one degenerated sample were used for training. The images of the other samples were used for validation and for testing. Therefore, we have a split of 50/25/25 percent: 1900 images were used for training (950 images from healthy sample, 950 image from degenerated sample) and 950 images were used for validation and testing. The training with this dataset was 25 epochs long. The testing (validation) accuracy of the VGG16 network was 68.6% (70.0%) whereas the training accuracy is 99.9%. The training/validation and testing took 32 min and 45 s. The inception network achieved a testing (validation) accuracy of 65.9% (66.6%) and a training accuracy of 99.0%. The calculation time was 29 min and 38 s. The testing (validation) accuracy was at 75.7% (79.8%) of the Xception and a training accuracy of 99.8%. The testing accuracy of the VGG16 network, declined as well as the accuracy of the inception network, when splitting the dataset based on samples. The Xception network increased its testing accuracy from 72.6% up to 75.7%.

For the liver data, we report in Fig. 4 the confusion matrixes. For all the three CNNs, the convergence was obtained after 50 epochs. The VGG16 network performed with a test accuracy of 96.0% (validation accuracy 96.1%), with a learning rate of 1 × 10^–6. The training accuracy is slightly lower at 94.6%. 2 healthy, 26 fat liver, 1 fibrotic (4 week) images got mistakenly classified (Fig. 4A). The computational time was 34 min and 21 s.

The Inception V3 network, with a learning rate of 1 × 10^–6, performed on this dataset with a test accuracy of 43.6% in 18 min and 54 s. By this network, 442 out of 720 images were falsely classified (Fig. 4B). The Xception network (learning rate of 1 × 10^–6) performed with a test accuracy score of 53.8% on this dataset in 36 min and 1 s (Fig. 4C).

To increase the number of samples as input for the network, we decided to repeat the classification by rotating the images to get a more general classifier. The training/validation/testing ratio was set to 60/20/20 again. As a result, 14,400 images (4 × 360 original images) were available: 8640 images were used for training the CNNs and 2880 for the validation; finally, 2800 were used for testing the network. The networks converged faster with a larger number of images, thus the number of epochs was reduced for this calculation down to 15 epochs. With the VVG16 network, a test (validation) accuracy of 95.6% (95.0%) was achieved. The Inception V3 network reached a testing (validation) accuracy of 38.8% (35.8%) and the Xception network a test (validation) accuracy of 50.3% (48.8%), but the training accuracy was at 86.2% and 95.4%. The calculation times increased compared to the previous liver cases: 42 min and 11 s for the VGG16, 23 min and 10 s for the Inception network and 44 min and 19 s for the Xception network.

In a next step, we tested the network performances by training the system with images of one set of samples and then validating and testing it with another set. For training 1800 images from three samples of different groups were used (600 images one healthy liver sample, 600 images from one fatty liver sample and 600 images from one four-week perfusion sample). The classification accuracy of the testing dataset by the VGG 16 network was 75.4%, whereas the training (validation) accuracy score was 99.8% (73.7%). Training (15 epochs) and testing process of this network took 10 min and 33 s. The inception network obtained a testing accuracy of 39.8% and a training (validation) accuracy of 99.94% (41.0%). This training (15 epochs) and testing of the network took 6 min and 10 s. The Xception network achieved a testing accuracy of 42.0%. The training and validation accuracy were 98.7% and 46.4%, respectively. The training of the network with 15 epochs and testing lasted 11 min and 17 s.

Discussion and conclusions

In this work, we have investigated the possibility of using convolutional neuronal networks for the classification of healthy and pathological biological tissues considering two different biomedical cases: osteoarthritic cartilage and liver fibrosis. The evaluation of the samples was carried out by two experienced pathologists on the basis of the histological results, which served as golden standard.

We have applied three CNNs (VGG16, Inception V3, and Xception networks) and compared their performances in terms of accuracy in the classification and the time needed for this calculation. The VGG16 network provided the highest accuracy, compared to the Inception V3 and Xception network in the analyzed cases. In the VGG16, the entire image is convoluted, whereas in the Inception V3 and the Xception networks, the image to be analyzed is split into different regions. This process of subdividing the images can lead to overfitting that causes poor performances of the networks when applied to data in the validation and testing phase. This fact determines the discrepancy between the training and the validation/testing accuracy curves for the Inception V3 and Xception networks. Additionally, this explains the discrepancy of our results with respect to the LSVRC competition.

We also tested the effect of training the network with images of two cartilage samples (one healthy and one degenerated) and validated and tested with images of another cartilage sample: the testing accuracy decreased in all of the three networks. However, the Xception network was the one with the highest testing accuracy. This last fact shows that the Xception network model is the best generalized model, when splitting the dataset by sample type.

Many different ways, such as additional fully connected layers and drop out layers, changing the optimizer algorithm or adjusting the learning rate, were used to reduce overfitting in the Xception and Inception V3 networks. The results we presented here were obtained after this optimization procedure (best accuracy and lowest overfitting); instead, the results of these intermediate optimization procedures were not reported.

In the case of the cartilage, other computer-aided diagnosis tools are available, like texture analysis. This kind of analysis on cartilage PCI images for characterizing osteoarthritis, gives good results for both 2D images⁴⁹ and 3D volumes⁵⁰.

The Inception V3 network with its inception modules is much faster for training, validation and testing than the other two networks. For the cartilage dataset, the Inception was 56.7% faster than the VGG16 network and 47.4% than the Xception. For the liver dataset, the Inception was 26.6% and 29.5% faster than the VGG16 and Xception networks, respectively. The reason of its higher performances lies in its unique inception module structure, which reduces the number of trainable weights and therefore speeds up the computation.

With the data augmentation of the liver dataset, we could show that the networks converged faster when the number of input images was increased and therefore we needed fewer epochs. For the VGG16 network, the testing accuracy stayed approximately the same 95.5% and but the computational time increased by 23.5% from 34 min and 21 s to 42 min and 11 s because of increasing the number of input images by a factor of 4. We can conclude that more input data leads to a better accuracy and a faster convergence of the VGG network, but this does not come with shorter computational times.

When we used data from different liver samples for the training and the testing of the networks, we achieved a decreased testing accuracy of all the networks, whereas the training accuracy increased. This result shows that an overfitting occurs and the networks do not generalize enough; to overcome this limitation, a larger number of samples should be used. The network presenting the best testing accuracy is the VGG16 with 75.38%, as in the calculation without the split based on samples. Both the Inception V3 and Xception networks testing accuracies were for this test below 50%; for this test and both networks did not perform well on liver data, in contrast to the cartilage data, were both networks had a testing accuracy above 68%.

The testing accuracy strongly depends on the data splitting method that is used. If the slices for training and testing the CNN are extracted from the same sample, the data used in the two processes may look very similar and an overfitting of the networks during training may occur. In this case, the generalization of the CNN on new samples is unsettled and may be severely hindered.

This study shows that the combination of advanced high sensitive X-ray imaging techniques (PCI) with newly available algorithms for data classification based on the neuronal network concept, could significantly support the discrimination between healthy-normal and pathological-abnormal conditions of biological tissues. The proof of concept of this methodology was here performed on small tissue samples (cylindrical bone/cartilage plugs of 7 mm in diameter). This method could be an important asset in the direction of the automation of diagnostic procedures. The application of CNNs to our datasets showed that these tools (in the specific case we identified the VGG16 network as the most accurate one) make it possible to analyze and classify sets of 9616 images of 224 × 224 pixels in less than 25 min providing a robust, fast and observer-independent method of diagnosis.

Data availability

The data used in this study are available from the corresponding author, upon reasonable request.

References

Annese, J. et al. Postmortem examination of patient HM’s brain based on histological sectioning and digital 3D reconstruction. Nat. Commun. 5, 1–9 (2014).
Article Google Scholar
Bravin, A., Coan, P. & Suortti, P. X-ray phase-contrast imaging: From pre-clinical applications towards clinics. Phys. Med. Biol. 58, R1-35. https://doi.org/10.1088/0031-9155/58/1/R1 (2013).
Article ADS PubMed Google Scholar
Arfelli, F. et al. Low-dose phase contrast X-ray medical imaging. Phys. Med. Biol. 43, 2845 (1998).
Article CAS Google Scholar
Parsons, D. W. et al. High-resolution visualization of airspace structures in intact mice via synchrotron phase-contrast X-ray imaging (PCXI). J. Anat. 213, 217–227 (2008).
Article Google Scholar
Beltran, M. et al. Interface-specific X-ray phase retrieval tomography of complex biological organs. Phys. Med. Biol. 56, 7353 (2011).
Article CAS Google Scholar
Töpperwien, M., Krenkel, M., Quade, F. & Salditt, T. Laboratory-based X-ray phase-contrast tomography enables 3D virtual histology. SPIE Opt. Eng. Appl. https://doi.org/10.1117/12.2246460 (2016).
Article Google Scholar
Qu, Q., Blom, H., Sanchez, S. & Ahlberg, P. Three-dimensional virtual histology of silurian osteostracan scales revealed by synchrotron radiation microtomography. J. Morphol. 276, 873–888 (2015).
Article Google Scholar
Mollenhauer, J. et al. Diffraction-enhanced X-ray imaging of articular cartilage. Osteoarthr. Cartil. 10, 163–171 (2002).
Article CAS Google Scholar
Coan, P. et al. Characterization of osteoarthritic and normal human patella cartilage by computed tomography X-ray phase-contrast imaging: A feasibility study. Invest. Radiol. 45, 437–444. https://doi.org/10.1097/RLI.0b013e3181e193bd (2010).
Article PubMed Google Scholar
Muehleman, C. et al. In-laboratory diffraction-enhanced X-ray imaging for articular cartilage. Clin. Anat. 23, 530–538 (2010).
Article Google Scholar
Li, J., Zhong, Z., Connor, D., Mollenhauer, J. & Muehleman, C. Phase-sensitive X-ray imaging of synovial joints. Osteoarthr. Cartil. 17, 1193–1196 (2009).
Article CAS Google Scholar
Lee, Y. S. et al. Articular cartilage imaging by the use of phase-contrast tomography in a collagen-induced arthritis mouse model. Acad. Radiol. 17, 244–250 (2010).
Article Google Scholar
Marenzana, M. et al. Visualization of small lesions in rat cartilage by means of laboratory-based X-ray phase contrast imaging. Phys. Med. Biol. 57, 8173 (2012).
Article Google Scholar
Wagner, A. et al. Options and limitations of joint cartilage imaging: DEI in comparison to MRI and sonography. Nucl. Instrum. Methods Phys. Res. Sect. A 548, 47–53. https://doi.org/10.1016/j.nima.2005.03.064 (2005).
Article ADS CAS Google Scholar
Brandlhuber, M. et al. A novel and sensitive approach for the evaluation of liver ischemia-reperfusion injury after liver transplantation. Invest. Radiol. 51, 170–176. https://doi.org/10.1097/RLI.0000000000000220 (2016).
Article CAS PubMed Google Scholar
Zhang, X. et al. Visualising liver fibrosis by phase-contrast X-ray imaging in common bile duct ligated mice. Eur. Radiol. 23, 417–423. https://doi.org/10.1007/s00330-012-2630-z (2013).
Article PubMed Google Scholar
Zhang, Z. Chondrons and the pericellular matrix of chondrocytes. Tissue Eng. Part B Rev. 21, 267–277. https://doi.org/10.1089/ten.TEB.2014.0286 (2015).
Article PubMed Google Scholar
Felson, D. T. Epidemiology of hip and knee osteoarthritis. Epidemiol. Rev. 10, 1–28. https://doi.org/10.1093/oxfordjournals.epirev.a036019 (1988).
Article CAS PubMed Google Scholar
Bataller, R. & Brenner, D. A. Liver fibrosis. J. Clin. Investig. 115, 209–218 (2005).
Article CAS Google Scholar
Krizhevsky, A., Sutskever, I. & Hinton, G. E. Imagenet classification with deep convolutional neural networks. Adv. Neural Inf. Process. Syst. 25, 1097–1105 (2012).
Google Scholar
Szegedy, C., Toshev, A. & Erhan, D. Deep neural networks for object detection. Adv. Neural Inf. Process. Syst. 2, 2553–2561 (2013).
Google Scholar
Hu, G. et al. When face recognition meets with deep learning: An evaluation of convolutional neural networks for face recognition. Proc. IEEE Int. Conf. Comput. Vis. Workshops 15, 142–150 (2015).
Google Scholar
Litjens, G. et al. A survey on deep learning in medical image analysis. Med. Image Anal. 42, 60–88 (2017).
Article Google Scholar
Abidin, A. Z. et al. Deep transfer learning for characterizing chondrocyte patterns in phase contrast X-ray computed tomography images of the human patellar cartilage. Comput. Biol. Med. 95, 24–33 (2018).
Article Google Scholar
Vidushi Sharma, S. R. & Anurag, D. A comprehensive study of artificial neural networks. Int. J. Adv. Res. Comput. Sci. Softw. Eng. 2, 2 (2012).
Google Scholar
Ciregan, D., Meier, U. & Schmidhuber, J. Multi-column deep neural networks for image classification. 2012 IEEE Conference on Computer Vision and Pattern Recognition, https://doi.org/10.1109/CVPR.2012.6248110, (2012).
Ruder, S. An overview of gradient descent optimization algorithms. arXiv preprint arXiv:1609.04747 (2016).
Schlegl, T., Ofner, J. & Langs, G. Unsupervised pre-training across image domains improves lung tissue classification. In International MICCAI Workshop on Medical Computer Vision, 82–93 (2014).
Deng, J. et al. Imagenet: A large-scale hierarchical image database. In Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on, 248–255 (2009).
Simonyan, K. & Zisserman, A. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014).
Szegedy, C., Vanhoucke, V., Ioffe, S., Shlens, J. & Wojna, Z. Rethinking the inception architecture for computer vision. Proceedings of the IEEE conference on computer vision and pattern recognition, 2818–2826 (2016).
Chollet, F. Xception: Deep Learning with Depthwise Separable Convolutions. (2016).
Russakovsky, O. et al. Imagenet large scale visual recognition challenge. Int. J. Comput. Vision 115, 211–252 (2015).
Article MathSciNet Google Scholar
Christian Szegedy, W. L., Yangqing Jia, Pierre Sermanet, Scott Reed, Dragomir Anguelov, Dumitru Erhan, Vincent Vanhoucke, Andrew Rabinovich. (2014).
Christian Szegedy, V. V., Sergey Ioffe, Jonathon Shlens, Zbigniew Wojna. (arXiv:1512.00567, 2015).
Chollet, F. Keras: Deep learning library for theano and tensorflow. URL: https://keras.io/k 7 (2015).
Abadi, M. et al. Tensorflow: Large-scale machine learning on heterogeneous distributed systems. arXiv preprint arXiv:1603.04467 (2016).
Hinton, G., Srivastava, N. & Swersky, K. Rmsprop: Divide the gradient by a running average of its recent magnitude. Neural Netw. Mach. Learn. Coursera lecture 6e (2012).
Pritzker, K. P. et al. Osteoarthritis cartilage histopathology: Grading and staging. Osteoarthr. Cartil. 14, 13–29 (2006).
Article CAS Google Scholar
Davis, T., Gao, D., Gureyev, T., Stevenson, A. & Wilkins, S. Phase-contrast imaging of weakly absorbing materials using hard X-rays. Nature 373, 595 (1995).
Article ADS CAS Google Scholar
Mittone, A., Fradin, L., Di Lillo, F., Fratini, M., Requardt, H., Mauro, A., Homs-Regojo, R. A., Douissard, P.-A., Barbone, G. E., Stroebel, J., Romano, M., Massimi, L., Begani-Provinciali, G., Palermo, F., Bayat, S., Cedola, A., Coan, P. & Bravin, A. Multiscale pink beam microCT imaging at the ESRF-ID17 biomedical beamline. J. Synchrotron Radiat. 27, 1347–1357 (2020).
Mittone, A. et al. Characterization of a sCMOS-based high-resolution imaging system. J. Synch. Radiat. 24, 1226–1236 (2017).
Article Google Scholar
Byra, M. et al. Transfer learning with deep convolutional neural network for liver steatosis assessment in ultrasound images. Int. J. Comput. Assist. Radiol. Surg. 13, 1895–1903 (2018).
Article Google Scholar
Frid-Adar, M. et al. GAN-based synthetic medical image augmentation for increased CNN performance in liver lesion classification. Neurocomputing 321, 321–331 (2018).
Article Google Scholar
Kamada, N. & Calne, R. Y. Orthotopic liver transplantation in the rat. Technique using cuff for portal vein anastomosis and biliary drainage. Transplantation 28, 47–50 (1979).
Article CAS Google Scholar
Post, S., Palma, P., Rentsch, M., Gonzalez, A. P. & Menger, M. D. Differential impact of Carolina rinse and University of Wisconsin solutions on microcirculation, leukocyte adhesion, Kupffer cell activity and biliary excretion after liver transplantation. Hepatology 18, 1490–1497 (1993).
CAS PubMed Google Scholar
Mirone, A., Brun, E., Gouillart, E., Tafforeau, P. & Kieffer, J. The PyHST2 hybrid distributed code for high speed tomographic reconstruction with iterative reconstruction and a priori knowledge capabilities. Nucl. Instrum. Methods Phys. Res. Sect. B 324, 41–48. https://doi.org/10.1016/j.nimb.2013.09.030 (2014).
Article ADS CAS Google Scholar
Van Dyk, D. A. & Meng, X.-L. The art of data augmentation. J. Comput. Graph. Stat. 10, 1–50 (2001).
Article MathSciNet Google Scholar
Nagarajan, M. B. et al. Computer-aided diagnosis for phase-contrast X-ray computed tomography: quantitative characterization of human patellar cartilage with high-dimensional geometric features. J. Digit. Imaging 27, 98–107. https://doi.org/10.1007/s10278-013-9634-3 (2014).
Article PubMed Google Scholar
Nagarajan, M. B., Coan, P., Huber, M. B., Diemoz, P. C. & Wismuller, A. Volumetric quantitative characterization of human patellar cartilage with topological and geometrical features on phase-contrast X-ray computed tomography. Med. Biol. Eng. Compu. 53, 1211–1220. https://doi.org/10.1007/s11517-015-1340-5 (2015).
Article Google Scholar

Download references

Acknowledgements

The authors acknowledge the European Synchrotron Radiation Facility (ESRF) for the provision of beam time and laboratory facilities, especially Dr. H. Requardt and Dr. A. Rack for the technical support during the beam times at ID17 and ID19, respectively. The authors would like to acknowledge the financial support from the Deutsche Forschungsgemeinschaft (Cluster of Excellence) Munich Center for Advanced Photonics (EXE158).

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

Authors and Affiliations

Faculty of Physics, Ludwig Maximilians University, Schellingstr. 4, 80799, München, Germany
Johannes Stroebel & Paola Coan
Faculty of Medicine, Department of Radiology, Ludwig Maximilians University, Marchioninistraße 15, 81377, München, Germany
Annie Horng, Marco Armbruster, Maximilian Reiser & Paola Coan
European Synchrotron Radiation Facility, 71, Avenue des Martyrs, 38043, Grenoble, France
Alberto Mittone, Alberto Bravin & Paola Coan
CELLS: ALBA Synchrotron, Carrer de la Llum, 2-26, 08290 Cerdanyola del Vallès, Barcelona, Spain
Alberto Mittone
RZM—Radiologisches Zentrum München-Pasing, Pippinger Str. 25, 81245, München, Germany
Annie Horng

Authors

Johannes Stroebel
View author publications
You can also search for this author in PubMed Google Scholar
Annie Horng
View author publications
You can also search for this author in PubMed Google Scholar
Marco Armbruster
View author publications
You can also search for this author in PubMed Google Scholar
Alberto Mittone
View author publications
You can also search for this author in PubMed Google Scholar
Maximilian Reiser
View author publications
You can also search for this author in PubMed Google Scholar
Alberto Bravin
View author publications
You can also search for this author in PubMed Google Scholar
Paola Coan
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.S. conducted the imaging experiments together with A.B. and A.M. A.H. and M.A. provided the biological samples for those experiments. M.R. and P.C. managed and organized the experiments and funding for this manuscript. The computer science part/analysis was done by J.S. J.S., A.B., A.M. and P.C wrote the manuscript. All authors reviewed the manuscript.

Corresponding author

Correspondence to Paola Coan.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Stroebel, J., Horng, A., Armbruster, M. et al. Convolutional neuronal networks combined with X-ray phase-contrast imaging for a fast and observer-independent discrimination of cartilage and liver diseases stages. Sci Rep 10, 20007 (2020). https://doi.org/10.1038/s41598-020-76937-y

Download citation

Received: 18 January 2020
Accepted: 27 October 2020
Published: 17 November 2020
DOI: https://doi.org/10.1038/s41598-020-76937-y

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.