Abstract
Time-lapse fluorescence microscopy is an essential technique for quantifying various characteristics of cellular processes, i.e. cell survival, migration, and differentiation. To perform high-throughput quantification of cellular processes, nuclei segmentation and tracking should be performed in an automated manner. Nevertheless, nuclei segmentation and tracking are challenging tasks due to embedded noise, intensity inhomogeneity, shape variation as well as a weak boundary of nuclei. Although several nuclei segmentation approaches have been reported in the literature, dealing with embedded noise remains the most challenging part of any segmentation algorithm. We propose a novel denoising algorithm, based on sparse coding, that can both enhance very faint and noisy nuclei signal but simultaneously detect nuclei position accurately. Furthermore our method is based on a limited number of parameters, with only one being critical, which is the approximate size of the objects of interest. We also show that our denoising method coupled with classical segmentation method works properly in the context of the most challenging cases. To evaluate the performance of the proposed method, we tested our method on two datasets from the cell tracking challenge. Across all datasets, the proposed method achieved satisfactory results with 96:96% recall for the C. elegans dataset. Besides, in the Drosophila dataset, our method achieved very high recall (99:3%).
Similar content being viewed by others
Introduction
In cellular and molecular biology research, automatic segmentation and tracking of biological structures, i.e. cells or their nuclei are important tasks for further understanding of cellular processes. Time-lapse fluorescence microscopy (TLFM) is one of the most appreciated imaging techniques which can be used to quantify various characteristics of cellular processes, such as cell survival1, proliferation2, migration3, and differentiation4. The quantification of these processes plays a significant role in studying embryogenesis, cancer cells, stem cells, and other applications in the fields of molecular and developmental biology.
In TLFM imaging, not only spatial information is acquired, but also temporal information as well as spectral information, that produces up to five-dimensional (X, Y, Z + Time + Channel) images. Typically, the generated datasets consist of several (hundreds or thousands) images, each containing hundreds to thousands of objects to be analysed5. These large volumes of data cannot easily be parsed and processed, via visual inspection or manual processing within any reasonable time.
Nowadays, there is a growing consensus that automated cell segmentation methods are necessary to manage the time issue and provide a level of reliability and validity. Accordingly, the implementation of automated high-throughput cell nuclei detection and segmentation techniques may be able to improve the clinical diagnosis, predict the treatment outcome, and help to enhance therapy planning.
Although TLFM imaging is a very powerful technique and can capture valuable information from cellular structures, it still poses several limitations that can be summarised as follows: 1) non-uniform background illumination because of the fluorescence in cytoplasm and mounting medium; 2) low contrast and weak boundaries of non-obvious nuclei; 3) the degradation of image intensity over time due to photo-bleaching of fluorophores5. As a result of these limitations, obtained images become very noisy and difficult to interpret, which might lead to false detection and segmentation results.
Over the last few years, several methods have been proposed for filtering and denoising of cell nuclei microscopy images. Darbon et al.6 have used a Non-Local Mean approach to reduce the noise in microscopy images. This method relied on replacing the intensity value of a pixel with the average intensity values of the most similar pixels in the image. The algorithm demonstrated good performance in enhancing particles contrast, and reducing the noise in electron cryo-microscopy images. For Transmission Electron Microscopy (TEM) images, several digital filters have been introduced by Kushwaha et al.7 such as median or Wiener filter. A similar work proposed by Sim et al.8 and Aguirre9 based on employing an adaptive Wiener filter to enhance the effectiveness of the classical Wiener filter by considering the noise variance. Luisier et al.10 have suggested a Poisson Unbiased Risk Estimation-Linear Expansion of Thresholds (PURE-LET) technique for denoising images corrupted with Poisson noise. The method is based on three criteria: 1) minimising of an unbiased estimate of Mean Square Error (MSE) for Poisson noise, 2) linear parametrisation of the denoising process 3) preserving of Poisson statistics across scales. This algorithm is particularly promising for large datasets as well as images having a low signal to noise ratio. In addition, it has limited system requirements.
Over the past few years, deep learning methods have been successfully introduced for biological data processing. For instance, Liu et al.11 suggested a convolutional encoder-decoder neural network method for denoising of cell nuclei in fluorescence microscopy images. This method based on using the stochastic characteristics of noise as well as the shape of nuclei for learning step, and then regenerating the clean nuclei image based on the learned priori knowledge. Weigert et al.12 proposed a method to enhance the axial resolution in 3D microscopy images by reconstructing isotropic 3D data from non-isotropic acquisitions using a convolutional neural network. Another approach has also been proposed by Weigert et al.13 that presented a content-aware image restoration (CARE) networks method to denoise fluorescence microscopy data. This method introduced a solution to the problem of missing training data for deep learning in fluorescence microscopy by generating training data without the need for laborious manual annotations.
For cell nuclei segmentation literature, the reported approaches are classified into two categories: simple approaches such as thresholding method14,15,16,17, edge detection18 and shape matching19,20,21, and more sophisticated approaches like region growing22,23,24,25, energy minimization26 and machine learning11,27,28,29.
In the first category, R. Bise et al.14, Arteta et al.15, Liao et al.16 and Gul-Mohammed et al.17 applied a thresholding-based approach to segment cell nuclei. This approach assumes that the cell is usually brighter than its surrounding areas and there often exists an optimal threshold where individual cells can be segmented as separate objects. This assumption is not applicable in the challenging regions, because it is impossible to find a suitable threshold to separate all touching cells. Wählby et al.18 suggested to use an edge detection approach, in which an edge filter is applied to the image and therefore pixels are classified as edge or non-edge. These edges are usually detected by the first or the second order derivative method. Nevertheless, this method fails to detect the non-obvious cell’s boundary. Cicconet et al.19 and Türetken et al.20 proposed to use a shape matching-based approach. This approach depends on the assumption that cells, particularly nuclei, have round shapes. Hence, multi-scale blob detection30 can be employed to detect and segment cells.
In the second category, Cliffe et al.22, Liu et al.23, Tonti et al.24 and Gul-Mohammed and Boudier25 employed region-based segmentation techniques in which the basic idea of these approaches is to combine the neighbouring pixels of initial seed points which have similar properties to form individual cells.
On the other hand, machine learning-based approaches can be implemented for cell segmentation. To give some examples, Ronneberger et al.27 proposed U-Net convolutional networks to segment cells by assigning class labels to every pixel in the image. A two-stage convolutional neural network method was presented by Akram et al.28 to precisely segment the cells. On the first stage, CNN is used to predict a regression on the cell bounding box. On the second stage, another CNN is employed to segment cells within the regressed bounding box. Liu et al.11 reported the use of a convolutional encoder-decoder network to segment the cell nuclei by learning stochastic characteristics of noise and shape of nuclei. Thus, segmented cell nuclei images can be generated. Moreover, Sadanandan et al.29 advised to use the Deep Convolutional Neural Networks (DCNNs) to segment cells in fluorescence microscopy images. The aforementioned filtering and segmentation methods have achieved good results in microscopy images. However, most of these methods require tuning a set of parameters. Moreover, they may be used effectively only for specific applications. As a result, identifying a proper and robust approach for various datasets regarding high variation in cell nuclei volume, shape, and stain distribution as well as high cell density has become a major challenge in image analysis.
The contribution of this paper is to design a generic method for denoising of 3D cell nuclei images based on a sparse representation model. Consequently, a classical segmentation method can be used to segment cell nuclei without the need for more complicated segmentation approaches. The concept of dictionary learning, and sparse representation is already well established by M. Elad and M. Aharon31 as well as it is implemented in different application such as image denoising32,33,34, and image classification35,36. However, the added value of the proposed approach is employing the advantage of the sparse representation to find the potential locations of nuclei in microscopy image. The novelty here is to obtain a denoised image and a detection map simultaneously. We believe that no similar studies have been reported in existing literature for denoising and simultaneously predicting objects location in images.
Results
Datasets description
Since the main motivation of our work is to automate the detection and segmentation of cell nuclei in time-lapse fluorescence microscopy images, we focused on applying our algorithm to development biology datasets, where only a limited number of existing methods had provided satisfactory results. The proposed framework is extensively tested on three real datasets for embryonic cells and one dataset of synthetic images with different values for the signal to noise ratio (SNR) and the object size. SNR is a performance measure for the sensitivity of imaging systems which is defined as the ratio of the average signal level (μsignal) to the standard deviation (σnoise) of the background noise level: SNR = μsignal/σnoise and expressed in logarithmic function as SNR(dB) = 20 × log10(μsignal/σnoise)37.
Synthetic dataset
In order to measure the robustness of the proposed method, we generated synthetic images of size (XYZ) equal (100 × 100 × 20) voxels containing spheres of two radii: 7 and 9 voxels. As it is common in fluorescence microscopy images to have low contrast and low signal to noise ratio (SNR), as a consequence of weak fluorescent staining or microscope properties, the images are distorted with different levels of Poisson-Gaussian noise, resulting in SNRs of 2 dB, −1 dB, −5 dB and −7 dB, respectively. Furthermore, the images include touching spheres where these conditions simulate the same characteristics existing in the real datasets as shown in Supplementary Fig. 1.
Real dataset
The first dataset comes from the work of Gul-Mohammed17 and it is named as CE-UPMC. The other two datasets come from the cell tracking challenge38,39, namely Fluo-N3DH-CE and Fluo-N3DL-DRO. The last two datasets are proven to be the hardest to be fully segmented automatically38. Each dataset from the cell tracking challenge contains 2 sequences. For the cell tracking challenge datasets, all pixels belonging to objects including the centroid are labelled as object by the ground truth. However, for the other dataset (i.e. CE-UPMC), only the centroid of each object is labelled. The datasets are described as follows:
CE-UPMC dataset: It involves the C. elegans embryonic cells. The size (XYZT) of dataset is 512 × 512 ×31 × 160. The cells were acquired every 1 minute using a spinning-disk confocal microscope. This dataset is very challenging, as the intensity of the images is decaying over time due to the labelling technique and acquisition system. Thus, the quality of the acquired images is low.
Fluo-N3DH-CE dataset: It includes the C. elegans embryonic cells. The size (XYZT) of the first sequence is 708 × 512 × 35 × 250 and of the second sequence is 712 × 512 × 31 × 250. Both sequences are 8–bit images with cells imaged every 1.5 minutes. The cells are acquired using a Zeiss LSM 510 Meta Confocal Microscope. This dataset is challenging as well, since it has a low signal to noise ratio (SNR = 6.74 dB), in addition, the fluorescence can fade when the cells divide. Furthermore, the cells become smaller over time.
Fluo-N3DL-DRO dataset: It contains the Drosophila melanogaster embryonic cells. The size (XYZT) of each sequence is 1272 × 603 × 125 × 49. Both sequences are 8–bit images with cells imaged every 30 second. The cells are acquired using a SIMView light-sheet microscope. This dataset is very challenging as it has a large number of densely packed cells. In addition, it has a low signal to noise (SNR = 2.46 dB).
Experimental setup and suitable parameters selection
Synthetic datasets are generated to study the effect of parameters (described in Table 1) on cell nuclei detection and segmentation, as well as to understand the overall mechanism for selecting and tuning the significant parameters of various datasets (summarised in Table 2).
Our approach is based on the building of a dictionary (small patches of the image) that will be eventually used for denoising and detection of cell nuclei. We investigated the optimal size of the patches and the number of patches (called atoms) in the dictionary. Then, the randomly created dictionary is updated, and so we investigated the number of iteration for the update. Finally, we investigated the sparsity level (i.e., the number of used atoms) for the reconstruction of the denoised image and the detection map. To start with, we tested several values for patch size (p = 5 × 5 × 5, 10 × 10 × 5, 15 × 15 × 5 and 20 × 20 × 5), dictionary size (K = 64, 128, 256 and 512), sparsity level (L = 3, 6 and 9) and number of iterations (N = 5, 10, 15, 20, 25 and 30) at different noise levels (SNR = 2, −1, −5 and −7 dB). Since the coefficient of variation (CV) is a useful statistical descriptor for comparing the degree of variation from one data series to another one. The CV is defined as the ratio of the standard deviation to the mean40. Thus, we employed the CV to measure the effect of changing the parameters on the result. The average CV from the patch size, dictionary size, sparsity level and the number of iterations over the four noise levels are approximately 15, 2, 2 and 2% respectively.
In all aforementioned parameters, we observed that the patch size is considered as a critical parameter where the change in this value has a major impact on the subsequent segmentation results as shown in Supplementary Fig. 2(a). On the contrary, changes in the other parameters i.e. dictionary size, sparsity level and the number of iterations often achieve very close results as shown in Supplementary Fig. 2(b–d). Therefore, we fixed all parameters while tuned the patch size according to the object’s size present in the images.
In order to confirm the importance of patch size tuning, we conducted more analysis in the term of cell nuclei detection as shown in Supplementary Fig. 3. For example, at the first three noise levels 2, −1 and −5 dB all cell nuclei are correctly detected for different patch size values. However, at noise level equal −7 dB, many objects are falsely detected with the patch size equivalent to p = 5 × 5 × 5 and one nucleus is not detected at patch size equal 20 × 20 × 5. Though, for patch size equal 10 × 10 × 5 and 15 × 15 × 5, all cell nuclei are correctly detected.
For all previously mentioned patch size values, the average of recall, precision, F-measure, and Jaccard index with different noise levels are presented in Supplementary Fig. 3(b), where these measures are high when the patch size values equal p = 10 × 10 × 5 and 15 × 15 × 5 compared to the measures of the other two values.
Following the above experiment, we observed that, for robust detection and segmentation results, the patch size should not be less than 25% and not more than 100% of the average cell nuclei volume in images.
Denoising of 3D cell nuclei images
In this work, a sparse representation model31,41 is employed to obtain the denoised images. Our method is compared with PURE-LET10, which is one of the most efficient, fast and automatic methods for denoising of multi-dimensional fluorescence microscopy images. The main motivation behind the need for cell nuclei denoising is assisting better segmentation of cell nuclei images. Therefore, the comparison between the denoising methods is performed in the context of improving segmentation results. For instance, the results in Supplementary Fig. 4 (first row), Supplementary Figs 6(a,c) and 7(a,c) show that our method is able to reduce, and almost remove the noise as well as enhance the contrast of cell nuclei. We have also noticed a better contrast than PURE-LET results as shown in Supplementary Fig. 4 (third row), Supplementary Figs 6(b,d) and 7(b,d).
For further assessment, thresholding-based approach is applied to the denoised images to obtain the segmentation mask. It can be noted from Supplementary Fig. 4 (second row), Supplementary Figs 6(e) and 7(e) that our method succeeded to segment all nuclei in comparison with the other method which failed to detect some nuclei as demonstrated in Supplementary Fig. 7(f). Even though, when the PURE-LET method is able to detect all cell nuclei shown in Supplementary Fig. 6(f), the size of segmented nuclei are smaller than their original size. Unfortunately, this method can not detect any cell nuclei at very low signal to noise ratios (−5 dB and −7 dB,) as presented in Supplementary Fig. 4 (fourth row).
Segmentation of 3D cell nuclei images
Following the denoising step, a local adaptive thresholding42 is applied to the denoised image to get the segmentation mask of candidates regions. In order to obtain the candidates locations of cell nuclei centres, we used a novel representation called the detection map. Each voxel in this map is computed as the summation of the patch coefficients that are used to reconstruct the denoised image. We then define a maximum response image by multiplying the denoised image with the detection map. This maximum response image is used to detect the local maxima (Fig. 1). Afterwards, the obtained local maxima are used as an input for a 3D marker-controlled watershed segmentation of the cell nuclei (Fig. 2).
For the synthetic dataset, great performance is observed on very low signal to noise ratios (2 dB, −1 dB, −5 dB and −7 dB), in which our method is capable of correctly identifying and segmenting all cell nuclei at the various noise levels as presented in Supplementary Fig. 8. Furthermore, our method has similar performance as the top-ranked KTH algorithm from the cell tracking challenge38,39, as shown in Supplementary Fig. 5.
Regarding the CE-UPMC dataset, there is an intensity decay over time owing to the labelling technique and acquisition system. As a result, the acquired image quality is low. Table 3 and Supplementary Fig. 9, illustrate the results obtained at certain time points. For instance, at early time points (40, 60, 80 and 100) all cell nuclei are correctly detected. In addition, few false positives are also detected (2 objects out of 119 cell nuclei). Even though the image quality at advanced time points (120, 140 and 160) is low, only 11 cell nuclei out of 247 are not detected, also there exists a small number of false positives (3 objects out of 247 cell nuclei which is displayed as yellow arrows in Supplementary Fig. 9(f)). Typically, the reason for missing cell nuclei is the detection of clustered cell nuclei (indicated by red arrows in Supplementary Fig. 9(f)) rather than detecting them separately. Supplementary Fig. 10 shows the segmentation results of our method and the results from the original paper17 for the CE-UPMC dataset.
In the Fluo-N3DH-CE dataset, the proposed approach is able to identify and segment correctly more than 96% of total cell nuclei. Furthermore, it detects a small number of false positives (9 objects out of 876 cell nuclei) as well as a small number of false negative (29 cell nuclei). The achieved F-measure of approximately 97.8%, which is comparable to the competing algorithm, i.e. KTH38,39 (Table 4, Supplementary Figs 11 and 12).
For the Fluo-N3DL-DRO dataset, despite our method succeeds to detect more cell nuclei (99% recall), it has low precision (3%) due to the annotated ground truth, which considered only the cell nuclei located in the early nervous system and all other nuclei are deemed as false positives. As a result, the obtained F-measure is low with an approximate value of 6.4% Table 4. Furthermore, our method has a comparable segmentation accuracy with KTH competing approach38,39 (Table 4, Supplementary Figs 13 and 14).
The results achieved by our method for the two datasets obtained from cell tracking challenge are compared with the top-ranked KTH algorithm38,39. KTH algorithm is chosen for the reason that, it presented the best overall performance in the challenge. This algorithm is mainly based on adopting the band-pass filter to detect and segment cell nuclei.
Regarding the KTH algorithm, some detected objects are actually noise and some cell nuclei are not detected. This is because the algorithm detected clustered cell nuclei instead of detecting them separately. For example, at time point 28 from Fluo-N3DH-CE (seq1) and at time point 106 from Fluo-N3DH-CE (seq2), KTH algorithm failed to resolve the fusion of two nuclei (as presented by the red arrows in Supplementary Fig. 12(b). In contrast, our method succeeds to identify and segment each nucleus individually as shown in Supplementary Fig. 12(a,c). For the Fluo-N3DL-DRO dataset, although our method succeeds to detect more cell nuclei than KTH approach (Table:4 and Supplementary Fig. 14), the evaluation method considered those cell nuclei as false positives, due to the annotation method which considered only the cell nuclei located in the early nervous system.
We have found that, the proposed method is less sensitive to some parameters such as dictionary size (K), sparsity level (L) and number of iterations (N). All these parameters are being fixed for different datasets and experiments with the subsequent values K = 64, L = 3 and N = 15. However, the proposed method is more sensitive to fundamental parameters, i.e., such as patch-size, and in a less critical manner to Sensitivity Factor and Nuclei Seed Dilation. As these parameters are easy to understand, this makes them easier to tune-up if needed. We need to stress that all parameters, except patch size, are quite robust, as we only need to use three sets of parameters for all datasets. The set of empirically determined parameter values being applied to the datasets are listed in the Table 2.
Concerning the Fluo-N3DH-CE dataset, although the average cell nuclei size in sequence (2) is slightly greater than the average size in sequence (1), we have decided to use the same parameter (i.e., patch size) for denoising of both sequences. As a result, we tuned Nuclei Seed Dilation to avoid detection of multiple local maxima for the same object as explained in section Marker points detection.
We have also presented a Supplementary Table 1 to show the detection and segmentation results among various datasets considering the patch size percentage (related to average cell nuclei volume).
In order to test the genericity of the algorithm, we have conducted additional experiments on real datasets coming from various tissues such as thymus tissue (provided by J. Sheridan, Walter and Eliza Hall Institute of Medical Research (WEHI)), lymphoid tissue (provided by JR. Groom, WEHI), and islets of Langerhans tissue (from Tran et al.43), where robust cell segmentation is still challenging. Despite the noisy and crowded environment, the obtained results from our method are quite encouraging as presented in Supplementary Fig. 15.
Methods
This section introduces a novel method for denoising and detection of cell nuclei in 3D TLFM images based on a sparse representation approach31,41. The use of sparse signal representation is becoming popular in several fields such as face recognition44, image denoising32,33,34 and inpainting45, and image classification35,36. Indeed, natural images represent very sparse data, especially in biology where numerous instances of the same structure, i.e. cell or nucleus, are present in the image. Moreover, a dictionary-based approach is usually linked to unsupervised learning since the data itself can be used to learn the basis vectors to build a sparse representation matrix.
The sparse representation method (shown in Fig. 3) is implemented as described by M. Elad and M. Aharon31, we have only changed the construction of the initial dictionary as depicted in the following steps. Firstly, the patches are extracted by moving a window with a step size of one pixel over the raw image. For each extracted patch, pixels’ intensities are summed up. Then, the average intensity over all patches is calculated. Secondly, an initial dictionary is constructed by selecting random patches from extracted patches among those having intensities greater than the obtained average intensity. By doing that, we are ensuring the presence of cell nuclei patches in the initial dictionary. Thirdly, a technique based on K-clustering with singular value decomposition (K-SVD)41 is implemented to update and obtain the final dictionary. Fourthly, the updated dictionary is used to reconstruct the denoised image as well as the detection map that will be used for detection of cell nuclei.
In the cell nuclei segmentation stage, the maximum response image, which is obtained by multiplying the denoised image with the detection map is used to detect the potential location of cell nuclei. Then, a thresholding-based approach42,46 is proposed to get the segmentation mask. Finally, a marker-controlled watershed approach47 is used to obtain the final cell nuclei segmentation result.
An introduction to sparse representation
The idea of sparse representation is to obtain an efficient representation of a signal as a linear combination of few atoms chosen from a dictionary. Given a dictionary D ∈ Rn×K that contains K atoms as column vectors \({d}_{j}\in {R}^{n},j=\mathrm{1,}\,\mathrm{2,}\ldots ,K\). The sparse representation problem of a signal y ∈ Rn can be described as finding the sparsest vector α ∈ RK where \(y\simeq D\alpha \). The problem can be formulated as an energy optimization problem as follows:
Where y is the signal, α denotes the sparse representation coefficients, \({\Vert \alpha \Vert }_{0}\) is the L0 pseudo-norm that counts the number of non-zeros of α and L is a predetermined sparsity threshold.
Solving the previous optimisation problem is NP-hard and numerically intractable, thus several methods have been developed to get an approximate solution for this particular problem. The first type of methods uses L0–norm minimisation, such as matching pursuit (MP)48 or orthogonal matching pursuit (OMP)49. The second type of method uses L1–norm for optimisation. The objective of L1–norm is to make the optimisation problem convex, which can be addressed efficiently using basis pursuit (BP)50.
The crucial issue in practical applications is to select the dictionary D. Basically, dictionaries are of two types: (1) fixed dictionaries, (2) adaptive dictionaries. The fixed dictionary as is the case for curvelet, discrete cosine, wavelet, ridgelet, or bandlet which use pre-defined and fixed atoms. This type of dictionary may not ensure a well-defined representation of all given signals. As a result, it is more appealing to use an adaptive dictionary approach to learn the dictionary directly from the data itself.
Learning the dictionary requires twosteps, the first step is to compute an initial dictionary. It is usually computed by taking random patches directly from the raw image. These patches are overlapped with a step size of one pixel. To ensure the presence of patches containing nuclei in the initial dictionary beside background patches, we select patches among those having intensity greater than the average intensity of all patches extracted from the image. The second step is to update the initial dictionary by using the K-SVD algorithm41. This algorithm is a standard unsupervised adaptive dictionary learning algorithm that generalizes the well-known K-means clustering approach. It jointly learns a dictionary \(D=[{d}_{1},{d}_{2},\,\ldots ,\,{d}_{K}]\), dj ∈ Rn, j = 1, 2, …, K and a related sparse representation matrix \(\alpha =[{\alpha }_{1},{\alpha }_{2},\,\ldots ,\,{\alpha }_{m}]\), αi ∈ RK from a set of training signals \(Y=[{y}_{1},{y}_{2},\,\ldots ,\,{y}_{m}]\), where each yi ∈ Rn by solving the following problem:
This technique solves the optimization problem by alternating between finding the sparse representation coefficients α and the dictionary D using an iterative approach. Assuming that D is known, the best sparse representation matrix is constructed by solving Eq. 2 using an orthogonal matching pursuit algorithm (OMP). Following the sparse representation stage, the representation vectors (α) are assumed to be fixed. Subsequently, the best dictionary is computed. Since finding the whole dictionary at the same time is impractical, the dictionary is updated atom by atom. Once the best dictionary and sparse representation coefficients are obtained, the denoised image and detection can be constructed.
Images with sparse representation
In this section, we present the reconstruction of the denoised image and detection map which will be used later in the detection and segmentation of cell nuclei
-
Denoised image reconstruction. The denoised image with dictionary learning is formed by solving the following problem:
$$E(X)=\mathop{{\rm{a}}{\rm{r}}{\rm{g}}{\rm{m}}{\rm{i}}{\rm{n}}}\limits_{X,\alpha ,D}\,\lambda \,{\Vert X-Y\Vert }_{2}^{2}+{\Vert X-D\alpha \Vert }_{2}^{2}\,s\mathrm{.}t\mathrm{.}\,{\Vert \alpha \Vert }_{0}\le L$$(3)Where X is the denoised reconstructed image, Y is the noisy image and λ is the regularisation parameter.
-
Detection map reconstruction. In point of fact, the denoised image does not have sufficient contrast to completely separate touching nuclei. In order to improve cell nuclei detection, a detection map image that indicates the potential locations of cell nuclei will be built. The construction of this image is based on the computation of the sparse coefficients (αi) of each image patch. It can be obtained by:
Where, C is a normalisation term, Pi is the probability value corresponding to the i–th patch and \({\alpha }_{i}^{k}\) denotes the k–th element of αi. Notably, voxels within the centre of the nucleus have very high sparse coefficients values, in contrast to voxels far away from the centre having low values. Consequently, the pi value of the patches containing nuclei tend to be large compared with the pi value of background patches. As a result, dictionary learning technique with sparse representation can capture strong structures of biological images as well as restrain the noise.
Cell nuclei segmentation
-
Initial cell nuclei segmentation. A local adaptive thresholding approach42 is applied to the denoised image. The general concept of the algorithm is that for every image’s voxel the threshold is determined by the following equation:
$${{\rm{T}}}_{{\rm{local}}}={{\rm{mean}}}_{{\rm{local}}}\,\ast \,\mathrm{(1}-{\rm{Sensitivity}}\,{\rm{Factor}})$$(5)where, meanlocal is the mean intensity value in the neighbourhood of each voxel and the Sensitivity Factor is a scalar value within a range from zero to one which controls sensitivity towards thresholding more voxels as foreground. Accordingly, voxels with intensity values larger than Tlocal are set to 1, all others are set to 0. Small regions detected as foreground and smaller than a predefined volume denoted by Min Nuclei Volume are discarded. This threshold corresponds to the volume of the smallest cell nucleus and is determined prior to the segmentation step. The resulting image is called the segmentation mask.
-
Marker points detection. For splitting of touching cell nuclei, we employed a marker-controlled watershed technique. The marker points are obtained as follow: first, the denoised image is multiplied by the detection map to provide a maximum response image. Second, The maximum response image is processed to detect the local maxima (Fig. 4). The obtained local maxima image is multiplied by the segmentation mask to discard local maxima detected in the background. Third, a morphological dilation operator of certain radius denoted by Nuclei Seed Dilation is employed to avoid detection of multiple local maxima for the same object by merging those maxima that were in close proximity to each other. Finally, the modified image determining the marker points is fed to the subsequent watershed algorithm.
-
3D marker-controlled watershed segmentation. Marker-controlled watershed segmentation is presented to separate connected cell nuclei clusters. The basic principle of watershed approach is to flood the denoised image, which contains merged objects starting at marker points as sources. Sometimes the flooding process is not stopped at the border of a cell nucleus, therefore the denoised image is multiplied by the segmentation mask prior to the flood. Eventually, watershed dams are built when different sources meet during the flooding process. This approach allows splitting clusters of apparently touching cell nuclei.
Evaluation method and metrics
To assess the performance of the proposed algorithm, three metrics are employed. The first two metrics are the recall51 and precision51 of object detection. The recall is the proportion of the number of relevant detected cell nuclei to the total number of relevant cell nuclei in ground truth. Precision is the proportion of the number of relevant detected cell nuclei to the total number of irrelevant and relevant detected cell nuclei. These parameters are defined as follows:
where, True Positive (TP) represents the total number of correctly detected nuclei, False Negative (FN) represents the number of undetected nuclei and False Positive (FP) represents the number of falsely detected nuclei. To compute these values, we used the following steps: first, we calculated the distance between the centroids of ground truth nuclei and centroids of segmented objects. Second, a weight is assigned to each pair of segmented and ground truth objects, equal to the distance between them. Third, Hungarian algorithm is used to solve this assignment problem. Objects with no match to any other object are considered as FP, objects absent in ground truth, but they appear in the segmentation result are deemed FP, and FN were objects absent in the segmentation result despite these objects appear in ground truth.
The third metric is the Jaccard index52 that measures the segmentation accuracy of the segmented objects. The Jaccard index for each set of segmented (A) and ground truth (B) objects is defined as the intersection between them divided by their union.
The final measure is then the average of the Jaccard indices of matched pairs.
Implementation details
The image analysis framework is developed using MATLAB (R2017b) on a Windows-based computer (Intel Core i 7, 3.07 GHz, and 16 GB RAM). Furthermore, the 3D ImageJ viewer plugin53 along with tools from the 3D ImageJ suite54 are used for three-dimensional visualisation of the final segmentation result. The source code, as well as datasets, are available upon request. Data processing using the complete framework took 5 mins for the synthetic dataset, 11 mins for the CE-UPMC dataset, 35 mins for Fluo-N3DH-CE dataset and 48 mins for Fluo-N3DL-DRO dataset to process only one time point of 3D image from the complete dataset. Regarding cell tracking challenge, the web site (http://www.codesolorzano.com/celltrackingchallenge) provides access to the datasets with the ground truth. In addition, it provides access to Windows and Linux executable files for the evaluation software as well as an executable program that includes the process description for KTH work.
Conclusion
In this paper, we have presented a novel generic method for the denoising and detection of 3D cell nuclei in 3D time-lapse fluorescence microscopy images, based on a sparse representation model. We showed significant improvements over other denoising methods, and consequently, classical methods can be used for segmentation. We suggested, as to propose a complete workflow for denoising, detection and segmentation, to pair our denoising algorithm with a rather classical local thresholding method and showed that we obtained similar or better results than state of the art algorithm. We observed than our denoising algorithm is performing extremely well for very noisy data and can hence help to detect very faint or previously undetectable nuclei. As the strength of our workflow is the denoising part, not so much the segmentation part, we observed (data not shown) little improvements for non-noisy data.
Over the last few years, deep learning approaches have achieved promising results in several domains, including denoising. However, they have some limits, for example, they are implemented to solve a specific problem, i.e., any new dataset will require a new training step. Furthermore, deep learning approaches required ground truth labels. Similarly to machine learning approaches, including deep learning, our method is based on a learning technique, but in dictionnary-based methods the learning is completely unsupervised and hence can be performed for any new data without any change.
We also showed that our algorithm can also lead to accurate detection of nuclei centroids, we coupled this detection to a classical segmentation method and showed very good results on challenging datasets. We believe than coupling our algorithm with a more powerful segmentation method may lead to even better results, but this was not the purpose of this article. We focused on a robust and powerful denoising method coupled with classical segmentation to provide a effective workflow with minimal tuning. The fundamental parameters which have a more noticeable impact on the result are patch-size, Sensitivity Factor, and Nuclei Seed Dilation. All of these parameters are based on the average of cell nuclei volume present in the image. As these fundamental parameters are easy to understand, this makes them easier to tune-up if needed.
The obtained final segmentation results are quite good and stable. In addition, the training step is unsupervised and the dictionary can be directly learned from the image itself. We believe that no similar studies have been reported in existing literature for denoising and simultaneously predicting objects location in images. As a future work, we will investigate an online learning method to handle the time issue to reduce the processing time needed for dictionary learning.
The proposed method can handle the most challenging cases involving noisy, densely packed and multiple touching cell nuclei. In addition, it can produce the denoised image and simultaneously the potential locations of cell nuclei. The proposed method is adapted to the segmentation of cell nuclei in 3D time-lapse fluorescence microscopy images, nevertheless, it can be employed to detect and segment the nearly interacting intracellular organelles, including the endosomes, lysosomes, and lipid droplets. Our method is successfully evaluated on two embryo models, the C. elegans, and the Drosophila datasets. The overall detection and segmentation results are comparable to the existing methods, which is a good starting point for automated cell nuclei tracking process.
References
Payne, S. L. et al. In vitro maturation of human ipsc-derived neuroepithelial cells influences transplant survival in the stroke-injured rat brain. Tissue Engineering Part A 24, 351–360 (2018).
Rapoport, D. H., Becker, T., Mamlouk, A. M., Schicktanz, S. & Kruse, C. A novel validation algorithm allows for automated cell tracking and the extraction of biologically meaningful parameters. PloS one 6, e27315 (2011).
Bise, R., Kanade, T., Yin, Z. & Huh, S.-i. Automatic cell tracking applied to analysis of cell migration in wound healing assay. In Engineering in Medicine and Biology Society, EMBC, 2011 Annual International Conference of the IEEE, 6174–6179 (IEEE, 2011).
Zhang, J., Moradi, E., Somekh, M. G. & Mather, M. L. Label-free, high resolution, multi-modal light microscopy for discrimination of live stem cell differentiation status. Scientific reports 8, 697 (2018).
Meijering, E., Smal, I., Dzyubachyk, O. & Olivo-Marin, J.-C. Time-lapse imaging. Microscope Image Processing 401–440 (2008).
Darbon, J., Cunha, A., Chan, T. F., Osher, S. & Jensen, G. J. Fast nonlocal filtering applied to electron cryomicroscopy. In 2008 5th IEEE International Symposium on Biomedical Imaging: From Nano to Macro, 1331–1334 (2008).
Kushwaha, H. S., Tanwar, S., Rathore, K. S. & Srivastava, S. Denoising filters for TEM (transmission electron microscopy) image of nanomaterials. In 2012 Second International Conference on Advanced Computing Communication Technologies, 276–281 (2012).
Sim, K. S., Teh, V. & Nia, M. E. Adaptive noise wiener filter for scanning electron microscope imaging system. Scanning 38, 148–163 (2016).
Aguirre, P. Image processing of microscopic cellular samples. In 2012 16th IEEE Mediterranean Electrotechnical Conference, 366–369 (2012).
Luisier, F., Vonesch, C., Blu, T. & Unser, M. Fast interscale wavelet denoising of poisson-corrupted images. Signal Processing 90, 415–427 (2010).
Liu, Z. et al. Nucleinet: A convolutional encoder-decoder network for bio-image denoising. In Engineering in Medicine and Biology Society (EMBC), 2017 39th Annual International Conference of the IEEE, 1986–1989 (IEEE, 2017).
Weigert, M., Royer, L., Jug, F. & Myers, G. Isotropic reconstruction of 3d fluorescence microscopy images using convolutional neural networks. In International Conference on Medical Image Computing and Computer-Assisted Intervention, 126–134 (Springer, 2017).
Weigert, M. et al. Content-aware image restoration: pushing the limits of fluorescence microscopy. bioRxiv 236463 (2017).
Bise, R. & Sato, Y. Cell detection from redundant candidate regions under nonoverlapping constraints. IEEE Transactions on Medical Imaging 34, 1417–1427 (2015).
Arteta, C., Lempitsky, V. S., Noble, J. A. & Zisserman, A. Detecting overlapping instances in microscopy images using extremal region trees. Medical image analysis 27, 3–16 (2016).
Liao, M. et al. Automatic segmentation for cell images based on bottleneck detection and ellipse fitting. Neurocomputing 173, 615–622 (2016).
Gul-Mohammed, J., Arganda-Carreras, I., Andrey, P., Galy, V. & Boudier, T. A generic classification-based method for segmentation of nuclei in 3D images of early embryos. BMC Bioinformatics 15, 9 (2014).
Wählby, C., Sintorn, I.-M., Erlandsson, F., Borgefors, G. & Bengttsson, E. Combining intensity, edge and shape information for 2D and 3D segmentation of cell nuclei in tissue sections. Journal of Microscopy 215, 67–76 (2004).
Cicconet, M., Geiger, D. & Gunsalus, K. C. Wavelet-based circular hough transform and its application in embryo development analysis. In VISAPP (2013).
Türetken, E., Wang, X., Becker, C. J., Haubold, C. & Fua, P. Network flow integer programming to track elliptical cells in time-lapse sequences. IEEE Transactions on Medical Imaging 36, 942–951 (2017).
Türetken, E., Wang, X., Becker, C. J. & Fua, P. Detecting and tracking cells using network flow programming. CoRR abs/1501.05499 (2015).
Cliffe, A. et al. Quantitative 3D analysis of complex single border cell behaviors in coordinated collective cell migration. Nature Communications 8, 14905 (2017).
Liu, K. et al. Optical flow guided cell segmentation and tracking in developing tissue. In 2014 IEEE 11th International Symposium on Biomedical Imaging (ISBI), 298–301 (2014).
Tonti, S., Cataldo, S. D., Bottino, A. & Ficarra, E. An automated approach to the segmentation of hep-2 cells for the indirect immunofluorescence ana test. Computerized Medical Imaging and Graphics 40, 62–69 (2015).
Gul-Mohammed, J. & Boudier, T. Classified region growing for 3D segmentation of packed nuclei. In Biomedical Imaging (ISBI), 2014 IEEE 11th International Symposium on, 842–845 (IEEE, 2014).
Tarnawski, W. et al. A robust algorithm for segmenting and tracking clustered cells in time-lapse fluorescent microscopy. IEEE Journal of Biomedical and Health Informatics 17, 862–869 (2013).
Ronneberger, O., Fischer, P. & Brox, T. U-net: Convolutional networks for biomedical image segmentation. In Navab, N., Hornegger, J., Wells, W. M. & Frangi, A. F. (eds) Medical Image Computing and Computer-Assisted Intervention–MICCAI 2015, 234–241 (Springer International Publishing, Cham, 2015).
Akram, S. U., Kannala, J., Eklund, L. & Heikkilä, J. Cell segmentation proposal network for microscopy image analysis. In Carneiro, G. et al. (eds.) Deep Learning and Data Labeling for Medical Applications, 21–29 (Springer International Publishing, Cham, 2016).
Sadanandan, S. K., Ranefall, P., Le Guyader, S. & Wählby, C. Automated training of deep convolutional neural networks for cell segmentation. Scientific reports 7, 7860 (2017).
Akram, S. U., Kannala, J., Eklund, L. & Heikkilä, J. Joint cell segmentation and tracking using cell proposals. In 2016 IEEE 13th International Symposium on Biomedical Imaging (ISBI), 920–924 (2016).
Elad, M. & Aharon, M. Image denoising via sparse and redundant representations over learned dictionaries. IEEE Transactions on Image Processing 15, 3736–3745 (2006).
Nasser, L., Coronado, P., David, E. & Boudier, T. A dictionary-based approach to reduce noise in fluorescent microscopy images. In Signal and Image Processing (ICSIP), 2017 IEEE 2nd International Conference on, 144–148 (IEEE, 2017).
Qiu, Z., Yang, L. & Lu, W. A new feature-preserving nonlinear anisotropic diffusion for denoising images containing blobs and ridges. Pattern Recognition Letters 33, 319–330 (2012).
Boulanger, J. et al. Patch-based nonlocal functional for denoising fluorescence microscopy image sequences. IEEE transactions on medical imaging 29, 442–454 (2010).
Alegro, M. et al. Automating cell detection and classification in human brain fluorescent microscopy images using dictionary learning and sparse coding. Journal of neuroscience methods 282, 20–33 (2017).
Theodorakopoulos, I., Kastaniotis, D., Economou, G. & Fotopoulos, S. Hep-2 cells classification via sparse representation of textural features fused into dissimilarity space. Pattern Recognition 47, 2367–2378 (2014).
Johnson, D. H. Signal-to-noise ratio. Scholarpedia 1, 2088 (2006).
Ulman, V. et al. An objective comparison of cell-tracking algorithms. Nature Methods (2017).
Maška, M. et al. A benchmark for comparison of cell tracking algorithms. Bioinformatics 30, 1609–1617 (2014).
Kesteven, G. The coefficient of variation. Nature 158, 520 (1946).
Aharon, M., Elad, M. & Bruckstein, A. k -svd: An algorithm for designing overcomplete dictionaries for sparse representation. IEEE Transactions on Signal Processing 54, 4311–4322 (2006).
Bradley, D. & Roth, G. Adaptive thresholding using the integral image. Journal of Graphics Tools 12, 13–21 (2007).
Nhu, H. T. T., Drigo, R. A. E., Berggren, P.-O. & Boudier, T. A novel toolbox to investigate tissue spatial organization applied to the study of the islets of langerhans. Scientific reports 7, 44261 (2017).
Adamo, A., Grossi, G., Lanzarotti, R. & Lin, J. Robust face recognition using sparse representation in lda space. Machine Vision and Applications 26, 837–847 (2015).
Ogawa, T. & Haseyama, M. Image inpainting based on sparse representations with a perceptual metric. EURASIP Journal on Advances in Signal Processing 2013, 179 (2013).
Otsu, N. A threshold selection method from gray-level histograms. IEEE transactions on systems, man, and cybernetics 9, 62–66 (1979).
Parvati, K., Rao, P. & Mariya Das, M. Image segmentation using gray-scale morphology and marker-controlled watershed transformation. Discrete Dynamics in Nature and Society 2008 (2008).
Mallat, S. G. & Zhang, Z. Matching pursuits with time-frequency dictionaries. IEEE Transactions on Signal Processing 41, 3397–3415 (1993).
Mallat, S. & Zhang, Z. Adaptive time-frequency transform. In 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 3, 241–244 vol.3 (1993).
Chen, S. S., Donoho, D. L. & Saunders, M. A. Atomic decomposition by basis pursuit. SIAM Journal on Scientific Computing 20, 33–61 (1998).
Powers, D. M. Evaluation: from precision, recall and f-measure to roc, informedness, markedness and correlation (2011).
Rohlfing, T. Image similarity and tissue overlaps as surrogates for image registration accuracy: widely used but unreliable. IEEE transactions on medical imaging 31, 153–163 (2012).
Schmid, B., Schindelin, J., Cardona, A., Longair, M. & Heisenberg, M. A high-level 3D visualization api for java and Image. J. BMC bioinformatics 11, 274 (2010).
Ollion, J., Cochennec, J., Loll, F., Escude, C. & Boudier, T. Tango: a generic tool for high-throughput 3D image analysis for studying nuclear organization. Bioinformatics 29, (1840–1841 (2013).
Author information
Authors and Affiliations
Contributions
L.N. wrote the manuscript text, analysed the data and developed the proposed algorithm; T.B. contributed to the conception and reviewed the manuscript.
Corresponding author
Ethics declarations
Competing Interests
The authors declare no competing interests.
Additional information
Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Nasser, L., Boudier, T. A novel generic dictionary-based denoising method for improving noisy and densely packed nuclei segmentation in 3D time-lapse fluorescence microscopy images. Sci Rep 9, 5654 (2019). https://doi.org/10.1038/s41598-019-41683-3
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-019-41683-3
This article is cited by
-
Comparison of particle image velocimetry and the underlying agents dynamics in collectively moving self propelled particles
Scientific Reports (2023)
-
Detection and Spatiotemporal Analysis of In-vitro 3D Migratory Triple-Negative Breast Cancer Cells
Annals of Biomedical Engineering (2023)
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.