Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

# Improved small blob detection in 3D images using jointly constrained deep learning and Hessian analysis

## Abstract

Imaging biomarkers are being rapidly developed for early diagnosis and staging of disease. The development of these biomarkers requires advances in both image acquisition and analysis. Detecting and segmenting objects from images are often the first steps in quantitative measurement of these biomarkers. The challenges of detecting objects in images, particularly small objects known as blobs, include low image resolution, image noise and overlap between the blobs. The Difference of Gaussian (DoG) detector has been used to overcome these challenges in blob detection. However, the DoG detector is susceptible to over-detection and must be refined for robust, reproducible detection in a wide range of medical images. In this research, we propose a joint constraint blob detector from U-Net, a deep learning model, and Hessian analysis, to overcome these problems and identify true blobs from noisy medical images. We evaluate this approach, UH-DoG, using a public 2D fluorescent dataset for cell nucleus detection and a 3D kidney magnetic resonance imaging dataset for glomerulus detection. We then compare this approach to methods in the literature. While comparable to the other four comparing methods on recall, the UH-DoG outperforms them on both precision and F-score.

## Introduction

There is great interest in tailoring diagnostic and therapeutic tools to individual patients. This concept reflects the growing recognition that there is significant variability between individuals. As therapies focus on molecular targets, diagnostic medical imaging tools must reveal focal pathologies and the effects of therapy in each patient. High-resolution object detection and image segmentation are thus critical to obtaining meaningful data in a heterogeneous image.

In image analysis, detection is used to identify objects such as organs and tumors, and segmentation is used to isolate the objects from an image. While large objects can often be automatically or semi-automatically isolated, small objects (blobs) are difficult to detect and segment. Blobs can range in size and location in images. Examples of blobs include cells or cell nuclei in images from optical microscopy1, exudative lesions in images of the retina2, breast lesions in ultrasound images3, and glomeruli in magnetic resonance (MR) images of the kidney4,5,6. Major challenges to detecting these blobs include low image resolution and high image noise. The small blobs are often numerous and can overlap each other. Many approaches have been proposed for blob detection7,8,9 of which intensity thresholding is among the most common10. Intensity thresholding assumes that the blobs have consistently different intensities from the background. Global differences can be addressed with a fixed threshold and local differences can be addressed with an adaptive threshold11,12. However, the assumptions required for consistent thresholding are often violated, and thresholding alone can lead to erroneous detection or segmentation. To address this, researchers have proposed multi-step pipelines13,14 in which thresholding is only the first step. Intensity-based features are then derived using filters for improved detection. One popular class of filters is based on mathematical morphology15,16. Operators such as erosion, dilation, opening and closing allow geometrical and topological properties of objects. This approach often begins with selected seed points in the image and iteratively adds connected points to form labeled regions. Mathematical morphology is preferred when the blobs are relatively large in size and small in number. Weaknesses of this approach include the tendency to under-segment and diminished performance in the presence of noise. Under-segmentation occurs when multiple blobs within close proximity are detected as one, resulting in an erroneously low detected number. Another type of filter is based on space transformation. For example, Radial-Symmetry17, a point detector for small blobs, uses radially symmetric space as a transformation space to detect radially symmetric blobs. SIFT18, SURF19 and BRISK20 are region detectors. Each of the region detectors extracts scale invariant features to detect small objects but may suffer from poor performance in optical imaging21. Recently, the Laplacian of Gaussian (LoG) detector22,23, from scale space theory, has attracted attention in blob detection8,24. Similar to the radially symmetric detector, the LoG detector is unreliable in detecting rotationally asymmetric blobs. To solve this, LoG extensions have been proposed, including the Difference of Gaussian (DoG)18,25,26,27 and the Generalized Laplacian of Gaussian (gLoG)21. While each approach detects small blobs to some extent, non-blob objects are detected as false blob candidates resulting in over-detection. A post-pruning procedure can remove false blob candidates, but results have been inconsistent28.

Here we focus on detecting individual glomeruli in MR images of the kidney as a specific blob detection problem. To date, most biomarkers of kidney pathology have come from histology using destructive techniques that estimate glomerular number29,30,31. A non-destructive imaging approach to measuring nephron endowment provides a new marker for renal health and susceptibility to kidney disease. Cationic ferritin enhanced MRI (CFE-MRI) enables the detection of glomeruli in animals32,33 and in human kidneys5,32,33,34. Because each glomerulus is associated with a nephron, CFE-MRI may provide an important imaging marker to detect changes in the number of nephrons and susceptibility to renal and cardiovascular disease5. Glomerulus detection by CFE-MRI presents difficulties because glomeruli are small and have a spatial frequency similar to image noise. Zhang et al. developed the Hessian-based Laplacian of Gaussian (HLoG) detector1 and the Hessian-based Difference of Gaussian (HDoG) detector28 to automatically detect glomeruli in CFE-MR images. They employed the LoG or DoG to smooth the images, followed by Hessian analysis of each voxel for pre-segmentation. Since LoG and DoG suffer from over-detection, a Variational Bayesian Gaussian Mixture Model (VBGMM) was implemented as a final step. LoG and DoG were the first two detectors applied to MR images of the kidney to identify glomeruli. However, deriving Hessian-based features from each blob candidate is computationally expensive, limiting high-throughput studies. In addition, unsupervised learning using the VBGMM in the post-pruning procedure requires a number of carefully tuned parameters for optimal clustering. Here we propose a new approach, termed UH-DoG, which applies joint constraints from spatial probability maps derived from U-Net, a deep learning model, and Hessian convexity maps derived from Hessian analysis on the DoG detector. The theoretical foundation of Hessian analysis guarantees that pre-segmentation will recognize all true convex blobs and some non-blob convex objects, resulting in a blob superset. Joining probability maps allows us to distinguish true blobs from the superset. The joint-constraint extension of the detector requires no post-pruning and thus is robust, generalizable and computationally efficient.

Within the field of deep learning, the Convolutional Neural Network (CNN) has been successfully implemented in medical imaging applications ranging from object detection and segmentation to classification35,36,37. The first generation of CNN models was used to classify images through fully connected layers. Shelhamer et al.38 first proposed a Fully Convolutional Network (FCN) that transfers the fully connected layers to deconvolutional layers and provides a dense class map with arbitrarily-sized input image. The FCN changes “image-label” mapping to “pixel/voxel-label” mapping for object detection and image segmentation. One limitation of the FCN for medical imaging is the need for large training datasets. A lightly weighted FCN model, the U-Net39, employs a modified FCN architecture to require fewer training images but yield precise, fast segmentation. U-Net has been implemented in various medical segmentation tasks such as nucleus, cell, and breast lesion segmentation40,41,42, all drawn from limited datasets. The U-Net yields a probability map where each pixel or voxel indicates the likelihood of being within the imaging object. However, based on our previous study43, U-Net does not reliably separate glomeruli within close proximity. Therefore, we choose to adopt the probability map as part of UH-DoG in conjunction with Hessian analysis for glomerulus detection from CFE-MR images.

There are three main advantages of the UH-DoG method. First, a global blob likelihood constraint from the U-Net probability map reduces over-detection by DoG. Second, a local convex constraint from the Hessian convexity map reduces under-segmentation. Third, integrating the probability map constraint with the Hessian convexity map eliminates the need for post-pruning. To validate the performance of UH-DoG, four methods were chosen from the literature: HLoG1, gLoG21, LoG22, and Radial-Symmetry17. We tested these on dataset of 2D fluorescent images (n = 200) where the locations of blobs were known. UH-DoG outperformed the other four methods in F-score and performed comparably to the other four methods in recall. Next, we compared blob detection of these methods on a 3D kidney MR dataset against the HDoG method. The differences between UH-DoG and HDoG were negligible but the average computation time of UH-DoG was 35% shorter than that of HDoG.

## Methods

We propose UH-DoG, a joint constraint-based detector for glomeruli detection. UH-DoG consists of three steps (Fig. 1). Step 1 is to use the Difference of Gaussian (DoG) to smooth the images, followed by Hessian analysis to identify possible blob candidates based on local convexity. Step 2 is to use a trained U-Net to generate a probability map, which captures the most likely blob locations. Step 3 is to combine the probability map from Step 2 with blob candidates from Step 1 as joint constraints to identify true blobs. Each step is discussed in detail in the following sections.

### Hessian analysis and hessian convexity map

Before implementing Hessian analysis, DoG is used to smooth the images. By employing a convolution operator, DoG can filter image noise and enhance objects at the selected scale24. DoG is a fast approximation of the LoG filter to highlight blob structure4 and is thus computationally efficient18.

Let a 3D image be $$f:{R}^{3}\to R.\,$$The scale-space representation $$L(x,y,z;\sigma )$$ at point (x, y, z), with scale parameter σ, is the convolution of image f(x, y, z) with the Gaussian kernel $$G(x,y,z;\sigma )$$:

$$L(x,y,z;\sigma )=\,G(x,y,z;\sigma )\ast f(x,y,z),$$
(1)

where * is the convolution operator and the Gaussian kernel $$G(x,y,z;\sigma )=\,\frac{1}{{(2\pi {\sigma }^{2})}^{\frac{3}{2}}}{e}^{-\frac{({x}^{2}+{y}^{2}+{z}^{2})}{2{\sigma }^{2}}}$$. The Laplacian of $$L(x,y,z;\sigma )$$ is:

$${\nabla }^{2}L(x,y,z;\sigma )=\frac{{\partial }^{2}L(x,y,z;\sigma )}{\partial {x}^{2}}+\frac{{\partial }^{2}L(x,y,z;\sigma )}{\partial {y}^{2}}+\frac{{\partial }^{2}L(x,y,z;\sigma )}{\partial {z}^{2}}.$$
(2)

According to18, $${\rm{\sigma }}{\nabla }^{2}L(x,y,z;\sigma )={L}_{\sigma }(x,y,z;\sigma ).\,$$We approximate the partial derivative $${L}_{\sigma }(x,y,z;\sigma )$$ by a one-sided difference quotient, the DoG approximation of LoG is:

$$\begin{array}{rcl}{\nabla }^{2}L(x,y,z;\sigma ) & = & \frac{{L}_{\sigma }(x,y,z;\sigma )\,}{{\rm{\sigma }}}\approx \frac{L(x,y,z;\sigma +\Delta \sigma )-L(x,y,z;\sigma )}{{\rm{\sigma }}\Delta \sigma }\\ & = & f(x,y,z)\ast \frac{(G(x,y,z;\sigma +\Delta \sigma )-G(x,y,z;\sigma ))}{{\rm{\sigma }}\Delta \sigma }\end{array}$$
(3)

To locate an optimum scale for the blobs, similar to1, we add γ-normalization to form the normalized DoG detector $${\sigma }^{\gamma }{\nabla }^{2}L(x,y,z;\sigma )$$, which is:

$$Do{G}_{nor}(x,y,z;\sigma )={\sigma }^{\gamma -1}f(x,y,z)\ast \frac{(G(x,y,z;\sigma +\Delta \sigma )-G(x,y,z;\sigma ))}{\Delta \sigma },$$
(4)

where γis introduced to automatically determine the optimum scale for the blobs. We set γ to 2 here. For details on tuning γ, refer to1. The normalized DoG transformation underlies Hessian-based convexity analysis to detect blobs.

After the image is smoothed by the normalized DoG, for a voxel (x, y, z) in the normalized DoG image $$Do{G}_{nor}(x,y,z;\sigma )$$ at scale σ, the Hessian matrix for this voxel is:

$$H(Do{G}_{nor}(x,y,z;\sigma ))=[\begin{array}{ccc}\frac{{\partial }^{2}Do{G}_{nor}(x,y,z;\sigma )}{\partial {x}^{2}} & \frac{{\partial }^{2}Do{G}_{nor}(x,y,z;\sigma )}{\partial x\partial y} & \frac{{\partial }^{2}Do{G}_{nor}(x,y,z;\sigma )}{\partial x\partial z}\\ \frac{{\partial }^{2}Do{G}_{nor}(x,y,z;\sigma )}{\partial x\partial y} & \frac{{\partial }^{2}Do{G}_{nor}(x,y,z;\sigma )}{\partial {y}^{2}} & \frac{{\partial }^{2}Do{G}_{nor}(x,y,z;\sigma )}{\partial y\partial z}\\ \frac{{\partial }^{2}Do{G}_{nor}(x,y,z;\sigma )}{\partial x\partial z} & \frac{{\partial }^{2}Do{G}_{nor}(x,y,z;\sigma )}{\partial y\partial z} & \frac{{\partial }^{2}Do{G}_{nor}(x,y,z;\sigma )}{\partial {z}^{2}}\end{array}].$$
(5)

In a normalized DoG-transformed 3D image, each voxel of a transformed bright blob has a negative definite Hessian matrix28. We define a binary indicator matrix, $$HI(x,y,z;\sigma )$$, termed the Hessian convexity map. $$HI(x,y,z;\sigma )=1$$ when $$H(Do{G}_{nor}(x,y,z;\sigma ))$$ is negative definite; otherwise, $$HI(x,y,z;\sigma )=0$$.

To determine a single optimum scale σ*, the maximum value of the normalized DoG is used here28. Let the average DoG value per blob candidate voxel measure BGoG be:

$${B}_{DoG}(\sigma )=\frac{{\sum }_{(x,y,z)}DoG(x,y,z)HI(x,y,z;\sigma )}{{\sum }_{(x,y,z)}HI(x,y,z;\sigma )}.$$
(6)

We have $${\sigma }^{\ast }=argmax\,{B}_{DoG}(\sigma )$$. σ* is used to generate the optimum Hessian convexity map $$HI(x,y,z;{\sigma }^{\ast })$$. This map is the local convexity constraint for detecting the convex blob regions. Result is a set of convex objects including all true blobs and some non-blob convex objects.

### U-Net and Probability Map

A classical CNN usually consists of multiple convolutional layers followed by pooling layers, activation layers, and fully connected layers. Convolutional layers learn hierarchical and high-level feature representation. Pooling layers can reduce feature dimensions and capture spatial feature invariance. The final fully-connected layers categorize the images into different groups. Compared to classical CNNs, FCNs replace all fully-connected layers with a fully convolutional layer. There are several advantages to this approach. First, the input image size can be arbitrary because all models consist of convolution layers, so output size only depends on input size. Second, the FCN can be trained from whole images without patch sampling, thus the effects of patch-wise training need not be considered. However, FCNs require a large dataset for training. To address this issue, a modified FCN model, the U-Net, was proposed. Interested readers are referred to U-Net39 for details. The output of U-Net is a probability map in [0, 1].

In U-Net, let the input images be $$X\in {R}^{{I}_{1}\times {I}_{2}\times {I}_{3}}$$, and the output map be $$Y\in {R}^{{I}_{1}\times {I}_{2}\times {I}_{3}}$$, where $${y}^{{i}_{1},{i}_{2},{i}_{3}}\in [0,1]$$. A binary cross entropy loss function is used in the training process to obtain the output map:

$$L=-\,\frac{1}{{I}_{1}{I}_{2}{I}_{3}}\mathop{\sum }\limits_{k=1}^{{I}_{1}\times {I}_{2}\times {I}_{3}}{Y}_{k}\Delta \,\log ({\hat{Y}}_{k})+(1-{Y}_{k})\Delta \,\log (1-{\hat{Y}}_{k}),$$

where Yk is the true label and $${\hat{Y}}_{k}\,$$is the predicted probability for voxel k.

In our U-Net output, we obtain a probability map $$U(x,y,z),\,$$where x and y are dimensions of each 2D image slice and z is the slice number. The probability map is the global blob likelihood constraint in our joint constraints operations to detect the most likely blob regions. By setting a probability threshold, most noise is removed. However, some touching blobs could have a higher probability than the threshold in the boundary and might not be split up, resulting in reduced detection known as under-segmentation. Joining the Hessian convexity map with U-Net probability map will address the challenges of small blob detection.

### Joint constraint operation for true blob identification

Given a 3D image $$f:{R}^{3}\to R,$$ Hessian analysis is applied to render a convexity map $$\,HI(x,y,z;{\sigma }^{\ast })$$, U-Net is applied to render a probability map $$U(x,y,z)$$. We introduce a joint operator

$$UH(x,y,z)=HI(x,y,z;{\sigma }^{\ast })\circ I(x,y,z),$$
(7)

where $$I(x,y,z)$$ is a binary indicator matrix. Given a probability threshold $${\delta }_{b},\,I(x,y,z)$$ = 1 when $$U(x,y,z) > {\delta }_{b}$$; otherwise, $$I(x,y,z)$$ = 0. We define the true blob candidate as a 27-connected voxel44, and the blob set is represented as:

$${S}_{blob}=\{(x,y,z)|(x,y,z)\in Do{G}_{nor}(x,y,z;\sigma ),UH(x,y,z)=1\,\}.$$
(8)

To illustrate, Fig. 2 shows images of blobs detected during the joint constraint operation of the U-Net probability map and the Hessian convexity map. The blue circle in Fig. 2a shows only one blob. The same blue circle on Fig. 2b, after application of the Hessian convexity map, shows there is one “bigger” blob in the middle and a number of smaller blobs around the boundary of the blue circle. Figure 2c shows the correct outcome, only one blob in the middle. This clearly illustrates the sensitivity of the Hessian matrix to noise. Even though the Hessian analysis guarantees the detection of the convex object, some non-blob convex objects (noise) will also be detected, resulting in over-detection. This noise can be readily filtered by the U-Net probability map (Fig. 2c). We conclude that U-Net may be useful for denoising, which alleviates the over-detection of Hessian analysis.

The red circle in Fig. 2a shows overlapped blobs. They are still overlapping in the U-Net probability map from Fig. 2c. But they are split up in the Hessian convexity map from Fig. 2b. By joining the Hessian convexity map and U-Net probability map with a single global threshold, the overlapped blobs in the red circle are visualized as distinct entities, as shown in Fig. 2d. We conclude Hessian analysis could alleviate the under-segmentation issue from U-Net.

Our proposed UH-DoG integrates the probability map from U-Net and convexity map from Hessian analysis to guarantee robustness to noise and effective blob detection. The detailed steps of UH-DoG are shown in Table 1.

### Experiments

Two experiments were conducted to validate of the performance of our proposed UH-DoG detector. The first experiment validated the UH-DoG on 200 fluorescence, 2D light microscopy images for cell detection45. The 2D cell images were of interest because (1) to the best of our knowledge, there are no 3D small blob datasets available for comparison; (2) the blobs from these images are small and each image could be used to test the performance of the algorithm in the presence of background noise; (3) this dataset has the ground truth of the locations of each blob. The detection accuracy measured by recall, precision, and F-score can be used to compare this approach with methods from the literature. The second experiment validated the performance of UH-DoG on CFE-MR images of mouse kidneys where each glomerulus was detected. All experiments were approved by the University of Virginia Institutional Care and Use Committee, in accordance with the NIH Guide for the Care and Use of Laboratory Animals.

## Results

### Training dataset and data augmentation

We used a public dataset46 to train our deep learning model, based on optical images of cell nuclei. This dataset has 141 optical microscopy pathology images (2,000 × 2,000 pixels), as shown in Fig. 3a. The 12,000 ground truth annotations are typically done by an expert, which involves delineating object boundaries over 40 hours46. Due to the large amount of time and effort required, the annotated nuclei in this dataset only represents a small fraction of the total number of nuclei present in all images. Since we aim to facilitate U-Net to denoise our blobs images based on the ground truth labeled images, as shown in Fig. 3b, we generated Gaussian distributed noise with $${\mu }_{noise}=0$$ and $${\sigma }_{noise}^{2}=0.01$$ and we added it to the ground truth labeled images, resulting in 141 simulated training images, as shown in Fig. 3c. Data were augmented to increase the in variance and robustness properties of U-Net39. We generated the augmented data by a combination of rotation shift, width shift, height shift, shear, zoom, and horizontal flip.

### Experiment I: Validation experiments using 2D fluorescent images

Figure 4 illustrates an example fluorescent image (256 × 256 pixels). Since this was a 2D image, our proposed UH-DoG must incorporate a modified 2D DoG because comparison algorithms were from the 2D LoG and its extensions.

To revise the DoG to a 2D version, for 2D images $$f(x,y)$$ with the Gaussian kernel $$G(x,y;\sigma )$$, we modified the normalized 3D DoG detector from Eq. 4 in a 2D format:

$$Do{G}_{nor}(x,y;\sigma )={\sigma }^{\gamma -1}f(x,y)\ast \frac{(G(x,y;\sigma +\Delta \sigma )-G(x,y;\sigma ))}{\sigma \Delta \sigma }.$$
(9)

Then the corresponding Hessian matrix were modified from Eq. 5 as follows:

$$H(Do{G}_{nor}(x,y;\sigma ))=[\begin{array}{cc}\frac{{\partial }^{2}Do{G}_{nor}(x,y;\sigma )}{\partial {x}^{2}} & \frac{{\partial }^{2}Do{G}_{nor}(x,y;\sigma )}{\partial x\partial y}\\ \frac{{\partial }^{2}Do{G}_{nor}(x,y;\sigma )}{\partial x\partial y} & \frac{{\partial }^{2}Do{G}_{nor}(x,y;\sigma )}{\partial {y}^{2}}\end{array}].$$
(10)

The parameter settings for Hessian analysis and DoG were as suggested in28. γ is set to 2. σ varies from 0.5 to 3 with step-size 0.5. Δσ is set to 0.001.

We used precision, recall and F-score to evaluate the performance of our proposed algorithm. Precision measures the fraction of retrieved candidates confirmed by the ground-truth. Recall measures the fraction of ground-truth data retrieved. F-score measures overall performance. Since ground truth data were provided in the form of dots (the coordinates of the blob centers), as in the literature1,28, a candidate was considered a true positive if its intensity centroid was within a threshold d of the corresponding ground truth dot. Specifically, if the Euclidian distance Dij between dot i and blob candidate j was less than or equal to d, the blob was considered a true positive. To avoid duplicate counting, the number (#) of true positives TP was calculated by Eq. 11. Precision, recall, and F-score are calculated by Eqs. 12, 13, 14 respectively:

$$TP=\,\min \{\#\{j:{\min }_{i=1}^{m}{D}_{ij}\le d\},\,\#\{i:{\min }_{j=1}^{n}{D}_{ij}\le d\}\},$$
(11)
$$precision=\frac{TP}{n},$$
(12)
$$recall=\frac{TP}{m}$$
(13)
$$F-score=2\times \frac{precision\times recall}{(precision+recall)},$$
(14)

where m is the number of ground-truth and n is the number of blob candidates; d is a thresholding parameter set to a positive value $$\,(0,+\infty )$$. If d is small, fewer blob candidates are counted since the distance between the blob candidate centroid and ground-truth should be small. If d is too large, more blob candidates are counted. Here, since local intensity extremes could be anywhere within a small blob with an irregular shape, we set d to the average diameter of the blobs: $$d=2\times \sqrt{\frac{{\sum }_{(x,y)}I(x,y;\sigma )}{\pi }}$$.

Since the results of detection by the complete versions of HLoG, gLoG, Radial-Symmetry and LoG on 200 pathological images are available online1,17,21, the results were directly used from these papers for comparison.

Figure 5 shows a comparison of UH-DoG to the HLoG, gLoG, LoG and Radial-Symmetry algorithms. While UH-DoG is comparable to HLoG, gLoG and Radial Symmetry algorithms in recall, it significantly outperforms the four algorithms in both precision and F-score (Table 2). The standard deviation of F-score in UH-DoG was 0.025, compared to 0.0377 with the HLoG method, compared to 0.1436 with the gLoG method, 0.0795 with the Radial-Symmetry method, and 0.0385 with the LoG method. We conclude that UH-DoG provides more accurate and robust detection of blobs in this dataset. In addition, statistical analysis was performed with the results summarized in Table 2. While comparable to the four algorithms on recall, our approach statistically outperformed the others on precision and F-score.

### Experiment II: Validation experiments using 3D Kidney MRI

In this section, we conducted experiments on CF-labeled glomeruli from a dataset of 3D magnetic resonance images (256 × 256 × 256 voxels) to measure number (Nglom) and apparent size (aVglom) of glomeruli in diseased kidneys and healthy control kidneys. Acute kidney injury was induced in adult male C57Bl/6 mice using an intraperitoneal injection of folic acid (125 mg). A subset of the group receiving folic acid, the AKI group (n = 4) was euthanized 4 days after the folic acid was administered and the remainder of those that received folate were euthanized 4 weeks later and termed the chronic kidney disease (CKD) group, n = 3. The control groups for AKI (n = 5) and CKD (n = 6) were age-matched adult male C57Bl/6 mice that received intraperitoneal sodium bicarbonate.

For improved detection, we adopted a preprocessing step to segment the medulla from the image because no glomeruli are located there. Based on the segmented kidney image, shown in Fig. 6a, we converted it to a binary mask (Fig. 6b). Then we generated a distance mask, seen in Fig. 6c. With the map showing the distance between each kidney’s voxel and the kidney boundary, we set up a distance threshold to remove regions farther from the boundary than this threshold. Figure 6d shows the 2D image slice after removing the medulla.

Then we performed the proposed UH-DoG method to segment the kidney glomeruli in Fig. 6d. The parameter settings are as follows: γ is set to 2. σ varies from 0.5 to 1.8 with step-size 0.1. Δσ is set to 0.001. Example segmentation results are shown in Figs. 7 and 8. The number of glomeruli (Nglom), mean apparent glomerular volume (aVglom) and median aVglom are reported in Table 3, where the UH-DoG method is compared to the HDoG method. We used the method of calculating apparent glomerular volume from the paper34. Similarly, Table 4 summarizes the results from the AKI and control groups.

We performed quality control by visually checking the identified glomeruli in kidney images. Figure 7 shows glomerular identification for CKD and control kidneys. Figure 8 shows glomerular identification for kidneys in the AKI and control groups.

### Discussion: computation cost

UH-DoG significantly decreases computation time compared to the HDoG algorithm28, as shown in Tables 5 and 6. The training time of U-Net is not included in the estimates of computation time as it is trained beforehand and can be used to test on all images.

### Discussion: Clinical translation

The use of imaging biomarkers in humans has increased both for disease early detection and disease severity assessment. Additionally, imaging biomarkers can serve as surrogate endpoints in clinical trials, reducing cost and burden associated with these studies. For example, total kidney volume has recently been accepted as a surrogate marker for disease progression in autosomal dominant polycystic kidney disease trials47. Although the importance of glomerular number has been universally accepted, the detection of glomerular number and size has been limited because the only methodology to obtain these metrics were destructive stereological approaches that could only be performed post mortem. With the advent of CFE-MRI, the need for image analysis tools is paramount. However, it is critical to the success of any imaging biomarker that the marker be accurate and rapidly obtained. This study demonstrates some of the challenges in detecting small objects, such as glomeruli, particularly in the settings of low image resolution, image noise and overlap of objects. It also shows the promise of rapid acquisition where data can be used in a timeframe to influence patient care. Further work is necessary to validate the accuracy of the detection of diseased glomeruli to apply this algorithm to a wider range of renal disease models.

## Conclusion

Discovering imaging biomarkers is important to inform disease diagnosis, prognosis, therapy development and treatment assessment. Of particular interest in this research is to identify quantitative glomeruli biomarkers from CFE-MR image. This is a challenging problem because the number of glomeruli is large, the size is small. In addition, the limitation from imaging acquisition such as hardware and variable acquisition parameters often renders the images with less desirable resolution resulting the overlapping glomeruli. In this paper, we demonstrated a new small blob detector by joining the Hessian convexity map and probability map from U-Net. This joint constraint-based approach overcomes under-segmentation by U-Net and over-detection by Hessian analysis. While it was successfully implemented in segmenting the kidney glomeruli, there are still some limitations. First, the assumption that the blobs are convex and similar in size may not be robust for non-convex objects with difference sizes. A future possible improvement is to enhance ability of U-Net to detect both convex and non-convex small objects. Second, the probability map is sensitive to the threshold. We plan to explore the use of thresholding to improve UH-DoG.

## Data availability

The datasets generated during and/or analyzed during the current study will be made available upon request.

## References

1. Zhang, M., Wu, T. & Bennett, K. M. Small Blob Identification in Medical Images Using Regional Features From Optimum Scale. IEEE Transactions on Biomedical Engineering (2015).

2. Sánchez, C. I. et al. Contextual computer-aided detection: Improving bright lesion detection in retinal images and coronary calcification identification in CT scans. Medical Image Analysis (2012).

3. Moon, W. K. et al. Computer-aided tumor detection based on multi-scale blob detection algorithm in automated breast ultrasound images. IEEE Transactions on Medical Imaging (2013).

4. Beeman, S. C. et al. Measuring glomerular number and size in perfused kidneys using MRI. AJP: Renal Physiology 300, F1454–F1457 (2011).

5. Beeman, S. C. et al. MRI-based glomerular morphology and pathology in whole human kidneys. AJP: Renal Physiology (2014).

6. Gao, F., Zhang, M., Wu, T. & Bennett, K. M. 3D small structure detection in medical image using texture analysis. In Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBS (2016).

7. Meijering, E., Dzyubachyk, O., Smal, I. & van Cappellen, W. A. Tracking in cell and developmental biology. Seminars in Cell and Developmental Biology (2009).

8. Lindeberg, T. Detecting salient blob-like image structures and their scales with a scale-space primal sketch: A method for focus-of-attention. International Journal of Computer Vision (1993).

9. Crocker, J. C. & Grier, D. G. Methods of digital video microscopy for colloidal studies. Journal of Colloid and Interface Science (1996).

10. Wu, Q., Merchant, F. A. & Castleman, K. R. Microscope Image Processing. Microscope Image Processing (2007).

11. Phansalkar, N., More, S., Sabale, A. & Joshi, M. Adaptive local thresholding for detection of nuclei in diversity stained cytology images. In ICCSP 2011 - 2011 International Conference on Communications and Signal Processing (2011).

12. Li, G. et al. 3D cell nuclei segmentation based on gradient flow tracking. BMC Cell Biology (2007).

13. Bergmeir, C., García Silvente, M. & Benítez, J. M. Segmentation of cervical cell nuclei in high-resolution microscopic images: A new algorithm and a web-based software framework. Computer Methods and Programs in Biomedicine (2012).

14. Guo, Y. et al. An image processing pipeline to detect and segment nuclei in muscle fiber microscopic images. Microscopy Research and Technique (2014).

15. Dalle, J., Racoceanu, D. & Putti, T. C. Nuclear Pleomorphism Scoring by Selective Cell Nuclei Detection. In IEEE Workshop on Applications of Computer Vision (2009).

16. Malpica, N. et al. Applying watershed algorithms to the segmentation of clustered nuclei. Cytometry (1997).

17. Loy, G. & Zelinsky, A. Fast radial symmetry for detecting points of interest. IEEE Transactions on Pattern Analysis and Machine Intelligence (2003).

18. Lowe, D. G. Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision (2004).

19. Bay, H., Ess, A., Tuytelaars, T. & Van Gool, L. Speeded-Up Robust Features (SURF). Computer Vision and Image Understanding (2008).

20. Leutenegger, S., Chli, M. & Siegwart, R. Y. BRISK: Binary Robust invariant scalable keypoints. In Proceedings of the IEEE International Conference on Computer Vision (2011).

21. Kong, H., Akakin, H. C. & Sarma, S. E. A generalized laplacian of gaussian filter for blob detection and its applications. IEEE Transactions on Cybernetics (2013).

22. Witkin, A. P. Scale-space Filtering. In Proceedings of the 8th International Joint Conference on Artificial Intelligence (1983).

23. Koenderink, J. J. The structure of images. Biological Cybernetics (1984).

24. Lindeberg, T. Feature Detection with Automatic Scale Selection. International Journal of Computer Vision (1998).

25. Mikolajczyk, K. & Schmid, C. Scale & affine invariant interest point detectors. International Journal of Computer Vision (2004).

26. Mikolajczyk, K. & Schmid, C. A performance evaluation of local descriptors. IEEE Transactions on Pattern Analysis and Machine Intelligence (2005).

27. Tuytelaars, T. & Mikolajczyk, K. Local Invariant Feature Detectors: A Survey. Foundations and Trends® in Computer Graphics and Vision (2007).

28. Zhang, M. et al. Efficient Small Blob Detection Based on Local Convexity, Intensity and Shape Information. IEEE Transactions on Medical Imaging (2016).

29. Bertram, J. F., Soosaipillai, M. C., Ricardo, S. D. & Ryan, G. B. Total numbers of glomeruli and individual glomerular cell types in the normal rat kidney. Cell & Tissue Research (1992).

30. Cullen-Mcewen, L. A., Armitage, J. A., Nyengaard, J. R. & Bertram, J. F. Estimating nephron number in the developing kidney using the physical disector/fractionator combination. Methods in Molecular Biology (2012).

31. Bertram, J. F. Analyzing Renal Glomeruli with the New Stereology. International Review of Cytology (1995).

32. Bennett, K. M. et al. MRI of the basement membrane using charged nanoparticles as contrast agents. Magnetic Resonance in Medicine (2008).

33. Baldelomar, E. J. et al. Phenotyping by magnetic resonance imaging nondestructively measures glomerular number and volume distribution in mice with and without nephron reduction. Kidney International (2016).

34. Baldelomar, E. J., Charlton, J. R., Beeman, S. C. & Bennett, K. M. Measuring rat kidney glomerular number and size in vivo with MRI. American Journal of Physiology-Renal Physiology 314, F399–F406 (2017).

35. Hoo-Chang Member, S. et al. Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning and Daniel Mollura are with Center for Infectious Disease Imaging HHS Public Access. IEEE Trans Med Imaging 35, 1285–1298 (2016).

36. Kamnitsas, K. et al. Efficient multi-scale 3D CNN with fully connected CRF for accurate brain lesion segmentation. Medical Image Analysis 36, 61–78 (2017).

37. Gao, F. et al. SD-CNN: A shallow-deep CNN for improved breast cancer diagnosis. Computerized Medical Imaging and Graphics (2018).

38. Shelhamer, E., Long, J. & Darrell, T. Fully Convolutional Networks for Semantic Segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence (2017).

39. Ronneberger, O., Fischer, P. & Brox, T. U-net: Convolutional networks for biomedical image segmentation. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2015).

40. Raza, S. E. A. et al. MIMO-Net: A multi-input multi-output convolutional neural network for cell segmentation in fluorescence microscopy images. Proceedings - International Symposium on Biomedical Imaging 337–340 (2017).

41. Yap, M. H. et al. Automated Breast Ultrasound Lesions Detection Using Convolutional Neural Networks. IEEE Journal of Biomedical and Health Informatics 22, 1218–1226 (2018).

42. Gao, F. et al. Deep Residual Inception Encoder-Decoder Network for Medical Imaging Synthesis. IEEE Journal of Biomedical and Health Informatics (2019).

43. Xu, Y. et al. U-net with optimal thresholding for small blob detection in medical images. In IEEE International Conference on Automation Science and Engineering (2019).

44. Toolbox, I. P. Image Processing Toolbox. Image Processing (2004).

45. Lempitsky, V. & Zisserman, A. Learning To Count Objects in Images. Neural information processing systems (NIPS) 1–9 (2010).

46. Janowczyk, A. & Madabhushi, A. Deep learning for digital pathology image analysis: A comprehensive tutorial with selected use cases. Journal of Pathology Informatics (2016).

47. Tangri, N. et al. Total kidney volume as a biomarker of disease progression in autosomal dominant polycystic kidney disease. Canadian Journal of Kidney Health and Disease (2017).

## Acknowledgements

This research was supported by funds from the National Institute of Health award under grant number R01DK110622, R01DK111861. This work used the Bruker ClinScan 7 T MRI in the Molecular Imaging Core which was purchased with support from NIH grant 1S10RR019911-01 and is supported by the University of Virginia School of Medicine. The U.S. Government is authorized to reproduce and distribute for governmental purposes notwithstanding any copyright annotation of the work by the author(s). The views and conclusions contained herein are those of the authors and should not be interpreted as necessarily representing the official policies or endorsements, either expressed or implied, of NIH or the U.S. Government.

## Author information

Authors

### Contributions

Y.X., T.W., F.G., J.R.C. and K.M.B. wrote the main manuscript. Y.X., T.W. and F.G. developed the computational methods and conducted the experiments. J.R.C. and K.M.B. acquired and provided the imaging data. All authors reviewed and approved this manuscript.

### Corresponding author

Correspondence to Teresa Wu.

## Ethics declarations

### Competing interests

T.W., K.M.B., and J.R.C. are co-owners of Sindri Technologies, LLC. K.B. has a research agreement with Johnson & Johnson. K.B. has a sponsored research agreement with Jannsen Pharmaceuticals, Inc.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

## Rights and permissions

Reprints and Permissions

Xu, Y., Wu, T., Gao, F. et al. Improved small blob detection in 3D images using jointly constrained deep learning and Hessian analysis. Sci Rep 10, 326 (2020). https://doi.org/10.1038/s41598-019-57223-y

• Accepted:

• Published:

• DOI: https://doi.org/10.1038/s41598-019-57223-y

• ### Nephron number and its determinants: a 2020 update

• Jennifer R. Charlton
• Edwin J. Baldelomar
• Kevin M. Bennett

Pediatric Nephrology (2021)