AtPCa-Net: anatomical-aware prostate cancer detection network on multi-parametric MRI

Zheng, Haoxin; Hung, Alex Ling Yu; Miao, Qi; Song, Weinan; Scalzo, Fabien; Raman, Steven S.; Zhao, Kai; Sung, Kyunghyun

doi:10.1038/s41598-024-56405-7

Download PDF

Article
Open access
Published: 08 March 2024

AtPCa-Net: anatomical-aware prostate cancer detection network on multi-parametric MRI

Haoxin Zheng^1,2,
Alex Ling Yu Hung^1,2,
Qi Miao¹,
Weinan Song³,
Fabien Scalzo^2,4,
Steven S. Raman¹,
Kai Zhao¹ &
…
Kyunghyun Sung¹

Scientific Reports volume 14, Article number: 5740 (2024) Cite this article

528 Accesses
6 Altmetric
Metrics details

Subjects

Abstract

Multi-parametric MRI (mpMRI) is widely used for prostate cancer (PCa) diagnosis. Deep learning models show good performance in detecting PCa on mpMRI, but domain-specific PCa-related anatomical information is sometimes overlooked and not fully explored even by state-of-the-art deep learning models, causing potential suboptimal performances in PCa detection. Symmetric-related anatomical information is commonly used when distinguishing PCa lesions from other visually similar but benign prostate tissue. In addition, different combinations of mpMRI findings are used for evaluating the aggressiveness of PCa for abnormal findings allocated in different prostate zones. In this study, we investigate these domain-specific anatomical properties in PCa diagnosis and how we can adopt them into the deep learning framework to improve the model’s detection performance. We propose an anatomical-aware PCa detection Network (AtPCa-Net) for PCa detection on mpMRI. Experiments show that the AtPCa-Net can better utilize the anatomical-related information, and the proposed anatomical-aware designs help improve the overall model performance on both PCa detection and patient-level classification.

Prostate cancer malignancy detection and localization from mpMRI using auto-deep learning as one step closer to clinical utilization

Article Open access 27 December 2022

Deep learning for fully automatic detection, segmentation, and Gleason grade estimation of prostate cancer in multiparametric magnetic resonance images

Article Open access 22 February 2022

Prostate Cancer Detection using Deep Convolutional Neural Networks

Article Open access 20 December 2019

Introduction

Prostate cancer (PCa) is the second leading cancer-related cause of death and the most common cancer among men in the United States¹. Multi-parametric MRI (mpMRI) is the preferred non-invasive imaging tool for PCa diagnosis before biopsies^2,3. According to the Prostate Imaging Reporting and Data System, version 2.1 (PI-RADS)^2,3, a combination of mpMRI findings is used for predicting the probability of clinically significant PCa, where the different combinations are used depending on the lesion location, in either transition zone (TZ) or peripheral zone (PZ) of the prostate. For example, following the PI-RADS, T2-weighted imaging (T2WI) is the primary imaging component for lesions in TZ with an additional assessment by diffusion-weighted imaging (DWI), while in the PZ, DWI/apparent diffusion coefficient (ADC) is the essential imaging component with an addition of dynamic contrast-enhanced (DCE) MRI^2,3. Therefore, given the significance of varying imaging appearances of PCa on mpMRI between TZ and PZ, there is considerable potential to improve PCa detection models further when this anatomical prior is thoughtfully incorporated.

With advances in deep learning, many studies proposed deep learning models for the detection of PCa using mpMRI. However, the different appearances of PCa lesions in TZ and PZ on different mpMRI components were generally not fully integrated into the model design^{4,5,6,7,8,9,10}. Overlooking this zonal-related anatomical prior, but treating all lesions equally regardless of the locations, could lead to potential suboptimal model performance^2,3. A design that can reflect both the zonal appearance differences and the commonality of them being PCa lesions is the key to improving the model’s performance.

Hierarchical label and loss design embed structural information hierarchically among different classes into the loss function to better guide the model training^11,12,13. The design transforms the binary labeling to a more structured label space and is able to account for the distinct inter-class property differences while preserving shared properties among different classes^11,12,13. In this study, we propose an anatomical-aware hierarchical loss design, the Zonal Loss (ZL). The ZL can direct the model to learn both the unique and shared characteristics of PCa lesions across different prostate zones in accordance with clinical practice, thus enhancing the model’s detection capabilities.

Furthermore, studies have shown that PCa, benign prostatic hyperplasia (BPH), and the central zone (CZ) of the prostate can occasionally present with visual similarities^14,15. This undesirable resemblance between PCa and other prostate tissues complicates the diagnosis process. In clinical practice, symmetric-related information as a reference is valuable for distinguishing BPH and CZ from PCa. Research indicates that BPH and CZ tend to be visually symmetric^14,15, while PCa is generally presented asymmetrically^16,17. Illustrations of PCa lesions and PCa-like visual patterns are shown in Fig. 1. We can observe that BPH and CZ are shown to be similar to PCa lesions-low intensity on both ADC and T2WI images and high intensity on high-B DWI images.

The visual similarity of non-PCa prostate tissues not only complicates the diagnostic process but also leads to performance degradation in PCa detection models due to the generation of undesired false positive (FP) predictions-a common issue in existing deep-learning-based PCa detection models^{4,5,6,7,8,9,10}. By taking symmetric-related patterns into consideration, FP predictions may be reduced as PCa lesions can be further distinguished from BPH and CZ by their asymmetrical appearance differences. Therefore, integrating symmetry-related anatomical priors into the design of PCa detection models may be crucial in reducing potential FP predictions.

Existing study has shown that the human visual system recognizes symmetric patterns by comparing visual differences between the original image and the mirrored image after reflecting with respect to an imagined center-axis¹⁸. Inspired by how symmetric patterns stimulate human visual perception, we propose a symmetric-aware network architecture that utilizes both original and mirrored mpMRI images for PCa detection. Simulating how the human vision system reacts to the symmetric patterns, the network can help distinguish the PCa lesions from other prostate tissue with visual similarities, like BPH and CZ, and thus reduce FP predictions^14,15,16,17.

In this study, we introduced PCa-related anatomical priors into the deep learning framework design and developed an anatomical-aware deep learning network for PCa detection on mpMRI. The proposed network leverages symmetry-related information and PCa zonal appearance differences on mpMRI images to form a 3D anatomically-aware PCa detection network (AtPCa-Net), enhancing the accuracy of PCa lesion detection. Our main contributions include the following:

1.
We exhibit that the introduction of the PCa-related anatomical priors into the DL network architecture design helps improve model performance. The extensive experiments demonstrate that either one of the anatomical-aware designs of the proposed AtPCa-Net can help improve the PCa detection and patient-level classification performance, and the integration of both designs can achieve the best model performance on both PCa detection and patient-level classification tasks.
2.
We incorporate the symmetric-related clinical priors into the network architecture design to suppress potential FP predictions. By utilizing symmetric-related visual appearance differences, the design could help distinguish PCa from other prostate tissues shown similar visual patterns on mpMRI, thereby reducing possible FP predictions. To the best of our knowledge, our study is the first study to achieve FP reductions on PCa detection using anatomical-related clinical prior.
3.
We integrate the zonal appearance differences of PCa on mpMRI explicitly into the loss design by proposing the Zonal Loss (ZL). Compared with existing models overlooking this property and treating all lesions equally regardless of their location, the ZL treats PCa in different prostate zones differently following the clinical guideline, and thus helps improve model performance.
4.
Compared with other baselines, the AtPCa-Net achieved lower FP predictions while maintaining same sensitivity. Although the model still needs to be improved in order to be deployed in clinical practice, the results suggests the potential to further reduce the number of unnecessary target biopsies could be caused by using current computer-aided diagnosis models.

Related works

Prostate cancer detection

The deep-learning models for PCa detection and classification based on mpMRI have been widely investigated^19,20,21,22. The models are generally built by convolutional neural networks (CNNs) for their outstanding performance on classification, segmentation, and detection tasks. Recent studies exhibit the feasibility of using CNNs for PCa detection using mpMRI^{4,5,6,7,8,9,10,23}. Li et al.²³ designed a multi-scale two-branch dilated-convolution-based deep learning network to segment both PCa lesions and prostate from mpMRI. Seetharaman et al.⁴ designed a PCa detection network to identify indolent and aggressive PCa separately with the help of two different encoder branches for T2WI and ADC images and the fusion of feature maps from multiple levels of the encoder branches. Cao et al.⁸ introduced the idea of ordinal encoding for PCa with different severity doing a multi-class classification. The author also designed the mutual finding loss mimicking the process of how radiologists interpret the T2WI and ADC images to detect PCa. Cao et al.⁹ modified the FocalNet⁸ to have a stack of adjacent slides as input and did a comprehensive evaluation of the PCa detection performance between the radiologists and their proposed model.

There are some existing studies that tried to integrate clinical priors related to different diagnostic focuses for PCa in different zones into the PCa detection network design^3,6. Hosseinzadeh et al.⁵ utilized this zonal-related anatomical prior by stacking the prostate zonal segmentation masks together with mpMRI images as part of the input to the model and showed the detection performance improved. Vente et al.⁶ discovered the idea of both using ordinal encoding for PCa with different aggressiveness and also feeding prostate zonal masks into the PCa detection networks to let the model learn the anatomical relationship between the prostate zones and PCa appearance. Duran et al.²⁴ discussed the performance differences between using a prostate mask or using a PZ mask as part of the segmentation model input and observed the latter approach got better lesion segmentation performance.

Although they provide zonal information from the input, the cross-entropy (CE) loss with binary labeling they used explicitly treated all lesions identically regardless of the location^2,3. The ignorance of the lesion appearance differences in different prostate zones might lead to suboptimal PCa detection performance, which could be further improved.

As PCa detection models generally suffer from undesired FP predictions, some studies have introduced network designs aiming for better FP reduction ability^7,10. Yu et al.⁷ proposed a multi-scale patch-wise network together with a squeeze-and-excitation (SE) block²⁵. The design tried to reduce FP predictions by letting the model learn the FP patterns from the context information provided by the multi-scale patches automatically. To suppress FP predictions, Saha et al.¹⁰ first introduced an auxiliary network to classify if a given image patch contains PCa lesions or not and then multiplied the classification results with the detection probability map to conduct the final output. The experiments showed that the FP predictions could be suppressed by the patch-wise classification results. However, neither study has considered achieving FP reduction using anatomical-related clinical prior, which could also be capable of helping effectively reduce FP predictions.

Anatomical-aware design for other diseases

There are existing studies investigating how to incorporate anatomical-related clinical priors into the network architecture design for various tasks related to other diseases^26,27,28. Sun et al.²⁶ introduced a weakly-supervised knowledge distillation model for breast mass segmentation, with auxiliary networks for reconstruction and aggressiveness classification. The anatomy property was designed to be learned from the encoder of the teacher model, an autoencoder network reconstructing the input image, and then transferred to the student model, the desired breast mass segmentation network. Kamal et al.²⁷ proposed a semi-supervised CNN for thoracic disease classification on chest Chest X-ray images. The anatomical information was brought by the prediction masks of lung and heart generated from the auxiliary segmentation network and then fed into the main classification network as an anatomy-informed reference for an attention module. Ma et al.²⁸ proposed a dual-branch cascaded CNN for the segmentation of retinal layers and fluid from optical coherence tomography (OCT) images. The model first calculated a relative positional map based on the retinal layer boundaries and then fed them into the final segmentation network to inform the model of the anatomical relationships among different retinal layers. All the studies showed improvements in model performance when including anatomical-aware network architecture design. In this study, the anatomical-aware designs are not only composed of a symmetric-aware network architecture for FP prediction reduction but also shown through the design of the hierarchical loss, the ZL, considering the diagnostic differences of lesions on different prostate zones following clinical guideline^2,3.

Methods

Overview

We propose a 3D anatomical-aware PCa detection network (AtPCa-Net) to detect whole-mount histopathology (WMHP) confirmed clinically significant PCa (csPCa) utilizing the PCa-related anatomical priors. The proposed AtPCa-Net consists of two parts. First, a 3D symmetric-aware network takes the symmetric-related information into consideration to suppress FP predictions. Second, the ZL structurally integrates the PCa-related zonal differences into the label and loss design. The overall architecture of AtPCa-Net is illustrated in Fig. 2. We adhered to the structure of nnU-Net as the backbone for AtPCa-Net because of its good performance on detection and segmentation of medical imaging tasks²⁹.

Dataset

Study population and mpMRI images

This retrospective study was carried out in compliance with the United States Health Insurance Portability and Accountability Act (HIPAA) of 1996 with approval from the institutional review board (IRB) of our institution with a waiver of the requirement for informed consent. All experiments conducted in this study adhered strictly to the relevant guidelines and regulations. The whole dataset consists of 652 patients. It is composed of two parts: (1) pre-operative mpMRI images from patients (N = 220) who had confirmed PCa lesions (N = 246) with whole-mount histopathology after radical prostatectomy (RP), and (2) mpMRI images from patients (N = 432) who did not have indications of PCa lesions, confirmed by systematic biopsies followed by negative mpMRI (PI-RADS$\le $2). We included mpMRI images with no indications of PCa lesions to balance the data distribution on model training and testing, as well as to perform patient-level classification evaluation. We used 5-fold cross-validation to validate and evaluate the model performance, in which each fold contains 130/131 patients assigned randomly from the entire dataset.

All mpMRI images are performed on Siemens 3T scanners with the standardized clinical prostate mpMRI protocol^2,3, including T2WI and DWI. We exclude the DCE-MRI images given the limited role of DCE-MRI^2,3,30,31. For T2WI, the repetition time (TR) and echo time (TE) are 3000–5900 ms and 101–109 ms, the field of view (FOV) of 20 cm $\times $ 20 cm with an in-plane resolution of 0.625 mm $\times $ 0.625 mm and through-plane resolution of 3 mm. For DWI, we use TR and TE of 4800–5300 ms and 60–81 ms, FOV of 26 cm $\times $ 21 cm with in-plane resolution of 1.625 mm $\times $ 1.625 mm and through-plane resolution of 3.6 mm. The ADC maps were calculated using linear least squares curve fitting of voxels in the four DWIs against their corresponding b values (0/100/400/800 s/mm2 ). We also denote the DWI images with b = 1400 s/mm2 as high-B value DWI (high-B DWI).

Clinical interpretation and annotations

The mpMRI images were reviewed by three genitourinary (GU) radiologists (10+ years of clinical prostate MRI reading) as part of the standard clinical procedure following the clinical guideline^2,3. Lesion findings with PI-RADS score $\ge $ 3 are reported as MRI-positive findings, and the findings with PI-RADS score < 3 are interpreted as MRI-negative findings in this study.

The ground truth of the lesion annotations is confirmed by WMHP after RP matched to mpMRI prior to RP in this study. Blinded to all MRI-related information, the sliced WMHP specimens are examined and reported by three GU pathologists (with 14, 8, and 5 years of experience in clinical prostate histopathology interpretation) as part of the standard clinical procedure. Every PCa lesion was contoured and assigned a corresponding Gleason Score (GS) on WMHP. PCa lesions with GS$\ge 7$ are defined as csPCa and are the detection targets of our proposed detection model in this study.

GU radiology research fellows, under the supervision of GU radiologists, retrospectively reviewed each mpMRI exam and contoured the region of interest (ROI) of MRI-visible lesions on T2WI images referring to the WMHP examination reports. MRI-positive findings are categorized as true positive if the radiological findings and the pathological findings are matched or false positive if no corresponding PCa lesion is found in histopathology reports. We defined the prospectively missed lesions that are retrospectively identified in the re-review procedure as false-negative (FN) lesions. The remaining PCa lesions that are MRI non-visible and also retrospectively unidentifiable on mpMRI are not included in the study as we cannot accurately contour them.

Compared to data consisting of biopsy-confirmed PCa, ground truth confirmed by WMHP offers additional insights into how the model would react to FN cases, which are generally harder to recognize in clinical practice. Understanding FN lesions is crucial, as overlooked or underestimated PCa can lead to insufficient treatment and undesired oncological outcomes^32,33.

The prostate zonal segmentation of TZ and PZ are treated as part of the AtPCa-Net’s input, shown in Fig. 2. The zonal masks are generated using a separate automatic prostate zonal segmentation model, CAT-Net³⁴, to explicitly provide the PCa-related anatomical information.

Preprocessing

The T2WI images underwent N4 bias filed correction to compensate for the low-frequency intensity non-uniformity³⁵. The high-B DWI and ADC images are registered and resampled with respect to T2WI images using rigid spatial transformation while utilizing real-world coordinates information for each patient since the DWI and T2WI sequences are acquired temporally closed and only minimal patient motion are found^8,36,37. After the registration, high-B DWI, ADC, and T2WI images are rotated with respect to the center line, generated by connecting the volumetric center of the prostate and the TZ, to show the symmetric appearance. Then, high-B DWI, ADC, and T2WI images are center cropped with the size of 128 $\times $ 128 pixels from the original 320 $\times $ 320 pixels images as all prostates are allocated in the center of the acquired MR images following the clinical protocol^2,3. The intensity value of voxels in high-B DWI and T2WI images are linearly normalized to have a value in the range of [0, 1]. As the values of ADC maps are quantitative, the voxel intensities are consistent across patients^2,3,8,38. Therefore, the intensity on ADC maps is first clipped by a patient-independent value and then normalized to be in the range of [0, 1]⁸.

Symmetric-aware network architecture

We first introduce the proposed symmetric-aware network architecture design that is capable of taking symmetric information into consideration explicitly. The detailed network architecture can be seen in Fig. 2. In this study, we implement a UNet-like backbone structure since the UNet-like structures have shown great performances in medical-imaging-related segmentation and detection tasks²⁹. The inputs are the 3D volumetric stack of images with a dimension of [$N \times C_{in} \times D_{in} \times H_{in} \times W_{in}$], where N is the batch size, $C_{in}$ is the number of channels, $D_{in}$ is through-plane resolution, $[H_{in}, W_{in}]$ are in-plane resolution of the input mpMRI images. Different categories of images (T2WI, ADC, high-B DWI, binary mask of TZ, or binary mask of PZ) correspond to different channels of the input, and each imaging modality has the same volumetric size of $[D_0, H_0, W_0]$. The network takes two inputs: one is the original 3D stack of images, and the other is the 3D stack of images mirrored across the vertical axis.

The weights of all the convolution blocks (ConvBlock) in each encoder layer of the network are shared by both the original and mirrored paths. The design of shared-weight encoders has proven to be useful when visual comparisons are applied in downstream tasks in the DL network architecture designs³⁹, similar as how the symmetric-related anatomical priors were used to distinguish between benign and cancerous prostate tissue in the clinical practice. By sharing weights on the two encoders, features extracted from both pathways will maintain symmetric information. This, in turn, assists the network in learning how to utilize these symmetric features, resembling the human visual system, and thus enhances the model’s decision-making process. In each level i other than the bottom level, the extracted feature maps $X_{i}^{ori}\in $ [$N \times C_{i} \times D_{i} \times H_{i} \times W_{i}$] and $X_{i}^{mir}\in $ [$N \times C_{i} \times D_{i} \times H_{i} \times W_{i}$] from both sides will be first concatenated channel-wisely and format a combined feature map $X_{i}^{cat}\in $ [$N \times 2C_{i} \times D_{i} \times H_{i} \times W_{i}$]. Then $X_{i}^{cat}$ will pass through a bridge block (BridgeBlock), composed of two 3D convolution blocks, and finally concatenated together with the upscaled feature map $X_{i+1}^{up}$ from level $i+1$. The final concatenation will then be used to do further feature extraction at that level. Detail representations can be seen in the sub-figure of Decoder Blocks in Fig. 2.

The output of the network is the detection probability map of where suspicious csPCa is allocated. The difference between the probability map and the ground truth mask is measured by the proposed ZL, which will be introduced in the following sub-section.

Zonal loss

Current labeling strategies in PCa detection models generally inadequately account for the significance of PCa’s zonal appearance differences, but using CE loss treating all PCa lesions identically regardless of their location^{2,3,4,5,6,7,8,9,10}. We propose an anatomical-aware hierarchical label and loss design, the ZL, to guide the model to learn the different appearances of PCa lesions in different zones with anatomy-informed constraints.

We denote the set of voxels of the PZ region as ${\mathbb {P}}$, the TZ region as ${\mathbb {T}}$, and the csPCa lesion region as ${\mathbb {L}}$ for a given prostate mpMRI image. Given an input image ${{\mathscr {I}}} \in {\varvec{R}}^{\ C\times D\times H\times W}$ and the corresponding binary mask ${{\mathscr {M}}} \in \{0, 1\}^{D\times H\times W}$, where $C,\ D,\ H,\ W$ are the channel number, depth, height and width of the input image ${{\mathscr {I}}}$, for any voxel $v \in {{\mathscr {I}}}$, the corresponding label voxel m on ${{\mathscr {M}}}$ in binary CE loss design is given as:

$$\begin{aligned} m=\left\{ \begin{array}{ l l } 1 &{} \quad \text {if }v\in {\mathbb {L}},\\ 0 &{} \quad \text {otherwise} \end{array} \right. \end{aligned}$$

(1)

One of the key points of the hierarchical label and loss design is the multi-level labeling design with respect to the number of properties each class holds—lower-level classes hold fewer properties and constraints, higher-level classes hold more properties and more constraints correspondingly^11,12,13. According to PI-RADS^2,3, PCa lesions in PZ mostly require visual information related to DWI, while lesions in TZ require a combined evaluation of both T2WI and DWI for accurate diagnosis. We adopt this clinical interpretation process using the hierarchical label and loss design by treating the class of TZ lesions as requiring additional information from T2WI images compared with the class of lesions in PZ for improved PCa detection on mpMRI. Hence, the ZL design is adept at acknowledging the distinct zonal appearance of PCa while preserving the anatomical congruence between lesions in the TZ and PZ.

We design a hierarchical labeling with ground truth mask ${{\mathscr {M}}} \in \{0, 1\}^{2\times D\times H\times W}$ for a given image ${{\mathscr {I}}} \in {\varvec{R}}^{\ C\times D\times H\times W}$. For any voxel $v \in {{\mathscr {I}}}$ , the corresponding label vector ${\varvec{m}}=[m_0, m_1]\in \{0,1\}^2$ in ${{\mathscr {M}}}$ in our loss design is given as:

$$\begin{aligned} {\varvec{m}} = {\left\{ \begin{array}{ll} {[}1, 1], \ \ \ \ \text {if}\ v\in {\mathbb {L}}\cap {\mathbb {T}}\\ {[}1, 0], \ \ \ \ \text {if}\ v\in {\mathbb {L}}\cap {\mathbb {P}}\\ {[}0, 0], \ \ \ \ \text {otherwise} \end{array}\right. } \end{aligned}$$

(2)

This label design aims to adopt the clinical prior knowledge to the detection of csPCa lesions. Abnormalities should be observed on image sequences related to DWI in common for both the lesions in TZ and PZ ($m_0$), and additional abnormal observations from T2WI are needed for lesions in TZ in order to make more accurate diagnoses ($m_1$)^2,3.

We denote the probability vector ${\varvec{p}}=[p_0, p_1]\in [0,1]^2$ in the output probability map ${{\mathscr {P}}}\in [0,1]^{2\times D\times H\times W}$, for the corresponding voxel $v \in {{\mathscr {I}}}$. The modified CE loss can be written as:

$$\begin{aligned} \begin{aligned} {\mathscr {L}}({{\mathscr {P}}}, {{\mathscr {M}}})&= \sum \limits _{v}-{\varvec{m}} \log {{\varvec{p}}}-({\varvec{1}}-{\varvec{m}})\log {({\varvec{1}}-{\varvec{p}})} \end{aligned} \end{aligned}$$

(3)

where ${\textbf{1}} = [1,1]$, and:

$$\begin{aligned} p_0= & {} \left\{ \begin{array}{ l l } p_0\ {} &{}\text {if}\ m_0\ = 1\\ max(p_0, p_1) &{}\text {if}\ m_0\ = 0 \end{array} \right. \end{aligned}$$

(4)

$$\begin{aligned} p_1= & {} \left\{ \begin{array}{ l l } min(p_0, p_1)\ {} &{}\text {if}\ m_1\ = 1\\ p_1 &{}\text {if}\ m_1\ = 0 \end{array} \right. \end{aligned}$$

(5)

The modified CE loss is designed to suppress prediction vector patterns that should not exist. Based on our labeling design, label vector pattern $\hat{{\varvec{m}}}=[0, 1]$ is not defined, since solely abnormalities found on T2WI have limited contribution to the diagnosis of suspicious PCa^2,3. Therefore, any output probability vectors with $p_1>p_0$ should be penalized in order to teach the model not to conduct such predictions. However, the original CE loss with binary labeling only computes the loss of each class independently but ignores this inter-class relationship. We intentionally add this constraint onto the original CE loss, which is shown in (4) and (5). In (4), $p_0=max(p_0, p_1)$ when $m_0=0$, and in (5), $p_1=min(p_0, p_1)$ when $m_1=1$ both indicate that $p_0$ should be greater than $p_1$ in any prediction outputs, and any patterns disobey this rule should be penalized. This modification could further help the model converge to a better solution.

In addition, we adopt Focal Loss onto the modified CE loss in Eq. (3) to account for the imbalance in the number of voxels between the csPCa lesions and background⁴⁰. This would reduce the relative weight for well-classified voxels and emphasize focus on hard ones like lesion voxels^8,40. The final ZL form follows:

$$\begin{aligned} \begin{aligned} {\mathscr {L}}^{ZL}({{\mathscr {P}}},{{\mathscr {M}}}) = \sum \limits _{v}-&{\varvec{m}}({\varvec{1}}-{\varvec{p}})^{\gamma }\log {{\varvec{p}}}- ({\varvec{1}}-{\varvec{m}}){\varvec{p}}^{\gamma }\log {({\varvec{1}}-{\varvec{p}})} \end{aligned} \end{aligned}$$

(6)

where ${\textbf{1}} = [1,1]$, and ${\varvec{p}}\in [0,1]^2$ is defined in (4) and (5).

Implementation details

In each of the three levels of the network architecture, the channel number is [64, 128, 256] for each level of the convolutional layers of the encoders, and [256, 128, 64] for each level of the convolutional layers in the decoders, correspondingly²⁹. Each level of the convolutional layers comprises four consecutive ConvBlocks, and each ConvBlocks consists of a $3\times 3\times 3$ 3D convolution kernel, following by a LeakyReLU activation function and an instance normalization, following the settings of the nnU-Net²⁹.

Each training procedure takes 60 epochs, with early-stopping strategy applied when no loss degradation for 30 accumulate epochs was found to avoid potential overfitting issues. Adam optimizer⁴¹ was adopted with the loss function of the Focal Loss⁴⁰ by default, the ZL when specifically stated. All models are trained on an Nvidia RTX3090 GPU.

Results

Quantitative results

For csPCa lesion detection, we evaluate the overall csPCa detection performance of the AtPCa-Net using the free-response receiver operating characteristic (FROC) analysis²⁰. The FROC curve helps analyze the relationship between model detection sensitivity and the level of FP predictions per patient. In the experiment, we consider the local maxima on the output probability map as the csPCa detection points. The csPCa detection point is defined as a true positive (TP) when the point is within 5 mm of any csPCa ground truth ROIs to account for a potential mismatch between the whole-mount specimen and mpMRI of the corresponding ROI^8,20.

We also evaluate the per-patient level classification performance of the proposed AtPCa-Net by defining patients with csPCa as positive cases, and patients without csPCa as negative cases. For each patient, we treat the highest value on the output probability map as its probability of having csPCa. The evaluation of the per-patient level classification performance is done by using Receiver Operating Characteristic (ROC) analysis. In both ROC and FROC analysis, we evaluate the model performances by 5-fold cross-validation after 1000 times bootstrapping. ROCs were compared with DeLong Test⁴², and the sensitivity results at each number of FP predictions per patient were compared using Chi-squared Test, in accordance with 95% confidence interval (95% CI), correspondingly.

We performed comparisons between our proposed model and other popular 3D image segmentation models, including SEResUNet²⁵, ResidualUNet⁴³, VNet⁴⁴, AttentionUNet⁴⁵, VoxResNet⁴⁶, nnUNet²⁹, and UNETR⁴⁷ for csPCa detection and patient-level classification. Figure 3 visualizes the comparison of csPCa detection performance among different models via different FROC curves. Figure 4 visualizes the comparison of patient-level classification performance among different models via ROC curves. Table 1 shows the comparisons of the patient level classification AUCs in the format of mean, and the comparisons of csPCa detection performance via showing the sensitivity results against 0.5/1/1.5/2/2.5 FP predictions per patient. Our proposed AtPCa-Net outperforms all other models on all the FROC measurements on 0.5/1/1.5/2/2.5 FP predictions per patient with higher mean sensitivities (p<0.05). The AtPCa-Net also outperforms all other models on the patient-level classification AUCs (p<0.05).

Compared with some of the existing studies proposing PCa detection using PCa biopsy results^4,5,37, our study uses results confirmed by WMHP, which results in additional FN csPCa annotations as the prospectively missed csPCa lesions were retrospectively annotated. In order to discuss the possible performance discrepancies caused by the dataset’s differences between existing studies and ours, we also perform ROC and FROC analysis to the results using the dataset after excluding FN lesions, shown in Table 5. The proposed AtPCa-Net outperforms all other baseline models^{25,43,44,45,46,47} on both ROC and FROC measurements when using the dataset excluding all FN lesions (p<0.05), which keeps consistent to its performance when including FN lesions, as shown in Table 1.

Table 1 Patient-level classification and csPCa detection performance comparisons among different models.

Full size table

Qualitative Results

We qualitatively evaluate the model performance by showing representative examples of csPCa detection performance comparisons in Fig. 5. In Fig. 5, A and B correspond to two patients with csPCa, and C and D correspond to two patients without csPCa. Overall, the proposed AtPCa-Net conducted fewer FP predictions with the same TP predictions on all cases compared with other models^{25,43,44,45,46,47}.

We can also observe its ability to suppress symmetric FP predictions with its better ability to distinguish symmetric abnormal patterns of csPCa from other normal prostate tissue, like BPH and CZ, compared with other models^{25,43,44,45,46,47}. Patients B and C are representative examples of patients who have BPH, and Patient D is a representative example of patients whose CZ’s appearance could mislead the model’s prediction. The BPH, pointed by green arrows for Patient B and C on the MR images, and the CZ region, pointed by the yellow arrows for Patient D, show visually similar appearances as the csPCa on mpMRI images but with symmetric patterns. We can observe that for all patients, other models that do not take the symmetric-related anatomical information into consideration misidentify the BPH and the CZ as csPCa and result in FP predictions. Our models can correctly detect the csPCa with fewer FP predictions with the help of the symmetric-related anatomical-aware architecture design.

Backbone network extension

To show the generalizable potential of the proposed anatomical-aware design, we also try to transplant the architecture onto another UNet-like backbone network. We implement the nnUNet++³⁴, a UNet++⁴⁸ variant, with the proposed ZL and symmetric-aware architecture. Similar to the nnUNet-based approach, the weights of each non-decoder block are shared. In nnUNet++, the feature maps from both sides of the network merge after each skip connection that ends at the decoder blocks on each level, similar to the implementation with nnUNet as the backbone network.

Table 2 shows the performance of using the two backbone networks on the original dataset, and Table 3 shows the performance comparisons when excluding all FN lesions. The AtPCa-Net(nnUNet) and AtPCa-Net(nnUNet++) represents for AtPCa-Net using nnUNet and nnUNet++ as a backbone network, respectively. From both the Table 2 and the Table 3, we can observe that both nnUNet-based AtPCa-Net and nnUNet++-based AtPCa-Net achieved better detection and classification performance than when only using the nnUNet or nnUNet++, respectively (p<0.05). This indicates the generalizable potential of applying the proposed anatomical-aware design with different backbone networks. From Table 2, we see the nnUNet-based AtPCa-Net performs better on patient-level classification and also achieves higher sensitivities at 0.5/1 FP predictions per patient. When the rate of FP predictions per patient raises to 2/2.5 FP predictions per patient, the nnUNet++-based AtPCa-Net achieves higher sensitivities. In Table 3, nnUNet-based AtPCa-Net outperforms the nnUNet++-based AtPCa-Net on all situations, except similar on patient-level classification AUC and sensitivity at 1.5 FP predictions per patient. We select nnUNet as the backbone as it outperforms the nnUNet++-based AtPCa-Net in the majority of situations and represents the nnUNet-based AtPCa-Net as AtPCa-Net if without any other descriptions in this paper.

Table 2 Model performance comparisons using different backbone networks.

Full size table

Table 3 Model performance comparisons using different backbone networks after excluding FN lesions.

Full size table

Ablation study

We conduct ablation studies to discover the importance of each component of the proposed AtPCa-Net, shown in Table 4. We can observe that either modifying the Focal Loss to the ZL or modifying the network architecture to be in the symmetric-aware architecture improves the performance on both per-patient level classification and csPCa detection. When all the components are included, which formats the proposed AtPCa-Net, it outperforms all other situations when only partial components are included, showing the superiority of our proposed method and the usefulness of integrating all the mentioned prostate anatomical-related prior into the model.

Table 4 Ablation study of effects of including/excluding components of the AtPCa-Net.

Full size table

Table 5 Patient-level classification and csPCa detection performance comparisons of different models without FN lesions.

Full size table

Discussion

Our study demonstrated that the anatomical-aware designs, specifically the symmetric-aware architecture and the ZL, of the AtPCa-Net can help improve the csPCa detection performance and also patient-level classification results. We attribute the improvements in our model not only to the zonal-related knowledge learned under the guidance of the anatomically-aware ZL but also to the ability to reduce FP, which is a direct result of our symmetric-aware network architecture design.

The proposed anatomical-aware designs in AtPCa-Net help improve model performance on both csPCa detection and patient-level classification. The ZL shows its effectiveness by taking the lesion appearance differences on mpMRI images in different prostate zones into consideration. There are several approaches^5,6 trying to utilize the zonal information by stacking the zonal mask as part of input together with the CE loss function and have shown improvement in model performance. However, PCa lesions located in different prostate regions are treated identically by using the CE loss, ignoring the essential anatomical information related to PCa’s zonal appearance differences. By using the ZL, an additional anatomical-aware constraint is added, and the zonal masks are further utilized. In addition, the symmetric-aware architecture of the AtPCa-Net helps reduce the FP predictions that are related closely to other normal prostate tissue with similar visual appearances as the PCa lesions on mpMRI, like BPH and CZ. The symmetric nature of the proposed network design helps distinguish the differences between the asymmetric patterns of PCa and the symmetric patterns of other normal prostate tissue. We can see that the integration of both the anatomical-aware designs, the ZL and the symmetric-aware architecture, helps improve the model performance more compared with the situations when including each individual design only, with 4.1%/2.1%/3.6%/3.3%/2.0% sensitivity per 0.5/1/1.5/2/2.5 FP/Patient and 3.7% AUC improvements.

In this study, all patients with csPCa are confirmed by the WMHP results. Different from some of the existing studies using results confirmed by prostate biopsies^4,5,37, our WMHP dataset has the retrospective annotations for MR-visible FN lesions that are prospectively missed. The model performance regarding the FN lesions is important as the missing lesions or underestimation of the PCa’s volume and significance could result in inadequate therapy and consequently undesired oncologic outcomes^32,33. Although both Tables 1 and 5 show the consistent superiority of the proposed AtPCa-Net compared with other models, it also reveals that all model performances dropped on both ROC and FROC measurements when including FN lesions compared with the situation when all FN lesions are excluded. The results highlight the challenges in automatically identifying FN csPCa lesions via deep-learning models, which aligns with the observations from existing study about the difficulty to identify FN lesions in clinical practice³². FN lesions are typically tiny and sometimes might be affected by the spatial resolution of the MRI imaging, making it hard to be detected³². Future studies could potentially be conducted regarding how to build an effective automatic csPCa detection model focusing on issues related to FN csPCa lesions, in conjunction with advancements MR technology to enhance the resolution of mpMRI.

We also evaluated the proposed AtPCa-Net on patients cohorts grouped by different prostate-specific antigen density (PSAD) level. The PSAD level is one of the clinical factors indicating the level of potential risk of patients having PCa^49,50. The results can be found in Table S2 and Table S3 in Supplementary Information. In all, we showed the csPCa detection and patient-level classification performances of the proposed AtPCa-Net on patient cohorts grouped by cut-off PSAD level of 0.15 ng/ml/ml and 0.20 ng/ml/ml, which used as recommended thresholds for evaluating the risk of patients having PCa in existing studies^49,50. The results exhibited the proposed AtPCa-Net performed better in patient cohort with higher PSAD compared with the cohort with lower PSAD in both cut-off settings. Further improvement could be made to improve the model performance when integrating the clinical information with the DL model design. For example, the DL model may be able to capture the risk for the patient having csPCa by the imported PSAD level, and then learn to enhance the prediction efficacy accordingly. Collecting potential related clinical and demographic information and discovering how to effectively integrating them with the DL model designs could be our future research directions.

Several limitations exist in the study. The model evaluations might be affected by the fact that the WMHP dataset was collected from a single institution and with MR machines from a single vendor. In the future, we will expand the WMHP dataset with multi-center collaborations and multi-vendor data, improving the diversity of the dataset with multiple clinical settings and patient demographics, and validate the proposed model’s generalizability and further solidify our findings. In addition, the real-world diagnosis of PCa generally integrates radiological findings together with clinical test results and demographic information^2,3. However, just like other existing studies^{4,5,6,7,8,9,10,23}, our study is limited on only utilizing information from mpMRI images. Potential performance improvements could be achieved if including clinical test results and demographic information in the csPCa detection model design, since they have shown the ability to improve model performance compared with using imaging information only in other computer-aided disease diagnosis studies^{51,52,53,54,55}. In addition, due to the limit role played by the DCE imaging in the clinical practice, we excluded the DCE imaging from the model design, like other existing studies with the same research objectives^{51,52,53,54,55}. As the DCE imaging can provide microvascular structure information, it could also potentially contribute to improved PCa diagnosis by providing imaging information from another perspectives³. The integration of clinical information and radiological findings, and the inclusion of the DCE imaging could be our future research directions.

We have shown that by taking PCa-specific anatomical priors into consideration, the PCa detection model improves its performance on both csPCa detection and patient-level classification. We believe the advantage comes from the key ideas of fusing the anatomical-related clinical priors into the loss function and network architecture design, which can better guide model training. The achievement could potentially influence the future designs of the DL-based PCa detection models on how the anatomical priors could help enhance the performance when integrating with DL model designs. We hypothesize that integrating the disease-specific anatomical-related knowledge into the model design could also potentially improve the model performance for other diseases, which could be a future research direction.

Conclusions

We have demonstrated that by integrating anatomical priors into the deep learning network architecture design, the model efficacy is enhanced on both clinically significant prostate cancer (csPCa) detection and patient-level classification. Adopted from the clinical interpretation, the anatomical priors are carefully achieved by a symmetric-aware architecture design and the Zonal Loss (ZL), which format the proposed 3D anatomical-aware prostate cancer detection network (AtPCa-Net). Our experiments show that the model performance improves when either symmetric-related anatomical priors or zonal appearance differences of PCa are considered, with the best results achieved when the model incorporates both information. The proposed AtPCa-Net shows superior performance to other baseline models in both csPCa detection and patient-level classification, and shows the potential to further reduce the number of unnecessary biopsies may be caused by using current DL models. Our approach also reveals the potential flexibility of the anatomical-aware designs as they can improve the model performance with different backbone networks. How to generalize the anatomical-aware design idea to other specific diseases and how to integrate the design with clinical test results and demographic information could be our future research directions.

Data availability

The dataset collected from our institution is currently not publicly available, since the IRB only approves its use for internal usage. The data might be available for research purposes on reasonable request or institutional collaborations. Please contact the corresponding author for any dataset-specific requests.

References

Rawla, P. Epidemiology of prostate cancer. World J. Oncol. 10, 63–89 (2019).
Article CAS PubMed PubMed Central Google Scholar
Weinreb, J. C. et al. Pi-rads prostate imaging-reporting and data system: 2015, version 2. Eur. Urol. 69, 16–40 (2016).
Article PubMed Google Scholar
Turkbey, B., Rosenkrantz, A. & Haider, M. e a. Prostate imaging reporting and data system version 2.1: 2019 update of prostate imaging reporting and data system version 2. Eur. Urol. 76, 340–351 (2019).
Article PubMed Google Scholar
Seetharaman, A. et al. Automated detection of aggressive and indolent prostate cancer on magnetic resonance imaging. Med. Phys. 48, 2960–2972 (2021).
Article PubMed Google Scholar
Hosseinzadeh, M. et al. Deep learning-assisted prostate cancer detection on bi-parametric MRI: Minimum training data size requirements and effect of prior knowledge. Eur. Radiol. 32, 2224–2234 (2022).
Article CAS PubMed Google Scholar
Vente, C. d, Vos, P., Hosseinzadeh, M., Pluim, J. & Veta, M. Deep learning regression for prostate cancer detection and grading in bi-parametric MRI. IEEE Trans. Biomed. Eng. (TBME) 68, 374–383 (2020).
Article Google Scholar
Yu, X. et al. False positive reduction using multiscale contextual features for prostate cancer detection in multi-parametric MRI scans. In IEEE International Symposium on Biomedical Imaging (ISBI) 1355–1359 (2020).
Cao, R. et al. Joint prostate cancer detection and gleason score prediction in mp-mri via focalnet. IEEE Trans. Med. Imaging (TMI) 38, 2496–2506 (2019).
Article Google Scholar
Cao, R. et al. Performance of deep learning and genitourinary radiologists in detection of prostate cancer using 3-T multiparametric magnetic resonance imaging. J. Magn. Reson. Imaging 54, 474–483 (2021).
Article PubMed PubMed Central Google Scholar
Saha, A., Hosseinzadeh, M. & Huisman, H. End-to-end prostate cancer detection in bpMRI via 3D CNNs: Effects of attention mechanisms, clinical priori and decoupled false positive reduction. Med. Image Anal. 73, 102155 (2021).
Article PubMed Google Scholar
Verma, N., Mahajan, D., Sellamanickam, S. & Nair, V. Learning hierarchical similarity metrics. In Conference on Computer Vision and Pattern Recognition (CVPR) 2280–2287 (2012).
Bertinetto, L., Mueller, R., Tertikas, K., Samangooei, S. & Lord, N. A. Making better mistakes: Leveraging class hierarchies with deep networks. In Conference on Computer Vision and Pattern Recognition (CVPR) (2020).
Li, L., Zhou, T., Wang, W., Li, J. & Yang, Y. Deep hierarchical semantic segmentation. In Conference on Computer Vision and Pattern Recognition (CVPR) 1236–1247 (2022).
Yu, J. et al. Prostate cancer and its mimics at multiparametric prostate MRI. Br. J. Radiol. 87, 20130659 (2014).
Article CAS PubMed PubMed Central Google Scholar
Panebianco, V. et al. Pitfalls in interpreting mp-MRI of the prostate: A pictorial review with pathologic correlation. Insights Imaging 6, 611–630 (2015).
Article CAS PubMed PubMed Central Google Scholar
Smith, R., Mettlin, C. & Huang, E. Cancer screening and early detection. Holland-Frei Cancer Med. 31, 141 (2003).
Google Scholar
Barentsz, J. O. et al. ESUR prostate MR guidelines 2012. Eur. Radiol. 22, 746–757 (2012).
Article PubMed PubMed Central Google Scholar
Wagemans, J. Characteristics and models of human symmetry detection. Trends Cogn. Sci. 1, 346–352 (1997).
Article CAS PubMed Google Scholar
Booven, D. J. V. et al. A systematic review of artificial intelligence in prostate cancer. Res. Rep. Urol. 13, 31–39 (2021).
PubMed PubMed Central Google Scholar
Litjens, G., Debats, O., Barentsz, J., Karssemeijer, N. & Huisman, H. Computer-aided detection of prostate cancer in mri. IEEE Trans. Med. Imaging (TMI) 33, 1083–1092 (2014).
Article Google Scholar
Bhattacharya, I. et al. A review of artificial intelligence in prostate cancer detection on imaging. Therapeut. Adv. Urol. 14, 859 (2022).
Google Scholar
Sufyan, M., Shokat, Z. & Ashfaq, U. A. Artificial intelligence in cancer diagnosis and therapy: Current status and future perspective. Comput. Biol. Med. 165, 107356 (2023).
Article PubMed Google Scholar
Li, Y., Wu, Y., Huang, M., Zhang, Y. & Bai, Z. Attention-guided multi-scale learning network for automatic prostate and tumor segmentation on mri. Comput. Biol. Med. 165, 107374 (2023).
Article PubMed Google Scholar
Duran, A. et al. Prostattention-net: A deep attention model for prostate cancer segmentation by aggressiveness in mri scans. Med. Image Anal. 77, 102347 (2022).
Article PubMed Google Scholar
Hu, J., Shen, L. & Sun, G. Squeeze-and-excitation networks. In Conference on Computer Vision and Pattern Recognition (CVPR) 7132–7141 (2018).
Sun, Y. & Ji, Y. AAWS-Net: Anatomy-aware weakly-supervised learning network for breast mass segmentation. PLoS ONE 16, e0256830 (2021).
Article CAS PubMed PubMed Central Google Scholar
Kamal, U., Zunaed, M., Nizam, N. B. & Hasan, T. Anatomy-XNet: An anatomy aware convolutional neural network for thoracic disease classification in chest X-rays. IEEE J. Biomed. Health Inf. (JBHI) 26, 5518–5528 (2022).
Article Google Scholar
Ma, D. et al. LF-UNet—a novel anatomical-aware dual-branch cascaded deep neural network for segmentation of retinal layers and fluid from optical coherence tomography images. Comput. Med. Imaging Graph. 94, 101988 (2021).
Article PubMed Google Scholar
Isensee, F. et al. nnu-net: A self-configuring method for deep learning-based biomedical image segmentation. Nat. Methods 2021, 859 (2021).
Google Scholar
Kozlowski, P., Chang, S., Jones, E. & Goldenberg, S. Assessment of the need for dce mri in the detection of dominant lesions in the whole gland: Correlation between histology and mri of prostate cancer. NMR Biomed. 31, 526 (2018).
Article Google Scholar
Brancato, V. et al. Assessment of dce utility for pca diagnosis using pi-rads v2.1: Effects on diagnostic accuracy and reproducibility. Diagnostics 10, 563 (2020).
Article Google Scholar
Borofsky, S. et al. What are we missing? false-negative cancers at multiparametric mr imaging of the prostate. Radiology 286, 186–195 (2018).
Article PubMed Google Scholar
Lamb, B. W. et al. Is prebiopsy MRI good enough to avoid prostate biopsy? A cohort study over a 1-year period. Clin. Genitourin. Cancer 13, 512–517 (2015).
Article PubMed Google Scholar
Hung, A. L. Y. et al. CAT-Net: A cross-slice attention transformer model for prostate zonal segmentation in MRI. In IEEE Transactions on Medical Imaging (TMI) (2022).
Tustison, N. J. et al. N4ITK: Improved N3 bias correction. IEEE Trans. Med. Imaging (TMI) 29, 1310–1320 (2010).
Article Google Scholar
Zheng, H. et al. Integrative machine learning prediction of prostate biopsy results from negative multiparametric MRI. J. Magn. Reson. Imaging 55, 100–110 (2022).
Article PubMed Google Scholar
Liu, P. et al. A prostate cancer computer-aided diagnosis system using multimodal magnetic resonance imaging and targeted biopsy labels. In SPIE Medical Imaging 86701G–86701G–6 (2013).
Vargas, H. A. et al. Diffusion-weighted endorectal MR imaging at 3 T for prostate cancer: Tumor detection and assessment of aggressiveness. Radiology 259, 775–784 (2011).
Article PubMed PubMed Central Google Scholar
Chicco, D. Siamese Neural Networks: An Overview 73–94 (Springer, 2021).
Google Scholar
Lin, T., Goyal, P., Girshick, R., He, K. & Dollar, P. Focal loss for dense object detection. IEEE Trans. Pattern Anal. Mach. Intell. (TPAMI) 42, 318–327 (2020).
Article Google Scholar
Kingma, D. P. & Ba, J. Adam: A Method for Stochastic Optimization (CoRR, 2015).
Google Scholar
Delong, E. R. D. C. P. & Delong, D. M. Comparing the areas under two or more correlated receiver operating characteristic curves: A nonparametric approach. Biometrics 44, 859 (1988).
Article Google Scholar
Bhalerao, M. & Thakur, S. Brain tumor segmentation based on 3d residual u-net. In Brainlesion: Glioma, Multiple Sclerosis, Stroke and Traumatic Brain Injuries 218–225 (2020).
Chen, C. et al. Domain knowledge powered deep learning for breast cancer diagnosis based on contrast-enhanced ultrasound videos. IEEE Trans. Med. Imaging (TMI) 40, 2439–2451 (2021).
Article Google Scholar
Oktay, O. et al. Attention u-net: Learning where to look for the pancreas. In Medical Imaging with Deep Learning (2018).
Chen, H., Dou, Q., Yu, L., Qin, J. & Heng, P.-A. Voxresnet: Deep voxelwise residual networks for brain segmentation from 3d mr images. Neuroimage 170, 446–455 (2018).
Article PubMed Google Scholar
Hatamizadeh, A., Yang, D., Roth, H. R. & Xu, D. Unetr: Transformers for 3d medical image segmentation. In 2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 1748–1758 (2022).
Zhou, Z., Rahman Siddiquee, M. M., Tajbakhsh, N. & Liang, J. Unet++: A nested u-net architecture for medical image segmentation. In Deep Learning in Medical Image Analysis and Multimodal Learning for Clinical Decision Support 3–11 (2018).
Yusim, I., Krenawi, M., Mazor, E., Novack, V. & Mabjeesh, N. J. The use of prostate specific antigen density to predict clinically significant prostate cancer. Sci. Rep. 10, 20015 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Washino, S. et al. Combination of prostate imaging reporting and data system (pi-rads) score and prostate-specific antigen (psa) density predicts biopsy outcome in prostate biopsy naïve patients. BJU Int. 119, 225–233 (2017).
Article CAS PubMed Google Scholar
Huang, S.-C., Pareek, A., Seyyedi, S., Banerjee, I. & Lungren, M. P. Fusion of medical imaging and electronic health records using deep learning: A systematic review and implementation guidelines. NPJ Digital Med. 3, 136 (2020).
Article Google Scholar
Reda, I. et al. Deep learning role in early diagnosis of prostate cancer. Technol. Cancer Res. Treatment 17, 1533034618775530 (2018).
Article Google Scholar
Zheng, H. et al. Multiparametric mri-based radiomics model to predict pelvic lymph node invasion for patients with prostate cancer. Eur. Radiol. 32, 5688–5699 (2022).
Article CAS PubMed PubMed Central Google Scholar
Dong, C. et al. Learning from dermoscopic images in association with clinical metadata for skin lesion segmentation and classification. Comput. Biol. Med. 152, 106321 (2023).
Article PubMed Google Scholar
Liu, Y. et al. Functional outcome prediction in acute ischemic stroke using a fused imaging and clinical deep learning model. Stroke 24, 2316 (2023).
Article Google Scholar

Download references

Funding

This research was funded in part by the National Institutes of Health under grants R01-CA248506 and R01-CA272702, and by the Integrated Diagnostics Program of the Departments of Radiological Sciences and Pathology in the UCLA David Geffen School of Medicine.

Author information

Authors and Affiliations

Radiological Sciences, University of California, Los Angeles, Los Angeles, 90095, USA
Haoxin Zheng, Alex Ling Yu Hung, Qi Miao, Steven S. Raman, Kai Zhao & Kyunghyun Sung
Computer Science, University of California, Los Angeles, Los Angeles, 90095, USA
Haoxin Zheng, Alex Ling Yu Hung & Fabien Scalzo
Electrical and Computer Engineering, University of California, Los Angeles, Los Angeles, 90095, USA
Weinan Song
The Seaver College, Pepperdine University, Los Angeles, 90363, USA
Fabien Scalzo

Authors

Haoxin Zheng
View author publications
You can also search for this author in PubMed Google Scholar
Alex Ling Yu Hung
View author publications
You can also search for this author in PubMed Google Scholar
Qi Miao
View author publications
You can also search for this author in PubMed Google Scholar
Weinan Song
View author publications
You can also search for this author in PubMed Google Scholar
Fabien Scalzo
View author publications
You can also search for this author in PubMed Google Scholar
Steven S. Raman
View author publications
You can also search for this author in PubMed Google Scholar
Kai Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Kyunghyun Sung
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

H.Z., A.H., K.Z. and Q.M. conceived the experiments; H.Z. conducted the experiments; W.S., Q.M., S.R. and K.S. provided clinical insights; H.Z. prepared and analysed the results; H.Z. and K.Z. wrote the main manuscript text; All authors reviewed the manuscript.

Corresponding author

Correspondence to Haoxin Zheng.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Tables.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Zheng, H., Hung, A.L.Y., Miao, Q. et al. AtPCa-Net: anatomical-aware prostate cancer detection network on multi-parametric MRI. Sci Rep 14, 5740 (2024). https://doi.org/10.1038/s41598-024-56405-7

Download citation

Received: 13 November 2023
Accepted: 06 March 2024
Published: 08 March 2024
DOI: https://doi.org/10.1038/s41598-024-56405-7

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Prostate cancer malignancy detection and localization from mpMRI using auto-deep learning as one step closer to clinical utilization

Deep learning for fully automatic detection, segmentation, and Gleason grade estimation of prostate cancer in multiparametric magnetic resonance images

Prostate Cancer Detection using Deep Convolutional Neural Networks

Introduction

Related works

Prostate cancer detection

Anatomical-aware design for other diseases

Methods

Overview

Dataset

Study population and mpMRI images

Clinical interpretation and annotations

Preprocessing

Symmetric-aware network architecture

Zonal loss

Implementation details

Results

Quantitative results

Qualitative Results

Backbone network extension

Ablation study

Discussion

Conclusions

Data availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Tables.

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links