Deep learning-based optical coherence tomography angiography image construction using spatial vascular connectivity network

Le, David; Son, Taeyoon; Kim, Tae-Hoon; Adejumo, Tobiloba; Abtahi, Mansour; Ahmed, Shaiban; Rossi, Alfa; Ebrahimi, Behrouz; Dadzie, Albert; Ma, Guangying; Lim, Jennifer I.; Yao, Xincheng

doi:10.1038/s44172-024-00173-9

Download PDF

Article
Open access
Published: 09 February 2024

Deep learning-based optical coherence tomography angiography image construction using spatial vascular connectivity network

Communications Engineering volume 3, Article number: 28 (2024) Cite this article

1045 Accesses
Metrics details

Subjects

Abstract

Optical coherence tomography angiography (OCTA) provides unrivaled capability for depth-resolved visualization of retinal vasculature at the microcapillary level resolution. For OCTA image construction, repeated OCT scans from one location are required to identify blood vessels with active blood flow. The requirement for multi-scan-volumetric OCT can reduce OCTA imaging speed, which will induce eye movements and limit the image field-of-view. In principle, the blood flow should also affect the reflectance brightness profile along the vessel direction in a single-scan-volumetric OCT. Here we report a spatial vascular connectivity network (SVC-Net) for deep learning OCTA construction from single-scan-volumetric OCT. We quantitatively determine the optimal number of neighboring B-scans as image input, we compare the effects of neighboring B-scans to single B-scan input models, and we explore different loss functions for optimization of SVC-Net. This approach can improve the clinical implementation of OCTA by improving transverse image resolution or increasing the field-of-view.

Integrated deep learning framework for accelerated optical coherence tomography angiography

Article Open access 25 January 2022

Deep learning-enabled ultra-widefield retinal vessel segmentation with an automated quality-optimized angiographic phase selection tool

Article 09 August 2021

An open-source deep learning network AVA-Net for arterial-venous area segmentation in optical coherence tomography angiography

Article Open access 17 April 2023

Introduction

Optical coherence tomography (OCT) enables the non-invasive visualization of individual retinal layers with micrometer-level resolution. As one modality extension of OCT, OCT-angiography (OCTA) provides unparalleled capability for depth-resolved visualization of retinal vasculature at the microcapillary level resolution. OCTA is label-free and thus is completely non-invasive, compared to traditional fluorescein angiography. Studies have shown that OCTA provides improved capability for detecting subtle vascular distortions associated with the progression of retinal pathology, such as vessel dropout, foveal abnormalities, and increased vessel tortuosity^1,2. Studies demonstrate that OCTA could even detect microaneurysms that were undetected on dilated clinical examination³.

The principle of OCTA is that repeated OCT scans from one location are acquired for temporal vascular connectivity (TVC) processing to map retinal vasculature at the microcapillary resolution. Therefore, OCTA can be obtained from existing OCT systems with the addition of unique scan protocols and data processing algorithms⁴. However, the fundamental similarity between all OCTA instruments is that repeated OCT scans from one location are required for correlation analysis of sequential images to identify regions with active blood flow. Therefore, OCTA requires higher imaging speeds than most currently available OCT systems can provide in order to obtain a densely sampled volume. Conventional OCT device scanning speeds would result in too much trade-off between decreased field of view, lower image quality, and greatly increased scanning time. Additionally, the prolonged scanning time may also increase the potential effect of motion artifacts, such as blinking and microsaccades ⁵.

A potential solution lies in the use of deep learning algorithms. In recent years, deep learning, a subset of machine learning and artificial intelligence, has been making strides in ophthalmic research^6,7,8,9,10. The principle behind deep learning is that the algorithm can learn directly from the training data and can objectively perform the required task. An example application is deep learning for artificial intelligence screening of retinopathies^{11,12,13,14,15}. Current screening procedures require clinicians to manually examine retinal photographs. This can, therefore, lead to inter- and intra-rater variability; the same clinician could classify the same image differently on different days. Furthermore, to manually screen retinal photographs is a time-consuming process. Therefore, the deployment of artificial intelligence algorithms could alleviate these problems. Recent studies in deep learning OCTA have primarily been focused on the classification of eye diseases such as diabetic retinopathy^16,17,18, age-related macular degeneration^19,20,21, and glaucoma^22,23,24. Other applications include improving the image quality of OCTA^25,26 and artery–vein segmentation^{27,28,29,30,31}. Recently, deep learning has also been explored for OCTA construction^32,33,34,35. While deep learning algorithms can detect large blood vessel branches in OCT readily, it is technically challenging to identify microcapillaries reliably.

We hypothesize that deep learning could leverage spatial vascular connectivity (SVC), i.e., brightness connectivity along the vessel direction, in a single-scan-volumetric OCT for OCTA construction. In this study, we train a convolutional neural network, titled SVC-Net, that leverages SVC inputs for OCTA construction. We show that SVC can be used reliably to predict microcapillary structures. We verify the feasibility of a deep learning approach using a dataset composed of single-scan-volumetric OCTs from animal and human eyes. In addition, we compare the differences between TVC and SVC-based signals in traditional OCTA construction and perform ablation studies on the optimization of deep learning models by different loss functions.

Results

The deep learning framework

In this section, we provide an overview of our deep learning-based method for OCTA construction from a single-scan OCT volume. A conceptual diagram of our proposed methodology is shown in Fig. 1a. Our hypothesis is that the blood flow should affect the reflectance brightness profile along the vessel direction (Fig. 1b). In other words, spatial intensity variance among the vessels in a single-scan-volumetric OCT, as shown in Fig. 1c, can be equivalent to the temporal intensity variance of the same vessel location in the sequential images in the multi-scan-volumetric OCT for conventional OCTA construction. Our deep learning-based OCTA construction framework, which we term SVC-Net (for details of network architecture, see “Methods” section), was trained and tested using spectral domain-OCT images acquired from mouse and human retina. The ground truth is based on conventional speckle variance OCTA construction from four repeated OCT B-scans. The input into SVC-Net will be comprised of OCT B-scans from single-scan OCT volume, and the output of SVC-Net will be the OCTA B-scan.

Optimization of neighboring scans

To leverage deep learning as a potential strategy for reliable OCTA construction, we first determine if SVC can provide the required information that the model can learn from. Therefore, we performed an ablation study to evaluate the effects of the different numbers of B-scans used for OCTA construction using the TVC, i.e., repeated B-scans, and the SVC, i.e., adjacent neighbor B-scans. To compare TVC and SVC, we use intensity-based speckle variance to generate OCTA. This procedure will help to determine the optimal number of neighboring B-scans to be used as input into SVC-Net. We illustrate the effect of the number of B-scans used on OCTA image quality using TVC and SVC. In our dataset, each volume contains four repeated B-scans. For qualitative comparison, we calculate OCTA for different numbers of B-scans, namely two, three, and four, which we refer to as 2 N, 3 N, and 4 N, respectively. The results of this ablation study are illustrated in Fig. 2 and Fig. 3 for animal and human datasets, respectively.

**Fig. 2: Comparison of conventional speckle variance OCTA construction in the mouse eye.**

**Fig. 3: Comparison of conventional speckle variance OCTA construction in the human eye.**

For TVC-based speckle variance (SV) processing, visual observations show as the number of repeated B-scans increases, the noise is reduced, as illustrated by Fig. 2 and Fig. 3. Therefore, the four B-scans for TVC-based SV processing have the best image quality. On the other hand, for SVC, there is a different trend; it can be observed that there is an optimal number of B-scans for SVC-based SV processing. For the animal dataset, we can observe that the SVC-2N has the worst image quality and that the SVC-3N and SVC-4N have comparable image quality. However, in the human dataset, SVC-4N, as compared to SVC-3N, decreases the vascular detail due to the increase in a blur effect, which can be visibly observed in the representative en face. This type of artifact is commonly described in OCTA as ‘vessel doubling’ due to poor registration. In the case of SVC-4N, since the vessels are not completely duplicated, we can refer to this artifact as pseudo-vessel doubling.

For the quantitative comparison, since the TVC-4N has the best qualitative performance, it will be the ground truth for comparison. We quantify the multi-scale structural similarity index measure (MS-SSIM) and the peak-signal-to-noise-ratio (PSNR) for the TVC 2 N and 3 N, and the SVC 2 N, 3 N, and 4 N en face images. The statistical information of this analysis is summarized in Tables 1 and 2. For the animal dataset, we confirm our qualitative observations that increasing the number of B-scans improves performance, as the TVC-3N has the best overall MS-SSIM and PSNR. Meanwhile, for the SVC, we observe that for the animal dataset, there is a decreasing trend for the MS-SSIM from SVC-2N to SVC-4N. However, we observed for the PSNR, the optimal number of B-scans was the SVC-3N. For the human dataset, we observe similar trends as compared to the mouse dataset, with the TVC having an improved performance on both metrics when using more B-scans. For the SVC metrics, we observe that for both the MS-SSIM and PSNR, the optimal number of B-scans was using SVC-3N since it has better performance than SVC-2N and SVC-4N. Therefore, for SVC-Net, based on qualitative observation and quantitative analysis, we will use a 3 N input, i.e., comprised of three adjacent neighboring B-scans.

Table 1 Evaluation metrics on conventional speckle variance OCTA using temporal and spatial scans.

Full size table

Table 2 Post hoc analysis of the different methods of conventional speckle variance OCTA.

Full size table

Microcapillary vessels visualization

The primary usage of OCTA is to observe the en face projections of the retinal vascular layers. Therefore, we perform both qualitative and quantitative analyses for the en face projection of the superficial vascular plexus (SVP) and deep vascular plexus (DVP). To compare the effects of the SVC, we qualitatively compare the 1 N and 3 N models to the ground truth en face of the SVP and DVP for the animal eye in Fig. 4. For the SVP, we observe that large vessels were constructed in both the 1 N and 3 N models. However, an example of a large vessel that progressed to a smaller vessel was observed to have poor construction in the 1 N model. Whereas in the 3 N model, the same vessel was constructed properly. Showing that SVC helped preserve the details of smaller vessels. For the DVP, we observed that the 1 N model was able to predict some capillary structures. However, due to the poor contrast, it has a noisier appearance. When compared to the 3 N model, the capillaries are reconstructed in finer detail.

**Fig. 4: Effect of SVC on en face OCTA prediction on mouse eye.**

Example visualization of the deep learning results on the effect of SVC on human eyes are shown in Fig. 5. On the en face OCT of both the SVP and DVP, as shown in Fig. 5, we can observe that there are distinct capillary structures with less contrast as compared to OCTA. Therefore, it explains why we can observe that the 1 N and 3 N models are able to predict both the large and small vessel structures in the SVP, as both the intensity and structural information are present for the model to learn from in order to predict the vessels. In the DVP, we can observe that the contrast in the 3 N model is better than the prediction of the 1 N model. This suggests that the SVC improves the model’s performance to predict the finer structural details in capillary-level vessels.

**Fig. 5: Effect of SVC on en face OCTA prediction on the human eye.**

Effect of the loss function

In this study, we also evaluate the effect of the different loss functions, i.e., mean squared error (MSE) and structural similarity index measure (SSIM) loss functions, for OCTA construction on the 1 N and 3 N models. Examples of output en face images from an animal eye of each model are illustrated in Fig. 6. Comparison between the 1 N models trained with MSE and SSIM, we observe a discernable increase in noise when the model is trained with the MSE loss function. In comparison to the 3 N models trained with MSE and SSIM, it results in noise reduction and contrast improvement, with the 3 N model trained with SSIM having the overall best contrast. In the DVP, we can observe that the 1 N trained with SSIM has a better capillary level structure as compared to the 1 N trained with the MSE, and when SVC is employed, it improves the contrast of the capillaries.

**Fig. 6: Effect of loss function on en face OCTA prediction on mouse eye.**

Next, we compare the effects of the loss functions on the model’s performance in a human eye, as illustrated in Fig. 7. We observe that for the 1 N models, the model trained with the MSE loss function has higher levels of noise and poorer contrast as compared to the 1 N model trained with the SSIM loss function. The modification of the loss function improved the 1 N’s performance to produce the capillary level structures in higher contrast. Similar observations can be seen for the 3 N models. In the model trained with the MSE loss function, in the SVP, we can observe that some of the capillary level structures are predicted. However, there is relatively more noise. Whereas in the model trained with the SSIM loss function, the noise level is reduced. We can also observe that in the DVP, the capillary level structures seem more dilated; this could be due to the lower levels of contrast between the fine vessels. In the human dataset, the 3 N input with SSIM is the best-performing model and can produce vessel structures with higher contrast.

**Fig. 7: Effect of loss function on en face OCTA prediction on the human eye.**

The evaluation metrics, MS-SSIM and PSNR, were quantified on both the SVP and DVP en faces to quantitatively compare the performances of the four models, and statistical analysis is summarized in Tables 3 and 4, respectively. For the MS-SSIM metric, it can be observed that, for both the animal and human datasets, the 1 N model trained with MSE had the lowest performance, followed by the 1 N model trained with the SSIM loss function, which had a slight improvement. The introduction of the SVC significantly improved the similarity between ground truth and predicted en face images. The 3 N model trained with MSE had significantly better results than both 1 N models. With the modification of the loss function, the 3 N model trained with SSIM had the best performance.

Table 3 Evaluation metrics on deep learning results for different combinations of input type and loss function.

Full size table

Table 4 Post hoc analysis of the different deep learning models as compared to the ground truth conventional OCTA.

Full size table

For the PSNR measurement, we observed that in the SVP, there were only slight differences between the loss functions while using the same input types (single or neighbored inputs). This may be due to the presence of large vessels in the SVP, which, regardless of the loss functions, the convolutional neural network (CNN) was able to predict consistently and had maximum pixel intensity, e.g., 255. However, when observing the quantitative evaluations for the DVP, we observe PSNR distinguishable improvements of the loss function and input type on the model’s performance. This may be due to the abundance of smaller capillary level structures, where if the predicted image had an increase in noise, it could be reflected in the PSNR value. The use of SVC and the SSIM loss function reduced the noise level and improved the PSNR. For both the mouse and human datasets, the 3 N model trained with the SSIM loss function had the best overall performance.

Connectivity analysis

To assess disparities in vessel connectivity and overall vessel structure, we conducted a quantitative analysis of OCTA utilizing well-established metrics: vessel area density, vessel skeleton density, and vessel perimeter index. The outcomes of this analysis, which contrast the ground truths with deep learning predictions derived from models utilizing diverse inputs (1 N and 3 N) and loss functions (MSE and SSIM), are presented in Table 5.

Table 5 Comparison of quantitative OCTA feature analysis on en face images from conventional OCTA and different deep learning models.

Full size table

Our examination reveals that the 1 N models consistently exhibit lower p-values in comparison to the ground truth, indicating detectable distinctions in values. Conversely, in the case of the 3 N models, we note both statistically insignificant disparities and relatively high p-values. This suggests that the predicted images maintain comparable structural connectivity to the ground truths. The evaluation of these quantitative metrics bears clinical relevance, as one of the fundamental applications of OCTA is the detection of retinal vascular changes.

Retinopathy

To assess the robustness of our proposed method, we conducted an evaluation using the best-performing model (3 N with SSIM loss) on an eye afflicted with proliferative diabetic retinopathy. Figure 8 presents representative images of this comparison. Notably, it becomes evident that the model’s en face prediction effectively enhances the visualization of microaneurysms when compared to conventional OCTA. Additionally, upon evaluating the cross-sectional B-scans, we can observe a heightened brightness of vessels that exhibit low contrast in the conventional OCTA images, as depicted in the predicted image. An overarching consistency is observed in the blinking artifacts within the OCT en face image, which is markedly conspicuous in the conventional OCTA en face image. However, within the predicted en face image, a noticeable smoothing effect is apparent, yielding an overall improvement in image quality.

Discussion

In this study, we reported a fully automated convolutional network (FCN), SVC-Net, for OCTA construction that leverages the use of spatial vascular connectivity in OCT for vascular structure prediction using a single OCT volume. We quantitatively determined the optimal number of adjacent B-scans for the input into SVC-Net and the differences in the number of B-scans used for SV calculation between TVC and SVC. We conclude that three adjacent neighbors, 3 N, is the most optimal input into SVC-Net. We quantitatively compare the effects of SVC by comparing the performance of using two different inputs, a single OCT B-scan input, 1 N, and three adjacent OCT B-scan inputs, 3 N. We demonstrate that the 3 N model has superior performance compared to the 1 N model. In addition, we also compare the effects of different loss functions, i.e., MSE and SSIM loss functions, on the model’s performance. Our study demonstrates that the SSIM loss function has superior performance over the MSE loss function. Our proposed method has been trained and tested on both animal and human OCT datasets. The ability to leverage single OCT volumes to generate OCTA can increase the speed of image acquisition by alleviating the need for multiple repetitions, reducing eye movement, and potentially increasing the FOV.

OCTA construction requires the acquisition of multiple OCT repetitions at the same imaging location, which, therefore, limits the imaging speed and FOV. In this study, we performed an ablation study to compare the effects of using different numbers of adjacent B-scans using SV calculation for OCTA construction. For quantitative comparison, the 2 N and 4 N had more noise compared to the 3 N. Qualitative observation, in particular for the human dataset, 4 N results in a pseudo vessel doubling artifact due to the larger area used for SV calculation. Therefore, 3 N had the optimal performance and was chosen as input into SVC-Net.

This observation has theoretical support in that the adjacent B-scans correspond to both spatial and temporal differences. Therefore, it carries information that can be used to estimate areas of hemodynamic changes, i.e., vascular tissue. For the SV calculation, the method uses a vector to determine the OCTA, i.e., for 3 N, it uses a vector of length 3. In principle, using an FCN, the model can leverage a localized region. For example, the standard convolutional filter size is of \(3\times 3\times N\), as an input into the first layer of the SVC-Net, the FCN uses a localized region of \(3\times 3\times 3\). In addition, as the information is carried through the FCN, global information is also used in the decision-making process. Therefore, the FCN can better predict vascular tissue compared to the SV method due to the larger number of pixels it can leverage. The performance of SVC-Net using the 3 N model reveals improved vascular connectivity compared to the SV-3N method, supporting the hypothesis that the FCN is able to leverage a larger number of pixels for vessel prediction. On the human dataset, we do note that the smaller vessels in the SVP have less contrast compared to the DVP in the deep learning models. This may be due to the strong signal from the nerve fiber layer, which may minimize the signal for the smaller vessels in the SVP. In the DVP, the vessel structure between the different models, i.e., 1 N and 3 N, are similar because the DVP is bounded by two hypo-reflective layers, namely the inner nuclear layer and the outer nuclear layer. Therefore, the contributing signal for vascular prediction can be clearly determined by the CNN.

There have been a limited number of studies that have explored methods to alleviate this limitation using deep learning. Lee et al. demonstrated the single OCT B-scan input for OCTA construction in a human dataset using a similar U-Net type model³⁴. In their work, they demonstrated that using an input-B-scan to output B-scan strategy, they can primarily predict the large blood vessels. At the same time, the smaller capillary-sized vessels have poor contrast and higher levels of noise. The results in this study for single OCT input are consistent with the results presented in Lee et al. This could primarily be due to the large vessels having better contrast compared to the smaller vessels in the OCT B-scan. In the study by Li et al., they demonstrate an input volume to output volume strategy in animal models using a generative adversarial network³⁵. Where the input is three adjacent OCT B-scans, and the output is three adjacent OCTA B-scans. While their results did not demonstrate capillary-level vessel structures, they did demonstrate that the use of SVC can help the deep learning model to predict higher performance metrics. The results in our study for SVC demonstrate capillary level vessels in both animal and human datasets. Overall, our methodology differs from the two aforementioned studies in that our strategy follows an input-volume-to-output B-scan strategy. The connectivity between the adjacent B-scans can provide the required information to accurately predict vessels of varied sizes.

In deep learning, there are many different hyperparameters that can be optimized for improved performance. Many studies often focus on the network architecture design, e.g., the depth or width of the network, or they develop different operations, e.g., atrous convolutions, depth-wise convolutions, etc. While all these hyperparameters play a role in the model’s performance, one of the most fundamental hyperparameters of a CNN is the loss function layer. The choice of the loss function ultimately drives CNN’s ability to learn its intended task³⁶. In this study, we performed an ablation study to compare two loss functions, the MSE and SSIM. The results of our study, when compared to Lee et al.³⁴, using a single input and optimized with the MSE loss function on the animal and human dataset, demonstrate mainly large vessels are predicted, and the smaller vessels have poorer contrast. In our study, when we optimize the model using the SSIM loss function, we can observe a lower level of noise and improved vascular prediction for the animal dataset and human dataset. There are also quantifiable differences as measured using the PSNR; we can observe an improved PSNR for both the SVP and DVP between the MSE and SSIM models.

Traditionally, MSE has been used for image reconstruction tasks. The MSE compares the ground truth and the predicted CNN image at the individual pixel level. The MSE models a quadratic function. Therefore, it can be easy to optimize due to its singular global minima characteristic. However, in many cases, an individual pixel is related to its surrounding pixel. In this case, there is a limitation to how much the MSE can optimize the deep learning model. On the other hand, the SSIM as a loss function evaluates three different parameters: luminance, contrast, and structure for a localized patch in the image. SSIM has been extensively used as an image quality metric. Therefore, it is reason that applying the SSIM as a loss function in image construction tasks can better optimize the deep learning model. When we combine the SVC and the loss function, we achieve the best-performing model. The model is trained to use the localized connectivity of the input, i.e., the adjacent B-scans, and is further optimized to achieve localized structural similarity in the predicted OCTA B-scan.

We have proposed a novel approach for OCTA construction using a single OCT volume for capillary-level visualization. However, there are some limitations with this study; for each of the dataset types (animal or human), the study is limited to a single OCT device. To demonstrate the generalization of this method, validation on different devices should be implemented. In addition, as an initial study, the dataset is limited to healthy eyes, in particular for human subjects. We evaluated our model on one disease name, namely proliferative diabetic retinopathy. For future considerations, we would need to evaluate this method in different eye conditions and disease states. Different eye conditions may affect the connectivity of the vasculature differently and, therefore, need to be further elucidated. Another variable to consider is that our proposed method has primarily been validated for OCTA constructed using the SV method; future studies should consider validating this method for other types of OCTA construction algorithms, e.g., OMAG and SSADA, as there may be performance differences as different construction algorithms rely on different information, e.g., phase or complex signal.

The SVC-Net for deep learning construction of microcapillary resolution OCTA from single-scan-volumetric OCT has been developed and validated. A comparative study shows that the SVC in single-scan-volumetric OCT provides equivalent information to the TVC in multi-scan-volumetric OCT for robust OCTA construction. The SSIM loss function provides superior performance, compared to the MSE loss function, to optimize deep learning visualization of microstructures, such as microcapillaries, in single-scan-volumetric OCT. The combination of SVC involvement and SSIM loss function enabled robust OCTA construction from single-scan-volumetric OCT. With single-volumetric-scan OCT for rapid OCTA construction, the SVC-Net holds great promise to increase the imaging speed and thus enable rapid wide-field OCTA and dynamic monitoring of vascular changes to advance the clinical management of eye diseases.

Methods

Datasets

OCT volumetric data. For the algorithm development, we optimized and evaluated SVC-Net on two dataset types, namely animal and human OCT datasets. For the animal dataset, 16 volumes comprised the dataset, 9 volumes for training, 1 volume for validation, and 6 volumes for testing. The total number of images used for training, validation, and testing were 5382, 598, and 3588 images, respectively. For testing en faces from 6 volumes were used to evaluate model performance. For the human dataset, 16 volumes were used for training, 1 volume for validation and 5 volumes for testing. The total number of images used for training, validation, and testing were 4768, 298, and 1490 images, respectively. For testing en faces from 6 volumes were used to evaluate model performance. Additionally, one from a patient with retinopathy was used to demonstrate the model’s qualitative performance for vascular abnormalities. For image acquisition, animal and human OCTs were taken with our custom lab-built OCT system. The system designs can be found in Supplementary Notes 1 and 2.

OCTA construction. The OCT scan pre-processing starts with registration of the OCT volume. The method that was employed for frame registration was the Discrete Fourier Transform registration method³⁷. Since the OCT volume contains multiple repeated scans, the first step is to perform intra-frame registration, where each repetitive scan is registered to the first scan. This process is repeated for all scans. Next, inter-frame registration is performed to register each of the scans within the volume. After the OCT Volume Pre-processing, SVC input was generated using the inter-frame registered OCT B-scans. Meanwhile, to generate the ground truth, intensity-based SV processing was applied to intra-frame registered OCT B-scans using the method in³⁸. SV processing algorithm can be found in Supplementary Note 3. Furthermore, for the ablation study to compare TVC and SVC, we used SV processing with varying numbers of repeated B-scans and varying numbers of adjacent neighbors, respectively. In this study, the number of adjacent numbers referred to as two, three, and four neighbors, referred to as 2 N, 3 N, and 4 N, respectively, was performed. To be consistent with the connotation, if only 1 B-scan was used, we refer to it as 1 N.

Ethics declaration

All animal experiments were approved by the local animal care and biosafety office and performed following the protocols approved by the Animal Care Committee (ACC) at the University of Illinois at Chicago (ACC Number: 19-044). This study followed the Association for Research in Vision and Ophthalmology Statement for the Use of Animals in Ophthalmic and Vision Research. All human experiments were approved by the Institutional Review Board of the University of Illinois at Chicago and were in pursuance of the ethical standards stated in the Declaration of Helsinki.

Deep learning implementation

Deep learning model. Our model, SVC-Net, was built using the methods described by Ahmed et al.³⁹, and the design is an encoder-decoder architecture, as shown in Fig. 9a. For the encoder, the EfficientNetB0 neural network⁴⁰ was employed. The decoder was designed using the Keras library, and the individual block components are illustrated in Fig. 9b. Briefly, we used a convolutional neural network to predict vessels in an image regression manner. The input into the CNN was a multichannel input comprised of OCT B-scans, and the output was a grayscale image. For other hyperparameters and training details, see Supplementary Note 4. The parameters of the CNN were optimized by training it on SVC inputs from single-scan OCT with the ground truth corresponding to OCTA images. The model was similarly trained on single channel inputs from OCT as well to determine the effects of SVC. To evaluate our model, we tested it on OCT volumes that were excluded from the training dataset.

**Fig. 9: SVC-Net is based on an encoder-decoder architecture.**

Loss function and evaluation metrics. The loss layer of a neural network compares the output of the network with the ground truth. In this paper, we evaluate the effect of two loss functions, MSE and SSIM, on the performance of the model for OCTA construction. Therefore, in this section, we define the MSE and SSIM loss functions. For the formulation of MSE and SSIM, see Supplementary Note 5. To evaluate the performance of the model, we used PSNR and MS-SSIM. The evaluation metrics were applied to en face projections of SVP and DVP. Statistical methods included one-way analysis of variance (ANOVA) for multi-group comparisons, and post-hoc tests were conducted using pair-wise two-way Student’s t-test. For the formulation of evaluation metrics and methodology of en face projection, see Supplementary Notes 6 and 7, respectively.

Data availability

The data that support the findings of this study are available from the corresponding author upon reasonable request.

Code availability

The code for the project is available on GitHub⁴¹.

References

Dadzie, A. K. et al. Normalized blood flow index in optical coherence tomography angiography provides a sensitive biomarker of early diabetic retinopathy. Transl. Vis. Sci. Technol. 12, 3–3 (2023).
Article PubMed PubMed Central Google Scholar
Le, D. et al. Comparative analysis of OCT and OCT angiography characteristics in early diabetic retinopathy. RETINA 43, 992-998 (2022).
Sun, Z. et al. Optical coherence tomography angiography in diabetic retinopathy: an updated review. Eye 35, 149–161 (2021).
Article PubMed Google Scholar
Choi, W. J. Imaging motion: a comprehensive review of optical coherence tomography angiography. Adv. Imaging Bio Techn. Converg. Sci. 1310, 343–365 (2021).
De Carlo, T. E. et al. A review of optical coherence tomography angiography (OCTA). Int. J. Retin. Vitreous 1, 1–15 (2015).
Article Google Scholar
Schmidt-Erfurth, U. et al. Artificial intelligence in retina. Prog. Retinal Eye Res. 67, 1–29 (2018).
Article Google Scholar
Fang, L. et al. Automatic segmentation of nine retinal layer boundaries in OCT images of non-exudative AMD patients using deep learning and graph search. Biomed. Opt. Express 8, 2732–2744 (2017).
Article PubMed PubMed Central Google Scholar
Sengupta, S. et al. Ophthalmic diagnosis using deep learning with fundus images—a critical review. Artif. Intell. Med. 102, 101758 (2020).
Article PubMed Google Scholar
Li, Z. et al. Efficacy of a deep learning system for detecting glaucomatous optic neuropathy based on color fundus photographs. Ophthalmology 125, 1199–1206 (2018).
Article PubMed Google Scholar
Schlegl, T. et al. Fully automated detection and quantification of macular fluid in OCT using deep learning. Ophthalmology 125, 549–558 (2018).
Article PubMed Google Scholar
Shankar, K. et al. Automated detection and classification of fundus diabetic retinopathy images using synergic deep learning model. Pattern Recognit. Lett. 133, 210–216 (2020).
Article ADS Google Scholar
Lam, C. et al. Automated detection of diabetic retinopathy using deep learning. AMIA Summits Transl. Sci. Proc. 2018, 147 (2018).
PubMed Central Google Scholar
Peng, Y. et al. DeepSeeNet: a deep learning model for automated classification of patient-based age-related macular degeneration severity from color fundus photographs. Ophthalmology 126, 565–575 (2019).
Article PubMed Google Scholar
Bajwa, M. N. et al. Two-stage framework for optic disc localization and glaucoma classification in retinal fundus images using deep learning. BMC Med. Inform. Decis. Mak. 19, 1–16 (2019).
Google Scholar
Ebrahimi, B. et al. Optimizing the OCTA layer fusion option for deep learning classification of diabetic retinopathy. Biomed. Opt. Express 14, 4713–4724 (2023).
Article PubMed PubMed Central Google Scholar
Le, D. et al. Transfer learning for automated OCTA detection of diabetic retinopathy. Transl. Vis. Sci. Technol. 9, 35–35 (2020).
Article PubMed PubMed Central Google Scholar
Heisler, M. et al. Ensemble deep learning for diabetic retinopathy detection using optical coherence tomography angiography. Transl. Vis. Sci. Technol. 9, 20–20 (2020).
Article PubMed PubMed Central Google Scholar
Zang, P. et al. A diabetic retinopathy classification framework based on deep-learning analysis of OCT angiography. Transl. Vis. Sci. Technol. 11, 10–10 (2022).
Article PubMed PubMed Central Google Scholar
Motozawa, N. et al. Optical coherence tomography-based deep-learning models for classifying normal and age-related macular degeneration and exudative and non-exudative age-related macular degeneration changes. Ophthalmol. Ther. 8, 527–539 (2019).
Article PubMed PubMed Central Google Scholar
Thakoor, K. et al. Hybrid 3d-2d deep learning for detection of neovascularage-related macular degeneration using optical coherence tomography B-scans and angiography volumes. In 2021 IEEE 18th International Symposium on Biomedical Imaging (ISBI) (IEEE, 2021).
Thakoor, K. A. et al. A multimodal deep learning system to distinguish late stages of AMD and to compare expert vs. AI ocular biomarkers. Sci. Rep. 12, 1–11 (2022).
Article Google Scholar
Bowd, C. et al. Deep learning image analysis of optical coherence tomography angiography measured vessel density improves classification of healthy and glaucoma eyes. Am. J. Ophthalmol. 236, 298–308 (2022).
Article PubMed Google Scholar
Bowd, C. et al. Deep-learning enface image classifier analysis of optical coherence tomography angiography images improves classification of healthy and glaucoma eyes. Investig. Ophthalmol. Vis. Sci. 62, 1024–1024 (2021).
Google Scholar
Schottenhamml, J. et al. Glaucoma classification in 3 x 3 mm en face macular scans using deep learning in a different plexus. Biomed. Opt. Express 12, 7434–7444 (2021).
Article PubMed PubMed Central Google Scholar
Gao, M. et al. Reconstruction of high-resolution 6× 6-mm OCT angiograms using deep learning. Biomed. Opt. Express 11, 3585–3600 (2020).
Article PubMed PubMed Central Google Scholar
Gao, M. et al. An open-source deep learning network for reconstruction of high-resolution OCT angiograms of retinal intermediate and deep capillary plexuses. Transl. Vis. Sci. Technol. 10, 13–13 (2021).
Article PubMed PubMed Central Google Scholar
Alam, M. et al. AV-Net: deep learning for fully automated artery-vein classification in optical coherence tomography angiography. Biomed. Opt. Express 11, 5249–5257 (2020).
Article CAS PubMed PubMed Central Google Scholar
Gao, M. et al. A deep learning network for classifying arteries and veins in montaged widefield OCT angiograms. Ophthalmol. Sci. 2, 100149 (2022).
Article PubMed PubMed Central Google Scholar
Abtahi, M. et al. MF-AV-Net: an open-source deep learning network with multimodal fusion options for artery-vein segmentation in OCT angiography. Biomed. Opt. Express 13, 4870–4888 (2022).
Article PubMed PubMed Central Google Scholar
Abtahi, M. et al. An open-source deep learning network AVA-Net for arterial-venous area segmentation in optical coherence tomography angiography. Commun. Med. 3, 54 (2023).
Article PubMed PubMed Central Google Scholar
Le, D. et al. Deep learning for artery–vein classification in optical coherence tomography angiography. Exp. Biol. Med. 248, 747–761 (2023).
Article CAS Google Scholar
Liu, X. et al. A deep learning based pipeline for optical coherence tomography angiography. J. Biophotonics 12, e201900008 (2019).
Article PubMed Google Scholar
Jiang, Z. et al. Weakly supervised deep learning-based optical coherence tomography angiography. IEEE Trans. Med. Imaging 40, 688–698 (2020).
Article Google Scholar
Lee, C. S. et al. Generating retinal flow maps from structural optical coherence tomography with artificial intelligence. Sci. Rep. 9, 1–11 (2019).
ADS Google Scholar
Li, P. L. et al. Deep learning algorithm for generating optical coherence tomography angiography (OCTA) maps of the retinal vasculature. Appl. Mach. Learn. 11511, 39–49 (2020).
Google Scholar
Zhao, H. et al. Loss functions for image restoration with neural networks. IEEE Trans. Comput. Imaging 3, 47–57 (2016).
Article Google Scholar
Guizar, M. Efficient subpixel image registration by cross-correlation. MATLAB Central File Exchange (2020).
Son, T. et al. Optical coherence tomography angiography of stimulus evoked hemodynamic responses in individual retinal layers. Biomed. Opt. Express 7, 3151–3162 (2016).
Article CAS PubMed PubMed Central Google Scholar
Ahmed, S. et al. ADC-Net: an open-source deep learning network for automated dispersion compensation in optical coherence tomography. Front. Med. 9, 864879 (2022).
Tan, M. & Le Q. Efficientnet: Rethinking model scaling for convolutional neural networks. Int. Conf. Mach. Learn. 97, 6105–6114 (2019).
Le, D. SVC-Net. GitHub (2023).

Download references

Acknowledgements

This project was funded by the National Eye Institute (P30 EY001792, R01 EY023522, R01 EY030101, R01EY029673, and R01EY030842), Research to Prevent Blindness, and Richard and Loan Hill Endowment.

Author information

Authors and Affiliations

Department of Biomedical Engineering, University of Illinois at Chicago, Chicago, IL, 60607, USA
David Le, Taeyoon Son, Tae-Hoon Kim, Tobiloba Adejumo, Mansour Abtahi, Shaiban Ahmed, Alfa Rossi, Behrouz Ebrahimi, Albert Dadzie, Guangying Ma & Xincheng Yao
Department of Ophthalmology and Visual Sciences, University of Illinois at Chicago, Chicago, IL, 60612, USA
Jennifer I. Lim & Xincheng Yao

Authors

David Le
View author publications
You can also search for this author in PubMed Google Scholar
Taeyoon Son
View author publications
You can also search for this author in PubMed Google Scholar
Tae-Hoon Kim
View author publications
You can also search for this author in PubMed Google Scholar
Tobiloba Adejumo
View author publications
You can also search for this author in PubMed Google Scholar
Mansour Abtahi
View author publications
You can also search for this author in PubMed Google Scholar
Shaiban Ahmed
View author publications
You can also search for this author in PubMed Google Scholar
Alfa Rossi
View author publications
You can also search for this author in PubMed Google Scholar
Behrouz Ebrahimi
View author publications
You can also search for this author in PubMed Google Scholar
Albert Dadzie
View author publications
You can also search for this author in PubMed Google Scholar
Guangying Ma
View author publications
You can also search for this author in PubMed Google Scholar
Jennifer I. Lim
View author publications
You can also search for this author in PubMed Google Scholar
Xincheng Yao
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

D.L. contributed to data preparation, network design, model implementation, data processing, statistical analysis, and paper preparation. T.S. contributed to data preparation, model implementation, data processing, and paper preparation. T.H.K. contributed to data acquisition and preparation. T.A. contributed to data acquisition, data preparation, and analysis tools. M.A. contributed to network design and model implementation. S.A. contributed to data acquisition, data preparation, and model implementation. A.R. contributed to data processing and analysis tools. B.E. contributed to network design and model implementation. A.D. contributed to network design and model implementation. G.M. contributed to data acquisition and preparation. J.I.L. contributed to data acquisition and preparation. X.Y. supervised the project and contributed to the study design and paper preparation. All authors reviewed and approved the paper.

Corresponding author

Correspondence to Xincheng Yao.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Communications Engineering thanks Yali Jia and the other anonymous reviewer(s) for their contribution to the peer review of this work. Primary Handling Editors: Mengying Su, Rosamund Daw. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Peer Review File

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Le, D., Son, T., Kim, TH. et al. Deep learning-based optical coherence tomography angiography image construction using spatial vascular connectivity network. Commun Eng 3, 28 (2024). https://doi.org/10.1038/s44172-024-00173-9

Download citation

Received: 26 January 2023
Accepted: 24 January 2024
Published: 09 February 2024
DOI: https://doi.org/10.1038/s44172-024-00173-9

Subjects

Abstract

Similar content being viewed by others

Integrated deep learning framework for accelerated optical coherence tomography angiography

Deep learning-enabled ultra-widefield retinal vessel segmentation with an automated quality-optimized angiographic phase selection tool

An open-source deep learning network AVA-Net for arterial-venous area segmentation in optical coherence tomography angiography

Introduction

Results

The deep learning framework

Optimization of neighboring scans

Microcapillary vessels visualization

Effect of the loss function

Connectivity analysis

Retinopathy

Discussion

Methods

Datasets

Ethics declaration

Deep learning implementation

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Supplementary information

Peer Review File

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links