A novel deep learning conditional generative adversarial network for producing angiography images from retinal fundus photographs

Tavakkoli, Alireza; Kamran, Sharif Amit; Hossain, Khondker Fariha; Zuckerbrod, Stewart Lee

doi:10.1038/s41598-020-78696-2

Download PDF

Article
Open access
Published: 09 December 2020

A novel deep learning conditional generative adversarial network for producing angiography images from retinal fundus photographs

Alireza Tavakkoli¹^na1,
Sharif Amit Kamran¹^na1,
Khondker Fariha Hossain²^na1 &
…
Stewart Lee Zuckerbrod³

Scientific Reports volume 10, Article number: 21580 (2020) Cite this article

6057 Accesses
53 Citations
3 Altmetric
Metrics details

Subjects

Abstract

Fluorescein angiography (FA) is a procedure used to image the vascular structure of the retina and requires the insertion of an exogenous dye with potential adverse side effects. Currently, there is only one alternative non-invasive system based on Optical coherence tomography (OCT) technology, called OCT angiography (OCTA), capable of visualizing retina vasculature. However, due to its cost and limited view, OCTA technology is not widely used. Retinal fundus photography is a safe imaging technique used for capturing the overall structure of the retina. In order to visualize retinal vasculature without the need for FA and in a cost-effective, non-invasive, and accurate manner, we propose a deep learning conditional generative adversarial network (GAN) capable of producing FA images from fundus photographs. The proposed GAN produces anatomically accurate angiograms, with similar fidelity to FA images, and significantly outperforms two other state-of-the-art generative algorithms ($p<.001$ and $p<.0001$). Furthermore, evaluations by experts shows that our proposed model produces such high quality FA images that are indistinguishable from real angiograms. Our model as the first application of artificial intelligence and deep learning to medical image translation, by employing a theoretical framework capable of establishing a shared feature-space between two domains (i.e. funduscopy and fluorescein angiography) provides an unrivaled way for the translation of images from one domain to the other.

What colour are your eyes? Teaching the genetics of eye colour & colour vision. Edridge Green Lecture RCOphth Annual Congress Glasgow May 2019

Article Open access 23 August 2021

Pretraining a foundation model for generalizable fluorescence microscopy-based image restoration

Article 12 April 2024

Segment anything in medical images

Article Open access 22 January 2024

Introduction

For a long time fluorescein angiography (FA) combined with Retinal Funduscopy have been used for diagnosing retinal vascular and pigment epithelial-choroidal diseases¹. The process requires the injection of a fluorescent dye which, depending on the age and cardiovascular structure of the eye, appears in the optic vascular system within 8–12 s and can stay up to 10 min². Although generally considered safe, there have been reports of mild to severe complications due to allergic reactions to the dye^3,4,5. Side effects can range from nausea and heart attack, to anaphylactic shock and death^6,7,8,9,10. In addition, leakage of fluorescein at the injection site can occur.

Given the complications and the risks associated with this procedure, non-invasive, affordable, and computationally effective alternatives to FA are quite necessary. The only current alternative to fluorescein angigraphy (FA) for visualizing retinal vasculature is carried out by additional hardware and software modifications to Optical Coherence Tomography (OCT)^11,12, called OCT Angiography (OCTA)^13,14. Despite the ability to generate visual blood flow maps without the adverse side effects of FA, OCTA systems are not widespread in assessment of retinal vascular diseases, due to their cost, the need for multiple acquisitions in the same anatomical location¹⁵, and limited field of view (FOV). In addition, the recent CoVID-19 pandemic has had a significant negative impact on ophthalmologists’ ability to conduct in-clinic exams¹⁶, demonstrated the limitations of the current state of tele-ophthalmology¹⁷, and highlighted the need for developing effective, low-cost, and reliable alternatives for both in-home and in-clinic measurements.

The introduction of convolutional neural networks and a gradient-based optimization regime for training these networks by LeCun et al.¹⁸ has resulted in a subsequent deep learning revolution¹⁹ in the field of Artificial Intelligence (AI). Not only has deep learning significantly improved the performance of visual object classification²⁰, object detection²¹, semantic²² and instance segmentation²³, and de-noising algorithms²⁴, but it also has introduced novel computational frameworks such as super-resolution²⁵, image generation²⁶, and style-transfer^27,28. Despite its late arrival to the field of ophthalmology compared to other medical domains^29,30, deep learning has already started to play a transformative role in ophthalmology ranging from noise removal³¹, to disease classification^32,33,34, to disease marker segmentation^33,35,36. These advances resulted in the first automatic AI-enabled Diabetic Retinopathy (DR) system, called IDx-DR³⁷, to be approved by the FDA in 2018³⁸. Although these AI-inspired ophthalmic systems have produced reasonable results^39,40,41, they only utilize basic and rudimentary deep learning architectures, susceptible to data bias⁴² and the need for massive amount of training samples⁴³.

In order to address the inefficiency in utilizing generic deep learning models in ophthalmology, our team has developed effective architectures^44,45,46 for retinal disease diagnosis. Our proposed architectures are capable of mapping ophthalmic images (e.g. fundus, OCT, angiograms, etc.) onto a high dimensional feature space (i.e. a latent manifold), rich with clustered information pertaining to the ocular anatomical structures. These latent representations of anatomical structures will have the potential to advance the current state of deep learning in ophthalmology. Specifically, the ability to map ocular structures from different imaging modalities such as funduscopy, FA, and OCT into a shared latent manifold will unlock novel approaches to fusing the information acquired from these modalities to enable far more useful information for disease diagnosis and prognosis. This is the premise of the proposed work in establishing a relationship between FA and fundus images in their shared latent manifold for the purpose of generating anatomically accurate FA images from fundus photographs. To our knowledge, and unlike the image generation method proposed by Lee et al.⁴⁷, the work proposed in this paper is the first deep learning application in ophthalmic imaging to generate images from truly different modalities.

Lee et al. have leveraged the relationship between the OCT and OCTA data to generate OCTA-like images solely from OCT images⁴⁷. This technique is was the first AI model to explore ophthalmic applications of deep learning beyond classification and segmentation. However, this architecture has several limitations that prevent it from performing as an effective domain transfer system (i.e. generating images from inherently different data modalities). First, the input domain (OCT) and the output domain (OCTA) are significantly correlated and do not constitute a truly different anatomical structure modalities. Second, the deep learning module used in this approach is a simple autoencoder⁴⁸ adapted from an encoder/decoder architecture for segmentation⁴⁹. As such, this network is not capable of exploiting the significantly different probability distributions governing the input and output modalities for the purpose of generating real OCTA images from OCT input data. Finally, although OCT technology is more widely used and less expensive than OCTA, OCT imaging is still expensive and requires clinic visits, preventing truly ubiquitous use as an in-home alternative.

Given the ability to use our previous deep learning architectures^44,45,46 to produce feature rich latent manifolds from input ocular structural images, we sought to explore the ability to map paired fundus photographs and FA images onto a shared latent manifold in which the retina vasculature from both domains share similar feature representations. This approach has its roots in the recently introduced Generative Adversarial Networks (GANs)^27,50,51,52 in the field of deep learning. Although GANs have been recently utilized in the field of ophthalmology from predicting post-therapeutic OCT images⁵³, to removing shadows from OCT images⁵⁴, these studies primarily focus on a single domain modality. The proposed study is the first of its kind to demonstrate the viability of cross-modality transformation in the field of ophthalmic imaging. Comparisons of our model with state-of-the-art image generation and style transfer systems showed that our model outperforms these networks, both qualitatively and quantitatively. In addition, expert ophthalmologists were asked to distinguish from a random set of balanced real FA images and those angiograms generated by our model in two trials. Results show that the angiograms generated by the proposed network are quite indistinguishable from real FA images.

It is worth discussing that the proposed study is designed as a proof-of-concept framework to demonstrate the technical and computational viability of performing image domain transformation to provide adjunctive information in the absence of FA modalities. As such, this framework is a part of an evolutionary study to establishing shared manifolds between different imaging modes that can be utilized to improve the diagnostic capabilities in the absence of a comprehensive battery of tests.

Results

We designed a conditional generative adversarial network (GAN) comprising of two generator modules and four discriminator modules (Fig. 1A) to take fundus photographs and produce anatomically accurate FA images inferred from the fundus images. The generator block consists of two generator modules, the fine and coarse generators, which are designed in a U-shaped encoder-decoder manner. The coarse generator is comprised of a reflection+padding block, three convolution (Conv)+batch normalization (BN)+leaky rectified linear units (ReLU), and four novel residual blocks^44,45 (ResBlk), followed by two transpose convolution (Deconv), one reflection+padding, one Conv, and an output activation layers (Fig. 1B-left), and is responsible for generating coarse and global structures of the FA image such as the structures of the macula, optic disc, color, contrast, and brightness. The fine generator is comprised of one reflection+padding, one Conv+BN+ReLU, and one Conv layer, followed by three ResBlk, one Deconv, one Conv, and one output activation layer (Fig. 1B-right), and produces local information including retinal venules, arterioles, hemorrhages, exudates, and microaneurysms. The last ResBlk of the coarse generator is added to the first Conv layer of the fine generator to integrate the global features from the coarse generator with the local information in the fine generator. The discriminator blocks of the proposed network are encoders tied to a final layer of fully connected binary classification, and takes a pair of real and generated FA images and decide which one is real. The fine discriminators take the pair of real and generated images at full resolution, while the coarse discriminators take images at half resolution (Fig. 1A). Each discriminator is comprised of an initial Conv layer and Conv+BN+ReLU layers, followed by one last Conv layer and finally the output activation layer (Fig. 1B-bottom). In our implementation the fine generator has 170,305 trainable parameters, the coarse Generator has 6,695,041, and each discriminator has 234,785 trainable parameters for a total of around 7.8 million parameters.

The models were trained over 100 epochs, with each epoch comprising of 212 iterations, in a minimax setup in which the generators attempt to produce realistic FA images and discriminators attempt to correctly identify whether a given FA image is real or produced by the generators. The loss values for the coarse and fine generators as well as the combined loss values of the discriminator block are shown in Fig. 1C. The black dotted curve is the discriminator loss, while the solid blue and dashed red curves are the fine and course generator losses, respectively. At the beginning of the training, the generators produce sampled random images from a latent representation of FA images, and thus the discriminators can easily identify real from generated FA images. This can be observed as the smaller (better) discriminator loss values compared to generator loss values for early epochs in Fig. 1C. As training progresses, the generators learn to produce more realistic FA images which become increasingly difficult for discriminators to identify as not real, as observed from the downward trend in generator loss and upward trend in discriminator loss curves in early epochs in Fig. 1C. The goal of the network is to reach an equilibrium where the loss values for the generators and discriminators stabilize (late epochs in Fig. 1C). The ideal loss curves for a generative adversarial network (GAN) is shown in Fig. 1D, in which the network reaches the Nash equilibrium.

For training, we use the fundus and angiography data-set provided by Hajeb et al.⁵⁵. The data-set includes 30 pairs of diabetic retinopathy and 29 pairs of normal FA and fundus images from 59 patients. Fundus photographs are in color format, whereas angiograms are in gray-scale format. Our proposed network is capable of performing with high degrees of accuracy even on this small dataset. To improve the accuracy of training we perform a randomized data augmentation process by which N random crops of size $512\times 512$ from each images are extracted. These random images are then processed through geometric and photometric manipulations and then used for training the model. So, the total number of training sample is $O(17\times N\times f)$, where f is the number of photometric and geometric manipulations. This process can potentially generate an infinite number of training samples for our deep learning algorithm, addressing data limitation issues in deep learning. For example, with $N=500$ crops from 17 images and with 10 geometric and photometric manipulations, the training set will consist of 85,000 pairs of fundus photographs and FA images. In our method we used image rotation, horizontal flip, and vertical flip as geometric transformations. Photometric transformations used in the proposed data augmentations include gamma correction, contrast stretching, contrast compression, and color manipulations.

The results of the training is shown in Fig. 2 over the course of 100 epochs. In this figure, the first and the third rows are the original fundus images, their paired FA image, and the generated results after training at epochs 1, 25, 50, 75, and 100, respectively. The second and fourth rows show the magnification of the red rectangular regions for a better visual representation of the vascular structure generated by our method as the training progresses. Our proposed method learns the global information like optic-disc, fovea position, and large vascular structures first. Next, it tries to learn the minuscule vascular structures, e.g., arteries and veins in a progressive manner.

Fundus-Angio alignment effects

Training on unaligned images hampers the synthesizing of realistic FA images. Although the global information such as the overall intensity, contrast, and the location of geometric features such as the optic-disc are retained, local information like vascular structures are distorted or absent from the generated image (Fig. 3). Without proper alignment of the paired fundus and FA images used for training, the deep learning GAN architectures fail to generate accurate vascular structures. In order to address this problem, there is a need for aligning the FA images with their fundus counterparts prior to training. The algorithm 1 shows the process by which a misaligned FA image could be aligned with its fundus counterpart. The process takes as input a pair of fundus and FA images and using a fast SIFT feature extractor, called SURF⁵⁶, finds corresponding features between the two images. Singular valued decomposition technique will be then utilized on the matched features between the fundus and FA images to uncover the transformation ($\Phi $) between the two images. This transformation will be then utilized to align the FA image with the fundus photograph.

FA image generation

The first experiment was designed to establish the performance of our proposed method in generating anatomically accurate FA images from fundus photographs, and compare our results with the leading conditional generative adversarial networks proposed by Wang et al.⁵⁰ and Isola et al.²⁷ Fig. 4 shows the results of this comparison. In this experiment we supplied the trained networks with a fundus image (Fig. 4A) while the paired FA image of the same patient (Fig. 4B) was held-out as ground truth. Figure 4C–E show the generated FA images by our proposed algorithm, the Wang et al. method⁵⁰, and the Isola et al. method²⁷, respectively. The yellow rectangular regions of Fig. 4A–E are magnified and shown in Fig. 4F–J for comparison purposes. Our FA generated images (Fig. 4H) have very good quality and are anatomically accurate compared to the held-out FA images (Fig. 4G) as details and even small vascular structures, pointed by yellow triangles, are preserved and generated compared to the other two methods, pointed by red triangles (Fig. 4I,J).

Table 1 Comparisons between Fréchet inception distance (FID) Achieved by Our Method Compared with Wang et al. and Isola et al.

Full size table

For quantitative evaluation we use two established measures, i.e., the Fréchet inception distance (FID)⁵⁷ and structural similarity measures (SSIM)⁵⁸. FID is a metric that measures the distance between feature vectors calculated for real and generated images. Since FID represents a distance metric, lower FID measures mean higher accuracy in generating images. FID allows for comparing how accurately the generated FA images represent anatomical features compared to the ground truth. Comparisons of FID measures between our proposed method and those presented by Wang et al.⁵⁰ and Isola et al.²⁷ show that our method produces significantly more accurate FA images, $p=.0005$ and $p=4.5\times 10^{-6}$, respectively (Fig. 5A).

Table 1 shows the average FID values of our proposed method compared to those of Wang et al.⁵⁰ and Isola et al.²⁷. The lower the FID the better the generated image. As it can be seen our method produces consistently lower FID measures when producing FA images from the original fundus image and from its transformed counterparts when motion blur, sharpening, noise, linear shift, and radial shifts are applied.

The SSIM is a well-known quality metric used to measure the similarity between two images. It is considered to be correlated with the quality perception of the human visual system (HVS) and is designed by modeling any image distortion as a combination of three factors that are loss of correlation, luminance distortion, and contrast distortion⁵⁹. Comparisons of SSIM measures between our proposed method and those presented by Wang et al.⁵⁰ ($p<.0003$) and Isola et al.²⁷ ($p<3\times 10^{-7}$) show that our method produces significantly more accurate FA images compared to both.

Table 2 Comparisons between structural similarity measure (SSIM) achieved by our method compared with Wang et al.⁵⁰ and Isola et al.²⁷.

Full size table

Table 2 shows the average SSIM values of our proposed method compared to those of Wang et al.⁵⁰ and Isola et al.²⁷. The higher the SSIM the better the generated image. As it can be seen our method produces consistently higher SSIM measures when producing FA images from the original fundus image and from its transformed counterparts when motion blur, sharpening, noise, linear shift, and radial shifts are applied.

Results on changes to fundus image acquisition

An important benefit of the proposed method is the robustness in accuracy of generated FA images from fudus photographs subject to varying imaging issues such as high signal to noise ratio (SNR), motion blur, and color sharpness. Figure 6 shows the generated FA images from noisy fundus photographs (Fig. 6A) compared to the FA image of same subject (Fig. 6B). Using a high SNR fundus photograph as input, Fig. 6C–E show FA images produced by our proposed algorithm, the Wang et al. method⁵⁰, and the Isola et al. method²⁷, respectively. The yellow rectangular regions of Fig. 6A–E are magnified and shown in Fig. 6F–J for better viewing. As shown by the yellow triangle in Fig. 6H, small vasculature are preserved and generated by our method, but they are lost in the generated FA images by Wang el al.⁵⁰ and Isola et al.²⁷—red triangles in Fig. 6I,J. Comparisons on the FA image generated by our proposed method from normal and from high SNR images show no statistically significant difference in FID measures (Fig. 5B).

Figure 7 shows the generated FA images from motion blurred fundus photographs (Fig. 7A) compared to the FA image of same subject (Fig. 7B). Using a motion blurred fundus photograph as input, Fig. 7C–E show FA images produced by our proposed algorithm, the Wang et al. method⁵⁰, and the Isola et al. method²⁷, respectively. The yellow rectangular regions of Fig. 7A–E are magnified and shown in Fig. 7F–J for better viewing. As shown by the yellow triangle in Fig. 7H, small vasculature are preserved and generated by our method, but they are lost in the generated FA images by Wang el al.⁵⁰ and Isola et al.²⁷—red triangles in Fig. 7I,J. Comparisons on the FA image generated by our proposed method from normal and from motion blurred images show no statistically significant difference in FID measures (Fig. 5B).

Figure 8 shows the generated FA images from fundus photographs subject to color and contrast sharpness (Fig. 8A) compared to the FA image of same subject (Fig. 8B). Using a sharpened fundus photograph as input, Fig. 8C–E show FA images produced by our proposed algorithm, the Wang et al. method⁵⁰, and the Isola et al. method²⁷, respectively. The yellow rectangular regions of Fig. 8A–E are magnified and shown in Fig. 8F–J for better viewing. As shown by the yellow triangle in Fig. 8H, small vasculature are preserved and generated by our method, but they are lost in the generated FA images by Wang el al.⁵⁰ and Isola et al.²⁷—red triangles in Fig. 8I,J. Comparisons on the FA image generated by our proposed method from normal and from sharpened images show no statistically significant difference in FID measures (Fig. 5B).

Results on anatomical structure changes

Although robustness to fuduscopy imaging variations should not impact the results of the FA image generation, certain anatomical changes to the vascular structure should be identified and utilized in the image generation process. Our proposed approach has the capability of generating anatomically correct FA images from fundus photographs that contain two kinds of anatomical changes, i.e. slight linear (translational) shift in the vascular pattern and radial shifts changing the curvature of blood vessels. Figure 9 shows the generated FA images from fundus photographs subject to anatomical changes to the structure of retina blood vessels (Fig. 9A,K) compared to the FA image of same subject (Fig. 9B,L). Figure 9C–E show FA images produced by our proposed algorithm, the Wang et al. method⁵⁰, and the Isola et al. method²⁷, respectively, on a fundus photograph containing slight blood vessel shift. The yellow rectangular regions of Fig. 9A–E are magnified and shown in Fig. 9F–J for better viewing. As shown by the yellow triangle and circle in Fig. 9H, small vasculature are preserved and generated and the slight vascular shifts are reconstructed with high fidelity by our method, but they are lost in the generated FA images by Wang el al.⁵⁰ and Isola et al.²⁷—red triangles in Fig. 9I,J. Comparisons on the FA image generated by our proposed method from normal and from the linear vasculature shifts show no statistically significant difference in FID measures (Fig. 5C).

Figure 9M–O show FA images produced by our proposed algorithm, the Wang et al. method⁵⁰, and the Isola et al. method²⁷, respectively, on a fundus photograph containing radial blood vessel shifts. The yellow rectangular regions of Fig. 9K–O are magnified and shown in Fig. 9P–T for better viewing. As shown by the yellow circle in Fig. 9R, the distortions in the blood vessel patterns are reconstructed with high fidelity by our method, but they are lost in the generated FA images by Wang el al.⁵⁰ and Isola et al.²⁷—red triangles in Fig. 9S,T. Comparisons on the FA image generated by our proposed method from normal and from the linear vasculature shifts show a statistically significant difference ($p=.010$) in FID measures (Fig. 5C).

Qualitative evaluations

In the next experiment we evaluated the quality of the generated angiograms by asking experts to identify whether a given angiograms is real, from a collection of 40 balanced (50%, 50%) and randomly mixed angiograms. For this experiment, the experts were not told how many of the images are real and how many are not real. The non-disclosed ratio of non-real and real images was a significant design choice for this experiment, as it will allow us to evaluate three metrics: (1) incorrectly identified generated images represent how real the generated images look, (2) correctly labeled real images representing how accurate the experts recognized angiogram salient features, and (3) the confusion metric representing how effective the overall performance of our proposed method was in confusing the expert in the overall experiment. The results are shown in Fig. 10. Given a real FA image, two out of three experts significantly identified fewer real images than the ground truth (Fig. 10A). Given generated angiongrams, all three experts missed significantly more generated angiograms (Fig. 10B). This experiment shows that the generated FA images by our proposed method are virtually indistinguishable from real FA images.

Discussion

Our study demonstrates that a deep learning generative adversarial network (GAN) can be trained to map anatomical features from different image modalities, i.e. fundus photographs and FA images, onto a shared feature manifold for the purpose of generating one image modality from the other. Once the deep learning network is trained on a training dataset of paired fundus and FA images, it is capable of generating anatomically accurate retinal vasculature in the form of FA images. Our deep learning model was capable of generating accurate and reliable FA images from fundus photographs, even under significant noise, motion blur, and color and contrast manipulations. The most significant aspect of the proposed deep learning architecture is the that it is the first application of deep learning in ophthalmology capable of translating between two different modalities of data. We also utilized a comprehensive data augmentation method to increase the accuracy of our deep learning system without the need for a very large training dataset.

Taking advantage of our study, detailed retinal vascular structures can be created without the need for fluorescein angiogrpahy to avoid its potential side effects. Furthermore, generating vascular images from fundus photography via deep learning generative networks enables a non-invasive, cost effective, easy to use, and low-cost alternative for FA. Bypassing FA protocols by utilizing our proposed deep learning approach has the potential to enable remote monitoring of patients. In addition, generating FA images from fundus photographs does not impose the need for multiple measurements required by OCTA to reconstruct vascular maps over large areas of the retina.

A potential explanation by which the proposed deep learning approach is capable of inferring the retinal vascular structure from fundus images is that a paired set of FA and fundus images of the same eye share the same statistical distributions governing the anatomical structure of the eye from which the images are acquired. Although not visible from fundus images, the light reflects differently from the blood vessels and their neighboring region on the retina. These minute differences, locally and globally, are utilized by our deep learning algorithm to establish shared local and global feature representations from paired FA and fundus images. The trained model is then capable of using these learned shared features to infer the structural statistics of an FA directly from the structural statistics of a given fundus photograph and produce an anatomically accurate FA image counterpart. In fact, this shared feature representation learning is recently used in computer vision to transfer image modalities and styles, e.g. transforming real photographs to art styles of Monet or Van Gogh paintings^60,61,62,63.

The proposed deep learning generative network in our study produces FA images from fundus photographs. This finding has significant clinical applications. Fundus imaging is an easy, low-cost, and non-invasive procedure and is one of the most commonly performed eye procedures, resulting in a very large number of fudnus imaging databases. Moreover, fundus imaging can be done at home from a number of recently introduced portable funduscopes^64,65,66,67. Our study demonstrates the potential for using fundus images acquired from these portable fundus imaging systems to produce reliable and anatomically accurate retina vascular structures. The inferred structural measurements of retinal vasculature may allow clinicians to determine the natural history of retinal vascular changes and clinical outcomes of retinal diseases as previously reported from direct analysis of fundus images^68,69, but with the accuracy of FA image analysis^70,71 or even OCTA⁷².

In our work we used a multi-scale conditional deep learning network comprised of two components, a generator block and a discriminator block. The generator block is responsible for sampling a probability distribution function to generate an image. The discriminator block is responsible for deciding whether a given image is a real FA image or a generated one. For training, the entire system undergoes a minimax optimization process⁷³, in which the generator tries to produce realistic FA images in such a way that the discriminator cannot correctly label as not real, while the discriminator tries to predict from a pair of real or generated images which one is real, as accurately as possible. To our knowledge, this architecture is the first of its kind to be designed and utilized for ophthalmic applications to generate one modality of images from another. The multi-scale design of the proposed network enables it to perform more accurately with fewer training samples and overcome the data limitations from which the majority of traditional deep learning architecture suffer²⁸. Our study and data shows the superior results of our network compared to recent generative networks. Future work would including designing a side network within this generative network capable of mapping anatomical structures and bio markers representative of specific pathologies to establish a latent manifold of pathology feature representations. This latent manifold would be instrumental in predicting future progression of retina vascular disorders much earlier in the disease stage.

More broadly, we demonstrated that deep learning architectures are capable of translating between different ophthalmic image modalities. A similar approach to our architecture could be utilized to establish relationships between ocular anatomical structure measurements, e.g. OCT, MRI, funduscopy, and visual function such as field perimetry, acuity, and color and contrast sensitivities. For example, several physiological assessments such as OCT and funduscopy, and functional measurements such as visual fields assessments of the same eye are usually performed at each clinic visit. A similarly designed network to the proposed architecture can be utilized to map OCT and fundus photographs along with visual fields onto a manifold of shared feature representations. These shared representations can then be utilized to convert visual field progressions to OCT images establishing retinal fiber layer changes in glaucoma patients without the need to perform OCT measurements. In addition, converting available OCT measurements to a more objective representation of the visual field deficits could help evaluate disease progression in a more objective manner without the need to use subjective field perimetry.

It is worth noting that the clinical use of generative adversarial networks without sufficient ablation studies on the neural network and its performance in generating images is dangerous. This is due to the GAN potential in producing fake features. General GAN approaches simply sample a random distribution to generate fake images, and therefore are susceptible to producing fake feature. This is particularly the case for GAN architectures designed for an unpaired dataset, as well as in traditional cycleGAN architectures. To avoid this issue we proposed the use of our conditional GAN framework in a paired setup with a hierarchical architecture (Fig. 1A). This is clinically significant, as the proposed method, if applied to unpaired images, will produce unnatural generated FA images. Therefore, the natural FA images are assured to include accurate anatomical features that are trained from paired FA and fundus datasets (Fig. 11).

The limitations of the current study are in the use of a single dataset of paired fundus and FA images, the size of the dataset, and due to data limitations, our inability to perform longitudinal studies of benefits of the proposed method in evaluating disease progression. While this dataset was sufficient for establishing the performance of the proposed deep learning model, further studies are needed on additional paired fundus and FA images to validate the results. In this study we proposed the use of data augmentation and conditional GANs in a multi-scale architecture to overcome the size limitations of the dataset. We anticipate using larger training samples acquired from large datasets will further improve the already establish superior results of our method. Another limitation in the study relates to the lack of information about the phase of the FAG images. The FAG phase information is missing from the current dataset on which the proposed method has been evaluated. In future studies we plan to include this information in our analyses. Finally, future longitudinal studies could prove the benefits of utilizing the proposed deep learning method in generating FA images from fundus photographs for regularly monitoring retinal vascular disease progression in ways not possible before while avoiding costs and side effects associated with FA.

In conclusion, we demonstrated that a deep learning based generative adversarial network (GAN) is capable of producing FA images from single fundus photographs alone, that are virtually indistinguishable from real FA images. This approach can be used on any existing fundus photograph dataset or could be integrated into funduscopy system to produce FA images along with the fundus photographs. Although our proposed framework provides an unrivaled way for the translation of images from one domain to the other, this study is designed as a proof-of-concept framework to demonstrate the technical and computational viability of performing image domain transformation to provide adjunctive information in the absence of FA modalities. Future studies are needed to validate how diagnostic capabilities may be improved by utilizing our framework in the absence of a FA test results.

Methods

This study utilizes publicly available and de-identified paired fluorescein angiogram and fundus photographs from the Isfahan University of Medical Sciences Persian Eye Clinic (Feiz Hospital)⁵⁵. The study has been approved by the University of Nevada, Reno Institutional Review Board for the use of retrospective de-identified data and all methods were performed in accordance with the relevant guidelines and regulations. This study only uses anonymized and de-unidentified data. However, informed consent was obtained from all subjects, who were over the age of 18, as a part of the original study. Retinal images ($576\times 720$ pixels) were collected and include 30 normal stage and 40 abnormal stages.

Deep learning conditional generative adversarial network

This study proposes a new conditional generative adversarial network (GAN) comprising of a novel residual block^44,45 for producing realistic FA from retinal fundus images. We use two generators ($G_{fine}$ and $G_{coarse}$) in the proposed network, as illustrated in Fig. 1A. The generator $G_{fine}$ synthesizes fine angiograms from fundus images by learning local information, including retinal venules, arterioles, hemorrhages, exudates, and microaneurysms. On the other hand, the generator $G_{coarse}$ tries to extract and preserve global information, such as the structures of the macula, optic disc, color, contrast and brightness, while producing coarse angiograms. The generator $G_{fine}$ takes input images of size $512\times 512$ and produces output images with the same resolution. Similarly, the generator $G_{coarse}$ network takes an image with half the size ($256\times 256$) and outputs an image of the same size as the input. In addition, the $G_{coarse}$ outputs a feature vector of the size $256\times 256 \times 64$ that is eventually added with one of the intermediate layers of $G_{fine}$. These hybrid generators are quite powerful for sharing local and global information between multiple architectures as seen in^50,52,74. Both generators use convolution layers for downsampling and transposed convolution layers for upsampling. It should be noted that $G_{coarse}$ is downsampled twice ($\times 2$) before being upsampled twice again with transposed convolution. In both the generators, the proposed residual blocks are used after the last downsampling operation and before the first upsampling operations as illustrated in Fig. 1B. On the other hand, in $G_{fine}$, downsampling takes place once with necessary convolution layer, followed by adding the feature vector, repetition of residual blocks and then upsampling to get fine angiography image. All convolution and transposed convolution operation are followed by Batch-Normalization⁷⁵ and Leaky-ReLU activations. To train these generators, we start with $G_{coarse}$ by batch-training it on random samples once and then we train the $G_{fine}$ once with a new set of random samples. During this time, the discriminator’s weights are frozen. Lastly, we jointly fine-tune all the discriminator and generators together to train the GAN.

Multi-scale PatchGAN as discriminator

For synthesizing fluorescein angiography images, GAN discriminators need to adapt to coarse and fine generated images for distinguishing between real and generated images. To alleviate this problem, we either need a deeper architecture or, a kernel with wider receptive field. Both these solutions result in over fitting and increase the number of parameters. Additionally, a large amount of processing power will be required for computing all the parameters. To address this issue, we exploit the idea of using two Markovian discriminators, first introduced in a technique called PatchGAN⁷⁶. This technique takes input from different scales as previously seen in^50,52. We use four discriminators that have a similar network structure but operate at different image scales. Particularly, we downsample the real and generated angiograms by a factor of 2 using the Lanczos sampling⁷⁷ to create an image pyramid of three scales (original and $2\times $downsampled and $4\times $downsampled). We group the four discriminators into two, $D_{fine}=[D1_{fine},D2_{fine}]$ and $D_{coarse}=[D1_{coarse},D2_{coarse}]$ as seen in Fig. 1A. The discriminators are then trained to distinguish between real and generated angiography images at the three distinct resolutions respectively.

The outputs of the PatchGAN for $D_{fine}$ are $64\times 64$ and $32\times 32$ and for $D_{coarse}$ are $32\times 32$ and $16\times 16$. With the given discriminators, the loss function can be formulated as given in Eq. 1. It is a multi-task problem of maximizing the loss of the discriminators while minimizing the loss of the generators.

$$\begin{aligned} \min \limits _{G_{fine},G_{coarse}} \max \limits _{D_{fine},D_{coarse}} {\mathscr {L}}_{cGAN}(G_{fine},G_{coarse}, D_{fine},D_{coarse}) \end{aligned}$$

(1)

Despite discriminators having similar network structure, the one that learns features at a lower resolution has wider receptive fields. It tries to extract and retain more global features such as macula, optic disc, color and brightness to better distinguish real images. In contrast, the discriminator that learns feature at original resolution dictates the generator to produce fine features such as retinal veins, arteries, and exudates. By doing this we combine feature information of global and local scale while training the generators independently with their paired multi-scale discriminators.

Weighted objective function and adversarial loss

We use LSGAN⁷⁸ to train our conditional GAN. The objective function for our conditional GAN is given in Eq. 2.

$$\begin{aligned} {\mathscr {L}}_{cGAN}(G,D) = {\mathbb {E}}_{x,y} \big [\ (D(x,y) -1)^2 \big ]\ + {\mathbb {E}}_{x} \big [\ (D(x,G(x)+1))^2 \big ]\ \end{aligned}$$

(2)

where the discriminators are first trained on the real fundus, x and real angiography image, y and then trained on the the real fundus, x and generated angiography image, G(x). We start with training the discriminators $D_{fine}$ and $D_{coarse}$ for couple of iterations on random batches of images. Next, we train the $G_{coarse}$ while keeping the weights of the discriminators frozen. We then train the the $G_{fine}$ on a batch of random samples in a similar fashion. We use Mean-Squared-Error (MSE) for calculating the individual loss of the generators as shown in Eq. 3.

$$\begin{aligned} {\mathscr {L}}_{L2}(G) = {\mathbb {E}}_{x,y} \Vert G(x) - y \Vert ^2 \end{aligned}$$

(3)

where ${\mathscr {L}}_{L2}$ is the reconstruction loss for a real angiogram, y, given a generated angiogram, G(x). We use this loss for both $G_{fine}$ and $G_{coarse}$ so that the model can generate high quality angiograms at different scales. From Eqs. 2 and 3 we can formulate our final objective function as given in Eq. 4.

$$\begin{aligned} \min \limits _{G_{fine},G_{coarse}} \max \limits _{D_{fine},D_{coarse}} {\mathscr {L}}_{cGAN}(G_{fine},G_{coarse}, D_{fine},D_{coarse}) + \lambda \big [\ {\mathscr {L}}_{L2}(G_{fine}) + {\mathscr {L}}_{L2}(G_{coarse})\big ]\ \end{aligned}$$

(4)

Here, $\lambda $ dictates either to prioritize the discriminators or the generators. For our architecture, more weight is given to the reconstruction loss of the generators and thus we pick a large $\lambda $ value.

Computational resources

The computational resources used for this study included an Alienware Aurora R9 Gaming Desktop, with Intel Core i7-9700 central processing unite (CPU), 16GB Memory, and an NVIDIA GeForce RTX 2080 SUPER graphics processing unit (GPU). The code was written in python with Keras wrapper for TensorFlow.

Data availability

The dataset analyzed for this study is comprised of de-identified fundus and FA images publicly available from the Isfahan University of Medical Sciences⁷⁹.

References

Mary, V. S., Rajsingh, E. B. & Naik, G. R. Retinal fundus image analysis for diagnosis of glaucoma: a comprehensive survey. IEEE Access 4, 4327–4354 (2016).
Mandava, N. et al. Fluorescein and ICG angiography. St Louis: Mosby 106, 800–808 (2004).
Google Scholar
Kwiterovich, K. A. et al. Frequency of adverse systemic reactions after fluorescein angiography: results of a prospective study. Ophthalmology 98, 1139–1142 (1991).
Article CAS Google Scholar
Brockow, K. & Sánchez-Borges, M. Hypersensitivity to contrast media and dyes. Immunol. Allergy Clin. 34, 547–564 (2014).
Article Google Scholar
Torres, M., Mayorga, C. & Blanca, M. 1 nonimmediate allergic reactions induced by drugs: Pathogenesis and diagnostic tests. J. Investig. Allergol. Clin. Immunol. 19, 80 (2009).
CAS PubMed Google Scholar
Lira, R. P. C., Oliveira, C. L. A., Marques, M. V. R. B., Silva, A. R. & Pessoa, C. C. Adverse reactions of fluorescein angiography: a prospective study. Arquivos brasileiros de oftalmologia 70, 615–618 (2007).
Article Google Scholar
Kwan, A. S., Barry, C., McAllister, I. L. & Constable, I. Fluorescein angiography and adverse drug reactions revisited: the lions eye experience. Clin. Exp. Ophthalmol. 34, 33–38 (2006).
Article Google Scholar
Lieberman, P. et al. The diagnosis and management of anaphylaxis: an updated practice parameter. J. Allergy Clin. Immunol. 115, S483–S523 (2005).
Article Google Scholar
El Harrar, N. et al. Anaphylactic shock caused by application of fluorescein on the ocular conjunctiva. Press. medicale (Paris, France: 1983) 25, 1546 (1996).
Google Scholar
Fineschi, V., Monasterolo, G., Rosi, R. & Turillazzi, E. Fatal anaphylactic shock during a fluorescein angiography. Forensic Sci. Int. 100, 137–142 (1999).
Article CAS Google Scholar
Fujimoto, J. G., Drexler, W., Schuman, J. S. & Hitzenberger, C. K. Optical coherence tomography (OCT) in ophthalmology: introduction. Opt. Express 17, 3978–3979 (2009).
Article ADS Google Scholar
Murthy, R., Haji, S., Sambhav, K., Grover, S. & Chalam, K. Clinical applications of spectral domain optical coherence tomography in retinal diseases. Biomed. J. 39, 107–120 (2016).
Article CAS Google Scholar
Wang, R. K. et al. Three dimensional optical angiography. Opt. Express 15, 4083–4097 (2007).
Article ADS Google Scholar
De Carlo, T. E., Romano, A., Waheed, N. K. & Duker, J. S. A review of optical coherence tomography angiography (OCTA). Int. J. Retina Vitreous 1, 5 (2015).
Article Google Scholar
Zhang, Q. et al. Wide-field optical coherence tomography based microangiography for retinal imaging. Sci. Rep. 6, 1–10 (2016).
Article Google Scholar
Nair, A. G. et al. Effect of COVID-19 related lockdown on ophthalmic practice and patient care in India: results of a survey. Indian J. Ophthalmol. 68, 725 (2020).
Article Google Scholar
Romano, M. R. et al. Facing COVID-19 in ophthalmology department. Curr. Eye Res. 45, 653–658 (2020).
Article CAS Google Scholar
LeCun, Y., Bottou, L., Bengio, Y. & Haffner, P. Gradient-based learning applied to document recognition. Proc. IEEE 86, 2278–2324 (1998).
Article Google Scholar
LeCun, Y., Bengio, Y. & Hinton, G. Deep learning. Nature 521, 436–444 (2015).
Article ADS CAS Google Scholar
Szegedy, C., Ioffe, S., Vanhoucke, V. & Alemi, A. A. Inception-v4, inception-resnet and the impact of residual connections on learning. In Thirty-first AAAI Conference on Artificial Intelligence (2017).
Bochkovskiy, A., Wang, C.-Y. & Liao, H.-Y. M. Yolov4: Optimal speed and accuracy of object detection. arXiv preprint arXiv:2004.10934 (2020).
Siam, M., Gamal, M., Abdel-Razek, M., Yogamani, S. & Jagersand, M. Rtseg: Real-time semantic segmentation comparative study. In 2018 25th IEEE International Conference on Image Processing (ICIP), 1603–1607 (IEEE, 2018).
He, K., Gkioxari, G., Dollár, P. & Girshick, R. Mask R-CNN. In Proceedings of the IEEE International Conference on Computer Vision, 2961–2969 (2017).
Laine, S., Karras, T., Lehtinen, J. & Aila, T. High-quality self-supervised deep image denoising. In Advances in Neural Information Processing Systems, 6970–6980 (2019).
Shamsolmoali, P. et al. Image super resolution by dilated dense progressive network. Image Vis. Comput. 88, 9–18 (2019).
Article Google Scholar
Creswell, A. et al. Generative adversarial networks: an overview. IEEE Signal Process. Mag. 35, 53–65 (2018).
Article ADS Google Scholar
Isola, P., Zhu, J.-Y., Zhou, T. & Efros, A. A. Image-to-image translation with conditional adversarial networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 1125–1134 (2017).
Hamada, K., Tachibana, K., Li, T., Honda, H. & Uchida, Y. Full-body high-resolution anime generation with progressive structure-conditional generative adversarial networks. In Proceedings of the European Conference on Computer Vision (ECCV) (2018).
Wang, F., Casalino, L. P. & Khullar, D. Deep learning in medicine—promise, progress, and challenges. JAMA Intern. Med. 179, 293–294 (2019).
Article Google Scholar
Ting, D. S. W. et al. Artificial intelligence and deep learning in ophthalmology. Br. J. Ophthalmol. 103, 167–175 (2019).
Article Google Scholar
Wu, Z. et al. Simba: scalable inversion in optical tomography using deep denoising priors. IEEE J. Sel. Top. Signal Process. 14(6), 1163–1175 (2020).
Abràmoff, M. D. et al. Improved automated detection of diabetic retinopathy on a publicly available dataset through integration of deep learning. Investig. Ophthalmol. Vis. Sci. 57, 5200–5206 (2016).
Article Google Scholar
Lee, C. S., Baughman, D. M. & Lee, A. Y. Deep learning is effective for classifying normal versus age-related macular degeneration OCT images. Ophthalmol. Retin. 1, 322–327 (2017).
Article Google Scholar
De Fauw, J. et al. Clinically applicable deep learning for diagnosis and referral in retinal disease. Nat. Med. 24, 1342–1350 (2018).
Article Google Scholar
Roy, A. G. et al. Relaynet: retinal layer and fluid segmentation of macular optical coherence tomography using fully convolutional networks. Biomed. Opt. Express 8, 3627–3642 (2017).
Article Google Scholar
Loo, J., Fang, L., Cunefare, D., Jaffe, G. J. & Farsiu, S. Deep longitudinal transfer learning-based automatic segmentation of photoreceptor ellipsoid zone defects on optical coherence tomography images of macular telangiectasia type 2. Biomed. Opt. Express 9, 2681–2698 (2018).
Article Google Scholar
Abràmoff, M. D., Lavin, P. T., Birch, M., Shah, N. & Folk, J. C. Pivotal trial of an autonomous AI-based diagnostic system for detection of diabetic retinopathy in primary care offices. NPJ Digit. Med. 1, 1–8 (2018).
Article Google Scholar
Hillman, L. First artificial intelligence system approved by the FDA to detect diabetic retinopathy. Eye World. https://www.eyeworld.org/first-artificial-intelligence-system-approved-fda-detect-diabetic-retinopathy (2018).
Gurudath, N., Celenk, M. & Riley, H. B. Machine learning identification of diabetic retinopathy from fundus images. In 2014 IEEE Signal Processing in Medicine and Biology Symposium (SPMB), 1–7 (IEEE, 2014).
Fu, H. et al. Disc-aware ensemble network for glaucoma screening from fundus image. IEEE Trans. Med. Imaging 37, 2493–2501 (2018).
Article Google Scholar
Poplin, R. et al. Prediction of cardiovascular risk factors from retinal fundus photographs via deep learning. Nat. Biomed. Eng. 2, 158 (2018).
Article Google Scholar
Kim, B., Kim, H., Kim, K., Kim, S. & Kim, J. Learning not to learn: training deep neural networks with biased data. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 9012–9020 (2019).
Tan, C. et al. A survey on deep transfer learning. In International Conference on Artificial Neural Networks 270–279 (Springer, 2018).
Kamran, S. A., Saha, S., Sabbir, A. S. & Tavakkoli, A. Optic-net: A novel convolutional neural network for diagnosis of retinal diseases from optical tomography images. In 2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA) 964–971 (IEEE, 2019).
Kamran, S. A., Saha, S., Sabbir, A. S. & Tavakkoli, A. A comprehensive set of novel residual blocks for deep learning architectures for diagnosis of retinal diseases from optical coherence tomography images. Deep. Learn. Appl. 2, 25–48 (2020).
Kamran, S. A., Tavakkoli, A. & Zuckerbrod, S. L. Improving robustness using joint attention network for detecting retinal degeneration from optical coherence tomography images. In 2020 IEEE International Conference On Image Processing (ICIP) (IEEE, 2020).
Lee, C. S. et al. Generating retinal flow maps from structural optical coherence tomography with artificial intelligence. Sci. Rep. 9, 1–11 (2019).
Article ADS Google Scholar
Chen, M., Shi, X., Zhang, Y., Wu, D. & Guizani, M. Deep features learning for medical image analysis with convolutional autoencoder neural network. IEEE Trans. Big Data (2017).
Ronneberger, O., Fischer, P. & Brox, T. U-net: Convolutional networks for biomedical image segmentation. In International Conference on Medical Image Computing and Computer-Assisted Intervention, 234–241 (Springer, Berlin, 2015).
Wang, T.-C. et al. High-resolution image synthesis and semantic manipulation with conditional gans. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 8798–8807, (2018).
Zhu, J.-Y., Park, T., Isola, P. & Efros, A. A. Unpaired image-to-image translation using cycle-consistent adversarial networks. In Proceedings of the IEEE International Conference on Computer Vision, 2223–2232 (2017).
Shaham, T. R., Dekel, T. & Michaeli, T. Singan: Learning a generative model from a single natural image. In Proceedings of the IEEE International Conference on Computer Vision, 4570–4580 (2019).
Liu, Y. et al. Prediction of oct images of short-term response to anti-vegf treatment for neovascular age-related macular degeneration using generative adversarial network. Br. J. Ophthalmol. 104(12), 1735–1740 (2020).
Cheong, H. et al. Deshadowgan: a deep learning approach to remove shadows from optical coherence tomography images. Transl. Vis. Sci. Technol. 9, 23–23 (2020).
Article Google Scholar
Hajeb Mohammad Alipour, S., Rabbani, H. & Akhlaghi, M. R. Diabetic retinopathy grading by digital curvelet transform. Comput. Math. Methods Med. 2012 (2012).
Bay, H., Tuytelaars, T. & Van Gool, L. Surf: Speeded up robust features. In European Conference on Computer Vision, 404–417 (Springer, 2006).
Heusel, M., Ramsauer, H., Unterthiner, T., Nessler, B. & Hochreiter, S. Gans trained by a two time-scale update rule converge to a local nash equilibrium. In Advances in Neural Information Processing systems, 6626–6637, (2017).
Wang, Z., Bovik, A. C., Sheikh, H. R. & Simoncelli, E. P. Image quality assessment: from error visibility to structural similarity. IEEE Trans. Image Process. 13, 600–612 (2004).
Article ADS Google Scholar
Hore, A. & Ziou, D. Image quality metrics: Psnr vs. ssim. In 2010 20th International Conference on Pattern Recognition, 2366–2369 (IEEE, 2010).
Karras, T. et al. Analyzing and improving the image quality of stylegan. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 8110–8119 (2020).
Nie, W. et al. Semi-supervised stylegan for disentanglement learning. arXiv–2003 (2020).
Karras, T., Laine, S. & Aila, T. A style-based generator architecture for generative adversarial networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 4401–4410 (2019).
Brock, A., Donahue, J. & Simonyan, K. Large scale gan training for high fidelity natural image synthesis. arXiv preprint arXiv:1809.11096 (2018).
Yates, P. A. & Tran, K. Hand-held portable fundus camera for screening photography (2016). US Patent 9,357,920.
Ignatovich, F. V., Kleinman, D. M., Cotton, C. T. & Blalock, T. Portable fundus camera (2014). US Patent 8,836,778.
Palacios, D., Shen, K., Baig, S., Wang, J. H. & Wang, M. R. Wide field of view retinal imaging by handheld fundus camera. In Ophthalmic Technologies XXIX, vol. 10858, 108581I (International Society for Optics and Photonics, 2019).
Rogers, T. W. et al. Evaluation of an ai system for the detection of diabetic retinopathy from images captured with a handheld portable fundus camera: the mailor AI study. Eye. https://doi.org/10.1038/s41433-020-0927-8 (2020).
Klein, R. et al. The relation of retinal vessel caliber to the incidence and progression of diabetic retinopathy: Xix: the Wisconsin epidemiologic study of diabetic retinopathy. Arch. ophthalmology 122, 76–83 (2004).
Article Google Scholar
Klein, R., Klein, B. E., Moss, S. E. & Wong, T. Y. Retinal vessel caliber and microvascular and macrovascular disease in type 2 diabetes: Xxi: the wisconsin epidemiologic study of diabetic retinopathy. Ophthalmology 114, 1884–1892 (2007).
Article Google Scholar
Group, E. T. D. R. S. R. et al. Classification of diabetic retinopathy from fluorescein angiograms: ETDRS report number 11. Ophthalmology 98, 807–822 (1991).
Article Google Scholar
Wessel, M. M. et al. Ultra-wide-field angiography improves the detection and classification of diabetic retinopathy. Retina 32, 785–791 (2012).
Article Google Scholar
Hwang, T. S. et al. Automated quantification of capillary nonperfusion using optical coherence tomography angiography in diabetic retinopathy. JAMA Ophthalmol. 134, 367–373 (2016).
Article Google Scholar
Simon, D. A game theory approach to constrained minimax state estimation. IEEE Trans. Signal Process. 54, 405–412 (2006).
Article ADS Google Scholar
Johnson, J., Alahi, A. & Fei-Fei, L. Perceptual losses for real-time style transfer and super-resolution. In European Conference on Computer Vision, 694–711 (Springer, 2016).
Ioffe, S. & Szegedy, C. Batch normalization: Accelerating deep network training by reducing internal covariate shift. arXiv preprint arXiv:1502.03167 (2015).
Li, C. & Wand, M. Precomputed real-time texture synthesis with Markovian generative adversarial networks. In European Conference on Computer Vision, 702–716 (Springer, 2016).
Duchon, C. E. Lanczos filtering in one and two dimensions. J. Appl. Meteorol. 18, 1016–1022 (1979).
Article ADS Google Scholar
Mao, X. et al. Least squares generative adversarial networks. In Proceedings of the IEEE International Conference on Computer Vision, 2794–2802 (2017).
Hajeb Mohammad Alipour, S., Rabbani, H. & Akhlaghi, M. R. Diabetic retinopathy grading by digital curvelet transform. https://sites.google.com/site/hosseinrabbanikhorasgani/datasets-1/fundus-fluorescein-angiogram-photographs--colour-fundus-images-of-diabetic-patients (2012).

Download references

Acknowledgements

We would like to thank Dr. Dolores Diaz (DD) for her assistance in evaluating the generated and real angiography images. We would like to also thank Dr. Hossein Rabbani for making the de-identified FA and fundus photogarph dataset publicly available for us to utilize. Core research was conducted by the University of Nevada, Reno’s office of Vice President for Research and Innovation under the faculty startup package.

Author information

These authors contributed equally: Alireza Tavakkoli, Sharif Amit Kamran and Khondker Fariha Hossain.

Authors and Affiliations

Department of Computer Science and Engineering, University of Nevada, Reno, Reno, NV, 89557, USA
Alireza Tavakkoli & Sharif Amit Kamran
Department of Computer Science, Deakin University, Melbourne, VIC, 3217, Australia
Khondker Fariha Hossain
Department of Ophthalmology, Houston Eye Associates, Houston, TX, 77401, USA
Stewart Lee Zuckerbrod

Authors

Alireza Tavakkoli
View author publications
You can also search for this author in PubMed Google Scholar
Sharif Amit Kamran
View author publications
You can also search for this author in PubMed Google Scholar
Khondker Fariha Hossain
View author publications
You can also search for this author in PubMed Google Scholar
Stewart Lee Zuckerbrod
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conception and design (A.T. and S.A.K.); analysis and interpretation (A.T., S.A.K and S.L.Z.); writing the article (A.T., S.A.K. and K.F.H.); critical revision (A.T. and S.L.Z.); final approval of the article (A.T., S.A.K., K.F.H and S.L.Z.); data collection (A.T. and S.A.K.); statistical expertise (A.T.); literature search (A.T. and S.A.K.).

Corresponding author

Correspondence to Alireza Tavakkoli.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Tavakkoli, A., Kamran, S.A., Hossain, K.F. et al. A novel deep learning conditional generative adversarial network for producing angiography images from retinal fundus photographs. Sci Rep 10, 21580 (2020). https://doi.org/10.1038/s41598-020-78696-2

Download citation

Received: 06 July 2020
Accepted: 26 November 2020
Published: 09 December 2020
DOI: https://doi.org/10.1038/s41598-020-78696-2

This article is cited by

SANS-CNN: An automated machine learning technique for spaceflight associated neuro-ocular syndrome with astronaut imaging data
- Sharif Amit Kamran
- Khondker Fariha Hossain
- Alireza Tavakkoli
npj Microgravity (2024)
A feasibility study on the adoption of a generative denoising diffusion model for the synthesis of fundus photographs using a small dataset
- Hong Kyu Kim
- Ik Hee Ryu
- Tae Keun Yoo
Discover Applied Sciences (2024)
Digital staining in optical microscopy using deep learning - a review
- Lucas Kreiss
- Shaowei Jiang
- Roarke Horstmeyer
PhotoniX (2023)
Synthetic OCT-A blood vessel maps using fundus images and generative adversarial networks
- Ivan Coronado
- Samiksha Pachade
- Luca Giancardo
Scientific Reports (2023)
Response to: ‘Deep learning synthetic angiograms for individuals unable to undergo contrast-guided laser treatment in aggressive retinopathy of prematurity’
- Prashant Kumar
- Parijat Chandra
- Rajpal Vohra
Eye (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.