Large area optimization of meta-lens via data-free machine learning

Zhelyeznyakov, Maksym; Fröch, Johannes; Wirth-Singh, Anna; Noh, Jaebum; Rho, Junsuk; Brunton, Steve; Majumdar, Arka

doi:10.1038/s44172-023-00107-x

Download PDF

Article
Open access
Published: 21 August 2023

Large area optimization of meta-lens via data-free machine learning

Communications Engineering volume 2, Article number: 60 (2023) Cite this article

4137 Accesses
9 Citations
1 Altmetric
Metrics details

Subjects

An Author Correction to this article was published on 03 October 2023

This article has been updated

Abstract

Sub-wavelength diffractive optics, commonly known as meta-optics, present a complex numerical simulation challenge, due to their multi-scale nature. The behavior of constituent sub-wavelength scatterers, or meta-atoms, needs to be modeled by full-wave electromagnetic simulations, whereas the whole meta-optical system can be modeled using ray/ Fourier optics. Most simulation techniques for large-scale meta-optics rely on the local phase approximation (LPA), where the coupling between dissimilar meta-atoms is neglected. Here we introduce a physics-informed neural network, coupled with the overlapping boundary method, which can efficiently model the meta-optics while still incorporating all of the coupling between meta-atoms. We demonstrate the efficacy of our technique by designing 1mm aperture cylindrical meta-lenses exhibiting higher efficiency than the ones designed under LPA. We experimentally validated the maximum intensity improvement (up to 53%) of the inverse-designed meta-lens. Our reported method can design large aperture ( ~ 10⁴ − 10⁵λ) meta-optics in a reasonable time (approximately 15 minutes on a graphics processing unit) without relying on the LPA.

All dielectric highly efficient achromatic meta-lens using inverse design optimization

Article Open access 01 November 2023

Design of optical meta-structures with applications to beam engineering using deep learning

Article Open access 16 November 2020

Large depth-of-field ultra-compact microscope by progressive optimization and deep learning

Article Open access 11 July 2023

Introduction

In the age of silicon computing, numerical simulations are at the heart of understanding and designing physical systems. For many cases, analytical solutions to complex device geometries are intractable to compute, or simply do not exist. From extremely large systems like rockets¹ to ultra-small nanophotonic devices², numerical simulations provide scientists and engineers with the necessary tools to design nonintuitive structures. In electromagnetics, direct solvers, including the finite difference time domain (FDTD)³ and the finite difference frequency domain (FDFD)^4,5 simulators, are the usual choices when dealing with heterogeneous structures with subwavelength features that require a high degree of numerical accuracy. Most commonly, electromagnetic simulation tools serve to validate the qualitative designs created by engineers based on prior knowledge and intuition. In recent years, the field of nanophotonics has incorporated a new paradigm of computer-aided device design, where a device’s performance is summarized by a quantitative figure of merit (FOM) that is optimized over. This method involves running a forward numerical simulation, computing the FOM, and iteratively modifying the device’s geometry based on an optimization algorithm to reach the desired FOM. Such optimization methods, often termed as inverse design, have already been used to create multi-functional and efficient nanophotonic structures^{2,6,7,8,9,10,11,12,13,14,15}. However, electromagnetic simulators suffer from a computational resource problem when the device dimension becomes large (≳10³λ), where λ is the device’s operating wavelength. As most electromagnetic simulations are performed over a sub-wavelength grid size, with increased size, the number of input variable becomes prohibitively large, making the simulation slow and memory extensive. The limitation of such forward electromagnetic simulators becomes even more severe for inverse design, where many such forward simulations are needed.

Sub-wavelength diffractive optics, also known as meta-optics, present an important test-bed for these problems: the constituent elements of the meta-optics, i.e. meta-atoms, are sub-wavelength, but the dimensions of the whole meta-optics are on the order of ~10³λ − 10⁵λ. Thus the underlying physics of each scatterer has to be modelled using full-wave electromagnetic simulation, but the whole meta-optical system needs to be simulated using ray or wave optics. Such multiscale electromagnetic simulators invariably rely on approximations, the most common of which is the local phase approximation (LPA): the scattering in any small region is taken to be the same as the scattering from a periodic surface⁹. This approximation allows the simulation of each scatterer in a periodic array, abstracting out the electromagnetic response as a simple phase shift. While this significantly reduces the computational complexity of simulating a meta-optic, this approximation fails to consider the coupling of each scatterer with their dissimilar neighbors. In fact, it has already been shown that meta-optical lenses designed under LPA have suboptimal efficiencies¹⁶, especially when the numerical aperture is large. The LPA becomes even more inaccurate when the material used to create the meta-lens has low index, such as SiN¹⁷. We note that, while a full FDTD coupled with adjoint optimization has been used to design a meta-optic without relying on LPA, their size has been limited to only ~100λ¹⁸. LPA can also be bypassed using Mie scattering approaches¹⁹, which however limits the shape of scatterers.

To address the computational bottleneck of large-area inverse design, here we introduce a physics-informed neural network (PINN), to model super-cell subsections of a larger metasurface^20,21,22 which in conjunction with the overlapping boundary method^{23,24,25,26,27}, can replace a traditional FDTD/ FDFD solver to predict the electric field distribution for a given dielectric distribution. PINNs and other model-based deep learning architectures have already been used in modeling physical systems²⁸. We also note that a large number of works already used artificial neural networks to predict spectral responses of meta-optics of varying scatterer geometries^{25,29,30,31,32,33,34,35,36}. However, these works used largely periodic structures for which LPA is accurate. We present a solution via PINNs^37,38 for lenses and devices with spatially varying scatterer geometries, where it is necessary to model the whole electric field from several scatterers and their neighbors. The use of PINNs to accurately model the electromagnetic scattering beyond the LPA is the main contribution of this work. PINNs solve partial differential equations (PDEs) by minimizing a loss function constructed from the PDE itself. This loss function is generally some norm of the residual³⁷ or an energy function derived from the PDE³⁹. PINNs have already seen wide usage in the field of fluid mechanics^40,41,42, biology⁴³, and solving stochastic PDEs⁴⁴. In electromagnetic inverse problems, PINNs have also been employed to design meta-optics and nanophotonic devices^45,46. These works, however, did not clearly demonstrate a simulation speedup, and are limited to the inverse design of only very small devices. We also note that pre-trained PINNs have been used to design small gratings⁴⁷; however their methodology is limited to small gratings that deflect light fields to specific angles, and thus cannot be readily used for the inverse design of arbitrary meta-optics or a meta-lens.

In our work, we train PINNs to predict the electric fields from a parameterized set of dielectric meta-atoms corresponding to rectangular pillars. We then use this as a surrogate model to design cylindrical meta-lenses operating in the visible with a diameter of 1 mm (~1500λ). Large area meta-optics are simulated by partitioning the simulation region into groups of 11 meta-atoms, with the outermost meta-atoms overlapping. After simulation, the fields are stitched together. Our PINNs do not require a training data set. They are trained by randomly generating distributions of dielectric meta-atoms ϵ, feeding them into a neural network NN, and minimizing the residual of the linear Maxwell PDE operator

$${\left\Vert {A}_{{{{{{{{\rm{Maxwell}}}}}}}}}(\epsilon )NN(\epsilon )-b\right\Vert }_{1}$$

(1)

over the neural network training parameters. This means our PINNs are trained without ever invoking a forward numerical simulation of Maxwell’s equations during the training process. Numerical simulations are invoked only to test the neural network performance (see next section, Supplementary Note 6, and Supplementary Fig. 5). A similar data-free approach has been applied to deep-tissue microscopy⁴⁸, however inverse design was not demonstrated. Once trained, this method can calculate the full electromagnetic field response from a 1 mm diameter cylindrical meta-lens at ~630nm in approximately 3 seconds on a graphics processing unit (GPU). Furthermore, we demonstrate an experimental improvement (over 50%) of the maximum intensity of cylindrical metalenses over their forward designed hyperboloid counterparts, signifying the improvement over using LPA. We note that the reported method is robust enough to handle even larger meta-optics, with simulation time scaling only linearly with the aperture of the cylindrical lens (see Supplementary Note 9).

Methods

Deep neural network proxy to Maxwell’s equations

Our problem statement is summarized in Fig. 1c. The monochromatic electromagnetic scattering equation for an inhomogeneous, nonmagnetic material is given by:

$$\nabla \times \nabla \times {{{{{{{\mathcal{E}}}}}}}}(x)-{\omega }^{2}\epsilon (x){{{{{{{\mathcal{E}}}}}}}}(x)=i\omega {{{{{{{\mathcal{J}}}}}}}}(x).$$

(2)

In the 2D case, assuming out of plane polarization $(0,0,{{{{{{{{\mathcal{E}}}}}}}}}_{z})$, and the double curl vector identity, ∇ × ∇ × = ∇ ( ∇ ⋅ ) − ∇² we can simplify Eq. (2) to:

$${\nabla }^{2}{{{{{{{{\mathcal{E}}}}}}}}}_{z}(x)+{\omega }^{2}\epsilon (x){{{{{{{{\mathcal{E}}}}}}}}}_{z}(x)=-i\omega {{{{{{{{\mathcal{J}}}}}}}}}_{z}$$

(3)

where ${{{{{{{{\mathcal{E}}}}}}}}}_{z}$ and ${{{{{{{{\mathcal{J}}}}}}}}}_{z}$ are scalar fields. Equation (3) is defined over all space, with boundary conditions at ∣x∣ → ∞. To simulate this equation, we discretize it on a Yee grid³ by replacing the ∇ operator with a matrix, and treating the field ${{{{{{{{\mathcal{E}}}}}}}}}_{z}(x)$ and current ${{{{{{{{\mathcal{J}}}}}}}}}_{z}$ as vectors E and J at discrete values of x. Similarly, we treat the dielectric distribution ϵ(x) as a diagonal matrix ε. To truncate the simulation to a finite domain, we use perfectly matched boundary layers (PML), by making the transformation on the partial derivative operators $\frac{\partial }{\partial x}\to \frac{1}{1+i\frac{\sigma (x)}{\omega }}\frac{\partial }{\partial x}$. Making these substitutions, Eq. (3) becomes:

$$\left[{D}_{x}^{h}{D}_{x}^{e}+{D}_{y}^{h}{D}_{y}^{e}+{\omega }^{2}\varepsilon \right]E=-i\omega J$$

(4)

with matrices ${D}_{x}^{h},{D}_{x}^{e},{D}_{y}^{h},{D}_{y}^{e}$ being the matrix representations of corresponding derivative operators on a Yee grid with incorporated PML boundaries. See Supplementary Note 5 and Supplementary Fig. 4 for a more detailed description of the matrices. These matrices were extracted from a modified version of the package angler⁴⁹ with constants c, ϵ, μ set to 1 and the length scale set to μm. To build a neural network proxy to solve Eq. (4), we employ a PINN (Fig. 1a and b). PINNs generally use the coordinates of the computational grid as the input to the neural network, and then minimize the residual of the physical equations by approximating the target quantity being solved for with a neural network. This approach is slow since it effectively functions as an iterative solver re-parameterized over neural network weights and biases. It also required retraining the neural network for all different dielectric distributions. Our approach is to build a proxy solver that predicts the field E from a dielectric distribution ε. We pretrain the PINN to predict fields from inputs ε before optimizing our meta-lenses. The minimization problem to train the PINN becomes:

$$\begin{array}{c}\mathop{\min }\limits_{\theta }f(\varepsilon ;\theta )\\ \,{{\mbox{where}}}\,\quad f(\varepsilon ;\theta )={\left\Vert \left[{D}_{x}^{h}{D}_{x}^{e}+{D}_{y}^{h}{D}_{y}^{e}+{\omega }^{2}\varepsilon \right]NN(\varepsilon ;\theta )+i\omega J\right\Vert }_{1}\end{array}$$

(5)

with NN(ε; θ) being the output field from the PINN, and ∣∣ ⋅ ∣∣₁ is the vector l₁ norm. Here θ refers to the weights and biases of the neural network NN. A lower physics informed loss indicates that the neural network is actually satisfying the PDE, and thus predicting the field more accurately. We re-emphasize that there is no data term in f(ϵ; θ), which simplifies the neural network training process. Furthermore, we believe that it mitigates the accumulation of error in the gradients during the inverse design process observed by Chen et. al.⁴⁷. Figure 1 outlines the general strategy for building the proxy model. During each epoch, 10 (batch size) dielectric distributions consisting of rectangular pillars of height h = 0.6 μm with dielectric constant 4 (corresponding to SiN), are generated from 11 random pillar half-widths per batch. The operation wavelength is λ = 0.633 μm. The neural network architecture chosen is a UNET, shown in Fig. 1a and b, due to previously reported good performance with scattering problems⁴⁷. The model is trained for 5 × 10⁵ epochs using the ADAM optimizer⁵⁰ with a learning rate set to 5 × 10⁻⁴. The final residual of the fields predicted by the neural network are of the order of ~0.5, compared to the numerical residual produced by FDFD which is on the order of 10⁻¹⁶. Although there is a large difference, in the next section we show that this still produces a simulator which is capable of outperforming the LPA when optimizing the efficiency of a metalens. Figure 2a shows an example of a field predicted from a random set of pillars by the neural network, by a 2D FDFD code, and their difference, showing good qualitative match. A more quantitative measure of the errors is shown in Fig. 2b, where we show the point-wise error probability density functions for the relative error between the complex fields predicted by FDFD and that predicted by the neural network and the field predicted under LPA, and the absolute error between pillar-wise average transmission coefficients. See Supplementary Note 3 for a more detailed description of the pillar wise transmission coefficient error. The relative error is expressed as:

$$\frac{{\left\Vert {E}_{{{{{{{{\rm{approx}}}}}}}}}-{E}_{{{{{{{{\rm{FDFD}}}}}}}}}\right\Vert }_{2}^{2}}{{\left\Vert {E}_{{{{{{{{\rm{FDFD}}}}}}}}}\right\Vert }_{2}^{2}}.$$

(6)

For the PINN, E_approx is the field predicted from a set of 11 pillars. For the LPA, E_approx is fields predicted from the same set of pillars, and then stitched together over the same region. See Supplementary Fig. 2 for a visual explanation. The mean expected relative error for the neural network is μ = 0.21 with a standard deviation of σ = 0.103. When using the LPA over the same region, we get a mean relative error of μ = 1.01 with a standard deviation of 0.411. Thus, based on the relative field error, our method is 4.8 × more accurate than the LPA. For the pillar-wise transmission coefficient error, we get an expected error of μ = 0.051 for the neural network with a standard deviation of σ = 0.033 and for the LPA method we get an expected error of μ = 0.38 with a standard deviation of 0.14. Thus, based on the transmission coefficient error, our method is 7.2 × more accurate than the LPA.

**Fig. 2: Neural network performance analysis.**

Device optimization

The optimization process based on automatic differentiation functionality of PyTorch for large area meta-optics is outlined in Fig. 3. The forward problem is solved via a pre-trained PINN. Since the input into the neural net is a meshed grid of pillars, a differentiable map from pillar half-widths (3a.) to meshed geometries (3b) must be generated. This is achieved by generating Gaussian functions centered around pillar centers, with standard deviations of pillar half-widths in the x dimension, and pillar height in the y dimension, and then using a modified softmax function to transform the Gaussians into rectangles with slightly rounded edges, making them differentiable via automatic differentiation (see Supplementary Note 4 and Supplementary Fig. 3). The meshed structures are fed into two separate neural networks that have been pre-trained to predict the complex electric field (3c.). The fields are then stitched together with regions of the outer half-widths overlapping. The total field is then propagated using the angular spectrum method (3d). The propagated field is used to calculate the FOM f (3e.)from Eq. (5). We use automatic differentiation to compute the gradients of the FOM with respect to the input half-widths ${\nabla }_{\overrightarrow{r}}f$, and iteratively update them with the ADAM optimizer⁵⁰.

**Fig. 3: Optimization strategy of 2D meta-optics with physics informed neural networks (PINNs).**

Results

We used the PINN surrogate model to optimize 9 different lenses, all with 1 mm aperture, with focal lengths ranging from 250 to 1500 μm in increments of 250 μm. The minimum feature size is set to 75 nm, to ensure fabricability. To compare our optimization approach, we also generated lenses according to the hyperboloid phase equation:

$$\phi (x,y)=\frac{2\pi }{\lambda }\left(\sqrt{{x}^{2}+{F}^{2}}-F\right)$$

(7)

The phase is implemented under LPA using SiN (refractive index 2), a wavelength of 0.633μm, and periodicity of p = 0.443 μm (see Supplementary Fig. 9). We then optimize the lens employing our PINN to increase the intensity at the focal spot, i.e., the FOM is given by:

$$f=| E(x=0,z=F){| }^{2}$$

(8)

Figure 4a and b show the intensity profile of a forward designed and optimized lens with F = 500 μm focal length. Figure 4c shows the normalized intensity slice at the focal spot of both lenses. As seen in Fig. 4d the maximum intensities at the focal spots improve in every case. Figure 4e shows that the efficiency improves in all except for the lens with the highest NA. We also find a trend that the improvement in the maximum intensity of the inverse-designed meta-lens over the forward-design meta-lens increases with increasing NA. As with higher NA, the phase gradient becomes larger, we expect the LPA to be a worse approximation. Interpreting the efficiency improvement is more convoluted. We defined the efficiency as the ratio of the light energy inside a circle of radius of three times the full width half maximum (FWHM) at the focal spot over the total energy in the focal plane. With increasing NA, the FWHM decreases, making the efficiency improvement lower with increased NA. For the highest NA, the FWHM of the inverse-designed meta-lens is significantly lower than the forward-designed meta-lens, making the efficiency lower.

**Fig. 4: Efficiency and intensity sweeps of forward designed lenses and optimized lenses.**

We validated our designs by fabricating and experimentally testing the meta-lenses using a microscope (details of fabrication and characterization in Supplementary Note 1.1 and Supplementary Note 1.2). Figure 5 shows an example of the inverse optimized device. Figure 5a–c shows the scanning electron micrographs (SEMs) of the fabricated optimized lens with focal length F = 500μm. Figure 5d shows the distribution of the dielectric pillar half-widths of the same forward and optimized lens. signifying the two designs are very different. Figure 5e shows the focal spot intensities of the lenses integrated over a r = 3 × FWHM region at the focal spot, which yields a quantitative value to compare the lens efficiency⁵¹ among different devices. Figure 5f plots the maximum intensity plot as a function of the lens NA. For optimized lenses with NA > 0.44, we see improvements of more than 25%, with a maximum improvement of 53% for the NA = 0.9 lens. The experimentally determined intensity integral, which is analogous to the efficiency of a lens, on has improvements of more than 18% in all cases except for the NA=0.9 case. This is because the FWHM of the optimized lens at the NA=0.9 case is actually smaller than the FWHM of the forward designed lens, leading to a smaller integration area when computing the energy. We note that, while a quantitative match between the experiment and design is not obtained, we did observe a similar trend in terms of improved intensity and efficiency as predicted by the theory. Fig. 5g shows experimentally measured field profiles of the forward designed F = 500μm meta-lens. Figure 5h shows the same for an optimized lens. Figure 5i is the slice of the focal spot intensity profile along the z = F plane. In all these figures, the intensity is normalized such that the maximum intensity of the forward designed lens is 1.

Discussion

We have developed a PINN to use as a proxy surrogate model for simulating the full Maxwell’s equations to design dielectric meta-optics. We used the PINN to optimize pillar half-widths to maximize the intensity at the focal spot of 1 mm aperture cylindrical meta-lenses at 633nm. We demonstrated experimental improvements of the maximum intensity of the lenses up to 53%. We also want to note that this method was useful for the inverse design of extended depth of focus lenses¹⁰ (see Supplementary Note 7 and Supplementary Fig. 6). This model did not use the LPA, but simulated meta-atoms by splitting up the device into chunks with overlapping boundaries, and stitching the chunks together to approximate the full field response. We emphasize that FDFD simulations were never carried out to train the PINN, and we only minimized the residual of the PDE itself to train the network. The PINN training took approximately 2 hours on our machine. In our studies, this method provided approximately a 3-5x speedup (see Supplementary Figure 8 and Supplementary Table 1) over conventional FDFD with overlapping boundary conditions, and was much simpler to use as a forward simulator for optimization problems since it can be used as a simple map from ϵ to E-field with gradients computed by automatic differentiation.

We would also like to note that the theoretical intensity and efficiency improvements are smaller than their experimental counterparts. While we do not have a clear explanation for this discrepancy, the theoretical and experimental trends in lens improvement are similar. One hypothesis could be that the the inverse designed lenses may be more tolerant to fabrication imperfections. However, randomly changing the scatterers in our meta-optics by 10% did not give a similar enhancement. As such this aspect of the quantitative match between experiment and theory remains an open problem and further studies will be needed. We would like to point out however the importance of experimentally verifying inverse design methodologies, since in our studies we used open source codes that produce reasonable looking results, but are not experimentally accurate (see Supplementary Note 8 and Supplementary Fig. 7). The inverse design solution we introduced in this paper can be integrated into various computationally intensive tasks which require mate-optical inverse design such as the end-to-end optimization of computational imaging systems and the design of optical neural networks^52,53. It is worth noting, however, that this method is not a general numerical solver. It is limited to predicting electromagnetic field responses from fixed source, material, and boundary parameters. Source type and k-vector, dielectric constant, geometry type (rectangular pillar of fixed height), and boundary conditions must all remain constant for this method to work. If any of these parameters are modified, the PINN must be retrained. Furthermore, the method we presented was only implemented under a 2D approximation. Extending this method to 3 dimensions would take significant effort due to the fact that the electric field ${{{{{{{\mathcal{E}}}}}}}}$ could no longer be treated as a scalar field, and the full vector nature would have to be modeled. On a n × n grid in 2D, the Maxwell operator [∇² + ω²ϵ] results in a n² × n² matrix, while for a n × n × n 3D grid the Maxwell operator [ ∇ × ∇ × − ω²ε] result in 9n³ × 9n³ square matrices due to the additional 2 vector field components that must be modeled. However, these operators are sparse with a small number of nonzero elements that scale as ~ 38n³ in 3D, making small problems still manageable. The other problem with generalizing this method to 3D is the large null-space of the ∇ × ∇ × operator which results in slow convergence of numerical methods^54,55. It is highly likely that this could also affect the training of the PINN, and require regularization or preconditioning which deflates the null space of this operator to properly converge onto a solution. On the other hand, in this work we showed that machine-precision numerical accuracy of numerical solvers may be not be needed for inverse design methods with FDFD. Solvers could be sped up by relaxing the relative error tolerance, such that iterative solvers converge quicker for predicting the forward and adjoint problems. Another interesting aspect will be to understand the optimal PINN to model the meta-optics, and if we can identify a relationship between the number of trainable parameters and size of the problem we are solving. In future work we aim to explore these options.

Data availability

Data sets generated in the paper are available from the corresponding author on reasonable request.

Code availability

Code for the project is available at https://github.com/demroz/pinn-ms.

Change history

03 October 2023
A Correction to this paper has been published: https://doi.org/10.1038/s44172-023-00114-y

References

Şumnu, A., Güzelbey, İ. H. & Öğücü, O. Aerodynamic shape optimization of a missile using a multiobjective genetic algorithm. Int. J. Aerospace Eng. 2020, 1528435 (2020).
Article Google Scholar
Molesky, S. et al. Inverse design in nanophotonics. Nat. Photo. 12, 659–670 (2018).
Article Google Scholar
Yee, K. Numerical solution of initial boundary value problems involving maxwell’s equations in isotropic media. IEEE Trans. Antennas Propagation 14, 302–307 (1966).
Article MATH Google Scholar
Rumpf, R. C., Garcia, C. R., Berry, E. A. & Barton, J. H. Finite-difference frequency-domain algorithm for modeling electromagnetic scattering from general anisotropic objects. Prog. Electromag. Res. B 61, 55–67 (2014).
Article Google Scholar
Shin, W.3D Finite-Difference Frequency-Domain Method for Plasmonics and Nanophotonics. Ph.D. thesis https://www.proquest.com/dissertations-theses/3d-finite-difference-frequency-domain-method/docview/2463202947/se-2 (2013).
Zhan, A., Fryett, T. K., Colburn, S. & Majumdar, A. Inverse design of optical elements based on arrays of dielectric spheres. Appl. Opt. 57, 1437–1446 (2018).
Article Google Scholar
Piggott, A. Y. et al. Inverse design and demonstration of a compact and broadband on-chip wavelength demultiplexer. Nat. Photo. 9, 374–377 (2015).
Article Google Scholar
Piggott, A. Y., Petykiewicz, J., Su, L. & Vučković, J. Fabrication-constrained nanophotonic inverse design. Sci. Rep. 7, 1786 (2017).
Article Google Scholar
Pestourie, R. et al. Inverse design of large-area metasurfaces. Opt. Exp. 26, 33732–33747 (2018).
Article Google Scholar
Bayati, E. et al. Inverse designed metalenses with extended depth of focus. ACS Photo. 7, 873–878 (2020).
Article Google Scholar
Zhelyeznyakov, M. V., Brunton, S. & Majumdar, A. Deep learning to accelerate scatterer-to-field mapping for inverse design of dielectric metasurfaces. ACS Photo. 8, 481–488 (2021).
Article Google Scholar
Elsawy, M. M. R. et al. Global optimization of metasurface designs using statistical learning methods. Sci. Rep. 9, 17918 (2019).
Article Google Scholar
Park, J. et al. Free-form optimization of nanophotonic devices: from classical methods to deep learning. Nanophotonics 11, 1809–1845 (2022).
Article Google Scholar
Zhelyeznyakov, M. V., Zhan, A. & Majumdar, A. Design and optimization of ellipsoid scatterer-based metasurfaces via the inverse t-matrix method. OSA Cont. 3, 89–103 (2020).
Article Google Scholar
Munley, C. et al. Inverse-designed meta-optics with spectral-spatial engineered response to mimic color perception. Adv. Opt. Mater. 10, 2200734 (2022).
Article Google Scholar
Chung, H. & Miller, O. D. High-na achromatic metalenses by inverse design. Opt. Exp. 28, 6945–6965 (2020).
Article Google Scholar
Bayati, E., Zhan, A., Colburn, S., Zhelyeznyakov, M. V. & Majumdar, A. Role of refractive index in metalens performance. Appl. Opt. 58, 1460–1466 (2019).
Article Google Scholar
Mansouree, M. et al. Multifunctional 2.5d metastructures enabled by adjoint optimization. Optica 7, 77–84 (2020).
Article Google Scholar
Zhan, A. et al. Controlling three-dimensional optical fields via inverse mie scattering. Sci. Adv. 5, eaax4769 (2019).
Article Google Scholar
Li, W. F. et al. Transcending shift-invariance in the paraxial regime via end-to-end inverse design of freeform nanophotonics. https://arxiv.org/abs/2302.01712 (2023).
Byrnes, S. J., Lenef, A., Aieta, F. & Capasso, F. Designing large, high-efficiency, high-numerical-aperture, transmissive meta-lenses for visible light. Opt. Exp. 24, 5110–5124 (2016).
Article Google Scholar
Spägele, C. et al. Multifunctional wide-angle optics and lasing based on supercell metasurfaces. Nat. Commun. 12, 3787 (2021).
Article Google Scholar
Hsu, L., Dupré, M., Ndao, A., Yellowhair, J. & Kanté, B. Local phase method for designing and optimizing metasurface devices. Opt. Exp. 25, 24974–24982 (2017).
Article Google Scholar
Phan, T. et al. High-efficiency, large-area, topology-optimized metasurfaces. Light Sci. Appl. 8, 48 (2019).
Article Google Scholar
Lin, Z., Liu, V., Pestourie, R. & Johnson, S. G. Topology optimization of freeform large-area metasurfaces. Opt. Exp. 27, 15765–15775 (2019).
Article Google Scholar
Torfeh, M. & Arbabi, A. Modeling metasurfaces using discrete-space impulse response technique. https://arxiv.org/abs/2003.06683 (2020).
Skarda, J. et al. Low-overhead distribution strategy for simulation and optimization of large-area metasurfaces. Comput. Mate. 8, 78 (2022).
Article Google Scholar
Shlezinger, N., Whang, J., Eldar, Y. C. & Dimakis, A. G. Model-based deep learning. https://arxiv.org/abs/2012.08405 (2020).
Malkiel, I. et al. Plasmonic nanostructure design and characterization via deep learning. Light: Sci. Appl. 7, 60 (2018).
Article Google Scholar
Li, X., Shu, J., Gu, W. & Gao, L. Deep neural network for plasmonic sensor modeling. Opt. Mater. Exp. 9, 3857–3862 (2019).
Article Google Scholar
Peurifoy, J. et al. Nanophotonic particle simulation and inverse design using artificial neural networks. Sci. Adv. 4 (2018). https://advances.sciencemag.org/content/4/6/eaar4206. https://advances.sciencemag.org/content/4/6/eaar4206.full.pdf.
Kiarashinejad, Y. et al. Knowledge discovery in nanophotonics using geometric deep learning. Adv. Intell. Syst. 2, 1900132 (2019).
Article Google Scholar
So, S., Badloe, T., Noh, J., Bravo-Abad, J. & Rho, J. Deep learning enabled inverse design in nanophotonics. Nanophotonics 9, 1041–1057 (2020).
Article Google Scholar
Liu, D., Tan, Y., Khoram, E. & Yu, Z. Training deep neural networks for the inverse design of nanophotonic structures. ACS Photo. 5, 1365–1369 (2018).
Article Google Scholar
Gao, L., Li, X., Liu, D., Wang, L. & Yu, Z. A bidirectional deep neural network for accurate silicon color design. Adv. Mater. 31, 1905467 (2019).
Article Google Scholar
An, S. et al. A deep learning approach for objective-driven all-dielectric metasurface design. ACS Photo. 6, 3196–3207 (2019).
Article Google Scholar
Raissi, M., Perdikaris, P. & Karniadakis, G. Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations. J. Comput. Phys. 378, 686–707 (2019).
Article MathSciNet MATH Google Scholar
Lu, L., Meng, X., Mao, Z. & Karniadakis, G. E. Deepxde: A deep learning library for solving differential equations. SIAM Rev. 63, 208–228 (2021).
Article MathSciNet MATH Google Scholar
Karumuri, S., Tripathy, R., Bilionis, I. & Panchal, J. Simulator-free solution of high-dimensional stochastic elliptic partial differential equations using deep neural networks. J. Comput. Phys. 404, 109120 (2020).
Article MathSciNet MATH Google Scholar
Raissi, M., Yazdani, A. & Karniadakis, G. E. Hidden fluid mechanics: Learning velocity and pressure fields from flow visualizations. Science 367, 1026–1030 (2020).
Article MathSciNet MATH Google Scholar
Zhu, Y., Zabaras, N., Koutsourelakis, P.-S. & Perdikaris, P. Physics-constrained deep learning for high-dimensional surrogate modeling and uncertainty quantification without labeled data. J. Comput. Phys. 394, 56–81 (2019).
Article MathSciNet MATH Google Scholar
Tartakovsky, A., Marrero, C., Perdikaris, P., Tartakovsky, G. & Barajas-Solano, D. Physics-informed deep neural networks for learning parameters and constitutive relationships in subsurface flow problems. Water Res. Res. 56, e2019WR026731 (2020).
Article Google Scholar
Yazdani, A., Lu, L., Raissi, M. & Karniadakis, G. E. Systems biology informed deep learning for inferring parameters and hidden dynamics. PLoS Comput. Biol 16, e1007575 (2020).
Article Google Scholar
Zhang, D., Lu, L., Guo, L. & Karniadakis, G. E. Quantifying total uncertainty in physics-informed neural networks for solving forward and inverse stochastic problems. J. Comput. Phys. 397, 108850 (2019).
Article MathSciNet MATH Google Scholar
Chen, Y., Lu, L., Karniadakis, G. E. & Dal Negro, L. Physics-informed neural networks for inverse problems in nano-optics and metamaterials. Opt Exp. 28, 11618–11633 (2020).
Article Google Scholar
Lu, L. et al. Physics-informed neural networks with hard constraints for inverse design. SIAM J. Sci. Comput. 43, B1105–B1132 (2021).
Article MathSciNet MATH Google Scholar
Chen, M. et al. High speed simulation and freeform optimization of nanophotonic devices with physics-augmented deep learning. ACS Photonics. https://doi.org/10.1021/acsphotonics.2c00876 (2022).
Valantinas, L. & Vettenburg, T. A physics-defined recurrent neural network to compute coherent light wave scattering on the millimetre scale. https://arxiv.org/abs/2208.01118 (2022).
Hughes, T. W., Minkov, M., Williamson, I. A. D. & Fan, S. Adjoint method and inverse design for nonlinear nanophotonic devices. ACS Photo. 5, 4781–4787 (2018).
Article Google Scholar
Kingma, D. P. & Ba, J. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).
Arbabi, A., Horie, Y., Ball, A. J., Bagheri, M. & Faraon, A. Subwavelength-thick lenses with high numerical apertures and large efficiency based on high-contrast transmitarrays. Nat. Commun. 6, 7069 (2015).
Article Google Scholar
Tseng, E. et al. Neural nano-optics for high-quality thin lens imaging. Nat. Commun. 12, 6493 (2021).
Article Google Scholar
Lin, X. et al. All-optical machine learning using diffractive deep neural networks. Science 361, 1004–1008 (2018).
Article MathSciNet MATH Google Scholar
Newman, G. A. & Alumbaugh, D. L. Three-dimensional induction logging problems, Part 2: A finite-difference solution. Geophysics 67, 484–491 (2002).
Article Google Scholar
Newman, G. & Weiss, C. Electromagnetic induction in a generalized 3d anisotropic earth, part 2: The lin preconditioner. Geophysics 68, 922–30 (2003).
Article Google Scholar

Download references

Acknowledgements

This research was funded by NSF-GCR-2120774. M.V. Z. is supported by an NSF graduate fellowship. Part of this work was conducted at the Washington Nanofabrication Facility/Molecular Analysis Facility, a National Nanotechnology Coordinated Infrastructure (NNCI) site at the University of Washington, with partial support from the National Science Foundation via Awards NNCI-1542101 and NNCI-2025489.

Author information

Authors and Affiliations

Department of Electrical and Computer Engineering, University of Washington, Seattle, 98195, Washington, USA
Maksym Zhelyeznyakov, Johannes Fröch & Arka Majumdar
Department of Physics, University of Washington, Seattle, 98195, WA, USA
Johannes Fröch, Anna Wirth-Singh & Arka Majumdar
Department of Mechanical Engineering, Pohang University of Science and Technology (POSTECH), Pohang, 37673, Republic of Korea
Jaebum Noh & Junsuk Rho
Department of Chemical Engineering, Pohang University of Science and Technology (POSTECH), Pohang, 37673, Republic of Korea
Junsuk Rho
POSCO-POSTECH-RIST Convergence Research Center for Flat Optics and Metaphotonics, Pohang University of Science and Technology (POSTECH), Pohang, 37673, Republic of Korea
Junsuk Rho
Department of Mechanical Engineering, University of Washington, Seattle, 98195, WA, USA
Steve Brunton

Authors

Maksym Zhelyeznyakov
View author publications
You can also search for this author in PubMed Google Scholar
Johannes Fröch
View author publications
You can also search for this author in PubMed Google Scholar
Anna Wirth-Singh
View author publications
You can also search for this author in PubMed Google Scholar
Jaebum Noh
View author publications
You can also search for this author in PubMed Google Scholar
Junsuk Rho
View author publications
You can also search for this author in PubMed Google Scholar
Steve Brunton
View author publications
You can also search for this author in PubMed Google Scholar
Arka Majumdar
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.V.Z. conceptualized the project. M.V.Z. designed the methodology, wrote the software, trained the machine learning model, and performed the data analysis. J.F. manufactured the metasurfaces and experimentally validated them. A.W.S. validated metasurface designs in simulation. J.N. assisted with validating metasurface designs in simulation. J.R. provided compute resources for training the machine learning model. S.B. and A.M. supervised the project. Initial version of manuscript was written by M.V.Z.

Corresponding authors

Correspondence to Maksym Zhelyeznyakov or Arka Majumdar.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

: Communications Engineering thanks the anonymous reviewers for their contribution to the peer review of this work. Primary Handling Editors: Rosamund Daw, Mengying Su. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Peer Review File

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Zhelyeznyakov, M., Fröch, J., Wirth-Singh, A. et al. Large area optimization of meta-lens via data-free machine learning. Commun Eng 2, 60 (2023). https://doi.org/10.1038/s44172-023-00107-x

Download citation

Received: 25 March 2023
Accepted: 31 July 2023
Published: 21 August 2023
DOI: https://doi.org/10.1038/s44172-023-00107-x