Neural Inverse Design of Nanostructures (NIDN)

Gómez, Pablo; Toftevaag, Håvard Hem; Bogen-Storø, Torbjørn; Aranguren van Egmond, Derek; Llorens, José M.

doi:10.1038/s41598-022-26312-w

Download PDF

Article
Open access
Published: 22 December 2022

Neural Inverse Design of Nanostructures (NIDN)

Pablo Gómez¹,
Håvard Hem Toftevaag¹,
Torbjørn Bogen-Storø¹,
Derek Aranguren van Egmond¹ &
…
José M. Llorens²

Scientific Reports volume 12, Article number: 22160 (2022) Cite this article

2937 Accesses
1 Citations
4 Altmetric
Metrics details

Subjects

Abstract

In the recent decade, computational tools have become central in material design, allowing rapid development cycles at reduced costs. Machine learning tools are especially on the rise in photonics. However, the inversion of the Maxwell equations needed for the design is particularly challenging from an optimization standpoint, requiring sophisticated software. We present an innovative, open-source software tool called Neural Inverse Design of Nanostructures (NIDN) that allows designing complex, stacked material nanostructures using a physics-based deep learning approach. Instead of a derivative-free or data-driven optimization or learning method, we perform a gradient-based neural network training where we directly optimize the material and its structure based on its spectral characteristics. NIDN supports two different solvers, rigorous coupled-wave analysis and a finite-difference time-domain method. The utility and validity of NIDN are demonstrated on several synthetic examples as well as the design of a 1550 nm filter and anti-reflection coating. Results match experimental baselines, other simulation tools, and the desired spectral characteristics. Given its full modularity in regard to network architectures and Maxwell solvers as well as open-source, permissive availability, NIDN will be able to support computational material design processes in a broad range of applications.

A comprehensive deep learning method for empirical spectral prediction and its quantitative validation of nano-structured dimers

Article Open access 20 January 2023

Deep learning approach based on dimensionality reduction for designing electromagnetic nanostructures

Article Open access 04 February 2020

A deep learning method for empirical spectral prediction and inverse design of all-optical nonlinear plasmonic ring resonator switches

Article Open access 09 March 2024

Introduction

Computational tools have been playing an increasingly central role in material design in recent decades. With lower costs and rapid development cycles they are an efficient starting point in the material design process that can later be complemented with experimental validation and refinement¹. Especially machine learning techniques have recently seen widespread adoption in applications ranging from predicting material properties to designing nanostructures^2,3 and this work aims to advance and support these efforts.

Brute-force parameter sweeps are increasingly being replaced by more sophisticated optimization methods in the design of metamaterials and nanophotonic devices^{4,5,6,7,8,9,10}. Indeed, many recent works rely especially on neural network-based approaches for this^4,11,12,13. And, instead of letting human intuition and analytically derived parameters determine the architecture, nanostructured devices can now be autonomously designed for high performance, made possible by the exhaustive exploration of the design space that such algorithms perform⁷. In this context, this process of determining the physical characteristics from certain properties¹⁴, e.g., finding the geometry of a structure for a given optical response, is called inverse design. Recently, there have been a lot of works on inverse design for nanophotonics using different techniques, such as generative adversarial networks⁴, variational auto-encoders⁵ and other machine learning and optimization methods^1,2,3.

One area where this has received a lot of attention is the space sector, where reducing the size, weight, power, and cost of objects put into orbit is not only a high priority, but can in many cases also determine the feasibility of the project. One example are photon sails, which must have sail thicknesses on the order of only a few micrometers (the IKAROS solar sail was only 7.5 μm thick¹⁵) in order to avoid an unsuitable weight. Recently, there has been research on using metasurfaces for solar sails, made with both conventional design¹⁶ and inverse design¹⁷ methods. These metasurfaces might offer, for instance, attitude control¹⁶ and mass reduction¹⁷ while maintaining a high efficiency.

Recent works have also applied metamaterial knowledge to the optical solar reflector (OSR), a structure placed on the external surface of a spacecraft that simultaneously reflects incoming sunlight and emit infrared radiation. Instead of using the conventional design of planar structures, Sun et al. showed that a simple metasurface can reduce the thickness^18,19. The terrestrial cousin of the OSR, radiative cooling structures, have followed a similar trend recently²⁰.

Overall, these methods rely on sophisticated software tools and datasets which has inspired efforts to make those available openly as, e.g., in the work of Jiang et al.²¹. In a similar vain, in this work, we present an open-source software package called Neural Inverse Design of Nanostructures (NIDN) that allows designing nanostructures with neural networks using two different, fully differentiable implementations of Maxwell equations solvers based on the finite-difference time-domain (FDTD) method²² and rigorous coupled-wave analysis (RCWA)²³. RCWA is a natural and efficient choice for stacked materials designed with NIDN. The relevance of this is also evident as another group has also concurrently undertaken the effort of implementing RCWA with PyTorch²⁴. FDTD complements it by supporting design in the visible light range of the spectrum and—depending on geometry—providing higher accuracy²⁵. Both, RCWA and FDTD, enable the machine-learning-based design of multilayer materials with patterned nanoscale grids to achieve a desired spectrum of reflectance, transmittance and absorptance. Thus, NIDN has the potential to support a wide range of the aforementioned applications. In comparison to previous similar efforts such as those by Jiang et al.²⁶ we utilize a fundamentally different technique relying on directly encoding the material inside a neural network.

Building on advances in physics-based deep learning²⁷, in NIDN, we directly optimize the nanostructure described by the neural network to achieve one specific spectrum using gradient information from the physical simulation via a loss. In this approach, no prior knowledge or training datasets are required as gradients are backpropagated through a (differentiable) numerical solver. Also, as the neural network does not need to solve the problem of fully describing the relationship between any material and spectral characteristics the task is more tractable. In many previous approaches^28,29, neural networks effectively need to invert the Maxwell equations to match material and spectral characteristics to each other. In NIDN, by solving for a specific reflectance, transmittance and absorptance spectrum, we—from a mathematical perspective—constrain the inversion to a specific solution of the Maxwell equations. This furthermore avoids being limited to only a few degrees of freedom, such as placing a specific structure on top of another, in the material design^29,30; instead we can fully explore the entire search space of structures. Several nanosurface modification strategies have been employed to tune spectral response in optical devices. NIDN specifically excels in the parametric optimization of heterogeneous, multi-layer devices, which use local composition to tune electromagnetic properties. This is in contrast to the geometric structuring of a single material surface, which has limited local spectral tunability.

There is overall a larger trend in the related research field of topology optimization to use inverse design approaches based on neural networks, where various works have explored this for problems in related domains of material design^31,32,33. However, to the authors knowledge, these works have not explored a similar setup for inverse design in photonics which is challenging as a Maxwell solver requires full support of complex numbers.

With NIDN, the neural network learns to produce a spatially continuous function describing the material permittivity. We demonstrate the validity and versatility of the approach by comparing to existing implementations that we built on. We showcase several examples ranging from simple uniform layers to practical applications related to photonic filter design and perfect anti-reflection. NIDN is also capable of running on graphics cards (GPUs) and is available open-source online (https://github.com/esa/NIDN Accessed: 2022-04-21). The authors hope to build a community around the software, extending its capabilities and fidelity even further in the future.

Results

In the following, we showcase results demonstrating NIDN’s capabilities. In particular, we first validate the numerical Maxwell solvers in NIDN against baselines, then show the efficacy of the inverse design on some simple materials, and, finally, investigate two common design scenarios from the literature and perform a comparison with related approaches.

Material design setup

The following results were obtained using NIDN and its Maxwell solvers derived from the RCWA implementations by Jin et al.¹⁷ and an adaption of an open-source implementation of FDTD in PyTorch 1.9. Both are described in more detail in the Section Methods. NIDN utilizes PyTorch 1.9 to enable automatic differentiation and neural network training to design materials of stacked layers such as shown in Fig. 1. Details on neural network architectures and training as well as the detailed training setup are given in the Section Methods. Data on material permittivity are obtained from an online database.(https://refractiveindex.info Accessed: 2022-04-21) More detailed info on specific references for each material are provided in the integrated database of materials in NIDN (https://github.com/esa/NIDN/tree/main/nidn/materials/data.

Permittivity between provided measurement points is interpolated linearly. Layer thickness if not specified otherwise is 1 μm. Grid dimensions if not specified are ${1}\,{\upmu }\text{m}\times {1}\,{\upmu }\text{m}$. In general, NIDN supports two modes of inverse design which are described in more detail in the Section Methods: The first one is referred to as regression, where the permittivity of the material is predicted within some value range to investigate hypothetical materials achieving the target spectral characteristics. If not specified otherwise, the range for relative permittivity $\epsilon _r$ considered by the network is $[0.01 , 20 + 3i]$. All references to permittivity refer to relative permittivity. The second approach is called classification and the permittivity in this case is chosen as a linear combination of reported permittivity of real materials. All considered materials are listed in the codebase. At present, only normal incidence is considered. Figure 1 gives a high-level overview of the training process.

The number of training iterations depended on the task and solver; for RCWA between 1000 and 5000 training iterations were used, for FDTD not more than 150 iterations due to the higher computation time. No dispersion model is used and frequencies are computed independently. For a single uniform layer, a single training iteration with FDTD requires about 8 s per frequency, with RCWA it takes six milliseconds per frequency per iteration. L1 errors reported in the figures refer to the mean absolute error between the target and obtained spectrum. Inversion with FDTD used a coarser grid in the numerical discretization than for validation to keep the computational cost manageable (32 instead of 100).

Validation and numerical stability

NIDN contains a fully differentiable RCWA implementation that is derived from the Python module GRCWA¹⁷. For RCWA, a full re-implementation of the GRCWA code to use PyTorch was undertaken replacing many components with adequate alternatives in PyTorch. Extensive tests are performed to validate both, RCWA and FDTD, implementations against the original implementations and results of these can be found in the Supplementary Information.

NIDN adapts the FDTD implementation from the Python module with the same name (https://github.com/flaport/fdtd Accessed: 2022-04-21) but adds support for gradient flow through it and performs post processing. A comparison with an experimental baseline is given to validate the implemented post processing. Further, we directly compare results with the original Python module to validate our changes to it. Overall, these implementations are major contributions of this paper as previously no Maxwell solver in PyTorch was available that allowed backpropagation through the entire simulation. This is also particularly challenging given the dependence on using complex numbers for permittivities and the simualtion as, at the start of this research work, there was a lack of support for complex numbers in PyTorch.

A comparison with an experimental measurement of the transmission spectrum of $\text{TiO}_{2}$ is performed for both. The $\text{TiO}_{2}$ layer in the experiment by Wojcieszak et al.³⁴ was 380 nm thick and transmission of wavelengths between 300 and 900 nm were measured using a spectrophotometer. Figure 2 displays transmittance obtained with NIDN RCWA and FDTD using continuous waves with wavelengths from 300 to 900 nm as well as the experimental result. Overall, it can be seen that the transmittance in NIDN resembles the one observed in the experiment. Point clustering in the FDTD version is a discretization effect due to the number of grid points used in the simulation.

Material and structure inference

As a first demonstration of NIDN’s capabilities, we present results on some rudimentary examples. These examples aim to reconstruct a material for which the spectral characteristics were computed with RCWA / FDTD, thus the inversion is certain to have a solution obtainable by the network. In particular, these results show that, depending on the complexity of the spectral characteristics, almost perfect results can be obtained. Furthermore, we can investigate how unique solutions are by studying the similarity of designed materials with the ground-truth baseline.

Uniform $\text{TiO}_{2}$ layer inversion

First, we show that NIDN is capable of recreating the permittivity of a single layer of $\text{TiO}_{2}$, with FDTD in the visible range of the spectrum (for RCWA a more complex three-layer example is tested).

Figure 3 showcases these results. Note that to save computational time, a limited number of frequency points, i.e. 20, were investigated. A clear convergence towards the ground truth is visible with a detailed replication of the desired reflectance spectrum and slightly higher than targeted transmittance. Peaks in the transmittance are at the targeted wavelengths. Overall, the replication of the spectral characteristics is not perfect but strongly resembles the target. However, the predicted permittivity clearly differs from the real one of $\text{TiO}_{2}$ with a higher real permittivity in the learnt material. Thus, the network seems to have found a solution leading to similar spectral characteristics with a different material.

Even though the inversion with the regression approach—in terms of reproducing the target spectrum—was successful, it was not constrained to realistic materials. In Fig. 4 we show results for the classification approach for this single-layer material where we constrain the network further in terms of allowed permittivities. Overall, the inversion is successful and the obtained spectral characteristics are even closer to the target. Furthermore, as seen in Fig. 4b, the used epsilon value now corresponds well to the experimental permittivity of $\text{TiO}_{2}$ and thus the network was able to identify the exact material required for this spectrum.

Three-layer uniform material

The second example is the design of a three-layer material consisting of $\text{TiO}_{2}$, Germanium ($\text{Ge}$) and $\text{Ta}_{2}\text{O}_{5}$. Figure 5 displays the regression results with RCWA for this material. The design of the spectral properties is successful and the reflectance is achieved flawlessly. A small difference is noticeable at the lower end of the spectrum for the transmittance. Similar as for the previous inversion with FDTD, however, permittivities of the different layers are notably further from real materials. The obtained permittivity is not resemblant of the materials to generate the target spectrum. This again demonstrates the ill-posed nature of this inversion task where there are non-unique solutions that lead to the desired spectral characteristics.

Similar to FDTD, with the classification and RCWA approach, the restriction for the permittivity actually leads to a virtually flawless reconstruction of the spectral characteristics and to the network learning the permittivity of the utilized materials in the ground truth for each layer, as can be seen in Fig. 6. Furthermore, the achieved spectral characteristics also almost perfectly match the design goal of the ground truth. Thus, the classification was successfully used to constrain the network to realistic material choices.

Two-layer patterned material

The last case based on a ground truth from the forward models (i.e., the RCWA and FDTD implementations) is a two-layer patterned material consisting of $\text{TiO}_{2}$ and $\text{Ge}$ with a square pattern in the center similar to those shown in the Supplementary Information. In the upper layer the square is made of $\text{TiO}_{2}$ and surroundings of $\text{Ge}$ and vice versa for the bottom layer. This test case is only studied with RCWA as support for patterned layers is currently still in development for FDTD.

Figure 7 displays the obtained spectral characteristics with regression and RCWA. The desired spectrum is reproduced very well as can be seen in Fig. 7a with both, reflectance and transmittance, matching the target spectral characteristics precisely. However, the utilized permittivity is not close to any material available in NIDN as Fig. 7b shows and the reconstructed pattern is not resemblant of the ground truth. Again, the regression finds a solution that leads to an excellent reconstruction of the desired spectral properties with different material properties than used for the ground truth. Interestingly, the utilized permittivities are somewhat less erratic than in the previous cases (compare Figs. 7 and 3). And, it seems that there are two distinct materials (i.e. permittivity) that the network was converging to. These did not match the ones in NIDN’s dataset though.

For the classification approach and RCWA, shown in Figure. 8, comparable errors in terms of reproducing the spectral characteristics are obtained. In this case—as can be seen in Fig. 8b—the permittivity chosen by the network is close to those of used in the ground truth, i.e., $\text{TiO}_{2}$ and $\text{Ge}$. The pattern of the ground truth was not reconstructed implying that the network found a good solution without it.

Filter design

The first test case closer to a potential application is the design of a bandpass filter. In this case, the target spectral properties are defined manually. Hence, the spectral characteristics may actually not be obtainable with the material properties the network is allowed to utilize. A stack of ten uniform layers is used to design the material. Due to the larger computational cost of FDTD, only RCWA was studied in this case. Also, the classification for this test case did not converge—likely because the available materials in database did not suffice to find a solution for this problem.

Regression

With the regression, the chosen range for the relative permittivity is $\epsilon _r \in [0 , 20 + 1i]$. The achieved spectral characteristics with regression and RCWA of the designed filter are shown in Fig. 9. The designed material captures the gap at the chosen wavelength around 1550 nm very well with a transmittance around 0.98 at those frequencies and a reflectance over 0.98 for the other wavelengths. Notably, the utilized permittivity is not similar to those of existing materials with erratic peaks in the real component of the permittivity for some of the layers, especially for lower wavelengths. Interestingly for higher wavelengths, the networks seems to converge to three distinct permittivities in the real part near 0, 20 and about 12.5.

Perfect anti-reflection coating

This second test case related to an application explores the scenario of building a perfect anti-reflection on top of a substrate with relative permittivity $\epsilon _r = 16$. This is closely related to the research described by Kim & Park³⁵. Thus, we use a stack of eight ${0.1}\,{\upmu }\textrm{m}$ thick layers on top of the higher permittivity substrate to achieve an anti-reflection coating (ARC).

Regression

For the regression with RCWA, Fig. 10 displays the achieved spectral characteristics. The obtained transmittance is above 0.9975 for the entire range between 400 and 700 nm, a peak transmittance of 0.9990 is achieved, the average is 0.9985. Thus, in summary the network is able to design a highly effective solution for these spectral characteristics. However, once again we can observe—as seen in Fig. 10b—that the chosen permittivity does not resemble real material properties. In this case, they are particularly erratic in the real part of the permittivity and especially in the lower wavelengths.

Classification

With the classification approach, the material design is still successful as can be seen in Fig. 11. On average, the transmittance is 0.9011 with all values being over 0.7822 and a peak at 0.9756, thus inferior to the regression. As discernible in Fig. 11b, the chosen materials for the layers in this approach are similar to existing ones such as Silicon Nitride and $\text{TiO}_{2}$ and the permittivities are not erratic as for the regression. Hence, with the limited materials currently included in NIDN, this is a better trade-off of obtaining an ARC with realistic materials.

Comparison with previous works and ablation

To study the capabilities of NIDN in detail, we performed two comparisons: First, with the already feasible autograd approach in GRCWA¹⁷ as demonstrated in online provided examples (Example 3) using the Python optimization module NLopt. Secondly, to study the role of having a continuous representation, we compared with directly encoding the permittivities in a voxelized grid. Detailed results are shown in Fig. 12. The test case is the optimization of a five-layer uniform material to maximize reflection. With GRCWA and NLopt, the optimization converged (within changes of $10^{-16}$) after 363 model evaluations to an optimum L1 error of 0.07045. The voxelized grid approach with NIDN achieved the best loss after 2 000 model evaluations at 0.04994. With the neural network approach with NIDN we obtained a loss of 0.06745 after 2 000 model evaluations.

As can be seen in Fig. 12a convergence was most stable with the voxelized grid and quickest with NLopt. From a convergence standpoint, the voxelized grid approach outperforms the other two. In Fig. 12b–d the permittivities produced by each approach are shown. Notably, in the voxelized grid and GRCWA approach in Fig. 12c,d, respectively, produced more erratic results. This is expected, as each permittivity ( in terms of location, layer and wavelength) is fully independent and this is a fully discrete representation. Thus, the neural network approach achieved similar performance while producing a continuous and differentiable representation.

Discussion

Overall, the results presented in this work clearly highlight the capabilities and potential of NIDN. We have shown it is able to correctly reproduce the material properties of several synthetic examples, which demonstrates that the inversion and training is successful for both, RCWA and FDTD. Similarly, the creation of both a 1550 nm filter and a perfect ARC are achieved. However, especially the regression approach is susceptible to finding solutions which are not plausible for real materials. This is a current limitation of the regression approach as the designed permittivities do not resemble existing materials. The main reason for this is the lack of constraints on the chosen permittivity. A future improvement here may be to have the network predict the parameters of a function that produces realistic permittivities. For the moment, the constraints on physical plausibility in NIDN are implicitly given through the numerical simulations for determining the spectral characteristics. An additional component that checks if the predicted permittivity matches a realistic permittivity may be needed. The classification approach is able to partially remedy this, but for the real-world examples of filter design and ARC it is less effective at matching desired spectral characteristics. This is likely due to the limited number (12) of materials currently available in NIDN. The number can be increased, however, this also creates a harder classification problem which may in turn require finding an optimal trade-off of the number and diversity of available materials vs. the design limitations induced by fewer available materials. In the future, an alternative may be approaches from the topology optimization community where different projections are used to clamp materials^36,37 or utilizing a gumbel-softmax³⁸.

One aspect where NIDN is more versatile than previous approaches²⁹ is that no imposition of shape or form is required or performed. In fact, even the discretization into a grid is not a constraint as it is only required by the solver (RCWA or FDTD); the neural network output is spatially continuous. One thing that has been observed is, however, that the simpler case of having uniform layers benefits training convergence when utilizing RCWA. The reason for this is likely the need to iteratively solve an eigenvalue problem if non-uniform layers are present which may deteriorate gradients. This may be remedied, e.g., by returning to a stochastic setting where gradients are averaged over several tested materials to make the training more robust. This does come at an increased computational cost but may be parallelized.

In summary, NIDN demonstrates the flexibility of physics-based deep learning methods. The range of applications for the software goes well beyond the shown test cases of filters and ARC as it conceivably can be used for a broad range applications such as radiative materials^18,19 or solar sails¹⁷. With RCWA and FDTD, two complementary solvers are integrated, RCWA being fast and well-suited for higher wavelength and FDTD being more potent and capable to investigate nanostructures in the visible and infrared spectrum. In the future, optimization and parallelization of the solvers will be critical to improve performance as especially training with FDTD for complex materials becomes prohibitively computationally expensive. Further, FDTD can also be used for the design of non-periodic structures which is a feature not yet implemented in NIDN.

Methods

Rigorous coupled-wave analysis (RCWA)

The RCWA method is a well-established tool for modeling scattering of stacked structures²³. In the NIDN implementation, single material as well as patterned layers can be stacked. A detailed overview of the capabilities can be found in the documentation of the Python module GRCWA¹⁷ which NIDN’s implementation is derived from. Note that the numerical stability of RCWA is dependent on the ratio of the size of the material grid to the investigated wavelength, i.e., for ratios above one RCWA may produce non-physical results.

The implementation in NIDN supports specifying a top or bottom layer in addition to the material; by default a vacuum layer is placed at both. The grid dimensions, number of layers and thicknesses of all layers can be specified manually. The default truncation order for simulations is 11 to keep computational costs low but it is configurable. The implementation employs a relaxation by adding a fictitious material absorption loss^17,39. For now, we only investigated inversion with perpendicular light incidence angles and polarized light, however, both are also configurable in the RCWA implementation.

Finite-difference time-domain (FDTD)

FDTD is a simulation method that models the electric and magnetic field using a regular grid as a spatial discretization with a finite difference method⁴⁰. The electric and magnetic fields at each grid point are calculated with numerical approximations of Maxwell’s equations; more specifically the electric field at each point is calculated using the curl of the magnetic field surrounding the point, and the magnetic field is calculated using the curl of the electric field surrounding the point.

The grid size is set to one tenth of the smallest wavelength used in the simulations, based on recommendations from the fdtd module’s documentation. The time step is set according to the Courant-Friedrichs-Lewy Condition⁴¹, to ensure numerical stability:

$$\begin{aligned} \Delta t = \frac{S_c\cdot \Delta x}{u} \end{aligned}$$

(1)

where $\Delta t$ is the time step, u is the wave velocity, $\Delta x$ is the spatial resolution and $S_c = \sqrt{dim} = \sqrt{2}$ is the Courant number.

In this work, the FDTD simulations were setup with periodic boundaries in the directions orthogonal to the wave-propagation to simulate an infinite plane. In the direction of propagation, Perfectly Matching Layers (PML) were used. PMLs aim to absorb as much of the electromagnetic-wave as possible to avoid reflections at the boundaries of the grid. In the propagation direction, the grid is built up by a PML layer of 1.5 µm, a vacuum layer of 1.0 µm, the material(s) chosen with the specified thicknesses, a vacuum layer of 1.0 µm and finally another PML layer of 1.5 µm. The source is a line across the width of the grid, radiating a continuous wave, placed at the interface between the first PML-layer and the free-space layer.

We measure the electromagnetic wave at placed detectors in the FDTD simulations such that the signal is independent of the direction of the incoming wave. Due to this, a correction of the reflected signal is necessary as the raw signal contains both the forward-going wave before reflection and the reflected wave. For the correction, we use an identical simulation setup without a material, the so-called free-space case, and subtract the measured signal from this simulation from the one with a material.

The transmission and reflection coefficients are calculated by comparing the energy of the wave after the transmission and a free-space case where there is no loss from a material such that

$$\begin{aligned} T = \frac{P_i}{P_t} = \frac{E_{0i}^2}{E_{0t}^2} \end{aligned}$$

(2)

where the subscripts i,t and 0 refer to incident wave, transmitted wave and amplitude of the wave, respectively. Since the mean squared value of an integer number of periods of a sine wave equals $\frac{E_0^2}{2}$, the mean squared value is used to calculate the transmission coefficients. The same is the case for the corrected reflection signal. No dispersion model has been implemented as frequencies are computed independently. By running the simulation for all desired wavelengths, the coefficients from each simulation were accumulated to obtain a transmission and reflection spectrum. The absorption spectrum is calculated by $A = 1-T-R$.

The number of time steps were chosen such that both the reflected signal and transmitted signal have at least two peaks, such that the mean square can be calculated over at least one period. The loss from a material in FDTD comes from the conductivity parameter. This is included in the AbsorbingObject from the fdtd module, which is the only object/material type currently implemented in NIDN. The conductivity $\sigma$ is set to $\sigma = \epsilon '' \cdot \epsilon _0\cdot \omega$.

Model inversion approach

The inversion process in NIDN is inspired by recent research related to physics-based deep learning²⁷ and in directly optimizing physical properties using differentiable simulations⁴². The main advantage of these approaches is that gradients in training neural networks are propagated directly through a numerical simulation (also referred to as forward model in the context of inverse problems). Thus, there is no need to build a dataset of any kind. In particular, the process in NIDN consists of the following steps which are also depicted in Fig. 1:

1.
Specify target spectral characteristics (reflectance, transmittance, absorptance) and material properties (size, allowed permittivity, simulation parameters)
2.
Randomly initialize a neural network that prompted with a position returns the material’s permittivity at that position
3.
Perform an evaluation of the numerical model for all target frequencies, i.e., compute the spectral characteristics for the materials as described by the network
4.
Compute the loss, i.e., the difference in the target spectral characteristics and those obtained through the network’s material
5.
Backpropagate the gradients and update the network parameters to iteratively minimize the difference between target and obtained spectral characteristics
6.
Until results are satisfactory, return to Step 3

After this process, the network is directly encoding the designed material by providing a permittivity for each position in the material. Herein, the conversion from a position to a permittivity is a particularly critical aspect to allow successful inverse design which will now be described in more detail.

Direct permittivity inference

NIDN implements two different ways to determine the relative permittivity $\epsilon _r^P$ at a point $P = (x,y,z,f)$ where x and y are the point’s location in a layer, z is the index of the layer it is on and f is the frequency at which we are evaluating. The first way is referred to as regression because the network herein directly infers a permittivity mapping $P \mapsto \epsilon _r^P$. The values are then clipped at some specified range to restrict it to physically plausible permittivity values. However, as can be seen in the results section, the frequency-dependent permittivity is still fairly unconstrained, allowing for materials that are not plausible as they, e.g., do not necessarily satisfy the Kramers-Kronig relationship⁴³. But, the advantage of this approach is the direct optimization of the material permittivity to obtain a solution that—at least with regard to the Maxwell solver—is plausible and can give a first idea of the hypothetical feasibility of some spectral characteristic assuming few constraints on permittivity. Permittivity values close to 0 are clipped to $\pm 0.01$ to avoid singularities in the Maxwell solvers.

Material classification

In order to obtain materials of higher physical fidelity, NIDN implements a second way of encoding material permittivity in the neural network. Naively, one may assume that it would be easiest to have the network perform a classification by assigning one specific material (and hence its permittivity) to each grid cell. Unfortunately, this is infeasible as the necessary argmax operation in a classification (to select the most probable material) is not differentiable and thus would break the gradient flow. As an alternative, NIDN features a collection of N—at time of writing twelve—materials from which we determine the relative permittivity $\epsilon _r$ at a point P through a linear combination such that

$$\begin{aligned} \epsilon _r^P = \sum _{i=1}^N y^P \epsilon _r^i, \end{aligned}$$

(3)

where $y^P \in [0,1]^N$ is the neural network output for point P and its i-th entry describes the probability of the i-th material at the point P and $\epsilon _r^i$ is the relative permittivity of the i-th material in NIDN, respectively. Thus, the produced permittivity is strongly resemblant of real experimental permittivity as can be seen in the results for the classification approach. Furthermore, to push the network towards picking one material without breaking gradient flow, the output $y^P$ is passed through a softmax function $\sigma (y)$ with $\beta =16$, i.e.

$$\begin{aligned} \sigma (y)_i = \frac{e^{\beta y_i}}{\sum _{j=1}^N e^{\beta y_j}} . \end{aligned}$$

(4)

Neural network training

The neural network training in NIDN is inspired by several previous works in the field of differentiable models^42,44,45,46. Thus, dense neural networks, namely the NeRF⁴⁵ and Siren⁴⁶ architecture are employed. Their parameters are optimized based on the gradient information obtained from running the simulation with the material encoded by the network. Thus, as illustrated in Fig. 1, the gradient of the model parameters, i.e. the permittivities, is computed by backpropagation from the loss through the computational graph formulated by the forward model (be it RCWA or FDTD) to the model. NIDN allows a variety of losses, such as absolute mean error or L2 error. For the classification, an additional regularization loss is used penalizing probabilities for their distance from the values 0 and 1. Training uses the Adam optimizer, a default learning rate of $8\cdot 10^{-5}$ and a learning rate scheduler to reduce the learning rate upon encountering a plateauing loss.

Software structure

In general, NIDN enables users with two potential use cases: The first is to run forward model simulations using either RCWA or FDTD to obtain the spectral characteristics of a material. The second is the inverse design using neural network models to describe a material structure that produces certain spectral characteristics. NIDN’s software repository has Jupyter notebooks describing both processes in detail for RCWA and FDTD to enable users to run their own experiments. The project is open to contributions and under a permissive GPL-3 open-source license to allow customization.

NIDN is designed with modularity in mind, allowing for arbitrary neural network models and Maxwell solvers to be plugged in. It uses a continuous integration methodology with automated unit tests to ensure the correctness of the implementation. Several plotting utilities, such as those used for the figures in this work are available. It is the authors’ hope that the described steps and features will allow others to build and improve upon it.

Data availability

All code and results presented in this paper are available open-source and open-access in the associated GitHub repository under https://github.com/esa/NIDN. Jupyter notebooks in the repository contain all data shown in this article.

References

Wei, J. et al. Machine learning in materials science. InfoMat 1, 338–358 (2019).
Article CAS Google Scholar
Liu, Y., Zhao, T., Ju, W. & Shi, S. Materials discovery and design using machine learning. J. Materiomics 3, 159–177 (2017).
Article ADS Google Scholar
Jiang, J., Chen, M. & Fan, J. A. Deep neural networks for the evaluation and design of photonic devices. Nat. Rev. Mater. 6, 679–700 (2021).
Article ADS Google Scholar
Liu, Z., Zhu, D., Rodrigues, S. P., Lee, K.-T. & Cai, W. Generative model for the inverse design of metasurfaces. Nano Lett. 18, 6570–6576. https://doi.org/10.1021/acs.nanolett.8b03171 (2018).
Article ADS CAS Google Scholar
Ma, W., Cheng, F., Xu, Y., Wen, Q. & Liu, Y. Probabilistic representation and inverse design of metamaterials based on a deep generative model with semi-supervised learning strategy. Adv. Mater. 31, 1901111. https://doi.org/10.1002/adma.201901111 (2019).
Article CAS Google Scholar
Tahersima, M. H. et al. Deep neural network inverse design of integrated photonic power splitters. Sci. Rep.https://doi.org/10.1038/s41598-018-37952-2 (2019).
Article Google Scholar
Piggott, A. Y., Petykiewicz, J., Su, L. & Vučković, J. Fabrication-constrained nanophotonic inverse design. Sci. Rep.https://doi.org/10.1038/s41598-017-01939-2 (2017).
Article Google Scholar
Rodríguez, J. A. et al. Inverse design of plasma metamaterial devices for optical computing. Phys. Rev. Appl.https://doi.org/10.1103/physrevapplied.16.014023 (2021).
Article Google Scholar
Sajedian, I., Badloe, T. & Rho, J. Finding the best design parameters for optical nanostructures using reinforcement learning. arXiv preprintarXiv:1810.10964 (2018).
Sajedian, I., Badloe, T. & Rho, J. Optimisation of colour generation from dielectric nanostructures using reinforcement learning. Opt. Express 27, 5874–5883 (2019).
Article ADS CAS Google Scholar
So, S., Mun, J. & Rho, J. Simultaneous inverse design of materials and structures via deep learning: Demonstration of dipole resonance engineering using core-shell nanoparticles. ACS Appl. Mater. Interfaces 11, 24264–24268 (2019).
Article CAS Google Scholar
So, S., Yang, Y., Lee, T. & Rho, J. On-demand design of spectrally sensitive multiband absorbers using an artificial neural network. Photon. Res. 9, B153–B158 (2021).
Article Google Scholar
So, S., Lee, D., Badloe, T. & Rho, J. Inverse design of ultra-narrowband selective thermal emitters designed by artificial neural networks. Opt. Mater. Express 11, 1863–1873 (2021).
Article ADS Google Scholar
Molesky, S. et al. Inverse design in nanophotonics. Nat. Photon. 12, 659–670. https://doi.org/10.1038/s41566-018-0246-9 (2018).
Article ADS CAS Google Scholar
Mori, O. et al. First solar power sail demonstration by IKAROS. Trans. Jpn. Soc. Aeronaut. Space Sci. Aerosp. Technol. Jpn. 8, To_4_25-To_4_31. https://doi.org/10.2322/tastj.8.to_4_25 (2010).
Article Google Scholar
Ullery, D. C. et al. Strong solar radiation forces from anomalously reflecting metasurfaces for solar sail attitude control. Sci. Rep.https://doi.org/10.1038/s41598-018-28133-2 (2018).
Article Google Scholar
Jin, W., Li, W., Orenstein, M. & Fan, S. Inverse design of lightweight broadband reflector for relativistic lightsail propulsion. ACS Photon. 7, 2350–2355. https://doi.org/10.1021/acsphotonics.0c00768 (2020).
Article CAS Google Scholar
Sun, K. et al. Metasurface optical solar reflectors using AZO transparent conducting oxides for radiative cooling of spacecraft. ACS Photon. 5, 495–501. https://doi.org/10.1021/acsphotonics.7b00991 (2017).
Article CAS Google Scholar
Sun, K. et al. VO$_{2}$ thermochromic metamaterial-based smart optical solar reflector. ACS Photon. 5, 2280–2286. https://doi.org/10.1021/acsphotonics.8b00119 (2018).
Article CAS Google Scholar
Hossain, M. M., Jia, B. & Gu, M. A metamaterial emitter for highly efficient radiative cooling. Adv. Opt. Mater. 3, 1047–1051. https://doi.org/10.1002/adom.201500119 (2015).
Article CAS Google Scholar
Jiang, J. et al. Metanet: A new paradigm for data sharing in photonics research. Opt. Express 28, 13670–13681 (2020).
Article ADS CAS Google Scholar
Sullivan, D. M. Electromagnetic Simulation Using the FDTD Method (Wiley, 2013).
Book Google Scholar
Moharam, M. & Gaylord, T. Rigorous coupled-wave analysis of planar-grating diffraction. JOSA 71, 811–818 (1981).
Article ADS Google Scholar
Kim, C. & Lee, B. Torcwa: Gpu-accelerated Fourier modal method and gradient-based optimization for metasurface design. Comput. Phys. Commun. 282, 108552 (2023).
Article CAS Google Scholar
Han, K. & Chang, C.-H. Numerical modeling of sub-wavelength anti-reflective structures for solar module applications. Nanomaterials 4, 87–128 (2014).
Article Google Scholar
Jiang, X. et al. Implementation of radiative cooling with an inverse-designed selective emitter. Opt. Commun. 497, 127209. https://doi.org/10.1016/j.optcom.2021.127209 (2021).
Article CAS Google Scholar
Thuerey, N. et al. Physics-based deep learning. arXiv preprintarXiv:2109.05237 (2021).
Ma, W. & Liu, Y. A data-efficient self-supervised deep learning model for design and characterization of nanophotonic structures. Sci. China Phys. Mech. Astron.https://doi.org/10.1007/s11433-020-1575-2 (2020).
Article Google Scholar
Nadell, C. C., Huang, B., Malof, J. M. & Padilla, W. J. Deep learning for accelerated all-dielectric metasurface design. Opt. Express 27, 27523. https://doi.org/10.1364/oe.27.027523 (2019).
Article ADS CAS Google Scholar
Colburn, S. & Majumdar, A. Inverse design and flexible parameterization of meta-optics using algorithmic differentiation. Commun. Phys.https://doi.org/10.1038/s42005-021-00568-6 (2021).
Article Google Scholar
Chandrasekhar, A. & Suresh, K. Tounn: Topology optimization using neural networks. Struct. Multidiscip. Optim. 63, 1135–1149 (2021).
Article MathSciNet Google Scholar
Shin, S., Shin, D., Kim, M., Ryu, H. & Kang, N. Machine learning-based topology optimization: A review. In The 2021 world congress on advances in structural engineering and mechanics (ASEM21) (IASEM, KAIST, KTA, SNU DAAE, 2021).
Woldseth, R. V., Aage, N., Bærentzen, J. A. & Sigmund, O. On the use of artificial neural networks in topology optimisation. Struct. Multidiscip. Optim. 65, 1–36 (2022).
Article Google Scholar
Wojcieszak, D., Kaczmarek, D. & Domaradzki, J. Analysis of surface properties of semiconducting (ti, pd, eu)ox thin films. Opto-Electron. Rev. 24, 15–19. https://doi.org/10.1515/oere-2016-0003 (2016).
Article ADS Google Scholar
Kim, K.-H. & Park, Q.-H. Perfect anti-reflection from first principles. Sci. Rep. 3, 1–5 (2013).
Google Scholar
Gaynor, A. T., Meisel, N. A., Williams, C. B. & Guest, J. K. Topology optimization for additive manufacturing: Considering maximum overhang constraint. In 15th AIAA/ISSMO multidisciplinary analysis and optimization conference, 2036 (2014).
Kawamoto, A. et al. Heaviside projection based topology optimization by a pde-filtered scalar function. Struct. Multidiscip. Optim. 44, 19–24 (2011).
Article MATH Google Scholar
Jang, E., Gu, S. & Poole, B. Categorical reparameterization with gumbel-softmax. arXiv preprintarXiv:1611.01144 (2016).
Liang, X. & Johnson, S. G. Formulation for scalable optimization of microcavities via the frequency-averaged local density of states. Opt. Express 21, 30812–30841 (2013).
Article ADS Google Scholar
Schneider, J. B. Understanding the finite-difference time-domain method (2010).
De Moura, C. A. & Kubrusly, C. S. The Courant–Friedrichs–Lewy (cfl) condition. AMC 10 (2013).
Izzo, D. & Gómez, P. Geodesy of irregular small bodies via neural density fields: Geodesynets. arXiv preprintarXiv:2105.13031 (2021).
Lucarini, V., Saarinen, J. J., Peiponen, K.-E. & Vartiainen, E. M. Kramers–Kronig relations in optical materials research Vol. 110 (Springer, 2005).
Google Scholar
von Looz, M., Gomez, P. & Izzo, D. Study of the asteroid bennu using geodesyanns and osiris-rex data. arXiv preprintarXiv:2109.14427 (2021).
Mildenhall, B. et al. Nerf: Representing scenes as neural radiance fields for view synthesis. In European conference on computer vision, 405–421 (Springer, 2020).
Sitzmann, V., Martel, J., Bergman, A., Lindell, D. & Wetzstein, G. Implicit neural representations with periodic activation functions. Adv. Neural Inf. Process. Syst. 33, 7462–7473 (2020).
Google Scholar

Download references

Author information

Authors and Affiliations

European Space Agency, Advanced Concepts Team, 2201AZ, Noordwijk, The Netherlands
Pablo Gómez, Håvard Hem Toftevaag, Torbjørn Bogen-Storø & Derek Aranguren van Egmond
Instituto de Micro y Nanotecnología, IMN-CNM, CSIC (CEI UAM+CSIC), Isaac Newton, 8, 28760, Tres Cantos, Madrid, Spain
José M. Llorens

Authors

Pablo Gómez
View author publications
You can also search for this author in PubMed Google Scholar
Håvard Hem Toftevaag
View author publications
You can also search for this author in PubMed Google Scholar
Torbjørn Bogen-Storø
View author publications
You can also search for this author in PubMed Google Scholar
Derek Aranguren van Egmond
View author publications
You can also search for this author in PubMed Google Scholar
José M. Llorens
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

P.G. and H.H.T. conceived of the presented idea, P.G., H.H.T and T.B.S. developed the computational framework and performed computations, P.G. and T.B.S. conducted the simulations, H.H.T, T.B.S., D.A.E. and J.M.L. contributed to the interpretation of results. J.M.L. provided supporting software and calculations. P.G. took the lead in writing the manuscript. All authors provided critical feedback and helped shape the research, analysis and manuscript.

Corresponding author

Correspondence to Pablo Gómez.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Gómez, P., Toftevaag, H.H., Bogen-Storø, T. et al. Neural Inverse Design of Nanostructures (NIDN). Sci Rep 12, 22160 (2022). https://doi.org/10.1038/s41598-022-26312-w

Download citation

Received: 05 August 2022
Accepted: 13 December 2022
Published: 22 December 2022
DOI: https://doi.org/10.1038/s41598-022-26312-w

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

A comprehensive deep learning method for empirical spectral prediction and its quantitative validation of nano-structured dimers

Deep learning approach based on dimensionality reduction for designing electromagnetic nanostructures

A deep learning method for empirical spectral prediction and inverse design of all-optical nonlinear plasmonic ring resonator switches

Introduction

Results

Material design setup

Validation and numerical stability

Material and structure inference

Uniform \(\text{TiO}_{2}\) layer inversion

Three-layer uniform material

Two-layer patterned material

Filter design

Regression

Perfect anti-reflection coating

Regression

Classification

Comparison with previous works and ablation

Discussion

Methods

Rigorous coupled-wave analysis (RCWA)

Finite-difference time-domain (FDTD)

Model inversion approach

Direct permittivity inference

Material classification

Neural network training

Software structure

Data availability

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Information.

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links