Plasmonic nanostructure design and characterization via Deep Learning

Malkiel, Itzik; Mrejen, Michael; Nagler, Achiya; Arieli, Uri; Wolf, Lior; Suchowski, Haim

doi:10.1038/s41377-018-0060-7

Download PDF

Article
Open access
Published: 05 September 2018

Plasmonic nanostructure design and characterization via Deep Learning

Itzik Malkiel¹^na1,
Michael Mrejen²^na1,
Achiya Nagler²^na1,
Uri Arieli²,
Lior Wolf¹ &
…
Haim Suchowski²

Light: Science & Applications volume 7, Article number: 60 (2018) Cite this article

27k Accesses
419 Citations
83 Altmetric
Metrics details

Subjects

Abstract

Nanophotonics, the field that merges photonics and nanotechnology, has in recent years revolutionized the field of optics by enabling the manipulation of light–matter interactions with subwavelength structures. However, despite the many advances in this field, the design, fabrication and characterization has remained widely an iterative process in which the designer guesses a structure and solves the Maxwell’s equations for it. In contrast, the inverse problem, i.e., obtaining a geometry for a desired electromagnetic response, remains a challenging and time-consuming task within the boundaries of very specific assumptions. Here, we experimentally demonstrate that a novel Deep Neural Network trained with thousands of synthetic experiments is not only able to retrieve subwavelength dimensions from solely far-field measurements but is also capable of directly addressing the inverse problem. Our approach allows the rapid design and characterization of metasurface-based optical elements as well as optimal nanostructures for targeted chemicals and biomolecules, which are critical for sensing, imaging and integrated spectroscopy applications.

Nonreciprocal thermal photonics

Article 10 April 2024

An optoacoustic field-programmable perceptron for recurrent neural networks

Article Open access 16 April 2024

Highly efficient vortex generation at the nanoscale

Article 01 April 2024

Introduction

In recent decades, many breakthroughs in optics have led to unprecedented imaging capabilities beyond the diffraction limit, with applications in biology and nanotechnology. In this context, nanophotonics has revolutionized the field of optics in recent years by enabling the manipulation of light–matter interactions with subwavelength structures^1,2,3. However, despite the many advances in this field, its impact and penetration in our daily life has been hindered by a convoluted and iterative process, cycling through modeling, nanofabrication and nanocharacterization. The fundamental reason starts with the fact that the prediction of the optical response is very time consuming and requires solving Maxwell's equations with dedicated numerical packages^4,5 http://www.lumerical.com/tcad-products/fdtd/. Next, more significantly, the inverse problem, i.e., designing a nanostructure with an on-demand optical response, is currently a prohibitive task even with the most advanced numerical tools due to the high nonlinearity of the problem^6,7. In parallel, for many years, computer science has been harnessed to address challenging tasks in nanophotonic imaging, design and characterization. In general, the approaches were either to target enhancing/resolving imaging and characterization beyond the diffraction limit (super-resolution techniques such as PALM and STORM techniques and more https://www.nobelprize.org/nobel_prizes/chemistry/laureates/2014/advanced-chemistryprize2014.pdf ^8,9,10) or to assist with the design process of nanophotonic devices^{11,12,13,14,15,16,17}. However, to date, very few computational techniques are capable of addressing both aspects in an integrated manner for nanoplasmonics. In this context, Deep Learning (DL) has emerged in recent years as a very powerful computational method that has achieved state-of-the-art results in various tasks, including computer vision¹⁸, speech recognition¹⁹, natural language processing²⁰, face recognition and other applications²¹. Inspired by the layered and hierarchical deep architecture of the human brain, DL uses multiple layers of nonlinear transformation to model high-level abstraction in data. DL has also been successfully employed in research areas beyond computer science, such as in particle physics²², ultra cold science²³, condensed matter²⁴, chemical physics²⁵ and conventional microscopy^26,27.

Here we present an integrated DL approach and show how deep neural networks (DNNs) can streamline the design process and provide a unique, robust, time-efficient and accurate characterization capability for complex nanostructures based on their far-field optical responses. The complexity of the DNN can address the high level of nonlinearity of the inference tasks by creating a model that holds bidirectional knowledge. While it is common practice in DL to separate different problems and to train multiple separate networks for each problem, We show that our approach of training a bidirectional network that goes from the optical response spectrum to the nanoparticle geometry and back is significantly more effective for both the design and characterization tasks. Furthermore, we show that this DL approach not only can predict the spectral response of nanostructures with high accuracy but also can address the inverse problem and provide a single nanostructure’s design, geometry and dimension, for a targeted optical response for both polarizations.

This DL approach provides a method for direct on-demand engineering of plasmonic structures and metasurfaces for applications in sensing, targeted therapy and more. Moreover, the predictive capability of the DL model also holds great promise for multivariate characterization of nanostructures beyond the diffraction limit.

Results

To demonstrate the paradigm shift that is enabled by our Deep Learning approach, we consider the interaction of light with subwavelength structures such as plasmonic nanostructures, metamaterials and composite layered metallic nanostructures embedded in dielectrics, which allow control of the properties of the outgoing light²⁸. Predicting the far-field optical response for a defined nanostructure geometry and composition involves solving the full set of Maxwell equations at each location in space and for each wavelength. However, whereas the far-field spectrum is directly connected to the nanostructure geometry, the solvability of the ‘inverse’ problem, i.e., inferring the nanoscale geometry from a measured or desired far-field spectrum, depends to a large extent on the complexity of the system of interest (Fig. 1a).

**Fig. 1: Comparison of the different computational approaches to plasmonic nanostructure design**

For a simple nanostructure, which exhibits single resonance peaks in each polarization, one can solve it semi-analytically or in an intuitive manner²⁹; however, for a general spectral response associated with more complex geometries, no analytical solution is known, and time-consuming numerical methods such as Finite Element Method (FEM) or Finite Difference Time Domain (FDTD) method must be used. Further optimization methods such as shallow neural networks, evolutionary algorithms and linear regression^11,13,30,31 have gained some success in solving the inverse problem task. However, current techniques are still limited in accuracy and practical feasibility and fall short in the modeling of nonlinear problems with high complexity of the underlying physical processes. To date, none of these approaches can efficiently address the inverse problem, and it still takes many cycles of trial and error of modeling and characterization to predict or design a nanostructure for a desired or measured far-field optical spectral response³¹.

We emphasize that the Deep Learning approach presented here is fundamentally different from evolutionary approaches since, for every single design task, the evolutionary approaches search the parameter space over dozens (sometimes hundreds) of generations, with each generation encompassing dozens/hundreds of individuals (Fig. 1b). For this reason, the individuals should be simple enough to enable their electromagnetic response to be analytically solved; otherwise, the optimization task takes a prohibitive amount of time, which limits the usefulness of such an approach. Our approach is radically different (Fig. 1c). We train our DNN on a set encompassing structures that are not trivial and for which responses must be calculated using time-consuming numerical approaches. However, once the data set is created and learned, this task is nonrecurring and each design task requires only a query of the DNN that takes not more than a few milliseconds.

To illustrate our approach, we design a novel deep network that uses a fully connected neural network. We introduce a bidirectional deep neural network architecture that is composed of two networks (Fig. 2a), where the first is a Geometry-predicting-network (GPN) that predicts a geometry based on the spectra (the inverse path) and the second is a Spectrum-predicting-network (SPN) that predicts the spectra based on the nanoparticle geometry (the direct path). The geometry predicted by the GPN is fed into the SPN which, in turn, predicts the spectrum. We thus solve the harder inverse problem first, i.e., predicting the geometry based on two spectra for both polarizations, and then, using the predicted geometry, we match the recovered spectrum with the original one (see Supplemental Document for further information). It is worthwhile to note that the training of such a bidirectional network requires a dedicated learning procedure, since the input to the SPN is a predicted geometry rather than the actual geometry (see Supplemental Document for more information). Furthermore, we also observe a significant gain from training one network on all the training sets rather than the alternative of training multiple separate networks. It is crucial to stress that the learning phase in the DNN is a nonrecurring effort, which means that once the data set is learned, the query phase is quasi instantaneous. This approach is a clear departure from evolutionary methods in which for every query, the whole parameter space is searched for optimization.

**Fig. 2: Deep-learning network architecture and results.**

To train our DNN, we created a large set of synthetic data using COMSOL Multiphysics⁴. The data contain more than 15,000 experiments, where each experiment is composed of a plasmonic nanostructure with a defined geometry, its metal properties, the host's permittivity and the optical response spectrum for both horizontal and vertical polarizations of the incoming field. While we maintain a constant thickness of the nanoparticle, the thickness can of course influence the transmission spectra (blueshift and resonance strength). This variable can be added as a parameter to the learning data set and allow refined predictions. In our proof of concept, we choose a nanostructure geometry represented by a general "H" form that can be easily fabricated using top-down approaches, where each of the outer edges can vary in length and angle or can be omitted (Fig. 2). Such variable geometry is sufficiently complex to span a wide variety of optical response spectra for both polarizations. We then feed the DNN with these synthetic optical experiments and let it learn the multivariate relationship between the spectra and all of the aforementioned geometric parameters. During this training process, the prediction provided by the DNN on a set of synthetic experiments is compared to the COMSOL solutions, and the network weights are optimized to minimize the discrepancy. A set of similarly created samples, unseen during training, is used to evaluate the network’s performance.

We then demonstrate our DNN’s ability to accurately predict the fabricated nanostructures’ parameters beyond simulations, by fabricating a set of different geometries that encompass some geometries that the network has never seen before. Those geometries were fabricated with gold on ITO covered glass (see “Methods” section). We measured the transmission spectra on a home-built reflection-transmission setup (see “Methods” section).

We fed these measured spectra into our trained DNN and obtained excellent agreement between the retrieved dimensions and those actually measured by the SEM (Fig. 2). These excellent predictions were obtained once the DNN was trained with an additional training set of 1500 simulated geometries (each of the geometries was considered under the two polarization illuminations), for which the network was able to learn the different geometries’ responses in the presence of the measured dispersion of the indium tin oxide layer (ITO). We emphasize that our DNN allows the retrieval of geometrical dimensions and optical properties of a subwavelength geometry that reproduce its far-field spectra from the family of subwavelength H-geometries.

This finding is, to our knowledge, an impressive capability of multivariate parameter retrieval. We note that this achievement is enabled by the unique bidirectional architecture and the simultaneous learning process between the GPN and SPN, which leads to co-adaptation between the networks. Compared to the simultaneous bidirectional training, we observed that the performance of the two separately trained GPN and SPN is significantly inferior.

The bidirectionality, where the output of the inverse network serves as an input to the direct network and is used to predict the two spectrums of the predicted geometry, constitutes a unique feature of our network and is therefore further investigated. As an example, we demonstrate the bidirectionality advantage in the case of the dispersive ITO. This advantage is apparent from the Mean Squared Error (MSE) achieved on the error function in both approaches, i.e., bidirectional versus composite direct (SPN) and inverse (GPN) networks (more information can be found in Supplemental Document). The bidirectional network exhibits a significantly lower MSE of 0.16 compared to the MSE achieved with the composite approach (MSE = 0.37).

To gain insight on the effect of the network’s depth on the prediction performance, we conduct an extensive comparison between different network architectures. We show that different network depths have a dramatic effect on the results. We vary the number of fully connected layers at the second part of the inverse network, and by comparing the results to one another, we see a significant effect on the accuracy of the prediction, as seen in Fig. 3. We find that the best inverse network architecture for our case is three parallel group layers followed by eight sequential fully connected join layers. Interestingly, we observe a significant gain in accuracy when using eight join layers compared to five or seven layers in the sequential part of the network. The benefit of such a deep network is directly derived from the complexity and nonlinearity of the underlying physical process. We observe a significant gain from training one network on all of the training set over the alternative of training multiple separate networks. While this finding can be attributed to the so-called transfer of knowledge³², where knowledge learned from one problem is transferred to another, we are not aware of other instances in which it is that crucial to train one single generalist network instead of applying a divide-and-conquer strategy with multiple specialist networks.

**Fig. 3: Analysis of the depth and parallelism of the network.**

To test the boundaries of the DDN retrieval, we check the performance of our DNN to address radically unseen cases such as no nanostructures being present in the queried spectra (meaning that the spectra will be approximately flat with 100% transmission in both axes). The DNN is presented with the horizontal and vertical input polarization spectra.

We observe (Fig. 4) that out of all of the infinite possibilities (the returned lengths could have, for example, blown), the network output matched the reality without previously seeing this geometry. This finding shows that the DNN is not simply “interpolating”, as there is nothing even close to the “none” case in the training set; in fact, the DNN performed generalization. Additionally, it is worthwhile to mention that even in the angle parameters and the two lengths, the output of the network was at the appropriate scale.

**Fig. 4: Deep Learning retrieval of “no nanostructure”, a boundary case for which the DNN has not seen anything close during the training phase.**

Next, we have examined the strength of the inverse predictive approach for sensing applications in which plasmonic nanostructures are used to enhance the light–matter interactions with various chemicals and biomolecules. Organic compounds typically exhibit pronounced resonances across the spectrum from ultraviolet to mid-infrared. We show that our trained DNN allows us to find the nanostructure configuration to best interact with a given molecule with target multiple resonances in the two polarizations. More specifically, we wish to design a nanostructure that is targeted at enhancing the interaction with dichloromethane, an important chemical used in industrial processes. This organic compound exhibits one resonance at ~1150 nm and another at approximately 1400–1500 nm https://commons.wikimedia.org/wiki/File:Dichloromethane_near_IR_spectrum.png#/media/File:Dichloromethane_near_IR_spectrum.png. Our design goal is to achieve a nanostructure that will resonate in an aqueous solution (at both wavelengths for one polarization and with completely different resonances at the orthogonal polarization, at ~820 nm (matching a Ti:Sapphire femtosecond laser excitation for a pump-probe experiment), 1064 nm and 1550 nm (Fig. 5a, b). In the existing design process, this task would require to iterate through different designs using the standard FEM or FDTD simulation tools, a process that can be extremely time consuming. The DNN’s inverse solution yields, in a few seconds, the parameters shown in Fig. 5c). We also applied this design approach to the asymmetrical phthalocyanine dimer 1a, a synthetic molecule that has more complex polarization characteristics (Fig. 5d, e) and has potential applications due to its charge transfer properties³³. The DNN inverse design for this targeted molecule and polarizations results in the configuration shown in Fig. 5f. This finding demonstrates the capability of our DNN to address various targeted resonances in different polarizations and emphasizes that this approach can be extended to other molecules for sensing in biology, chemistry and material science.

**Fig. 5: Prediction of the nanostructure’s geometry for chemical sensing.**

Discussion

In conclusion, we introduce a novel deep-learning approach for predicting the geometries of nanostructures based solely on their far-field responses. We have designed, trained and tested the proposed scheme, showing a very accurate prediction of the geometry of a complex nanostructure. This approach could be extended to other physical and optical parameters of the host materials and compounds. The approach also effectively addresses the currently inaccessible inverse problem of designing a geometry for a desired optical response spectrum and significantly speeds up the direct spectrum prediction of such subwavelength structures. This approach allows for the on-demand design of optical responses of nanostructures and metasurfaces for many applications, such as sensing, imaging and more.

Materials and Methods

Preparation

ITO covered glass (Sigma Aldrich) were covered with PMMA-A4 polymer and spin-coated for one minute at 7,000 RPM. The electron beam (Raith150) used was a 10 kV beam, aperture 6 mm WD, and a dose was deposited in single-pixel lines. Samples were then developed in MIBK/IPA (1:3) for 1 min and rinsed in isopropanol for 20 s. A concentration of 40 nm of gold was then evaporated on the sample with an E-Beam Evaporator (VST evaporator). Lift-off was performed with acetone and followed with a final wash in isopropanol.

Sample characterization

Sample sizes were verified using an electron microscope and were optically characterized using an OSL2 Broadband Halogen Fiber Optic Illuminator (Thorlabs) light-source and LPNIR050 (Thorlabs) broad band polarizer. Transmitted light was filtered in an imaging plane by an iris such that only light that passed through the sample was collected and then analyzed by an AQ6370D (Yokogawa) spectrometer.

COMSOL simulation

We performed finite element method (FEM) simulations using the 'Electromagnetic Waves, Frequency Domain' module of the COMSOL 4.3b commercial software. For consistency, the edges were made using fillets with a constant radius of 15 nm. We have considered geometries based on a five edges shape of 'H' while varying an angle of one of the edges, the existing edges and the edges’ lengths.

The nanostructure is simulated in a homogeneous dielectric medium with a chosen real effective-permittivity. For preventing reflections from the far planes, PMLs with a depth of the maximum wavelength were placed on both far ends of the homogeneous medium in the propagation direction of the radiating field.

For the data set predicting the fabrications, the nanostructure made of Gold was modeled with a wavelength-dependent homogeneous medium permittivity, and where the ITO permittivity is wavelength dependent, such that its imaginary part can be neglected in the measured spectrum range. It has been shown that changes in the thickness of a Titanium adhesion layer higher than 40% of the nanostructures’ height do not affect the plasmon resonance. Furthermore, for an Au nanoparticle with a diameter of 10 nm and a graphene layer, the LSPR shifting saturates when the distance is >20 nm.

A prediction for a similar behavior of the ITO layer is assumed. In our case, the ITO thickness is ~100 nm, which is approximately 250% of the nanostructure thickness of ~40 nm.

Data availability

The open source code for the DNN presented in this work can be found at the following URL https://github.com/ItzikMalkiel/DeepNanoDesign

References

Yu, N. F. & Capasso, F. Flat optics with designer metasurfaces. Nat. Mater. 13, 139–150 (2014).
Article ADS Google Scholar
Kildishev, A. V., Boltasseva, A. & Shalaev, V. M. Planar photonics with metasurfaces. Science 339, 1232009 (2013).
Article Google Scholar
Ni, X. J., Wong, Z. J., Mrejen, M., Wang, Y. & Zhang, X. An ultrathin invisibility skin cloak for visible light. Science 349, 1310–1314 (2015).
Article ADS Google Scholar
COMSOL. COMSOL Multiphysics^® v. 5.2 (COMSOL AB, Stockholm, Sweden) https://www.comsol.com/support/knowledgebase/1223/.
Oskooi, A. F. et al. MEEP: A flexible free-software package for electromagnetic simulations by the FDTD method. Comput. Phys. Commun. 181, 687–702 (2010).
Article MATH ADS Google Scholar
Colton, D. & Kress, R. Inverse Acoustic and Electromagnetic Scattering Theory (Springer, New York, 2013). .
Odom, T. W., You, E. A. & Sweeney, C. M. Multiscale plasmonic nanoparticles and the inverse problem. J. Phys. Chem. Lett. 3, 2611–2616 (2012).
Article Google Scholar
Betzig, E. et al. Imaging intracellular fluorescent proteins at nanometer resolution. Science 313, 1642–1645 (2006).
Article ADS Google Scholar
Rust, M. J., Bates, M. & Zhuang, X. W. Sub-diffraction-limit imaging by stochastic optical reconstruction microscopy (STORM). Nat. Method 3, 793–796 (2006).
Article Google Scholar
Hess, S. T., Girirajan, T. P. K. & Mason, M. D. Ultra-high resolution imaging by fluorescence photoactivation localization microscopy. Biophys. J. 91, 4258–4272 (2006).
Article ADS Google Scholar
Macías, D., Adam, P. M., Ruiz-Cortés, V., Rodríguez-Oliveros, R. & Sánchez-Gil, J. A. Heuristic optimization for the design of plasmonic nanowires with specific resonant and scattering properties. Opt. Express 20, 13146–13163 (2012).
Article ADS Google Scholar
Sacha, G. M. & Varona, P. Artificial intelligence in nanotechnology. Nanotechnology 24, 452002 (2013).
Article ADS Google Scholar
Ginzburg, P., Berkovitch, N., Nevet, A., Shor, I. & Orenstein, M. Resonances on-demand for plasmonic nano-particles. Nano Lett. 11, 2329–2333 (2011).
Article ADS Google Scholar
Forestiere, C. et al. Particle-swarm optimization of broadband nanoplasmonic arrays. Opt. Lett. 35, 133–135 (2010).
Article ADS Google Scholar
Forestiere, C. et al. Genetically engineered plasmonic nanoarrays. Nano Lett. 12, 2037–2044 (2012).
Article ADS Google Scholar
Feichtner, T., Selig, O., Kiunke, M. & Hecht, B. Evolutionary optimization of optical antennas. Phys. Rev. Lett. 109, 127701 (2012).
Article ADS Google Scholar
Forestiere, C., He, Y. Y., Wang, R., Kirby, R. M. & Dal Negro, L. Inverse design of metal nanoparticles’ morphology. ACS Photonics 3, 68–78 (2016).
Article Google Scholar
Krizhevsky, A., Sutskever, I. & Hinton, G. E. in Advances in Neural Information Processing Systems 25 edn, (eds Pereira, F., Burges, L. Bottou, C. J. C. & Weinberger, K. Q.) 1097–1105 (Curran Associates, Inc., Lake Tahoe, NV, USA, 2012).
Hinton, G. et al. Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups. IEEE Sign Process Mag. 29, 82–97 (2012).
Article ADS Google Scholar
Socher, R., et al. Recursive deep models for semantic compositionality over a sentiment Treebank. Baldwin, T. & Korhonen, A. (eds) In Proc. of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, WA, 1631–1642 (Association for Computational Linguistics (ACL), Stroudsburg, PA, USA, 2013).
Taigman, Y., Yang, M., Ranzato, M. & Wolf, L. Deepface: closing the gap to human-level performance in face verification. Dickinson, S., Metaxas, D. & Turk, M. (eds) In Proc. of 2014 IEEE Conference on Computer Vision and Pattern Recognition. 1701–8 (IEEE: Columbus, OH, USA, 2014).
Baldi, P., Sadowski, P. & Whiteson, D. Searching for exotic particles in high-energy physics with Deep Learning. Nat. Commun. 5, 4308 (2014).
Article ADS Google Scholar
Wigley, P. B. et al. Fast machine-learning online optimization of ultra-cold-atom experiments. Sci. Rep. 6, 25890–1 (2016).
Article ADS Google Scholar
Brouwer, W. J., Kubicki, J. D., Sofo, J. O. & Giles, C. L. An investigation of machine learning methods applied to structure prediction in condensed matter. arXiv 1405, 3564 (2014).
ADS Google Scholar
Hansen, K. et al. Assessment and validation of machine learning methods for predicting molecular atomization energies. J. Chem. Theory Comput. 9, 3404–3419 (2013).
Article Google Scholar
Waller, L. & Tian, L. Computational imaging: Machine learning for 3D microscopy. Nature 523, 416–417 (2015).
Article ADS Google Scholar
Chen, C. L. et al. Deep Learning in label-free cell classification. Sci. Rep. 6, 21471 (2016).
Article ADS Google Scholar
Cai, W. S. & Shalaev, V. Optical Metamaterials: Fundamentals and Applications (Springer, New York, 2010). .
Latimer, P. Light scattering by ellipsoids. J. Colloid Interface Sci. 53, 102–109 (1975).
Article ADS Google Scholar
Rodríguez-Oliveros, R., Paniagua-Domínguez, R., Sánchez-Gil, J. A. & Macías, D. Plasmon spectroscopy: Theoretical and numerical calculations, and optimization techniques. Nanospectroscopy 1, 67–96 (2015).
Google Scholar
Glorot, X., Bordes, A. & Bengio, Y. Deep sparse rectifier neural networks. Gordon, G., Dunson, D. & Dudík, M. (eds) In Proc. of the Fourteenth International Conference on Artificial Intelligence and Statistics. (PMLR, Ft. Lauderdale, FL, USA, 2011).
Yosinski, J. et al. How transferable are features in deep neural networks? Ghahramani, Z., Welling, M., Cortes, C., Lawrence, N.D. & Weinberger, K.Q. (eds) In Proc. of the 27th International Conference on Neural Information Processing Systems. 3320–3328 (Curran Associates, Inc.: Montreal, Canada, 2014).
Huang, G. Q. et al. A series of asymmetrical Phthalocyanines: synthesis and near Infrared properties. Molecules 18, 4628–4639 (2013).
Article Google Scholar

Download references

Acknowledgements

The funding from the Israel Science Foundation (ISF) under grant number: 1433/15 is acknowledged.

Author information

These authors contributed equally: Itzik Malkiel, Michael Mrejen, Achiya Nagler

Authors and Affiliations

School of Computer Science, Faculty of Exact Sciences, Tel Aviv University, Tel Aviv, 69978, Israel
Itzik Malkiel & Lior Wolf
School of Physics and Astronomy, Faculty of Exact Sciences, Tel Aviv University, Tel Aviv, 69978, Israel
Michael Mrejen, Achiya Nagler, Uri Arieli & Haim Suchowski

Authors

Itzik Malkiel
View author publications
You can also search for this author in PubMed Google Scholar
Michael Mrejen
View author publications
You can also search for this author in PubMed Google Scholar
Achiya Nagler
View author publications
You can also search for this author in PubMed Google Scholar
Uri Arieli
View author publications
You can also search for this author in PubMed Google Scholar
Lior Wolf
View author publications
You can also search for this author in PubMed Google Scholar
Haim Suchowski
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

H.S. conceived the project. M.M. and A.N. conducted the COMSOL simulations. I.M. and L.W. designed and implemented the Deep Learning Network. U.A. fabricated and characterized the nanostructures. All of the authors discussed the results and wrote the manuscript.

Corresponding author

Correspondence to Haim Suchowski.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Article accepted preview

Published online: 22 August 2018

Electronic supplementary material

Supplementary Material for “Plasmonic nanostructure design and characterization via Deep Learning”

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Malkiel, I., Mrejen, M., Nagler, A. et al. Plasmonic nanostructure design and characterization via Deep Learning. Light Sci Appl 7, 60 (2018). https://doi.org/10.1038/s41377-018-0060-7

Download citation

Received: 22 March 2018
Revised: 17 July 2018
Accepted: 13 August 2018
Published: 05 September 2018
DOI: https://doi.org/10.1038/s41377-018-0060-7

This article is cited by

A deep learning method for empirical spectral prediction and inverse design of all-optical nonlinear plasmonic ring resonator switches
- Ehsan Adibnia
- Mohammad Ali Mansouri-Birjandi
- Pouria Jafari
Scientific Reports (2024)
Developing a carpet cloak operating for a wide range of incident angles using a deep neural network and PSO algorithm
- Amirhossein Fallah
- Ahmad Kalhor
- Leila Yousefi
Scientific Reports (2023)
Correlating metasurface spectra with a generation-elimination framework
- Jieting Chen
- Chao Qian
- Hongsheng Chen
Nature Communications (2023)
Imaging-based intelligent spectrometer on a plasmonic rainbow chip
- Dylan Tua
- Ruiying Liu
- Qiaoqiang Gan
Nature Communications (2023)
Machine learning for nanoplasmonics
- Jean-Francois Masson
- John S. Biggins
- Emilie Ringe
Nature Nanotechnology (2023)