Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

Terahertz pulse shaping using diffractive surfaces

Abstract

Recent advances in deep learning have been providing non-intuitive solutions to various inverse problems in optics. At the intersection of machine learning and optics, diffractive networks merge wave-optics with deep learning to design task-specific elements to all-optically perform various tasks such as object classification and machine vision. Here, we present a diffractive network, which is used to shape an arbitrary broadband pulse into a desired optical waveform, forming a compact and passive pulse engineering system. We demonstrate the synthesis of various different pulses by designing diffractive layers that collectively engineer the temporal waveform of an input terahertz pulse. Our results demonstrate direct pulse shaping in terahertz spectrum, where the amplitude and phase of the input wavelengths are independently controlled through a passive diffractive device, without the need for an external pump. Furthermore, a physical transfer learning approach is presented to illustrate pulse-width tunability by replacing part of an existing network with newly trained diffractive layers, demonstrating its modularity. This learning-based diffractive pulse engineering framework can find broad applications in e.g., communications, ultra-fast imaging and spectroscopy.

Introduction

Inspired by neural interactions in human brain1, artificial neural networks and deep learning have been transformative in many fields, providing solutions to a variety of data processing problems, including for example image recognition2, natural language processing3 and medical image analysis4. Data-driven training of deep neural networks has set the state-of-the-art performance for various applications in e.g., optical microscopy4,5,6,7,8,9,10, holography11,12,13,14,15,16, and sensing17,18,19,20, among others. Beyond these applications, deep learning has also been utilized to solve inverse physical design problems arising in e.g., nanophotonics and plasmonics21,22,23,24. These advances cover a wide range of engineering applications and have motivated the development of new optical computing architectures25,26,27,28,29,30,31 that aim to benefit from the low-latency, power-efficiency and parallelization capabilities of optics in the design of machine learning hardware. For example, Diffractive Deep Neural Networks (D2NN)32 have been introduced as an optical machine learning framework that uses deep learning methods, e.g., stochastic gradient-descent and error-backpropagation, to train a set of diffractive layers for computing a given machine learning task as the light propagates through these layers. Early studies conducted on this framework showed its statistical inference capabilities, achieving >98% numerical blind testing33,34 accuracy for the classification of the images of handwritten digits. Recently, the D2NN framework has also been extended to harness broadband radiation in order to design spatially-controlled wavelength de-multiplexing systems24; however, this former work did not engineer the spectral phase values at different frequencies of the input radiation and therefore did not report any temporal wave control or pulse shaping.

In parallel to these recent advances at the intersection of optics and machine learning, there has been major progress in optical pulse shaping, including pulse compression for optical telecommunication35 and pulse stretching for chirped pulse amplification36. Dynamic, customizable temporal waveform synthesis has been achieved using time37,38,39 or frequency domain40,41,42 modulation. Among different approaches, the Fourier-transform based configuration43, which relies on conventional optical components such as lenses to establish a mapping between the pixels of an optical modulation device and the spectral components of the input broadband light, is one of the most commonly employed techniques. In various forms of its implementation, the optical modulation device placed at the Fourier plane in between two gratings can be a dynamic component e.g., a spatial light modulator44,45,46,47, an acousto-optic modulator48,49, a movable mirror50 or even a metasurface51, offering engineered dispersion and wavefront manipulation, tailored for different applications.

However, these earlier pulse shaping techniques have restricted utility at some parts of the electromagnetic spectrum, such as the terahertz band, due to the lack of advanced optical components that can provide spatio-temporal modulation and control of complex wavefronts, covering both a broad bandwidth and a high spectral resolution at these frequencies52,53. As a result, direct shaping of terahertz pulses by independent control of the spectral amplitude and phase of the input wavelengths has not been achieved to date; instead, the synthesis of terahertz pulses has been generally performed indirectly through the engineering of the optical-to-terahertz converters or shaping of the optical pulses that pump terahertz sources54,55,56,57,58. Previous work also demonstrated an active device using an external pump-induced inhomogeneous medium to shape input terahertz pulses59.

Here, we demonstrate the use of diffractive networks designed by deep learning to all-optically shape pulses by simultaneously controlling the relative phase and amplitude of each spectral component across a continuous and wide range of frequencies using only trainable diffractive layers, forming a small footprint, compact and passive pulse engineering system. This framework uses a deep learning-based physical design strategy to devise task-specific diffractive systems that can shape various temporal waveforms of interest. Following the digital training stage in a computer, we fabricated the resulting diffractive layers (Fig. 1) and experimentally demonstrated the success of our pulse shaping diffractive networks by generating pulses with various temporal widths using a broadband terahertz pulse as input. Despite using passive diffractive layers, the presented pulse shaping networks offer temporal pulse-width tunability that is experimentally demonstrated by varying the inter-layer distances within a fabricated diffractive network. We also investigated a physical transfer learning approach to show the modularity of the design space provided by our framework. In addition to engineering terahertz pulses, the fundamental design approach that is presented here can be readily adapted to different parts of the electromagnetic spectrum for shaping pulses. We believe that this study extends the engineering and precise control of electromagnetic fields through deep learning-designed diffractive networks into time-domain shaping of pulses, further motivating the development of all-optical machine learning and information processing platforms that can better harness the 4D spatio-temporal information carried by light.

Fig. 1: Schematic of the pulse shaping diffractive network and a photo of the experimental setup.
figure1

a 3D printed pulse shaping diffractive network that generates a square pulse with a width of 15.57 ps. b The schematic of the THz-TDS setup used in our experiments. The red line represents the optical path of a 780 nm femtosecond laser, and the blue line represents the terahertz beam. Dashed lines show the input and output apertures of the diffractive network. c The physical system layout of the pulse shaping diffractive network design. The input and output apertures are squares, with edge lengths of 0.8 cm and 0.2 cm, respectively. Gray regions on the aperture planes represent aluminum coating to block light transmission. d The photo of the experimental setup.

Results

Synthesis of arbitrary temporal waveforms

Synthesis of arbitrary temporal waveforms through small footprint and compact systems has been of great interest for various applications in e.g., tele-communications, ultra-fast imaging and spectroscopy, and it represents a challenging inverse design problem. Specifically, it requires accurate control of the complex-valued weights of the spectral components across a wide bandwidth and with high spectral resolution. We addressed this challenging inverse design problem through the training of diffractive networks as shown in Fig. 1c. The forward training model of our diffractive networks formulates the broadband light propagation using the angular spectrum representation of optical waves24. Based on the complex dispersion information of a diffractive material, the thickness of each diffractive feature (i.e., ‘neuron’) of a given diffractive layer is iteratively trained and optimized through the error-backpropagation with respect to a target cost function (see the “Methods” section). After the convergence of this deep learning-based training in a computer, we fabricated the resulting diffractive layers (Fig. 1c) using a 3D-printer to physically form our pulse shaping network as shown in Fig. 1a. This diffractive network was then experimentally tested for its desired/targeted pulse shaping capability using a terahertz time-domain spectroscopy (THz-TDS) setup60 that provides a noise equivalent bandwidth of 0.1–5 THz (Fig. 1b, d).

Each one of our pulse shaping diffractive networks consists of 4 trained layers that process the input terahertz pulse to synthesize a desired temporal waveform over an output aperture of 0.2 cm × 0.2 cm. Based on this system layout and a given input pulse profile to be shaped (Fig. 2b), we trained and fabricated diffractive networks that generate square pulses with different temporal widths. For example, Fig. 2a demonstrates the diffractive layers of a pulse shaping network that was trained to generate a 15.5 ps square pulse by processing the spectrum carried by the input terahertz pulse. Figure 2c demonstrates the time-domain amplitude of the output waveform numerically computed (blue) based on the trained diffractive layers and the corresponding experimentally measured temporal waveform (orange), along with the associated spectral amplitude and phase distributions. The carrier frequency of the desired temporal waveform at the output was a non-learnable, predetermined parameter set to be 0.35 THz to avoid water absorption bands in the terahertz regime (depicted by the red arrows in Fig. 2b). The numerically predicted output waveform (blue) in Fig. 2c indicates that a 4-layer diffractive network can synthesize a square temporal waveform with a pulse width of 15.69 ps without using any conventional optical components, in a compact architecture that spans approximately 250-times the carrier wavelength in the axial direction. The pulse width of the temporal waveform created by the 3D printed diffractive layers at the output aperture is measured as 15.52 ps, closely matching the numerically predicted result (15.69 ps). Similarly, a comparison of the output spectral amplitude profiles for the numerical and experimental results shows a good agreement in terms of the peak locations of the main and side lobes, as well as the relative amplitude carried by each spectral component. On the other hand, an examination of the unwrapped phase profiles (experimental vs. numerical) reveals that the 3D-fabricated, physical diffractive network could not exactly create the sharp phase transitions at the expected spectral locations, but rather generated smoothened transitions. This smoothening contributes to some of the differences observed between the experimentally measured and the numerically calculated time-domain waveforms (Fig. 2c). The power efficiency of this diffractive network was experimentally measured as ~0.51% at the carrier frequency (f0 = 355 GHz), quantified at the output aperture, normalized with respect to the input; here we should emphasize that >70% of the input optical power at the carrier frequency is in fact lost due to absorption within the 3D printed diffractive layers. Therefore, to create our diffractive layers, the selection of a different fabrication material with a much lower loss (e.g., polymers such as poly-methylpentene: TPX)61,62,63 can significantly boost the overall efficiency of these diffractive pulse shaping networks. Other strategies to improve the power efficiency include increasing the output aperture size and introducing additional power-related penalty terms during the training phase of the diffractive network (see the “Discussion” section).

Fig. 2: Pulse shaping diffractive network design and output results.
figure2

a The thickness profiles of the resulting diffractive layers after deep learning-based training in a computer. These diffractive layers synthesize a square pulse with a width of 15.69 ps over the output aperture for an input pulse shown in b. b Normalized input terahertz pulse measured right after the input aperture (see Fig. 1); in time-domain (left) and spectral domain (right). The red arrows on the measured spectral amplitude profile represent the water absorption bands at terahertz frequencies. c Left: The numerically computed (blue) and the experimentally measured (orange) output pulses in time domain. Top right: The normalized spectral amplitudes corresponding to the numerically computed (blue) and the experimentally measured (orange) pulses. Bottom right: Unwrapped spectral phase distributions computed based on the numerical forward model (blue) and the experimentally measured (orange) pulse.

Supplementary Fig. 1 further illustrates another diffractive network that was designed to create a narrower square pulse at its output aperture. At the end of its deep learning-based training, the numerical forward model converged to the thickness profiles shown in Supplementary Fig. 1a in order to synthesize a 10.96 ps square pulse (blue) illustrated in Supplementary Fig. 1c. When the diffractive layers depicted in Supplementary Fig. 1a were 3D printed and experimentally tested using the setup shown in Fig. 1d, the output pulse waveform was measured to have a temporal width of 11.85 ps (orange curve in Supplementary Fig. 1c), providing a good match to our numerical results, similar to the conclusions reported in Fig. 2.

Beyond fabrication artefacts and misalignments observed in the 3D-printed diffractive networks, the variation of the input terahertz pulse from experiment to experiment is one of the significant contributors for any mismatch between the numerical and experimental output waveforms. The deep learning-based design of the diffractive networks shown in Fig. 2 and Supplementary Fig. 1 relies on a known input terahertz pulse profile that is experimentally measured over the input aperture. To be able to take into account uncontrolled variations of the input pulse profile from run to run, we used 5 different experimentally measured input pulse profiles (dashed curves in Supplementary Figs. 2a–b) during the training phase of each diffractive network. In the experimental testing phase, however, the terahertz input pulse (light blue curve in Supplementary Figs. 2a–b) slightly deviated from these input pulse profiles used in the training, causing some distortions in the experimental results shown in Fig. 2 and Supplementary Fig. 1, compared to their numerically computed counterparts for the same diffractive network models (also see Supplementary Fig. 3 and Supplementary Fig. 4).

To shed more light onto this, next we normalized the experimentally measured spectral amplitude profiles depicted in Fig. 2c and Supplementary Fig. 1c, based on the ratio between the average spectral amplitudes carried by the input pulses used in the training phase and the input pulse measured at the experimental testing phase. This simple spectral normalization procedure nullifies the effect of input terahertz source variations from experiment to experiment and provides us an opportunity to better evaluate the accuracy of the complex-valued spectral filtering operation performed by the 3D-fabricated diffractive network. Supplementary Figs. 2c and 2d demonstrate the experimental spectral amplitudes and the corresponding temporal waveforms at the network output before and after this spectral normalization step for the diffractive networks shown in Fig. 2a and Supplementary Fig. 1a, respectively. Following the spectral normalization, the width of the square pulse created by the diffractive network in Supplementary Fig. 1a, for example, decreased from 11.85 ps to 10.49 ps, providing a better match to the 11.07 ps that is predicted by our numerical forward model (Supplementary Fig. 2d). A similar improvement using spectral normalization was also observed for the diffractive network shown in Fig. 2a, almost perfectly matching its numerical counterpart in terms of the square pulse width, achieving 15.71 ps after the normalization step (Supplementary Fig. 2c).

These results highlight that experiment-to-experiment variability of our input terahertz pulse profile causes it to deviate from the input pulse profiles used in the training phase of our diffractive network, creating some uncontrolled errors in the output pulse profile, which can be improved significantly after the spectral normalization step, as discussed above. To further explore the pulse shaping capabilities of diffractive networks, next we trained a set of generic diffractive networks that used/assumed a flat input spectrum during their training in order to achieve a desired output waveform; stated differently, a generic diffractive network is trained using an input pulse where all the wavelengths have the same spectral amplitude and phase. To accurately demonstrate the pulse shaping behavior of these generic diffractive designs that were trained with flat spectra, we used spectral normalization based on the input pulse profile, experimentally measured at each run. For example, Supplementary Fig. 5a and 6 show the diffractive layers of a generic pulse shaping network model that was trained to create a 15.5 ps square pulse. Supplementary Fig. 5c reports the time-domain amplitude of the output waveform numerically computed (blue) based on these trained diffractive layers and the experimentally measured temporal waveform (orange) along with the corresponding spectral amplitude and phase distributions. The synthesized pulse shape by the 3D-printed diffractive network closely matches the numerically computed waveform using our forward model, despite the water absorption bands that appear in our experimental results, illustrated by the red arrows in Supplementary Fig. 5b. The power efficiency at the carrier frequency (f0 = 400 GHz) of this diffractive network was experimentally measured as ~0.97%. Figure 3 further demonstrates three additional generic pulse shaping diffractive network models that were trained with a flat input spectrum and experimentally tested using our terahertz setup to achieve different square pulses, with pulse widths of 11.25 ps, 13.45 ps, and 16.69 ps, respectively, demonstrating a very good match to their numerical counterparts. The numerically computed peak frequencies for these three different diffractive networks were 399.4 GHz, 396.1 GHz, and 399.4 GHz, which were measured experimentally as 399.1 GHz, 402.2 GHz, and 401.8 GHz, respectively. As we move towards higher optical frequencies beyond 0.6 THz, the experimental spectral amplitude distributions start to deviate from their numerically predicted counterparts. Considering that the maximum material thickness in our model is ~1 mm, at higher optical frequencies corresponding to wavelengths below ~0.5 mm, the light may travel more than 2 wavelengths inside a diffractive feature (depending on the final trained model) which will then violate the thin modulation layer assumption in our forward model contributing to some of the experimental errors observed in Fig. 3. In addition, the size of each diffractive feature corresponding to a unique complex-valued modulation per neuron (see “Methods” section) was chosen to be 0.5 mm due to the limited lateral resolution of our 3D printer. Therefore, for higher frequencies, the light fields are modulated at each diffractive layer with 2D functions sampled at lower spatial rates, which, in return, partially limits the design capabilities of our diffractive networks at those smaller wavelengths of the pulse bandwidth. Furthermore, the uneven surface profile in 3D printing combined with thickness variations induced by fabrication imperfections contribute to some additional sources of experimental errors observed in our results.

Fig. 3: Experimental validation of different generic pulse shaping diffractive networks.
figure3

From left to right, the numerically computed (blue) and the experimentally measured (orange) normalized spectral amplitudes are illustrated with the inset plots showing the corresponding unwrapped spectral phase profiles; the numerically calculated (blue) normalized output pulse and the experimentally measured (orange) normalized output pulse are also shown along with the thickness profiles of the diffractive layers resulting from deep learning-based training for synthesizing the desired (ground truth) square pulses with pulse-widths of (a) 10.52 ps, (b) 13.02 ps, and (c) 17.98 ps.

To further demonstrate the design capabilities of our diffractive pulse shaping framework, in addition to the square pulses with various temporal widths reported earlier, we also trained three new diffractive network models that were designed to output (1) a chirped-Gaussian pulse (Supplementary Fig. 7), (2) a sequence of positive and negative chirped Gaussian pulses, one following another (Supplementary Fig. 8), and (3) a sequence of two chirp-free Gaussian pulses (Supplementary Fig. 9). These results report a very good match, both in time and spectral domains, between the target, ground-truth pulse profiles and the corresponding output pulses synthesized by the trained diffractive networks, clearly demonstrating the versatile nature of the presented framework to synthesize arbitrary pulses, engineered through the deep learning-based design of diffractive surfaces.

Pulse width tunability

Next, we demonstrated the temporal width tunability of pulse shaping diffractive networks despite the passive nature of their layers. By changing the axial distance between successive diffractive layers by ΔZ, the temporal width and the peak frequency of the output waveform can be tuned without any further training or a change to the 3D printed diffractive layers. We demonstrated this pulse-width tunability using the 3D printed diffractive network depicted in Supplementary Fig. 5, but a similar tunability also applies to the network models shown in Fig. 3. Since our diffractive networks used 30 mm layer-to-layer distance in their design, we considered the ΔZ range to be between −10 mm to 20 mm; for instance, when ΔZ is taken as −10 mm, the axial distance between all the successive layers of the diffractive network is set to be 20 mm. Within this axial tuning range, Fig. 4a–h demonstrate the effect of changing this layer-to-layer distance of an already designed/trained diffractive network on the output waveform and its complex-valued spectrum. The results reveal that as the diffractive layers get closer to each other axially, i.e., a negative ΔZ, the pulse-width of the output waveform increases and the peak frequency decreases. For instance, when the axial distance between each diffractive layer of the design shown in Supplementary Fig. 5 is decreased by 5 mm (ΔZ = −5 mm) as shown in Fig. 4d, the peak of the spectral amplitude distribution shifts from 399.4 GHz to 349.1 GHz according to our numerical forward model. The pulse-width of the resulting square pulse at the output aperture was numerically found to be 17.59 ps suggesting a longer pulse compared to 15.56 ps synthesized by the original design, ΔZ = 0 mm (Fig. 4d). The experimentally measured pulse width with the same amount of axial change in the layer-to-layer distance of the diffractive network revealed a 17.56 ps pulse after the spectral normalization step, confirming the tunability of our pulse shaping diffractive network and also providing a very good match to our numerical results (Fig. 4).

Fig. 4: Pulse width tunability of diffractive networks.
figure4

a Numerically calculated and experimentally measured temporal pulse widths and (b) the corresponding shifts in the center frequency are depicted as a function of the inter-layer distances of a pulse shaping diffractive network that was originally trained for synthesizing a square pulse width of 15.50 ps (Δz = 0 mm, see Supplementary Fig. 5). ch The numerically computed (blue) and the experimentally measured (orange) normalized spectral amplitudes, with the inset plots showing the experimentally measured temporal waveform (red) when the layer-to-layer distances are changed by (c) Δz = −10 mm, (d) Δz = −5 mm, (e) Δz = 5 mm, (f) Δz = 10 mm, (g) Δz = 15 mm and (h) Δz = 20 mm. The negative (positive) sign indicates that the inter-layer axial distances decrease (increase).

When the layer-to-layer distance is increased, i.e., a positive ΔZ, the output square pulse gets narrower in time domain with an accompanying shift in the peak frequency toward higher values. Figure 4e demonstrates an example of this case with ΔZ = 5 mm, i.e., the distance between each diffractive layer is increased to 35 mm. In this case, the experimentally measured and numerically computed square pulses at the output plane have peak frequencies of 451.4 GHz and 453.1 GHz, with the corresponding pulse-widths of 14.3 ps and 13.97 ps, respectively, once again confirming the tunability of our pulse shaping diffractive networks and demonstrating a very good agreement between the numerical forward model and our experiments. As we further increase ΔZ beyond 10 mm (depicted in Fig. 4f), the time domain pulse continues to get narrower.

Modularity of diffractive pulse shaping network

To further explore methods to alter a given fabricated diffractive network and its output function, next we employed a Lego-like physical transfer learning approach to demonstrate pulse-width tunability by updating only part of a pre-trained network with newly trained and fabricated diffractive layers, showing the modularity of a diffractive pulse shaping network. For this aim, we took the pre-trained network that experimentally synthesized a 15.57 ps square waveform, noted as the original design in Fig. 5a, and further trained only the last diffractive layer to synthesize a new desired output waveform, i.e., a 12.03 ps square pulse, by keeping the first three layers as they are (already fabricated). We experimentally validated this transfer learning approach as shown in Fig. 5b by removing the existing last diffractive layer and inserting a newly trained layer, fabricated using the same 3D printer. Numerical and experimental results revealed very good match to each other for the normalized output spectral amplitude over a wide frequency range, as well as for the normalized output field waveform, generating pulse-widths of 12.21 ps and 13.25 ps, respectively. Next we took an alternative approach: this time, the last two diffractive layers were replaced with new diffractive layers trained to generate 12.03 ps square pulses. As illustrated in Fig. 5c, with the addition of these two new diffractive layers to the already existing first two layers, the resulting new diffractive network successfully demonstrated the synthesis of 12.14 ps and 12.39 ps waveforms at the output aperture for the numerical and experimental waveforms, respectively. The peak frequency of the new network model was calculated to be at 399.4 GHz and it was experimentally measured to be at 399.8 GHz, showing once again a very good match between our numerical forward model and experimental results. Overall, the insertion of two newly trained layers, when compared to a single newly trained layer added on top of the existing layers of a 3D-fabricated network, provided us improved performance for achieving the new pulse form that is desired.

Fig. 5: Changing the output temporal waveform of a diffractive network by a physical transfer learning approach.
figure5

a The temporal and spectral output distributions (blue dashed box), synthesized by the original diffractive design that was trained to generate a 15.50 ps square pulse. b Replacing the last diffractive layer with another, newly trained diffractive layer to synthesize a 12.03 ps square pulse at the output. The thickness profile of the newly trained diffractive layer is shown, together with the normalized spectral and temporal profiles synthesized by this new diffractive network in green dashed box. c Replacing the last two diffractive layers with newly trained, two diffractive layers to synthesize a 12.03 ps square pulse at the output. The thickness profiles of the newly trained diffractive layers are shown, together with the normalized spectral and temporal profiles synthesized by this new diffractive network in red dashed box.

Discussion

Our results reported in earlier sub-sections demonstrate direct pulse shaping in terahertz part of the spectrum, where a complex-valued spectral modulation function that is trained using deep learning directly acts on terahertz frequencies through a passive diffractive device, without the need for an external pump. The presented learning-based approach can shape any input terahertz pulse through diffraction and is fundamentally different from previous approaches that indirectly synthesize a desired terahertz pulse through optical-to-terahertz converters or shaping of the optical pump that interacts with terahertz sources. This capability of direct pulse shaping in terahertz band enables new opportunities that could not be explored with indirect pulse shaping approaches. For example, precise engineering and synthesis of terahertz pulses with the state-of-the-art methods is either not possible or very hard and costly to achieve, including e.g., pulsed terahertz generation through quantum cascade lasers64,65,66, solid-state circuits67,68, and particle accelerators69. Furthermore, the presented deep learning-based framework is quite flexible and versatile that can be used to engineer terahertz pulses regardless of their polarization state, beam shape, beam quality or aberrations of the specific terahertz generation mechanism.

The intrinsic pulse-width tunability of a given diffractive network that is achieved by changing the axial layer-to-layer distance is an interesting feature that we demonstrated numerically and experimentally: Fig. 4a shows various pulse-widths obtained at seven different layer-to-layer distances using an existing network design. As the layer-to-layer distance of a diffractive network design increases, the temporal pulse-width at the output aperture gets smaller, without any further training or fabrication of new diffractive layers. This opens up the opportunity to synthesize new waveforms within a certain time window around the originally designed output pulse. In addition to that, an axial distance change between the existing layers of a diffractive network also shifts the center frequency of the output pulse as shown Fig. 4b. As the diffractive layers get closer to each other, we observed a red-shift in the center frequency. Another related aspect of this pulse shaping diffractive framework is its modularity to tune the output pulses using a physical transfer learning approach. By training a new layer (or layers) to replace part of an existing, pre-trained diffractive network model, on demand synthesis of new pulses can be achieved, as demonstrated in Fig. 5b–c. These results highlight some of the unique features of diffractive pulse shaping networks and how they can adapt to potential changes in the desired output pulse patterns.

The presented pulse shaping framework has a compact design, with an axial length of approximately 250 × λ0, where λ0 denotes the peak wavelength. Moreover, it does not utilize any conventional optical components such as spatial light modulators, which makes it ideal for pulse shaping in terahertz part of the spectrum, where high-resolution spatio-temporal modulation and control of complex wavefronts over a broad bandwidth represent a significant challenge. In addition to being compact and much simpler compared to previous demonstrations of pulse shaping in terahertz spectrum, our results present the implementation of direct pulse shaping in terahertz band, where the learned complex-valued spectral modulation function of the diffractive network directly acts on terahertz frequencies for pulse engineering. This capability enables new opportunities: when merged with appropriate fabrication methods and materials, the presented pulse shaping approach can be used to directly engineer terahertz pulses generated through quantum cascade lasers, solid-state circuits and particle accelerators. Another major advantage of this deep learning-based approach is that it is versatile and can be easily adapted to engineer terahertz pulses irrespective of their polarization state, beam quality, as well as spectral/spatial aberrations.

The experimentally measured power efficiency values reported in our manuscript are ~1%. However, there are various design strategies that can increase power-efficiency in diffractive pulse shaping networks as detailed in table in Fig. 6 (also see the “Methods” section). The diffractive networks reported in Fig. 6 were trained to synthesize 15.5 ps square pulses at their output plane. As one can observe in Fig. 6, the power efficiency values of the resulting diffractive models can be increased by more than an order of magnitude by adjusting the training loss function, increasing the output aperture size and using low absorption materials. For example, as reported in the second column of table in Fig. 6, when the material absorption is ignored during the testing of a diffractive network model, a 2-fold wider output aperture (i.e., 4 mm) provides a significant improvement in the power efficiency of the pulse shaping networks, reaching 60.37 and 61% for two different network models. On the other hand, if the absorption of our 3D-printing material is taken into account as part of the optical forward model, one can reach an efficiency value of 17.84% by accordingly optimizing the training loss function and using a 4 mm output aperture (see Fig. 6).

Fig. 6: Power efficiency values of pulse shaping diffractive networks trained with different loss functions.
figure6

These diffractive pulse shaping networks were trained to generate a 15.5 ps square pulse with output aperture widths of 2 mm (left) and 4 mm (right), with their corresponding loss functions shown in the leftmost column.

By comparing the top and bottom efficiency values for a given training loss function and design strategy reported in Fig. 6, we clearly see that the 3D-printing material used in this work decreases the pulse shaping network efficiency 2–5 times, in different designs, compared to an ideal, non-absorbing optical material. As an alternative fabrication material for diffractive pulse shaping networks, one can consider low-absorption polymers61,62,63 used in commercially available components designed for THz wavelengths, such as TPX, which exhibits a two orders-of-magnitude smaller absorption coefficient compared to the 3D printing material used in our work. There have been various fabrication processes developed for such low absorption polymers70,71, which can be used to precisely control the thickness of these low-loss polymers with a relatively high-resolution to manufacture pulse shaping diffractive networks with much lower material absorption. To even further improve the output efficiency of pulse shaping diffractive networks, anti-reflective (AR) coatings over diffractive surfaces can also be utilized to reduce back-reflections, similar to the AR-coated commercial lenses and other optical components.

In conclusion, we presented a modular pulse shaping network that synthesizes various pulse waveforms using deep learning. Precise shaping of the spectral amplitude and phase profile of an arbitrary input pulse over a wide frequency range can be achieved using this platform, which will be transformative for various applications including e.g., communications, pulse compression, ultra-fast imaging and spectroscopy. In addition to direct engineering of terahertz pulses, the presented diffractive pulse shaping network can be utilized in different parts of the electromagnetic spectrum by using appropriate fabrication technologies and materials.

Methods

Terahertz setup

Figure 1 shows the schematic diagram of the terahertz time-domain spectroscopy (THz-TDS) setup that was used to measure the input and output pulse profiles reported in this work. A Ti:sapphire laser (Coherent Mira HP) is used to generate femtosecond optical pulses. The optical beam generated by the laser is split into two parts. One part of the beam is used to pump a high-power plasmonic photoconductive terahertz source to generate terahertz pulses72, which are collimated with off-axis parabolic mirrors and guided to a high-sensitivity plasmonic photoconductive terahertz detector60. The other part of the beam passes through an optical delay line (Newport IMS300LM) and is focused onto the terahertz detector. As a result, an ultrafast signal which is directly proportional to the incident terahertz field is generated within the terahertz detector. The signal is sampled with a 12.5 fs time-resolution over a 400 ps time-window by changing the time delay between the terahertz and optical probe pulses incident on the detector, amplified with a transimpedance pre-amplifier (Femto DHPCA-100), and acquired with a lock-in amplifier (Zurich Instruments MFLI). For each measurement, 10 time-domain traces are collected and averaged. The described THz-TDS setup provides a 90 dB signal-to-noise ratio over a 5 THz noise-equivalent-power bandwidth.

Each one of the pulse shaping diffractive networks consists of 4 trained layers that are separated by 3 cm as illustrated in Fig. 1. The diffractive layers, input and output apertures, were fabricated using a 3D Printer (Objet30 Pro, Stratasys Ltd.). The fabrication/preparation of each diffractive layer takes approximately 1.5–2 h. A square input aperture (0.8 cm) and an output aperture (0.2 cm) are placed 3 cm from the first diffractive layer and 10 cm from the last diffractive layer, respectively (Fig. 1c). The printed apertures were aluminum coated to prevent any light wave passing through the regions outside of the aperture. After the design and printing of the diffractive layers, they were placed at their corresponding locations inside a 3D printed holder that ensures robust alignment between the layers. During the pulse shaping experiments, the diffractive network was directly placed between the terahertz source and detector, coaxial with the terahertz input pulse emanating from the source (Fig. 1b,d). After the alignment of the diffractive network, the output pulse was measured and it was followed by the measurement of the reference input pulse which was acquired by placing the same terahertz detector at the input aperture, without any diffractive layers between the source and detector. For generic diffractive networks that were trained with flat input spectra, the measured output pulse spectrum is normalized with respect to the measured reference input pulse and its spectral amplitude is smoothened around water absorption lines shown in Figs. 35 and Supplementary Fig. 5. The measured pulse width at the network output is defined as the width of the time interval that the envelope of the pulse amplitude is at least 20% of its maximum (see e.g., Figs. 25).

Forward model

Our forward model considers the layers of a diffractive network as thin modulation elements that are connected to the next layer through free space propagation. The modulation of neurons at each layer can be modeled as:

$$M^n\left( {x_i,y_i,z_i,\lambda } \right) = A^n\left( {x_i,y_i,z_i,\lambda } \right)\exp \left( {j\varphi ^n\left( {x_i,y_i,z_i,\lambda } \right)} \right),$$
(1)

where M represents the complex transmission/reflection coefficient. The field amplitude, phase, wavelength, and diffractive layer number are denoted by A, ϕ, λ and n, respectively. Free space propagation between each layer is calculated based on the Rayleigh–Sommerfeld formulation of diffraction that models a diffractive feature as source of a secondary wave:

$$W_i^n\left( {x,y,z,\lambda } \right) = \frac{{z - z_i}}{{r^2}}\left( {\frac{1}{{2\pi r}} + \frac{1}{{j\lambda }}} \right)\exp \left( {\frac{{j2\pi r}}{\lambda }} \right),$$
(2)

where \(r = \sqrt {\left( {x - x_i} \right)^2 + \left( {y - y_i} \right)^2 + \left( {z - z_i} \right)^2}\), \(j = \sqrt { - 1}\) and \(W_i^n(x,y,z,\lambda )\) is the secondary wave generated by the ith neuron on nth layer at location (xi, yi, zi), respectively. Then, we can write the optical field at layer n, at point (xi, yi, zi) as:

$$U^n\left( {x_i,\,y_i,\,z_i,\,\lambda } \right) = M^n\left( {x_i,\,y_i,\,z_i,\,\lambda } \right)\mathop {\sum}\limits_k {U^{n - 1}\left( {x_k,\,y_k,\,z_k,\,\lambda } \right)W_k^{n - 1}\left( {x_i,\,y_i,\,z_i,\,\lambda } \right),\,n \ge 1} .$$
(3)

Network training

During the training of a pulse shaping diffractive network, one of the 5 pulses measured at the input plane (Supplementary Figs. 2a–b) were randomly selected as the input pulse at each iteration of the training model for the diffractive networks reported in Fig. 2 and Supplementary Fig. 1; for the generic diffractive network models reported in Figs. 35, Supplementary Fig. 5 and Supplementary Figs. 79, however, the input is modeled as a spectrally flat Gaussian beam with varying FWHM values over a wide frequency range (Supplementary Fig. 10) and with a uniform phase profile. The wave propagation is performed for N = 300 discrete frequencies that were uniformly sampled between 3 GHz and 1 THz.

In our wave propagation through the diffractive layers, a 0.5 mm pixel (i.e., diffractive feature) size is assumed based on the lateral resolution of our 3D printer. While a pixel size of 0.5 mm can create all the propagating modes of free-space for frequencies below ~300 GHz, they can only excite plane waves over a subset of the k-vectors supported by the free-space for the spectral components between 300 GHz and 1 THz73. Therefore, diffractive pulse shaping networks would in general benefit from higher resolution fabrication techniques with better lateral resolution to more accurately control and engineer the complex-valued spectral weights of a given desired pulse.

To calculate the Rayleigh–Sommerfeld integral more accurately, each pixel is oversampled twice so that all 4 elements have the same thickness values in that 2 × 2 grid. The thickness of each pixel, h, is composed of a base height (hbase) of 0.1 mm, which provides adequate mechanical stiffness to the fabricated diffractive layer and a trainable modulation height (htr) that is between 0 and 1 mm, i.e.,

$$h = h_{{\mathrm{base}}} + h_{{\mathrm{tr}}}$$
(4)

To confine the modulation height between 0 and 1 mm, we defined htr over an auxiliary training-related variable, ha, using:

$$h_{{\mathrm{tr}}} = 0.5 \,{\mathrm{mm}} \times \left\{ {1 + \sin \left( {h_{\mathrm{a}}} \right)} \right\},$$
(5)

In its general form, the amplitude and phase modulation of each neuron of a given diffractive layer is a function of the layer thickness, incident wavelength, material extinction coefficient κ(λ) and refractive index n(λ), i.e.,

$$A^n\left( {x,y,z,\lambda } \right) = \exp \left( { - \frac{{2\pi \kappa \left( \lambda \right)h}}{\lambda }} \right)$$
(6)
$$\varphi ^n\left( {x,y,z,\lambda } \right) = \frac{{2\pi h\left( {n\left( \lambda \right) - n_{air}} \right)}}{\lambda }$$
(7)

The material refractive index n(λ) and the extinction coefficient κ(λ) are defined as the real and imaginary parts of the complex refractive index, \(\tilde n\left( \lambda \right) = n\left( \lambda \right) + j\kappa (\lambda )\), determined by the dispersion of our 3D fabrication material24. Since we have relatively small variations in the extinction coefficient over the frequency band that we utilized in this work, we ignored the material absorption during the training and numerical simulations of diffractive layers.

After the wave propagation through diffractive layers, light goes through the output square aperture of 2 mm width, which is placed right in front of the hemisphere silicon lens which is 1.2 cm in diameter. Since the effective aperture of this Si lens was significantly restricted by the output aperture, it was modeled as a uniform slab with a refractive index of 3.4 and 0.5 cm thickness. After the propagation through the Si slab, the coherent integration of the optical waves incident on the active area of the detector was computed to obtain the spectral field amplitude and phase for each frequency. The power efficiency was defined as \(\eta _{f_0} = \frac{{I_{{\mathrm{sensor}},f_0}}}{{I_{{\mathrm{input}},f_0}}}\) for the peak/center frequency (f0) of given diffractive network design, where \(I_{{\mathrm{input}},f_0}\) and \(I_{{\mathrm{sensor}},f_0}\) denote the power within the input and output apertures, respectively.

Our loss function (L) used during the training phase has three components: temporal loss term (Lt) which penalizes the mismatch between the target and the output time waveforms, the power loss term (Lp), and the power surrounding the detector region (Ls), i.e.,

$$L = \alpha L_t + \beta L_p + L_s$$
(8)

To calculate the temporal loss, Lt, first the output temporal waveform is reconstructed from the spectral field amplitude and phase on the detector area, and it is normalized. Then, the difference between the target temporal waveform and the reconstructed output waveform is integrated over time:

$$L_t = \mathop {\sum}\limits_t {\left( {f_{{\mathrm{target}}} - f_{{\mathrm{output}}}} \right)^2} ,$$
(9)

where ftarget and foutput denote the ground-truth, time-domain waveform and the synthesized waveform by the diffractive network model at a training iteration. For a given diffractive network model, foutput is computed by propagating the input waves of all the spectral components from the input aperture to the output aperture. Next, the complex-valued wave fields of these different wavelength components are integrated over the sensitive area of the detector to obtain each complex-valued spectral coefficient at the output, which is followed by an inverse Fourier transform operation over the resulting vector. Alternatively, the error term between a target, time-domain pulse, ftarget, and the synthesized waveform by the diffractive network, foutput, can directly be computed based on the complex-valued spectral coefficients without any inverse Fourier transform operation. However, in this case, since the error is defined based on the complex-valued target and output functions, two separate error functions must be computed for the real and imaginary parts of the spectral coefficients and these two losses must be combined to compute the final loss term.

The analytical form of the square pulses used in this work can be written as: \(f_{{\mathrm{target}}}(t) = rect(bt)\cos (2\pi f_0t)\), where f0 and b represent the carrier frequency and the rectangular pulse-width, respectively. For the Gaussian pulses, however, the analytical form of the target waveform can be written as: \(f_{{\mathrm{target}}}\left( t \right) =\! \mathop {\sum}\nolimits_{i = 1}^n {C_i} \cos \left(\! {2\pi f_0(t - t_{0,i})} \right)\exp \left( \!{ - \left( {t - t_{0,i}} \right)^2/(2p_i)} \right)\exp \left( {jq_i(t - t_{0,i})^2} \!\right)\), where t0,i, Ci, pi and qi denote the time instant of the peak, magnitude, variance of the low-pass envelope and the instantaneous angular chirpiness, respectively. The number of desired pulses inside a targeted time-window is determined by n. For the examples shown in Supplementary Figs. 79, the target time domain waveforms were created by setting these parameters to [n = 1, t0 = 0, C1 = 1, p1 = 2.2 × 10−22, q1 = 5.76 × 1021]; [n = 2, t0,1 = 0, t0,2 = 27 ps, C1 = 1, C2 = 0.5, p1 = p2 = 1.38 × 10−23, q1 = 6.25 × 1022 q2 = −6.25 × 1022]; and [n = 2, t0,1 = 0, t0,2 = 19 ps, C1 = 1, C2 = 1, p1 = p2 = 4.58 × 10−24, q1 = q2 = 0], respectively.

For the diffractive network designs shown in the last row of the table in Fig. 6, we used a power loss term, Lp, defined as:

$$L_p = \left\{ {\begin{array}{*{20}{c}} { - \log \left( {\frac{\eta }{{\eta _{{\mathrm{th}}}}}} \right),} & {{\mathrm{if}}\,\eta \,\,< \,\,\eta _{{\mathrm{th}}}} \\ {0,} & {{\mathrm{if}}\,\eta \,\,\ge \,\,\eta _{{\mathrm{th}}}} \end{array}} \right.,$$
(10)

where η = \(\frac{{\mathop {\sum}\nolimits_\omega {I_{{\mathrm{sensor}}}} }}{{\mathop {\sum}\nolimits_\omega {I_{{\mathrm{input}}}} }}\). Iinput and Isensor denote the power within the input and output apertures for a given wavelength, respectively. For the diffractive network designs shown in the last row of the table in Fig. 6, corresponding to 2 mm and 4 mm output apertures, ηth was selected as 0.07 and 0.08, respectively. For the all remaining designs reported in the manuscript, the power loss term is defined as:

$$L_p = \frac{{\mathop {\sum}\nolimits_\omega {\left( {I_{{\mathrm{target}}} - I_{{\mathrm{sensor}}}} \right)^2} }}{{\mathop {\sum}\nolimits_\omega {I_{{\mathrm{target}}}^2} }},$$
(11)

where Itarget is the total power of the target waveform at a given wavelength within the input aperture, normalized with respect to the power of the input at the center frequency, f0.

The last component of our loss function which represents the power surrounding the detector aperture is defined as:

$$L_s = \frac{{\mathop {\sum}\nolimits_\omega {I_{{\mathrm{surround}}}} }}{{\mathop {\sum}\nolimits_\omega {I_{{\mathrm{output}}\,{\mathrm{plane}}}} }},$$
(12)

where Isurround is the total power at a given wavelength within the 5 mm × 5 mm square region that is centered around the output aperture (excluding the output aperture, i.e., it only measures the signal surrounding the output aperture) and Ioutput plane is the total power at a given wavelength within the output plane.

The diffractive networks that synthesized 10.58 ps, 10.96 ps, 13.26 ps, 15.56 ps, 15.69 ps, and 17.94 ps square terahertz pulses were trained with \(\frac{\alpha }{\beta }\) ratios of 6500, 500, 4500, 1500, 750,000, and 2500, respectively. For the physical transfer learning approach, an \(\frac{\alpha }{\beta }\) ratio of 8500 was used. For Supplementary Figs. 7 and 9, we used an \(\frac{\alpha }{\beta }\) ratio of 1500, and for Supplementary Fig. 8, we used \(\frac{\alpha }{\beta } = 15000\).

Figure 6 reports a series of diffractive optical network designs that are trained to create a square pulse of 15.5 ps at their output apertures, achieving different levels of power efficiencies. Among these pulse shaping diffractive network models, the \(\frac{\alpha }{\beta }\) ratio was adjusted depending on the size of the output aperture. Specifically, the diffractive networks targeting a 2 mm aperture at the output plane were trained with \(\frac{\alpha }{\beta } = 1500\), and this ratio was reduced to 136 for the diffractive pulse shaping systems with 4 mm wide output apertures. Finally, an \(\frac{\alpha }{\beta }\) ratio of 150 was used for the diffractive optical networks that were trained with the power efficiency loss term described in Eq. (10).

In our training, Adam optimizer is used as a standard error backpropagation method with a learning rate of 0.8 × 10−3 for the pulses reported in Supplementary Figs. 79. For the diffractive networks synthesizing 10.96 ps and 15.69 ps square pulses, on the other hand, the learning rate was set to be 10−3. For the rest of the diffractive network models 10−4 was used as the learning rate. All the trainable parameters were initialized as zero. Our designs used Python (v3.7.3) and TensorFlow (v1.15.0) on a computer that has Nvidia Titan RTX graphical processing unit, Intel Core i9 CPU and 128 GB of RAM with Windows 10 operating system. MATLAB 2016b is used to convert designed diffractive layers to a 3D printable (.stl) file format.

Data availability

All the data and methods needed to evaluate the conclusions in this work are present in the main text and the Supplementary Information. Any other relevant data are available from the authors upon reasonable request.

Code availability

The codes used in this work use standard libraries that are publicly available using TensorFlow.

References

  1. 1.

    Cox, D. D. & Dean, T. Neural networks and neuroscience-inspired computer vision. Curr. Biol. 24, R921–R929 (2014).

    CAS  PubMed  Article  Google Scholar 

  2. 2.

    LeCun, Y., Bengio, Y. & Hinton, G. Deep learning. Nature 521, 436–444 (2015).

    CAS  PubMed  PubMed Central  Article  ADS  Google Scholar 

  3. 3.

    Collobert R., Weston J. A unified architecture for natural language processing: deep neural networks with multitask learning. In: Proceedings of the 25th International Conference on Machine Learning. 160–167 (Association for Computing Machinery, Helsinki, Finland, 2008).

  4. 4.

    Litjens, G. et al. A survey on deep learning in medical image analysis. Med. Image Anal. 42, 60–88 (2017).

    PubMed  Article  Google Scholar 

  5. 5.

    Rivenson, Y. et al. Deep learning enhanced mobile-phone microscopy. ACS Photon. 5, 2354–2364 (2018).

    CAS  Article  Google Scholar 

  6. 6.

    Rivenson, Y. et al. Deep learning microscopy. Optica 4, 1437–1443 (2017).

    Article  ADS  Google Scholar 

  7. 7.

    Nehme, E., Weiss, L. E., Michaeli, T. & Shechtman, Y. Deep-STORM: super-resolution single-molecule microscopy by deep learning. Optica 5, 458–464 (2018).

    CAS  Article  ADS  Google Scholar 

  8. 8.

    Ouyang, W., Aristov, A., Lelek, M., Hao, X. & Zimmer, C. Deep learning massively accelerates super-resolution localization microscopy. Nat. Biotechnol. 36, 460–468 (2018).

    CAS  PubMed  Article  Google Scholar 

  9. 9.

    Wang, H. et al. Deep learning enables cross-modality super-resolution in fluorescence microscopy. Nat. Methods 16, 103–110 (2019).

    CAS  PubMed  Article  Google Scholar 

  10. 10.

    Wu, Y. et al. Three-dimensional virtual refocusing of fluorescence microscopy images using deep learning. Nat. Methods 16, 1323–1331 (2019).

    CAS  PubMed  Article  Google Scholar 

  11. 11.

    Rivenson, Y., Zhang, Y., Günaydın, H., Teng, D. & Ozcan, A. Phase recovery and holographic image reconstruction using deep learning in neural networks. Light. Sci. Appl. 7, 17141–17141 (2018).

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  12. 12.

    Rivenson, Y. et al. PhaseStain: the digital staining of label-free quantitative phase microscopy images using deep learning. Light. Sci. Appl. 8, 23 (2019).

    PubMed  PubMed Central  Article  ADS  CAS  Google Scholar 

  13. 13.

    Wu, Y. et al. Deep learning enables high-throughput analysis of particle-aggregation-based biosensors imaged using holography. ACS Photon. 6, 294–301 (2019).

    CAS  Article  Google Scholar 

  14. 14.

    Sinha, A., Lee, J., Li, S. & Barbastathis, G. Lensless computational imaging through deep learning. Optica 4, 1117–1125 (2017).

    Article  ADS  Google Scholar 

  15. 15.

    Wu, Y. et al. Bright-field holography: cross-modality deep learning enables snapshot 3D imaging with bright-field contrast using a single hologram. Light. Sci. Appl. 8, 25 (2019).

    PubMed  PubMed Central  Article  ADS  CAS  Google Scholar 

  16. 16.

    Wu, Y. et al. Extended depth-of-field in holographic imaging using deep-learning-based autofocusing and phase recovery. Optica 5, 704–710 (2018).

    Article  ADS  Google Scholar 

  17. 17.

    Ballard, Z. S. et al. Deep learning-enabled point-of-care sensing using multiplexed paper-based sensors. npj Digital Med. 3, 1–8 (2020).

    Article  Google Scholar 

  18. 18.

    Holmström, O. et al. Point-of-care mobile digital microscopy and deep learning for the detection of soil-transmitted helminths and Schistosoma haematobium. Glob. Health Action 10, 1337325 (2017).

    PubMed  PubMed Central  Article  Google Scholar 

  19. 19.

    Joung, H.-A. et al. Point-of-care serodiagnostic test for early-stage lyme disease using a multiplexed paper-based immunoassay and machine learning. ACS Nano 14, 229–240 (2020).

    CAS  PubMed  Article  Google Scholar 

  20. 20.

    Veli, M. & Ozcan, A. Computational sensing of staphylococcus aureus on contact lenses using 3D imaging of curved surfaces and machine learning. ACS Nano 12, 2554–2559 (2018).

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  21. 21.

    Malkiel, I. et al. Plasmonic nanostructure design and characterization via Deep Learning. Light. Sci. Appl. 7, 60 (2018).

    PubMed  PubMed Central  Article  ADS  CAS  Google Scholar 

  22. 22.

    Peurifoy, J. et al. Nanophotonic particle simulation and inverse design using artificial neural networks. Sci. Adv. 4, eaar4206 (2018).

    PubMed  PubMed Central  Article  ADS  CAS  Google Scholar 

  23. 23.

    Liu, D., Tan, Y., Khoram, E. & Yu, Z. Training deep neural networks for the inverse design of nanophotonic structures. ACS Photon. 5, 1365–1369 (2018).

    CAS  Article  Google Scholar 

  24. 24.

    Luo, Y. et al. Design of task-specific optical systems using broadband diffractive neural networks. Light. Sci. Appl. 8, 112 (2019).

    PubMed  PubMed Central  Article  ADS  CAS  Google Scholar 

  25. 25.

    Borhani, N., Kakkava, E., Moser, C. & Psaltis, D. Learning to see through multimode fibers. Optica 5, 960–966 (2018).

    Article  ADS  Google Scholar 

  26. 26.

    Shen, Y. et al. Deep learning with coherent nanophotonic circuits. Nat. Photon. 11, 441–446 (2017).

    CAS  Article  ADS  Google Scholar 

  27. 27.

    Hughes, T. W., Williamson, I. A. D., Minkov, M. & Fan, S. Wave physics as an analog recurrent neural network. Sci. Adv. 5, eaay6946 (2019).

    PubMed  PubMed Central  Article  ADS  Google Scholar 

  28. 28.

    Miscuglio, M. et al. All-optical nonlinear activation function for photonic neural networks [Invited]. Opt. Mater. Express 8, 3851–3863 (2018).

    CAS  Article  ADS  Google Scholar 

  29. 29.

    Bueno, J. et al. Reinforcement learning in a large-scale photonic recurrent neural network. Optica 5, 756–760 (2018).

    Article  ADS  Google Scholar 

  30. 30.

    Sande, G. Vder, Brunner, D. & Soriano, M. C. Advances in photonic reservoir computing. Nanophotonics 6, 561–576 (2017).

    Article  Google Scholar 

  31. 31.

    Hamerly, R., Bernstein, L., Sludds, A., Soljačić, M. & Englund, D. Large-scale optical neural networks based on photoelectric multiplication. Phys. Rev. X 9, 021032 (2019).

    CAS  Google Scholar 

  32. 32.

    Lin, X. et al. All-optical machine learning using diffractive deep neural networks. Science 361, 1004 (2018).

    MathSciNet  CAS  PubMed  MATH  Article  ADS  Google Scholar 

  33. 33.

    Li, J., Mengu, D., Luo, Y., Rivenson, Y. & Ozcan, A. Class-specific differential detection in diffractive optical neural networks improves inference accuracy. Adv. Photon. 1, 046001 (2019).

    ADS  Google Scholar 

  34. 34.

    Mengu, D., Luo, Y., Rivenson, Y. & Ozcan, A. Analysis of diffractive optical neural networks and their integration with electronic neural networks. IEEE J. Sel. Top. Quantum Electron. 26, 1–14 (2020).

    Article  Google Scholar 

  35. 35.

    Marin-Palomo, P. et al. Microresonator-based solitons for massively parallel coherent optical communications. Nature 546, 274–279 (2017).

    CAS  PubMed  Article  ADS  Google Scholar 

  36. 36.

    Strickland, D. & Mourou, G. Compression of amplified chirped optical pulses. Opt. Commun. 56, 219–221 (1985).

    Article  ADS  Google Scholar 

  37. 37.

    Vabishchevich, P. P., Shcherbakov, M. R., Bessonov, V. O., Dolgova, T. V. & Fedyanin, A. A. Femtosecond pulse shaping with plasmonic crystals. JETP Lett. 101, 787–792 (2015).

    CAS  Article  ADS  Google Scholar 

  38. 38.

    Rahimi, E. & Şendur, K. Femtosecond pulse shaping by ultrathin plasmonic metasurfaces. J. Opt. Soc. Am. B 33, A1–A7 (2016).

    CAS  Article  ADS  Google Scholar 

  39. 39.

    Szipöcs, R., Ferencz, K., Spielmann, C. & Krausz, F. Chirped multilayer coatings for broadband dispersion control in femtosecond lasers. Opt. Lett. 19, 201–203 (1994).

    PubMed  Article  ADS  PubMed Central  Google Scholar 

  40. 40.

    Supradeepa, V. R., Huang, C.-B., Leaird, D. E. & Weiner, A. M. Femtosecond pulse shaping in two dimensions: towards higher complexity optical waveforms. Opt. Express 16, 11878–11887 (2008).

    CAS  PubMed  Article  ADS  Google Scholar 

  41. 41.

    Weiner, A. M. Femtosecond pulse shaping using spatial light modulators. Rev. Sci. Instrum. 71, 1929–1960 (2000).

    CAS  Article  ADS  Google Scholar 

  42. 42.

    Dugan, M. A., Tull, J. X. & Warren, W. S. High-resolution acousto-optic shaping of unamplified and amplified femtosecond laser pulses. J. Opt. Soc. Am. B 14, 2348–2358 (1997).

    CAS  Article  ADS  Google Scholar 

  43. 43.

    Weiner, A. M. Ultrafast optical pulse shaping: a tutorial review. Opt. Commun. 284, 3669–3692 (2011).

    CAS  Article  ADS  Google Scholar 

  44. 44.

    Yelin, D., Meshulach, D. & Silberberg, Y. Adaptive femtosecond pulse compression. Opt. Lett. 22, 1793–1795 (1997).

    CAS  PubMed  Article  ADS  Google Scholar 

  45. 45.

    Assion, A. et al. Control of chemical reactions by feedback-optimized phase-shaped femtosecond laser pulses. Science 282, 919–922 (1998).

    CAS  PubMed  Article  ADS  Google Scholar 

  46. 46.

    Efimov, A., Moores, M. D., Beach, N. M., Krause, J. L. & Reitze, D. H. Adaptive control of pulse phase in a chirped-pulse amplifier. Opt. Lett. 23, 1915–1917 (1998).

    CAS  PubMed  Article  ADS  Google Scholar 

  47. 47.

    Weiner, A. M., Leaird, D. E., Patel, J. S. & Wullert, J. R. Programmable femtosecond pulse shaping by use of a multielement liquid-crystal phase modulator. Opt. Lett. 15, 326–328 (1990).

    CAS  PubMed  Article  ADS  Google Scholar 

  48. 48.

    Bardeen, C. J. et al. Feedback quantum control of molecular electronic population transfer. Chem. Phys. Lett. 280, 151–158 (1997).

    CAS  Article  ADS  Google Scholar 

  49. 49.

    Hillegas, C. W., Tull, J. X., Goswami, D., Strickland, D. & Warren, W. S. Femtosecond laser pulse shaping by use of microsecond radio-frequency pulses. Opt. Lett. 19, 737–739 (1994).

    CAS  PubMed  Article  ADS  Google Scholar 

  50. 50.

    Zeek, E. et al. Pulse compression by use of deformable mirrors. Opt. Lett. 24, 493–495 (1999).

    CAS  PubMed  Article  ADS  Google Scholar 

  51. 51.

    Divitt, S., Zhu, W., Zhang, C., Lezec, H. J. & Agrawal, A. Ultrafast optical pulse shaping using dielectric metasurfaces. Science 364, 890–894 (2019).

    CAS  PubMed  PubMed Central  Article  ADS  Google Scholar 

  52. 52.

    Hashemi, M. R., Cakmakyapan, S. & Jarrahi, M. Reconfigurable metamaterials for terahertz wave manipulation. Rep. Prog. Phys. 80, 094501 (2017).

    PubMed  Article  ADS  CAS  Google Scholar 

  53. 53.

    Rahm, M., Li, J.-S. & Padilla, W. J. THz wave modulators: a brief review on different modulation techniques. J. Infrared Milli Terahz Waves 34, 1–27 (2013).

    Article  Google Scholar 

  54. 54.

    Danielson, J. R., Amer, N. & Lee, Y.-S. Generation of arbitrary terahertz wave forms in fanned-out periodically poled lithium niobate. Appl. Phys. Lett. 89, 211118 (2006).

    Article  ADS  CAS  Google Scholar 

  55. 55.

    Stepanov, A. G., Hebling, J. & Kuhl, J. Generation, tuning, and shaping of narrow-band, picosecond THz pulses by two-beam excitation. Opt. Express 12, 4650–4658 (2004).

    CAS  PubMed  Article  ADS  Google Scholar 

  56. 56.

    Sato, M. et al. Terahertz polarization pulse shaping with arbitrary field control. Nat. Photon. 7, 724–731 (2013).

    CAS  Article  ADS  Google Scholar 

  57. 57.

    Keren-Zur, S., Tal, M., Fleischer, S., Mittleman, D. M. & Ellenbogen, T. Generation of spatiotemporally tailored terahertz wavepackets by nonlinear metasurfaces. Nat. Commun. 10, 1778 (2019).

    PubMed  PubMed Central  Article  ADS  CAS  Google Scholar 

  58. 58.

    Liu, Yongqian, Park, Sang-Gyu & Weiner, A. M. Terahertz waveform synthesis via optical pulse shaping. IEEE J. Sel. Top. Quantum Electron. 2, 709–719 (1996).

    CAS  Article  ADS  Google Scholar 

  59. 59.

    Gingras, L. & Cooke, D. G. Direct temporal shaping of terahertz light pulses. Opt., Opt. 4, 1416–1420 (2017).

    Google Scholar 

  60. 60.

    Yardimci, N. T. & Jarrahi, M. High sensitivity terahertz detection through large-area plasmonic nano-antenna arrays. Sci. Rep. 7, 42667 (2017).

    CAS  PubMed  PubMed Central  Article  ADS  Google Scholar 

  61. 61.

    Cunningham, P. D. et al. Broadband terahertz characterization of the refractive index and absorption of some important polymeric and organic electro-optic materials. J. Appl. Phys. 109, 043505-043505–043505-043505 (2011).

    Article  ADS  CAS  Google Scholar 

  62. 62.

    Podzorov, A. & Gallot, G. Low-loss polymers for terahertz applications. Appl Opt. 47, 3254–3257 (2008).

    CAS  PubMed  Article  ADS  Google Scholar 

  63. 63.

    Jin, Y.-S., Kim, G.-J. & Jeon, S.-G. Terahertz dielectric properties of polymers. J. Korean Phys. Soc. 49, 513–517 (2006).

    CAS  Google Scholar 

  64. 64.

    Burghoff, D. et al. Terahertz laser frequency combs. Nat. Photon. 8, 462–467 (2014).

    CAS  Article  ADS  Google Scholar 

  65. 65.

    Bachmann, D. et al. Short pulse generation and mode control of broadband terahertz quantum cascade lasers. Optica 3, 1087–1094 (2016).

    Article  ADS  Google Scholar 

  66. 66.

    Barbieri, S. et al. Coherent sampling of active mode-locked terahertz quantum cascade lasers and frequency synthesis. Nat. Photon. 5, 306–313 (2011).

    CAS  Article  ADS  Google Scholar 

  67. 67.

    van der Weide, D. W. Delta-doped Schottky diode nonlinear transmission lines for 480-fs, 3.5-V transients. Appl Phys. Lett. 65, 881–883 (1994).

    Article  ADS  Google Scholar 

  68. 68.

    Assefzadeh, M. M. & Babakhani, A. Broadband oscillator-free thz pulse generation and radiation based on direct digital-to-impulse architecture. IEEE J. Solid-State Circuits 52, 2905–2919 (2017).

    Article  ADS  Google Scholar 

  69. 69.

    Abo-Bakr, M. et al. Brilliant, Coherent Far-Infrared (THz) synchrotron radiation. Phys. Rev. Lett. 90, 094801 (2003).

    CAS  PubMed  Article  ADS  Google Scholar 

  70. 70.

    Partanen, A. et al. Fabrication of terahertz wire-grid polarizers. Appl Opt. 51, 8360–8365 (2012).

    PubMed  Article  ADS  Google Scholar 

  71. 71.

    Kitahara, H. et al. Terahertz wave dispersion in two-dimensional photonic crystals. Phys. Rev. B 64, 045202 (2001).

    Article  ADS  CAS  Google Scholar 

  72. 72.

    Yardimci, N. T., Yang, S.-H., Berry, C. W. & Jarrahi, M. High-power terahertz generation using large-area plasmonic photoconductive emitters. IEEE Trans. Terahertz Sci. Technol. 5, 223–229 (2015).

    CAS  Article  ADS  Google Scholar 

  73. 73.

    Kulce, O., Mengu, D., Rivenson, Y. & Ozcan A. All-optical information processing capacity of diffractive surfaces. Preprint at http://arxiv.org/abs/2007.12813.

Download references

Acknowledgements

The Ozcan Research Group at UCLA acknowledges the support of Koç Group and HHMI. The Jarrahi Research Group at UCLA also acknowledges the support of NSF.

Author information

Affiliations

Authors

Contributions

A.O., M.V., D.M., and Y.R. conceived the research, M.V. and N.T.Y. conducted the experiments, M.V. analyzed the data, and Y.L. and J.L. assisted with the analysis and experiments. All the authors contributed to this manuscript. A.O. and M.J supervised the research.

Corresponding author

Correspondence to Aydogan Ozcan.

Ethics declarations

Competing interests

A.O., M.V., D.M., and Y.R. are co-inventors of a patent application on the presented method. The remaining authors declare no competing interests.

Additional information

Peer review information Nature Communications thanks the anonymous reviewers for their contribution to the peer review of this work.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Veli, M., Mengu, D., Yardimci, N.T. et al. Terahertz pulse shaping using diffractive surfaces. Nat Commun 12, 37 (2021). https://doi.org/10.1038/s41467-020-20268-z

Download citation

Further reading

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Search

Quick links

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing