Introduction

Integrating optical components on the silicon platform is appealing due to its compatibility with the complementary metal-oxide-semiconductor (CMOS) technology. This offers interesting perspectives for the mass-production of integrated photonic circuits1. Many devices which are suited for diverse applications from bio-sensing to optical routing were demonstrated2,3. Our attention is specifically devoted to nonlinear optics in the silicon photonics platform since the small waveguide size and the large modal confinement favor a high power density for a low input optical power4. This leads to the realization of integrated photonic devices, e.g., for wavelength conversion or generation of quantum states of light5,6. These exploit third order nonlinearities, since in the dipole approximation second order nonlinearities are inhibited in bulk silicon due to its centrosymmetric crystalline structure7. However, second order nonlinearities, which require a lower threshold power than third order effects, would be advantageous for, e.g., high frequency electro-optical modulation or spontaneous parametric down-conversion. Therefore, many efforts have been paid to induce second order nonlinearities in silicon. The most successful works were based on strained silicon, where a stressing over-layer deposited on the top of the silicon waveguide breaks the silicon bulk centrosymmetry8. The first experiment reported on the observation of a DC electro-optic effect in waveguides stressed by a silicon nitride (SiN) over-layer9. This observation was followed by many others, and strain-induced second order nonlinear coefficients χ(2) up to 340 pmV−1 were reported10,11. Other works, some of which were based on high frequency measurements, questioned the interpretation of these experiments. They demonstrated that frozen charges at the Si/SiN interface determine an abundance of free-carriers in the waveguide, causing a large DC electro-optic effect due to free carrier dispersion12,13,14,15. In particular, an upper limit of 8 pmV−1 to the value of the strain-induced χ(2) was set13. In a very recent work16, a high-frequency experiment was able to demonstrate an electro-optic modulation that is ascribed to a strain-induced χ(2) of the order of 1.8 pm/V. The coefficients connecting strain gradients to the strain-induced χ(2) were fitted from the experiment according to the theoretical model proposed in17. Still no proof that the data are related to the Pockels effect10 and not to the Kerr effect18 is clearly given. Other early works were about second harmonic generation (SHG) in strained silicon waveguides19. By using pump wavelengths above 2 μm, SH was generated in multimodal waveguides. A χ(2) as large as (40 ± 30) pmV−1 was deduced by the generation efficiency without accounting for any phase-matching mechanism. Subsequently, SHG experiments performed on differently stressed waveguides yield indication that not only strain but also charged defects in the cladding could explain the SHG signal20. In fact, charged defects could introduce a DC field EDC that couples to the pump photons through the third order nonlinear coefficient χ(3). This generates an effective nonlinearity \({\chi }_{EFISH}^{(2)}=3{\chi }^{(3)}{E}_{DC}\), which is known as the electric-field-induced SH (EFISH). Finally, SiN possesses a non-negligible χ(2)21,22. So, the observed SHG could be due to the evanescent field of the pump mode that extends in the cladding.

The complexity of the materials, where strain, stress, interfaces and residual doping all play a role, impedes a clear disentanglement of these possible contributions and of their relative importance. This limits the development of second order nonlinear devices based on strained silicon. Therefore, the aim of this work is to confirm the observation of SHG in suitably designed waveguides and to clarify the origin of the nonlinearity. Here, we report on the quantitative measurements of the SHG efficiency in phase-matched silicon waveguides covered by a stressing SiN cladding. Using a pulse propagation modeling, we estimate a χ(2) of 0.5 ± 0.1 pmV1. To clarify its origin, we use a screw-equipped sample holder, which allows inducing a tunable strain on the waveguides. Surprisingly, this does not affect the SHG efficiency. On the other hand, the SHG signal is suppressed when the strained silicon waveguides are irradiated by ultraviolet (UV) light to passivate the charged defects23. These two observations allow us to conclude that, within our experimental resolution, the measured SHG signal is entirely due to EFISH caused by the charged defects. Assuming that the strain-induced nonlinearity is below the noise level of our measurement, we set an upper limit to the strain-induced χ(2) at about 0.05 pmV1.

Second Harmonic Generation Experiment

Experimental set-up

The experimental setup is similar to the one described in24,25 and is shown in Fig. 1(a). A regenerative amplified Ti:Sapphire laser pumps an optical parametric amplifier to obtain high power optical pulses with tunable wavelengths above 2.2 µm, 1 kHz repetition rate and 35 fs pulse duration. The use of a wavelength above 2.2 µm has two reasons. First, it prevents from two-photon absorption (TPA) within silicon. TPA competes with SHG, limiting the maximum achievable pump peak power in the waveguide and generating free carriers, which cause losses both at the pump and the SH wavelengths26. Second, with a >2.2 µm pump, SH is generated above 1.1 µm, where silicon is transparent. A pulse shaping stage allows to lengthen the pump pulse duration to up to 5 ps, which limits the peak power on the waveguide facet and reduces walk-off effects in the waveguide27. A polarization stage controls the pulse polarization, while a Fourier transform infrared spectrometer (FTIR) measures its spectrum. Using a tapered lensed fiber, an average power of up to 100 nW is coupled into the waveguide. The waveguide is mounted on a screw-equipped sample holder like the one described in13,28. This special holder is used to apply an external and tunable mechanical load to the waveguide. A second lensed fiber collects the SHG signal, which is detected by an InGaAs single photon avalanche detector (SPAD). The pump pulse triggers the SPAD, opening a detection window of 2.5 ns and setting the detection limit to about 0.004 fW for 1s of integration time. A scanning monochromator positioned between the fiber and the SPAD allows spectral analysis of the SHG signal.

Figure 1
figure 1

(a) Sketch of the experimental set-up. (b) Waveguide cross section with nominal dimensions. The width w is engineered for each modal combination to achieve phase-matching. (c) Effective refractive index dispersions for the TE1 mode at pump wavelengths (red line) and for the TM3 mode at SHG wavelengths (blue line). In the simulation w = 1 µm. The insets show the mode profiles for the pump and SHG modes. (d) Dependence of the SHG efficiency (in normalized dB scale) on pump wavelength for the different waveguide widths reported in the inset (combination TE1–TM3).

Waveguide engineering

The devices are fabricated on a 6′ silicon-on-insulator (SOI) wafer, with a 243 nm thick device layer and a 3 μm thick buried-oxide (BOX). The waveguides are defined by a 365 nm UV lithography and realized by reactive ion etching. A 140 nm thick SiN cladding layer is conformally deposited via low-pressure chemical vapor deposition (LPCVD), causing a measured tensile stress of 1.25 GPa in the silicon nitride cladding. Different silicon waveguide widths are used. The cross section of the waveguide is sketched in Fig. 1(b).

For SHG, the phase-matching condition requires the pump (np) and the SH (nsh) effective refractive indices to be equal (np = nsh). Due to the index dispersion, the pump and the SH signal propagating in the same modal order do not have equal effective indices. Therefore, we use different modes to get the phase-matching: the pump and the SH signals propagate in different modes which have the same effective modal indices21,29. We investigate two modal combinations: both have the pump in the fundamental transverse electric (TE1) mode, while, in the first case, SH is generated on the third order transverse magnetic (TM3) mode and, in the second case, SH is generated on the fifth order transverse magnetic (TM5) mode. Phase-matching is achieved by changing the waveguide width w (the first type has w ~ 1 µm, the second w ~ 2.3 µm). Figure 1(c) reports the simulated dependence of the effective refractive index versus the pump wavelength λp for the TE1 mode and the SH wavelength λsh for the TM3 mode in a 1 µm wide waveguide. At λp ~ 2378 nm, np = nsh, i.e. phase-matching is satisfied. The insets show the mode profiles for the pump and the SH waves. One can notice that, for equal parity modes the overlap integral is not negligible, which contributes to the generation efficiency. We perform these simulations using a finite element method (FEM) simulation software, taking material refractive index dispersions measured with ellipsometry30. SHG efficiency is proportional to sinc2(ΔβL/2), being L the waveguide length and Δβ = p − βsh = (np − nsh)sh the phase mismatch coefficient31. In Fig. 1(d) we plot the SHG efficiency in a normalized dB scale for the TE1–TM3 combination as a function of λp and for different w, considering L = 2 mm. Phase-matching wavelength strongly depends on w, and larger w requires longer wavelengths.

Second harmonic generation characterization

Figure 2(a) shows experimental SHG power as a function of λp, i.e. the spectrum of the SHG efficiency. The measurement is performed on a 1050 nm wide 4 mm long waveguide, where phase-matching of the TE1–TM3 combination is expected. The SHG signal strongly depends on λp, showing a peak at 2391 nm which corresponds to the phase-matching wavelength.

Figure 2
figure 2

(a) Measured SHG power as a function of λp. On the top: sketch of the setup used in this experiment. The waveguide is 4 mm long and 1050 nm wide. In the inset: SHG power dependence on the pump power. Results are fitted by a straight line in a log-log plot. (b) Spectral analysis of the generated signal performed using the set-up sketched on the top. (c) Dependence of the phase-matching pump wavelength on w. Data refer to the TE1–TM3 (left) and to the TE1–TM5 combination (right). Points with the same color refer to waveguides located nearby on the wafer. Black line is the behavior predicted by FEM simulations, while the gray area comes from simulations with 5% variations on w.

The attribution of the signal to SHG is also confirmed by the quadratic dependence of the SHG power on the pump power, shown in the inset of Fig. 2(a). In the log-log plot a slope of 1.81 ± 0.12 is observed, which almost matches the theoretical expectation of 27. The on-chip pump power is estimated from the incident power knowing the in-coupling losses, which are determined via the cut-back method (9 dB per facet for w = 1 µm, and 4 dB per facet for w = 2.3 µm)32. The on-chip SHG power is determined from the SPAD signal knowing its responsivity at the SH wavelength and the out-coupling losses. These are evaluated analytically from the field profile of the fiber and of the involved modes (15.5 dB per facet for the TM3 mode in the w = 1 µm waveguide, and 21.7 dB per facet for the TM5 mode in the w = 2.3 µm waveguide)33.

SHG is confirmed also by the spectral analysis of the generated signal, shown in Fig. 2(b), where both the pump and the SHG wavelengths are scanned. A clear SHG spectrum centered at one-half of pump wavelength is observed.

Figure 2(c) reports the phase-matching pump wavelength as a function of w, for the TE1–TM3 (left) and for the TE1–TM5 (right) combinations. Measurements referred to the same position in the wafer (same color) show a monotonous dependence on w. The trend agrees with the one predicted by simulations, reported as a straight line. However, data show an offset with respect to the theory. This offset is different between nominally equal waveguides located in different positions of the wafer (different colors). We attribute this offset to local variations of w on the wafer. This is evidenced by the gray area in the figure, corresponding to simulations with 5% variations on w (according to typical fabrication uncertainties). Experimental results fall in the area predicted by simulations. Moreover, waveguides positioned in the same wafer position show almost the same shift with respect to the theory, demonstrating local uniformity of the waveguide geometry on the short-scale distance.

Theoretical model

We estimate the strength of the non-linearity causing SHG by modelling the propagation of the pump and SHG pulses in the waveguide4. The model considers that, besides SHG, other processes affect the generation. Due to the short time duration of the pulses, dispersion is important. Moreover, due to the strong χ(3) of silicon, also self-phase modulation (SPM) and cross-phase modulation (XPM) are significant. The model describes the temporal and spatial evolution of the dimensionless pump and SHG field amplitudes, up and ush, related to the total optical power by Pp |up|2 and Psh |ush|2. The model solves the following coupled equations4:

$$\begin{array}{rcl}\frac{d}{dz}{u}_{p}+\sum _{m\ge 1}\frac{{(i)}^{m-1}{\beta }_{m}}{m!}\frac{{d}^{m}}{d{t}^{m}}{u}_{p} & = & -\,\frac{{\alpha }_{p}}{2}{u}_{p}+i{\gamma }_{p}^{(3)}{P}_{p}{|{u}_{p}|}^{2}{u}_{p}\\ & & +\,2i{\gamma }_{p}^{(2)\ast }\sqrt{{P}_{sh}}{u}_{sh}{u}_{p}^{\ast }\,\exp (\,-\,i{\rm{\Delta }}\beta z),\end{array}$$
(1a)
$$\begin{array}{rcl}\frac{d}{dz}{u}_{sh}+\sum _{m\ge 1}\frac{{(i)}^{m-1}{\beta }_{m}}{m!}\frac{{d}^{m}}{d{t}^{m}}{u}_{sh} & = & -\,\frac{{\alpha }_{sh}}{2}{u}_{sh}+2i{\gamma }_{sh,p}^{(3)}{P}_{p}{|{u}_{p}|}^{2}{u}_{sh}\\ & & +\,i{\gamma }_{sh}^{(2)}\frac{{P}_{p}}{\sqrt{{P}_{sh}}}{u}_{p}^{2}\,\exp (i{\rm{\Delta }}\beta z).\end{array}$$
(1b)

On the left-hand side, spatial and temporal derivatives are present. The coefficients βm are the Taylor expansion terms of the mode propagation constant β4. On the right-hand side there are a loss term (described by loss coefficients αp and αsh), a phase-modulation term (related to the third order nonlinear coefficient γ(3)), and the SHG term (related to the second order nonlinear coefficient γ(2)). Since the SHG power is much weaker than the pump power, in the pump equation we consider only SPM and we neglect XPM, while in the SHG equation we consider only XPM. The XPM and SPM terms are proportional to \({\gamma }_{ij}^{(3)}\) and \({\gamma }_{i}^{(3)}\), defined as:

$${\gamma }_{ij}^{(3)}=\frac{3{\omega }_{i}{n}_{G,i}{n}_{G,j}}{4{\varepsilon }_{0}{A}_{0}{c}^{2}}{{\rm{\Gamma }}}_{ij}^{(3)},\,{\gamma }_{i}^{(3)}={\gamma }_{ii}^{(3)}.$$
(2)

The factor ωi is the i-th pulse frequency, nG,i the group index of the mode at frequency ωi, A0 the waveguide area and c the speed of light in vacuum. The overlap term \({{\rm{\Gamma }}}_{ij}^{(3)}\) is defined by:

$${{\rm{\Gamma }}}_{ij}^{(3)}={A}_{0}\tfrac{{\int }_{{A}_{\infty }}{E}^{\ast }({r}_{\perp },{\omega }_{j}){\chi }^{(3)}\therefore E({r}_{\perp },{\omega }_{i}){E}^{\ast }({r}_{\perp },{\omega }_{i})E({r}_{\perp },{\omega }_{j})dA}{({\int }_{{A}_{\infty }}n{({r}_{\perp },{\omega }_{i})}^{2}|E({r}_{\perp },{\omega }_{i}){|}^{2}dA)({\int }_{{A}_{\infty }}n{({r}_{\perp },{\omega }_{j})}^{2}|E({r}_{\perp },{\omega }_{j}){|}^{2}dA)},$$
(3)

where χ(3) is the third order susceptibility tensor, r the coordinate in the waveguide cross-section plane, E(r, ωi) the mode profile at frequency ωi and n(r, ωi) the refractive index at frequency ωi. Integrals are taken on the transverse plane A. In the SHG term \({\gamma }_{i}^{(2)}\) is given by:

$$\begin{array}{c}{\gamma }_{i}^{(2)}=\frac{{\omega }_{i}{n}_{G,p}\sqrt{{n}_{G,sh}}}{\sqrt{8{\varepsilon }_{0}{A}_{0}{c}^{3}}}{{\rm{\Gamma }}}^{(2)},\\ {{\rm{\Gamma }}}^{(2)}=\sqrt{{A}_{0}}{\textstyle \tfrac{{\int }_{{A}_{{\rm{\infty }}}}{E}^{\ast }({r}_{\perp },{\omega }_{p}){\chi }^{(2)}\,:\,E({r}_{\perp },{\omega }_{sh}){E}^{\ast }({r}_{\perp },{\omega }_{p})dA}{({\int }_{{A}_{{\rm{\infty }}}}n{({r}_{\perp },{\omega }_{p})}^{2}\,|E({r}_{\perp },{\omega }_{p}){|}^{2}dA){({\int }_{{A}_{{\rm{\infty }}}}n{({r}_{\perp },{\omega }_{sh})}^{2}|E({r}_{\perp },{\omega }_{sh}){|}^{2}dA)}^{1/2}}}.\end{array}$$
(4)

where χ(2) is the second order susceptibility tensor. Note that Γ(2) depends on both the pump and SH optical mode profiles, as already underlined.

We solve Equations (1a) and (1b) using the split-step method, dividing the spatial domain in small segments34. The initial condition is the temporal profile of the pump pulse (we use a Gaussian shape) and of the SH pulse (set to zero). In each spatial segment, the solution is evaluated in three steps. First, the equations are solved in the Fourier domain, considering only dispersion. Second, first step solutions are used as initial conditions to solve the equations with SPM, XPM and losses alone. Third, the equations are coupled via the SHG term. This procedure is then iterated until the end of the waveguide.

The unknown parameter of Equations (1a) and (1b) is the tensor χ(2). Therefore, given the experimental SHG power, we aim to find χ(2). However, since the spatial distribution of χ(2) is unknown and is expected to be not constant, χ(2) remains encoded within Γ(2). So, Γ(2) is the parameter that can be evaluated from the model. Γ(2) in general is complex, so the model cannot provide a unique solution. Nevertheless, if Γ(2) is small, the pump pulse is not much affected by the SHG signal. Therefore, neglecting dispersive and phase-modulation effects, the SHG equation shows that ushΓ(2). Consequently, since in the experiment we measure Psh|ush|2, we cannot measure the phase of Γ(2) but only its module. This fact is confirmed by numerically solving Equations (1a) and (1b), from which we find that - for our typical powers - SHG power depends on (2)|, independently on its share between the real and the imaginary parts.

In our experiment, the pump is a TE mode (polarized along x), while the SHG signal is a TM mode (polarized along y). Therefore, we define the effective second order susceptibility \({\chi }_{eff}^{(2)}\):

$$\begin{array}{c}{\chi }_{eff}^{(2)}=|\frac{{{\rm{\Gamma }}}^{(2)}}{K}|,\\ \,K=\sqrt{{A}_{0}}\tfrac{{\int }_{{A}_{\infty }}{E}_{x}^{\ast }({r}_{\perp },{\omega }_{p}){E}_{y}({r}_{\perp },{\omega }_{sh}){E}_{x}^{\ast }({r}_{\perp },{\omega }_{p})dA}{({\int }_{{A}_{\infty }}n{({r}_{\perp },{\omega }_{p})}^{2}|{E}_{x}({r}_{\perp },{\omega }_{p}){|}^{2}dA){({\int }_{{A}_{\infty }}n{({r}_{\perp },{\omega }_{sh})}^{2}|{E}_{y}({r}_{\perp },{\omega }_{sh}){|}^{2}dA)}^{1/2}}.\end{array}$$
(5)

\({\chi }_{eff}^{(2)}\) is actually the second order susceptibility term if χ(2) is spatially constant. If not, \({\chi }_{eff}^{(2)}\) is a quantity related to the strength of the second order nonlinearity. K is a dimensionless parameter related to the overlap between pump and SH optical modes. If χ(2) is constant, conversion efficiency between the two optical modes is proportional to K. The far χ(2) is from being constant, the less the SHG efficiency is related to K.

Nonlinear susceptibility estimation

In Table 1 we report (2)|, estimated from the measured SHG power using our model. (2)| is reported for the TE1–TM3 and the TE1–TM5 combinations. We analyzed waveguides of different L selected from different positions on the wafer. Moreover, within the same modal combination, we performed measurements on different w. The values reported in Table 1 result from this statistical analysis (each value is an average of 64 measurements for the TE1–TM3 combination and 11 measurements for the TE1–TM5 combination). From this estimation we find that (2)| is much stronger for the TE1–TM3 than for the TE1–TM5 combination. The parameters used in the model come from FEM simulations (group indices, modal profiles, propagation constants) and from measurements (refractive indexes, propagation losses). In particular, we use the αp measured by the cut-back method (8 dB/cm for w = 1 µm, 5 dB/cm for w = 2.3 µm). Since we cannot measure propagation losses for high order modes, we approximate αsh ~ αp. In any case, the waveguides considered in this work are very short (between 2 mm and 6 mm), so the effect of a wrong estimation of αsh is not determinant.

Table 1 Estimation of (2)| from the measurements using the model described in this work.

Table 1 reports also |K| for both combinations. This value is calculated from mode-analysis FEM simulations according to Equation (5). |K| is much larger for the TE1–TM3 than for the TE1–TM5 combination, showing that the first is more efficient than the latter for an equal χ(2) value. This fact is confirmed experimentally, since for the TE1–TM5 combination we need a stronger pump power to generate the same SHG power of the TE1–TM3 combination.

In Table 1 we also give \({\chi }_{eff}^{(2)}\) values, estimated by Equation (5) with the measured (2)| and simulated |K| values. Remarkably, the two modal combinations yield comparable \({\chi }_{eff}^{(2)}\) values within the error bars. So, the strong difference in the generation efficiency expressed by (2)| is mainly due to the mode overlap difference, taken into account by |K|.

Role of Strain

Second harmonic generation under external load

The results presented so far do not offer insights on the origin of the nonlinearity causing SHG. In order to investigate the role of strain, we perform the same experiment described previously using the screw-equipped sample holder sketched in Fig. 3(a)13,28. The screw introduces a variable vertical displacement ∆H (load) in the center of the sample (orthogonally to its main plane), while the sample edges are fixed by the mount. The tensile stress introduced by the SiN cladding (1.25 GPa from wafer bow measurements) sets the initial strain conditions which is then increased by using the screw. A detailed modeling and analysis of this kind of experiments are described in28.

Figure 3
figure 3

(a) Sketch of the screw-equipped sample holder. The total width of the sample is 6 mm, and the analyzed waveguide is located at a distance of about 0.9 mm from the center. (b) SHG power (in logarithmic scale) as a function of the pump wavelength for different values of ΔH given in the inset. (c) Phase-matching wavelength as a function of ∆H. (d) Dependence of \({\chi }_{eff}^{(2)}\) on ∆H. Error bars come from repeated measurements.

Figure 3(b) reports the SHG power as a function of λp for different loads applied by the screw. The measurement refers to a waveguide with w = 906 nm and L = 4 mm, where phase-matching is expected between TE1 and TM3 modes. ∆H varies from 0 μm (no load) to 50 μm (almost the sample rupture threshold). SHG is observed for all the loads. However, it is noted that the phase-matching wavelength is blue-shifted with increasing strain, while the SHG magnitude does not change significantly.

A linear dependence of the pump phase-matching wavelength as a function of ∆H is observed (Fig. 3(c)). The maximum phase-matching wavelength shift is about 3.8 nm for ∆H = 50 μm. This strain-induced shift ensures that we are actually varying the strain inside the waveguide. The shift is due to the waveguide deformation and to the photoelastic effect which cause a strain-induced effective refractive index change for the pump (δnp) and for the SHG (δnsh) modes28. The new phase-matching condition requires np + δnp = nsh + δnsh, determining a modification of the phase-matching wavelength if δnp ≠ δnsh.

Figure 3(d) shows the dependence of \({\chi }_{eff}^{(2)}\) on ∆H, where no significant influence on the load increase is observed. This surprising result contradicts the strain as one of the explanations of the observed SHG. Note that the value of \({\chi }_{eff}^{(2)}\) reported in this figure is lower than the mean value of \({\chi }_{eff}^{(2)}\) reported in Table 1. This is due to the large variability of \({\chi }_{eff}^{(2)}\) between different samples, which is taken into account in the statistical analysis reported in Table 1.

Modelling strain inside the waveguide

To model the deformation and photoelastic contributions to the change of the phase-matching wavelength, we perform FEM simulations using the procedure described in28. First, we perform a 3D simulation of the silicon substrate subjected to the screw load. Elastic parameters of Si, SiO2 and SiN are taken from35,36,37. Si and SiO2 photoelastic parameters at the pump and SHG wavelengths are interpolated from38, while SiN photoelastic parameters are not reported in the literature, so we use the same values of SiO2 as we did in28. Si parameters are rotated to align to the waveguide crystallographic directions, considering that it is oriented along the [110] direction35. Figure 4(a) shows a sketch of the simulation domain, as well as the results of this simulation. A crucial role is played by the screw contact area on the back of the sample. Here we use a screw contact diameter of 7 μm, which yields good agreement with experimental results. We use the results of the 3D simulation as prescribed displacements to perform a local 2D simulation of the waveguide cross-section. Figure 4(b) shows the strain distribution inside the waveguide for ∆H = 0 μm and for ∆H = 50 μm, referred to w = 906 nm. In the first case, no load is applied, and the waveguide strain is due to the tensile stress introduced by the cladding. In the second case, we consider the maximum load applied by the screw, showing an increase of about 50% of the average strain inside the waveguide. From the simulations reported in Fig. 4(b), we determine two quantities. First, we determine the mean strain value in the waveguide cross-section, from which we estimate the new size of the deformed waveguide. This, in turn, is used to determine the deformation-induced effective refractive index variations δnp|def and δnsh|def. Second, given the strain distribution in the waveguide and the photoelastic coefficients, we determine the photoelastic-induced effective refractive index variations δnp|ph and δnsh|ph. We show the ∆H dependence of these quantities in Fig. 4(c). Deformation introduces an effective refractive index variation for both the TE1 and the TM3 modes. However, this effect is one order of magnitude weaker than the one introduced by photoelasticity. Moreover, photoelastic variation has opposite sign for the two involved modes, due to their different polarization. Considering δnp = δnp|def + δnp|ph and δnsh = δnsh|def + δnsh|ph we compute the phase-matching wavelength as a function of the load. An example is shown in Fig. 4(d). Results refer to ∆H = 0 μm and ∆H = 50 μm. Since photoelasticity dominates, the pump effective refractive index increases, while the SHG effective index decreases. This determines a shift of the phase-matching of almost 4 nm, which is comparable with the experiment. Note that the simulated phase-matching wavelength shown in Fig. 4(d) is slightly different from the experimental one shown in Figure 3(b) (2293.8 nm against 2288 nm at ∆H = 0 μm). This is due to local variations of the waveguide geometry, as already discussed previously. However, what matters is the agreement between the experimental and the simulated strain-induced phase-matching shift.

Figure 4
figure 4

(a) On the top: sketch of the simulation domain of the 3D FEM simulation used to estimate the strain introduced by the screw. The supports are modelled by the prescribed displacement and the fixed constraint on the top, while the screw displacement is shown by the arrow. On the bottom: results of the simulation when ∆H = 50 μm. The volumetric strain εv is superimposed in color scale on the deformed sample. Displacements are emphasized by a factor of 10. (b) Distribution of the strain tensor element εxx inside the waveguide for ∆H = 0 μm and ∆H = 50 μm. Simulation refers to a 906 nm wide waveguide. (c) Dependence of the effective refractive index variations induced by the photoelastic effect (δnph) and by the waveguide deformation (δndef) on ∆H. (d) Dependence of the effective refractive indexes of the pump (blue line) and SHG (red line) modes on the pump and SHG wavelengths. Simulations are reported for no load (continuous lines) and for ∆H = 50 μm (dashed lines). Strain induces a shift Δλ of the phase-matching condition.

In17, a model to connect the strain gradient components to the strain-induced nonlinear-coefficient \({\chi }_{strain}^{(2)}\) has been introduced. Considering the optical modes involved in the experiment described here, the χ(2) component to be considered is:

$${\chi }_{strain,xxy}^{(2)}(\phi )={{\rm{\Gamma }}}_{xxy,xxy}{\eta }_{xxy}(\phi )+{{\rm{\Gamma }}}_{xxy,yyy}{\eta }_{yyy}(\phi )$$
(6)

being ηmnl = mn/dl the strain gradient component, φ the orientation of the waveguide with respect to the crystallographic axis (in our case φ = 0°), while the coefficients Γijk,mnl depend on the nature of the crystal. In16, these coefficients are fitted to the experimental data for a Si/SiN electro-optic modulator. Considering φ = 0°, they are Γxxy,xxy = −4 × 10−16 m2V-1 and Γxxy,yyy = −5.1 × 10−16 m2V-1. By using these values, the \({\chi }_{strain,xxy}^{(2)}\) map inside the waveguide is calculated, and it is shown in Fig. 5 referred to the cases ∆H = 0 μm and ∆H = 50 μm. The average \({\chi }_{strain,xxy}^{(2)}\) value varies from around 1.75 pmV−1 to about 2 pmV−1, showing an increase of about 14%. However, we cannot directly compare these values with the experimental \({\chi }_{eff}^{(2)}\). In fact, the SHG efficiency is not directly related to the \({\chi }_{strain,xxy}^{(2)}\) map, but to the overlap of the \({\chi }_{strain,xxy}^{(2)}\) map with the optical modes. According to Eq. (4), Γ(2) can be evaluated using \({\chi }^{(2)}={\chi }_{strain,xxy}^{(2)}\), and \({\chi }_{eff}^{(2)}\) can be consequently estimated using Eq. (5). In the case ∆H = 0 μm, and considering the modal combination TE1–TM3 described in this work, we estimate \({\chi }_{eff}^{(2)}\) = 0.13 pmV−1. In the case ∆H = 50 μm, \({\chi }_{eff}^{(2)}\) = 0.20 pmV−1, more than 50% larger than the value at ∆H = 0 μm.

Figure 5
figure 5

Strain-induced χ(2) map in the waveguide calculated according to the model16, and referred to ∆H = 0 μm (a) and ∆H = 50 μm (b).

To conclude, the model proposed in16 applied to our experiment provides a 50% increase of the \({\chi }_{eff}^{(2)}\) as a consequence of the applied load. Surprisingly, in the experiment no significant variation of the SHG efficiency is observed.

Role of Charged Centers

Dangling bonds and UV charge passivation

During the deposition of SiN films on the top of silicon, defects can be formed39,40. The most significant defect is the K center, which is formed by a silicon atom bound to three nitrogen atoms, with a dangling bond. The K center has three different charge states: the positive state K+ (no electrons in the dangling bond), the neutral state K0 (one electron in the dangling bond) and the negative state K (two electrons in the dangling bond)23. The most thermodynamically favorable charge state is the K+ center. In presence of a Si/SiN interface, K+ centers form, which results in a positively charged layer at the Si/SiN interface41. Going back to the waveguide here analyzed, the positively charged K+ defects in the cladding introduce a static field EDC in the waveguide. This field adds up to the optical waves by causing the EFISH process, which can be modelled by an effective second order coefficient \({\chi }_{EFISH}^{(2)}=3{\chi }^{(3)}{E}_{DC}\)7,20. Even though EFISH is a third order process, it yields SHG as an intrinsic bulk χ(2).

To remove the influence of EFISH in our measurements, we passivate the charged defects by a 254 nm UV irradiation. Through this mechanism, the positive center K+ is annihilated and neutralized to the K0 state23. A specific experiment is performed to measure the effect of UV irradiation. A 140 nm thick SiN layer is deposited on a silicon substrate. C-V measurements in a MOS configuration allow estimating the areal charge density σ42. We quantify a charge density reduction from σ ~ 1.7 × 1012 cm−2 to σ ~ 3.1 × 109 cm−2 after 23 hours of UV exposure.

Second harmonic generation after charge passivation

The UV treatment is applied to the strained silicon waveguides. First evidence of the UV treatment is a propagation loss reduction from 8 dB/cm to 4 dB/cm in a 1 μm wide waveguide at a wavelength of 2.3 μm. A similar loss reduction was observed in strip-loaded waveguides, formed by a 27 nm thick SOI patterned by a SiN cladding42. Loss reduction was attributed to the neutralization of the SiN charged layer, causing a reduction of the carrier concentration in silicon, and determining a reduced free carrier absorption.

Figure 6(a) compares the SHG power measured before and after the UV treatment. Notably, a complete suppression of SHG after UV passivation is observed. This indicates that SHG is due to the EFISH process and not to the strain or to the generation in the SiN cladding. As a matter of fact the effect of strain, as well as the generation in the SiN cladding, is below our experimental sensitivity. In our experiments, the noise level sets an upper limit to the nonlinearity of \({\chi }_{eff}^{(2)}\) = 0.05 pm/V, which has to be interpreted as the upper limit of the χ(2) introduced by strain or by the generation in the cladding. We emphasize that this upper limit is much lower than the strain-induced value calculated here according to the model proposed in16,17.

Figure 6
figure 6

(a) SHG power as a function of the pump wavelength before (black line) and after (red line) 23 hours of UV exposure. (b) In red: SHG power as a function of the pump wavelength after 23 hours of UV exposure (the same as panel a). In blue: SHG after applying a displacement ∆H = 25 μm by the screw following the UV exposure. In green: SHG power after removing the screw displacement. The waveguide has w = 906 nm.

More interestingly, we apply a load to the UV passivated waveguide using the screw-equipped sample holder to verify if a strain-induced χ(2) can be seen once that the dominant charge contribution is removed. Indeed, the applied strain yields a SHG peak, measured at a pump wavelength position blue-shifted with respect to the original waveguide. This measurement is reported as the blue line in Fig. 6(b). However, when we remove the load, SHG still persists and shifts back to the pump wavelength observed in the original waveguide (green line in Fig. 6(b)). Therefore, since the SHG peak remains and redshifts after the load removal, it cannot be attributed to the applied strain or to a plastic deformation of the sample. We suggest that the observed SHG peak is due to the re-activation of some K centers as a consequence of the applied load43. In fact, the SHG peak is suppressed by further UV exposition.

Modelling the effect of charges

To validate our hypotheses on the origin of the SHG, we perform FEM simulations to evaluate the charge-induced field EDC inside the waveguide. Given the DC field, we estimate the distribution of \({\chi }_{EFISH}^{(2)}\) in the waveguide, and so the parameter (2)|.

In the simulation, we consider an intrinsic p-type doping concentration of 1015 cm−3 in the waveguide as stated by the wafer supplier for the device layer in the SOI wafer. A positive surface charge density σ = 1.7 × 1012 cm−2 is applied to the top and to the sidewalls of the waveguide. This value is the one measured for the SiN/Si interface in our LPCVD deposited SiN42. For simplicity, in the simulation we assume the charged defects uniformly distributed on the Si/SiN interface, neglecting their extension within silicon nitride. This is however a good approximation, because simulations confirm that the electrical properties within the waveguide are mostly affected by the total charge in the cladding, and not by its distribution.

Figure 7(a) reports the calculated electron and hole distribution in the waveguide cross section for a w = 906 nm waveguide. Simulation shows that the charged defects determine an inversion of the silicon carrier population, with an electron concentration as high as 1019 cm−3 and a hole concentration as low as 105 cm−3. Electrons accumulate close to the positively charged layer, while holes are mainly concentrated near to the uncharged Si/BOX interface. This agrees with previously reported results, where population inversion caused by charges was shown in similar geometries42,44.

Figure 7
figure 7

(a) Logarithm of the electron concentration N and of the hole concentration P, expressed in [cm−3], when a surface charge density σ = 1.7 × 1012 cm−2 is present at the side and top borders of the waveguide. (b) x and y components of the DC field inside the waveguide. (c) \({\chi }_{EFISH}^{(2)}\) induced by the field inside the waveguide. The value of \({\chi }_{EFISH}^{(2)}\) along the dashed black line is shown in panel (d).

Given the carrier distribution inside the waveguide, we estimate the carrier-induced absorption using well-known semi-empirical relations45. Considering the TE1 mode at the wavelength of 2.3 µm in a w = 906 nm waveguide, we estimate a loss reduction of about 4 dB/cm when the charged defects are passivated by UV exposure (σ = 1.7 × 1012 cm−2 is reduced to σ = 3.1 × 109 cm−2). This value agrees with the experiment, where we measure a loss reduction from 8 dB/cm to 4 dB/cm after UV treatment. We ascribe the residual loss of 4 dB/cm to the waveguide sidewall roughnesses.

Using this model, we estimate the electric field EDC distribution inside the waveguide. In Fig. 7(b) we report the x and the y components of EDC. They have their maximum value close to the interface, rapidly decreasing when moving into the waveguide.

Knowing the electric field distribution, we compute the SHG efficiency by EFISH. This process relates the pump frequency ωp, the SH frequency ωsh and the DC field frequency ω = 0. Therefore, we need to know the tensor χ(3)p, ωp, ωsh, 0). Since literature does not report estimations of it, as a first approximation we use the tensor element measured at the pump frequency χ(3)p,ωp,ωp,ωp). Moreover, in our measurements the pump mode is polarized along x, while the SHG mode is directed along y. As reported in Fig. 7(b), EDC has components both along x and y. Therefore, the tensor elements that can origin the SHG process are the \({\chi }_{xxxy}^{(3)}\) (connected to EDC,x) and the \({\chi }_{xxyy}^{(3)}\)(connected to EDC,y). Because of the cubic symmetry of silicon, \({\chi }_{xxxy}^{(3)}\) = 046. Therefore, only EDC,y plays a role and \({\chi }_{EFISH}^{(2)}=3{\chi }_{xxyy}^{(3)}{E}_{DC,y}\). In the range between 1.2 μm and 2.4 μm, the tensor elements \({\chi }_{xxxx}^{(3)}\) and \({\chi }_{xxyy}^{(3)}\) are related by \({\chi }_{xxyy}^{(3)}={\chi }_{xxxx}^{(3)}/2.36\)47. However, even the measurements of \({\chi }_{xxxx}^{(3)}\) at pump frequency are ambiguous, ranging from 0.94 × 10−19 m2 V−2 up to 4.24 × 10−19 m2V−248,49,50. Using an average of these values, we obtain the distribution of \({\chi }_{EFISH}^{(2)}\) reported in Fig. 7(c). In Fig. 7(d) we show \({\chi }_{EFISH}^{(2)}\) evaluated on the cutline passing through the center of the waveguide and directed along y. A strong \({\chi }_{EFISH}^{(2)}\) is present close to the charged layer, rapidly decreasing as we approach the opposite border of the waveguide. The mean value of \({\chi }_{EFISH}^{(2)}\) in the waveguide is −0.3 pmV−1.

Once that the distribution of \({\chi }_{EFISH}^{(2)}\) in the waveguide is known, we evaluate (2)| according to Equation (4) using \({\chi }^{(2)}={\chi }_{EFISH}^{(2)}\). Moreover, we evaluate also \({\chi }_{eff}^{(2)}\) according to its definition in Equation (5). These values are reported in Table 2 and compared with the results evaluated from the experiment. The error bars on the simulated values come from the uncertainty on the tabulated values of \({\chi }_{xxxx}^{(3)}\). For both the modal combinations, simulated values and experimental results are in good agreement considering the simplicity of the model.

Table 2 Comparison of the values of (2)| and \({\chi }_{eff}^{(2)}\) estimated from the modelling of the DC field inside the waveguide with results coming from the experiments.

In this paper, we designed and measured waveguides providing phase-matching on odd parity TM modes. These modes are symmetric along the x direction with respect to the waveguide center. In these cases, EFISH is mediated by EDC,y, which in turn is symmetric. In this way, the integral at the numerator of Eq. (4) is nonzero. On the other hand, the EDC,x field is antisymmetric with respect to the waveguide center. So, it can enable the generation on even parity modes, which are in turn antisymmetric. Recalling that \({\chi }_{xxxy}^{(3)}=0\), in this case the generation is possible only on TE polarized modes, such as TE4, TE6, and so on. We did not design waveguides providing phase-matching with these combinations, so they are not analyzed in this work.

Conclusions

In this work, we reported a study of the SHG process in strained silicon waveguide with a SiN cladding. We measured an effective second-order nonlinear susceptibility of about 0.5 pmV−1. We devoted particular attention to the investigation of the origin of the measured signal. Studying SHG under external load, we demonstrated no significant dependence of the SHG efficiency value on the applied stress. We then performed a UV treatment of the waveguide to passivate the charged defects formed in the cladding. We showed a complete suppression of the SH signal, demonstrating that charges play a crucial role in the measured phenomenon. Using FEM simulation, we calculated the electric field in the waveguide due to the charged defects and from it the effective second order nonlinearity. A remarkable agreement between measured and computed values is observed, supporting the conclusion that the measured SHG is due to the presence of the charged centers in the cladding. We also verified that the strain effect on χ(2) is below the noise level, setting an upper limit of 0.05 pmV−1 to the strain-induced nonlinear coefficient. The suppression of the SH signal after the UV treatment proves also that a possible generation in the SiN cladding is below the experimental noise. These results settle the origin of SHG in strained silicon, showing that SHG is mainly due to electric field effects caused by the positive charges trapped in the cladding. Interestingly, the upper limit to the measured strain-induced \({\chi }_{eff}^{(2)}\) is comparable with the theoretical results of51,52. In these works it was demonstrated that the deformation of the crystalline lattice, induced by the strain gradient, yields low values of χ(2).

Our outcomes offer interesting perspectives, introducing a paradigmatic change in the development of these kind of structures. Till now great effort was done towards increasing the amount of strain inside the waveguide. However, in this work we evidence that the strain has a secondary role, and large nonlinearities can be obtained by increasing the DC fields inside the waveguide. This can be done both by maximizing the amount of charges deposited on the waveguide border, as well as by realizing thinner waveguides. In fact, Fig. 7(b) shows that the induced field rapidly decreases as far as we move from the charged layer. This clearly has the drawback to increase the propagation losses, since we showed that charges cause the increase of the free carriers inside the waveguide. Therefore, a tradeoff condition must be found between the strength of the DC field and the absorption induced by the carriers.

Furthermore, our work shows that SHG efficiency can be controlled by applying UV irradiation. This offers interesting perspectives for the realization of quasi-phase matched (poled) waveguides. Applying a proper photolithographic mask and exposing it to UV light, periodically varying \({\chi }_{EFISH}^{(2)}\) can be introduced along the waveguide propagation direction. Setting the proper poling period, the conversion between fundamental modes can be directly studied, measuring larger conversion efficiencies due to the stronger mode overlap with respect to the intermodal approach. A recent work already showed the possibility to exploit EFISH in silicon by applying DC fields via lateral p-n junctions18. Differently from that work, our suggestion concerns a completely passive device, with no need to apply any external bias.