Introduction

Localized Surface Plasmon Resonance (LSPR) biosensors have been of interest in sensing applications due to their plasmonic properties, small size and small sample requirement. LSPR sensors have a smaller sensing volume than propagating SPR sensors, hence a higher sensitivity per unit volume based on local changes1,2,3. This is attributed to the reduced interfering effects from the bulk RI changes when sensing small molecules. In addition, SPR sensors require a temperature a controller as the effect of bulk refractive index change significantly contributes to the detected signal. In the case of LSPR sensors, the size of the nanostructures is comparable to the size of the small biological molecules, increasing the specificity and reducing the effect of bulk RI change on the measured quantity (a temperature controller is not needed). We previously introduced the projection method that directly measures the effective refractive index with improved signal to noise ratio (SNR) for LSPR sensors. The effective refractive index is influenced by the adlayer thickness and the change in bulk refractive index4. Although the projection method provides a direct measurement for the effective bulk RI change, it cannot be used to distinguish between the effects due to biomolecular adlayer and those due to the bulk RI change, and it was affected by systematic errors and noise. This paper provides a novel method to mitigate these effects and yield quantitative information about binding evens (adlayer thickness and bulk RI change).

Most of surface plasmon resonance (SPR) sensors deploy a reference channel to reduce the interfering effects due to the bulk refractive index (RI) change5. However, several methods have been previously proposed for self–referencing SPR sensors, including an integration measurement of reflected intensity from arbitrarily distributed sensing and referencing spots6. A self referencing SPR platform was previously introduced based on a microcapillary, where the SPR mode was used as a sensing mode and the Fabry-Perot (FP) mode was used as a ref.7. Although the system has employed the low sensitivity of the FP mode to temperature variations, other factors may affect the sensing results, including mass transport. Another approach based on a fiber optic particle plasmon resonance was proposed, employing two plastic-silica fiber optics (with core and cladding diameters of 400 μm and 430 μm, respectively) as sensing and reference channels8. The metal nanoparticles were anchored on an unclad sensing fiber of 20 mm sensing window, whereas the other unclad fiber optic was used as a reference channel. This platform was shown to reduce the interfering effects, but it does not provide quantitative information about the binding molecules.

U-shaped nanostructures have been previously used in differentiating between specific and nonspecific binding by extracting information from two or more LSPR modes9. The study assumed that specific binding mostly occurs at the metal surface, and the nonspecific binding occurs at a distant location (on the substrate), and ignored any non-specific binding that may occur at the metal surface. However, the method requires repetitive simulation in order to determine the sensitivity matrix based on other biological samples, which is a practical limitation of the method. Alternatively, a model that distinguishes bulk RI and adlayer thickness changes is a more practical solution, as it can decouple the effects associated with them. The sensor response–at each resonance–is related to the adlayer thickness and bulk RI changes,and the effects can be determined by solving the two equations (corresponding to the number of resonances). This linear response model has been previously employed for an SPR sensor with a dielectric overlayer to excite two resonances10, an SPR sensor based on simultaneous excitation of short and long range SPR modes11,12, a dual-resonance SPR sensor with different penetration depths13, simultaneous excitation of transverse and longitudinal modes of nanorods14, and a sensor based on surface plasmon resonance and plasmon waveguide resonance15. This model, however, requires a sensitivity matrix (including the adlayer and bulk RI sensitivities for both resonances) with a low condition number to avoid any numerical errors, which may not be the case for many sensing platforms. We have improved this method by applying the MLE method to a set of results based on three-resonance nanorod structures, estimating the adlayer thickness and the bulk RI change16. Although the accuracy and precision have been improved with self-referencing capabilities, the method remains valid only for extremely low adlayer thickness (<l d /10). This article presents a method based on the maximum likelihood estimation (MLE) technique alongside a nonlinear response model to improve the accuracy and precision of the estimated quantities. This indicates a considerable improvement in the RI resolution and the limit of detection and sensing dynamic range (<l d /2). Herein the proposed method is applied to nanocrescent structures as they can be fabricated using cost-effective fabrication methods and they feature a multiple resonance absorption spectrum17. However, the method can be applied to any multiple resonance sensor.

Nonlinear Model for Sensor Response

The electromagnetic field decays exponentially from the surface of the nanostructures. If the maximum EM field is located at z = 0, it decays with a factor of exp(−z/l d ) along the z direction, where l d is the electromagnetic (EM) decay length, see Fig. 1

Figure 1
figure 1

Schematic illustrating the refractive index distribution around a single nanocrescent.

Since the intensity is the square of the electric field strength, it decays with a factor of exp(−2z/l d ) with respect to the nanostructure surface (z). This suggests the following for the effective refractive index

$${n}_{eff}={n}_{a}\mathrm{[1}-\exp (-2d/{l}_{d})]+{n}_{B}\,\exp (-2d/{l}_{d})$$
(1)
$$={n}_{B}+({n}_{a}-{n}_{B}\mathrm{)[1}-\exp (-2d/{l}_{d})]$$
(2)

Accordingly, the effective refractive index along the z direction is weighted by using the same decay factor exp(−2d/l d ) as shown in Fig. 1. The effective refractive index can be obtained by integrating the refractive indices along z direction as follows18

$${n}_{eff}=(2/{l}_{d}){\int }_{0}^{\infty }n(z)\exp (-\,2z/{l}_{d})dz$$

where n(z) changes along the z direction as

$$n(z)=(\begin{array}{cc}{n}_{a}, & {\rm{if}}\,0 < z < d,\\ {n}_{B}, & {\rm{if}}\,d < z < \infty \mathrm{.}\end{array}$$
(3)

where n a and n B are the analyte and bulk refractive indices, respectively. The resonance shift can be determined by

$${\rm{\Delta }}\lambda ={S}_{B}\,({n}_{eff}-{n}_{B})$$

where S B is the bulk RI sensitivity. From equation (2), substituting for n eff , the following is obtained for the resonance shift

$${\rm{\Delta }}\lambda ={S}_{B}\,({n}_{eff}-{n}_{B})\equiv {S}_{B}\,({n}_{a}-{n}_{B}\mathrm{)[1}-\exp (-2d/{l}_{d})]$$
(4)

This can be used to estimate the the adlayer thickness d and bulk RI change Δn B . First, the bulk RI sensitivity S B is measured based on bulk RI change for ethanol solutions of known refractive indices. The nanocrescents support three resonances, but only two resonances are used here. The base solution is injected to obtain the sensing baselines for these resonances, and the shifts in the resonance wavelengths Δλ i are tracked in real time as the biological sample is introduced.

The maximum response can be obtained when a thick adsorbate layer is reached d → ∞, and equation (4) becomes

$${\rm{\Delta }}{\lambda }_{max}={S}_{B}({n}_{a}-{n}_{B})$$
(5)

Dividing equation (4) by equation (5), the following is obtained

$$\frac{{{\rm{\Delta }}}_{\lambda }}{{\rm{\Delta }}{\lambda }_{max}}=\mathrm{[1}-\exp (-2d/{l}_{d})]$$
(6)

Equation (6) can be rearranged in order to obtain an expression that is linear in adlayer thickness as follows

$$\mathrm{ln}(1-\frac{{\rm{\Delta }}\lambda }{{\rm{\Delta }}{\lambda }_{max}})=\frac{-2}{{l}_{d}}\,d$$
(7)

For a nanostructure with multiple resonances, the electromagnetic decay length is dependent on the resonance wavelength, and the bulk RI sensitivity. Considering a two-resonance system and including the errors associated with the measured data, equation (7) can be written as follows

$${y}_{i}={C}_{{d}_{i}}\,d\,\pm {\varepsilon }_{{y}_{i}}$$
(8)

where y i  = ln(1−Δλ i λi,max), \({C}_{{d}_{i}}\) is a coefficient related to the electromagnetic field decay length for the ith resonance calculated as \(-2/{l}_{{d}_{i}}\), and \({\varepsilon }_{{y}_{i}}\) represents the error due to noise which is directly related to the variance of the measured quantities R y over the measured shifts in resonance wavelengths. In matrix notation

$${\boldsymbol{y}}=[\begin{array}{c}{y}_{1}\\ {y}_{2}\end{array}],\,{{\boldsymbol{C}}}_{{\boldsymbol{d}}}=[\begin{array}{c}{C}_{d1}\\ {C}_{d2}\end{array}],\,and\,{{\boldsymbol{R}}}_{{\boldsymbol{y}}}=[\begin{array}{cc}{R}_{y11} & {R}_{y12}\\ {R}_{y12} & {R}_{y22}\end{array}]$$
(9)

where R12 represents the covariance accounting for correlated noise. Considering a multiple resonance response, y1, y2, ..., y i , and a normal distribution for the noise effect, the likelihood of obtaining these measured quantities (y i ) given the true value (\(\hat{d}\)) and the variance R y can be obtained by multiplying the normal distributions \({\mathscr{N}}\) of these estimates. In matrix notation this can be expressed as follows

$$P({y}_{1},\mathrm{...}{y}_{i}|{C}_{d1}\hat{d},\mathrm{...}{C}_{di}\,\hat{d},\,{{\boldsymbol{R}}}_{{\boldsymbol{y}}})=\prod _{i}{\mathscr{N}}({\boldsymbol{y}}|{{\boldsymbol{C}}}_{{\bf{d}}}\,\hat{d},{{\boldsymbol{R}}}_{{\boldsymbol{y}}})\equiv \frac{1}{{\mathrm{(2}\pi )}^{i\mathrm{/2}}|{{\boldsymbol{R}}}_{{\boldsymbol{y}}}{|}^{\mathrm{1/2}}}\exp [-\frac{1}{2}{({\boldsymbol{y}}-{{\boldsymbol{C}}}_{{\bf{d}}}\hat{d})}^{T}{{\boldsymbol{R}}}_{{\bf{y}}}^{-{\bf{1}}}({\boldsymbol{y}}-{{\boldsymbol{C}}}_{{\boldsymbol{d}}}\,\hat{d})]$$
(10)

According to the MLE, the maximum likelihood of estimating the true value is obtained when the derivative of the above likelihood with respect to the true value approaches zero19. For simplicity, we obtain the log of the above likelihoods as follows

$$ln\,P({\boldsymbol{y}}|{{\boldsymbol{C}}}_{{\boldsymbol{d}}}\hat{d},\,{{\boldsymbol{R}}}_{{\boldsymbol{y}}})=-\frac{i}{2}ln\mathrm{(2}\pi )-\frac{1}{2}ln|{{\boldsymbol{R}}}_{{\boldsymbol{y}}}|-\frac{1}{2}{({\boldsymbol{y}}-{{\boldsymbol{C}}}_{{\boldsymbol{d}}}\hat{d})}^{T}{{\boldsymbol{R}}}_{{\boldsymbol{y}}}^{-{\bf{1}}}({\boldsymbol{y}}-{{\boldsymbol{C}}}_{{\boldsymbol{d}}}\hat{d})$$
(11)

Now, the true value can be estimated such that the derivative of the log likelihood with respect to this true value is equal to zero19.

$$\begin{array}{rcl}\frac{\partial }{\partial \hat{d}}\{\,ln\,P({\boldsymbol{y}}|{{\boldsymbol{C}}}_{{\boldsymbol{d}}}\hat{d},{{\boldsymbol{R}}}_{{\bf{y}}})\} & \equiv & \frac{\partial }{\partial \hat{d}}({{\boldsymbol{y}}}^{{\boldsymbol{T}}}{{\boldsymbol{R}}}_{{\boldsymbol{y}}}^{-{\bf{1}}}{\boldsymbol{y}}-{{\boldsymbol{y}}}^{{\boldsymbol{T}}}{{\boldsymbol{R}}}_{{\boldsymbol{y}}}^{-{\bf{1}}}{{\boldsymbol{C}}}_{{\bf{d}}}\hat{d}-{{\boldsymbol{C}}}_{{\boldsymbol{d}}}^{{\boldsymbol{T}}}{\hat{d}}^{T}{{\boldsymbol{R}}}_{{\boldsymbol{y}}}^{-{\bf{1}}}{\boldsymbol{y}}+{{\boldsymbol{C}}}_{{\boldsymbol{d}}}{\hat{d}}^{T}{{\boldsymbol{R}}}_{{\boldsymbol{y}}}^{-{\bf{1}}}{{\boldsymbol{C}}}_{{\boldsymbol{d}}}\hat{d})\\ & = & 0\Rightarrow -{{\boldsymbol{y}}}^{{\boldsymbol{T}}}{{\boldsymbol{R}}}_{{\boldsymbol{y}}}^{-{\bf{1}}}{{\boldsymbol{C}}}_{{\boldsymbol{d}}}-{{\boldsymbol{C}}}_{{\boldsymbol{d}}}^{{\boldsymbol{T}}}{{\boldsymbol{R}}}_{{\boldsymbol{y}}}^{-{\bf{1}}}{\boldsymbol{y}}+2\,{{\boldsymbol{C}}}_{{\boldsymbol{d}}}^{{\boldsymbol{T}}}{{\boldsymbol{R}}}_{{\boldsymbol{y}}}^{-{\bf{1}}}{{\boldsymbol{C}}}_{{\boldsymbol{d}}}\hat{d}=0\end{array}$$

This can be solved for the estimate \(\hat{d}\)

$$\hat{d}={({{\boldsymbol{C}}}_{{\boldsymbol{d}}}^{{\boldsymbol{T}}}{{\boldsymbol{R}}}_{{\boldsymbol{y}}}^{-{\bf{1}}}{{\boldsymbol{C}}}_{{\boldsymbol{d}}})}^{-1}({{\boldsymbol{C}}}_{{\boldsymbol{d}}}^{{\boldsymbol{T}}}{{\boldsymbol{R}}}_{{\boldsymbol{y}}}^{-{\bf{1}}}{\boldsymbol{y}})$$
(12)

In the case of the dual-resonance nanocrescents (i = 2), the adlayer thickness can be estimated as follows

$$\widehat{d}=\frac{({C}_{d1}{R}_{y11}^{-1}+{C}_{d21}{R}_{y12}^{-1}){y}_{1}+({C}_{d2}{R}_{y22}^{-1}+{C}_{d1}{R}_{y12}^{-1}){y}_{2}}{{C}_{d1}^{2}{R}_{y11}^{-1}+{C}_{d2}^{2}{R}_{y22}^{-1}+2\,{C}_{d1}\,{C}_{d2}{R}_{y12}^{-1}}$$
(13)

If the noise is uncorrelated (Ry12 = 0), the estimate becomes

$$\widehat{d}=\frac{({C}_{d1}{R}_{y11}^{-1}){y}_{1}+({C}_{d2}{R}_{y22}^{-1}){y}_{2}}{{C}_{d1}^{2}{R}_{y11}^{-1}+{C}_{d2}^{2}{R}_{y22}^{-1}}$$
(14)

Likewise, the RI change can be estimated with an improved accuracy using the MLE method. From equation (4), the shifts in the resonance wavelengths are given by

$${\rm{\Delta }}{\lambda }_{i}={C}_{{n}_{i}}\,{\rm{\Delta }}n\pm {\varepsilon }_{{\lambda }_{i}}$$

where Δn = n a  − n B ±Δn B , and \({C}_{ni}={S}_{i}(1-{e}^{-2\hat{d}/{l}_{di}})\) represents the sensitivity coefficient for the ith resonance, and \(\hat{d}\) is the value estimated by equation (13) or (14). \({\varepsilon }_{{\lambda }_{i}}\) is the error due to noise, represented by the variance associated with the ith measured wavelength shift. The estimated \(\widehat{{\rm{\Delta }}n}\) that maximizes the likelihood of the above probability assuming a correlated noise can be obtained using the MLE

$$\widehat{{\rm{\Delta }}n}=\frac{({C}_{n1}{R}_{{\lambda }_{1}}^{-1}+{C}_{n2}{R}_{{\lambda }_{12}}^{-1}){{\rm{\Delta }}}_{{\lambda }_{1}}+({C}_{n2}{R}_{{\lambda }_{2}}^{-1}+{C}_{n1}{R}_{{\lambda }_{12}}^{-1}){\rm{\Delta }}{\lambda }_{2}}{{C}_{n1}^{2}{R}_{{\lambda }_{1}}^{-1}+{C}_{n2}^{2}{R}_{{\lambda }_{2}}^{-1}+2\,{C}_{n1}\,{C}_{n2}{R}_{{\lambda }_{12}}^{-1}}$$
(15)

For an uncorrelated system (\({R}_{{\lambda }_{12}}\mathrm{=0}\)), this simplifies the above equation to

$$\widehat{{\rm{\Delta }}n}=\frac{{C}_{n1}{R}_{{\lambda }_{11}}^{-1}{\rm{\Delta }}{\lambda }_{1}+{C}_{n2}{R}_{{\lambda }_{22}}^{-1}{\rm{\Delta }}{\lambda }_{2}}{{C}_{n1}^{2}{R}_{{\lambda }_{11}}^{-1}+{C}_{n2}^{2}{R}_{{\lambda }_{22}}^{-1}}$$
(16)

Simulated Results and Validation of the Estimation Method

This section compares the nonlinear–MLE method with the established linear response model based on FEM simulated results, and presents a FEM evaluation for the accuracy based on each method with respect to deviated resonance wavelengths (simulating the effect of noise on each resonance wavelength).

The simulated results show distinct values for the sensitivity to bulk RI and adlayer thickness changes. Inspecting both modes in terms of the adlayer and bulk RI sensitivities can provide an insight into the sensing efficiency for each mode. It is evident that the adlayer thickness, the adlayer refractive index n a and the bulk refractive index n B contribute to the effective refractive index n eff [Equation (1)]. Therefore, the adlayer sensitivity (S d ) can be given by20

$$\frac{\partial \lambda }{\partial d}=\frac{\partial \lambda }{\partial {n}_{eff}}\,\frac{\partial {n}_{eff}}{\partial d}$$
(17)

where ∂λ/∂d and ∂λ/∂n eff are the adlayer sensitivity S d and bulk RI sensitivity S B , respectively. This equation suggests that the adlayer sensitivity is determined by the bulk RI sensitivity and the rate at which the adlayer thickness contributes to the effective refractive index n eff . Using equation (17) we can define the adlayer sensing efficiency η of a given biosensor as follows

$$\eta =\frac{\partial {n}_{eff}}{\partial d}\equiv \frac{{S}_{d}}{{S}_{B}}$$
(18)

This determines how effectively a given biosensor can detect changes in adlayer thickness. Referring to Figs 2 and 3, the first mode (1200 nm) exhibits a lower bulk RI sensitivity, comparing to that associated with the second mode ~1700 nm (325.25 nm/RIU versus 787.35 nm/RIU). These modes yield adlayer sensitivities of 1.47 and 2.2, respectively. Therefore, the calculated adlayer sensing efficiency η is 4.5 × 10−3 RIU/nm and 2.8 × 10−3 RIU/nm. In other words, changing the adlayer thickness by 1 nm would alter the effective refractive index by 4.5 × 10−3 RIU based on the first mode and 2.8 × 10−3 RIU in the case of the second mode, although the second mode features a higher bulk RI sensitivity than that of the first mode. We can now consider the established linear response model for comparison, the LM method relates each resonance wavelength shift to the changes in RI and thickness of an adsorbed biological material by the following relationship

$$[\begin{array}{c}{\rm{\Delta }}{\lambda }_{1}\\ {\rm{\Delta }}{\lambda }_{2}\end{array}]=\mathop{\underbrace{[\begin{array}{cc}{S}_{{B}_{1}} & {S}_{{d}_{1}}\\ {S}_{{B}_{2}} & {S}_{{d}_{2}}\end{array}]}}\limits_{{\bf{S}}}[\begin{array}{c}{\rm{\Delta }}n\\ d\end{array}]$$
(19)
Figure 2
figure 2

(a) Resonance shift versus bulk RI change associated with the first mode supported by exciting the nanocrescent structures with a normal incident, horizontally polarized, plane wave. (b) the simulated sensor response due to increasing the adlayer thickness. The results from the nonlinear model are shown on the graph. The FEM model used the following parameters: n B  = 1.3, n a  = 1.4. This mode is located at 1100 nm when n B  = n a  = 1.

Figure 3
figure 3

(a) Resonance wavelength shift versus bulk RI change associated with the second mode. (b) The simulated sensor response due to changing the adlayer thickness. The results from the nonlinear model are shown on the graph. The mode is located at 1450 nm in air (n B  = n a  = 1).

The application of the linear model is valid only if: (i) the sensitivity matrix (S) is not singular, and (ii) the normalized sensitivity matrix (S′) is well conditioned21. The normalized sensitivity matrix is calculated as follows

$${\boldsymbol{S}}^{\prime} =[\begin{array}{cc}\frac{{S}_{{B}_{1}}}{\sqrt{{S}_{{B}_{1}}^{2}+{S}_{{B}_{2}}^{2}}} & \frac{{S}_{{d}_{1}}}{\sqrt{{S}_{{d}_{1}}^{2}+{S}_{{d}_{2}}^{2}}}\\ \frac{{S}_{{B}_{2}}}{\sqrt{{S}_{{B}_{1}}^{2}+{S}_{{B}_{2}}^{2}}} & \frac{{S}_{{d}_{2}}}{\sqrt{{S}_{{d}_{1}}^{2}+{S}_{{d}_{2}}^{2}}}\end{array}]$$
(20)

This matrix meets the above conditions if it has a non-zero determinant and low condition number.

$${\boldsymbol{S}}=\mathop{\underbrace{[\begin{array}{cc}325.25 & 1.02\\ 787.38 & 1.51\end{array}]}}\limits_{|{\boldsymbol{S}}|=-\,366.25\ne 0}\Rightarrow {\boldsymbol{S}}^{\prime} =\mathop{\underbrace{[\begin{array}{cc}0.38 & 0.55\\ 0.92 & 0.84\end{array}]}}\limits_{\kappa ({\boldsymbol{S}}^{\prime} \mathrm{)=10.49 < 100}}$$

From equation (19), the adlayer thickness and bulk RI change can be calculated as

$$[\begin{array}{c}{\rm{\Delta }}n\\ d\end{array}]={{\boldsymbol{S}}}^{-{\bf{1}}}[\begin{array}{c}{\rm{\Delta }}{\lambda }_{1}\\ {\rm{\Delta }}{\lambda }_{2}\end{array}]$$
(21)

The nonlinear–MLE method is now employed to estimate the bulk RI change and adlayer thickness–that were used in the simulation–by adding uncertainties to the simulated shifts of the resonance wavelengths (\(\pm {\sigma }_{{\lambda }_{i}}\)) such as \({\rm{\Delta }}{\lambda }_{i}/{\sigma }_{{\lambda }_{i}}\mathrm{=10}\), as shown in Fig. 4(a). The resonance wavelength shifts are used to determine the corresponding values for ln(1−Δλλ max ), which are then used in equation (14) to determine the adlayer thickness, as shown in Fig. 4(b). The adlayer thickness is then used to determine the sensitivity coefficient, C n  = S[1−exp(−2d/l d )], to estimate the bulk RI change using equation (16). The estimated adlayer thickness and bulk RI change are shown in Fig. 4(c). The results obtained based on the LM are also shown in Fig. 4(b,c). The nonlinear-MLE method revealed the following for the estimates: d = 5.95 nm, Δn B ≈ 0, with 0.47 nm and 1.4 × 10–3 RIU uncertainties, respectively. Under the same conditions, the linear response model revealed 5.5 nm and 1.3 × 10−3 for the estimated adlayer thickness and bulk RI change with uncertainties of 1.8 nm and 6 × 10−3, respectively. This suggests that the nonlinear–MLE can improve the accuracy of the estimated adlayer thickness by one order of magnitude, and the precision by a factor of 4, as shown in Fig. 4.

Figure 4
figure 4

(a) Probability density function representation of the calculated shift in the first and second resonances, with a 10 dB signal to noise ratio (SNR = \({\rm{\Delta }}{\lambda }_{i}/{\sigma }_{{\lambda }_{i}}\mathrm{=10}\)) [\({\sigma }_{{\lambda }_{1}}\mathrm{=0.85}\) nm, \({\sigma }_{{\lambda }_{2}}\mathrm{=1.3}\) nm]. The following parameters were used in the simulation: n B  = 1.33, n a  = 1.43, d = 6 nm. (b) The estimated adlayer thickness, and (c) the estimated bulk RI change based on the linear model (blue) and the MLE method (red). The precision for the MLE results: σ d  = 0.09 nm, σ n  = 1.6  × 10−4 RIU. The precision of the LM: σ d  = 0.25 nm, σ n  = 6.14 × 10−4 RIU.

Figure 5 compares both methods in terms of accuracy and precision based on adlayer thickness ranging from 6 nm to 25 nm. The percentage error in the estimated adlayer thickness ranges from 0.83–1.96% based on the nonlinear–MLE method, and 8.3–71.6%, based on the linear response model. This indicates that the nonlinear–MLE method can improve the results by a factor of 36 as compared to those based on the linear response model, when the adlayer thickness approaches \(\sim {l}_{d}/2\). The error in the estimated RI change tends to be negligible based on the nonlinear–MLE method and increases drastically based on the linear response model, as shown in Fig. 5(b). The nonlinear–MLE method and the linear response model reveal 5 × 10−3 and 1.5 × 10−2 RIU uncertainties when the adlayer thickness approaches 25 nm, as shown Fig. 5(b).

Figure 5
figure 5

(a) Error in the estimated adlayer thickness based on the nonlinear–MLE method (red bars) and the linear response model (blue bars). The following parameters were used in the FEM model: n a  = 1.43, n B  = 1.33, and d = {6 nm, 15 nm, 25 nm}. The shifts in the resonance wavelengths were determined, and each resonance was added uncertainty \({\sigma }_{{\lambda }_{i}}\), such that each \({\rm{\Delta }}{\lambda }_{i}/{\sigma }_{{\lambda }_{i}}\mathrm{=10}\) (SNR = 10). The error was then determined as the difference between the true and estimated values. (b) The error associated with estimated Δn based on the same simulated results (resonance wavelength shifts) used in (a).

Experimental Results

This section provides the experimental results based on the Bioten-Streptavidin binding events. The samples were prepared according to the established protocol9,22. Before being used in the sensing experiments, the fabricated nanocrescent structures were cleaned by DI water and ethanol solution, blown dry with nitrogen,and plasma treated to remove any biological contaminant. The substrate was then functionalized by biotin-hpdp, and streptavidin solution was prepared in tris-buffer according to the established surface chemistry protocol9,22. The bulk RI sensitivity at each resonance was first determined based on resonance shift against RI change due to changing the concentration in ethanol solution. The sensitivities to adlayer thickness were then determined based on the measured results by correcting the simulated counterparts based on the measured bulk RI sensitivities. Since the adayer sensitivity is related to the bulk RI sensitivity by S d  = S B (n a n B )/l d . The corrected adlayer sensitivity \({S}_{d}^{{\prime} }\) can be determined as follows

$${S}_{d}^{{\prime} }={S}_{d}\,S{^{\prime} }_{B}/{S}_{B}$$
(22)

where S d is the simulated adlayer sensitivity and S B and \({S}_{B}^{{\prime} }\) are the simulated and measured buk RI sensitivities, respectively. The bulk RI sensitivity is used along the EM decay length in equations (14) and (16) to estimate the adlayer thickness and bulk RI change, respectively, based on the nonlinear–MLE method. This section also compares these results to those obtained based on the linear model. The linear model uses the bulk and adlayer sensitivity factors in equation (19) to estimate the adlayer thickness and bulk RI change.

Figure 6(a) shows the shifts in the resonance wavelengths, based of the measured extinction curves for streptavidin biotin sensing experiments. These shifts were translated into the logarithmic normalized quantity, ln(1 − Δλλ max ), and used in equations (14) and equation (16) to estimate the adlayer thickness and bulk RI change, as shown in Fig. 6(b,c). Now the linear model is used based on the same measured data, shown in Fig. 6(a) in estimating the adlayer thickness and bulk RI change using the sensitivity factors in equation (19). Figure 6(c) shows the results based on the linear model. The limit of detection and bulk RI resolution can be determined based on the standard deviation of the estimated adlayer thickness and bulk RI change, respectively. Based on these results, the nonlinear-MLE method improves the accuracy and precision as compared with the linear model. The limit of detection for adlayer thickness is reduced by a factor of 4, and the bulk RI resolution is improved by a factor of 22 based on the bulk RI change. Table 1 compares between the proposed nonlinear-MLE method and the linear model based on the estimated adlayer thickness and measured sensing characteristics.

Figure 6
figure 6

(a) Shifts in resonance wavelengths (λ1 = 1200 nm, λ2 = 1700 nm) with respect to streptavidin-bioten binding. (b) Adlayer thickness (left y-axis) and bulk RI change (right y-axis) estimated by the nonlinear-MLE method. (c) Estimated adlayer thickness (left y-axis) and the bulk RI change (right y-axis) based on the linear response model. The sequence on the graphs indicates the order of introducing the solutions to the nanocrescents [1]: Tris buffer solution [2], streptavidin, and [3] Tris buffer solution.

Table 1 Estimated adlayer thickness and sensing characteristics based on the nonlinear-MLE method and the linear model.

Discussion

The paper presented a statistical method, combining the the MLE method with a nonlinear model, for extracting the adlayer thickness and bulk RI change with improved accuracy. The method not only provided a quantitative information about the binding events, but also improved the precision of LSPR sensors. The paper adopted a nonlinear model for the sensor response, to adlayer thickness change, which reflects the real scenario of typical sensing experiments. The model follows a similar trend to that based on the association constant in chemical sensors.

The nonlinear model uses the EM field decay length and sensitivity for each resonance to estimate the adlayer and bulk RI change, whilst the linear model uses the sensitivity to bulk RI change and the adlayer sensitivity that needs to be recalculated (corrected) for other target biomolecules. The latter represents a substantial disadvantage of the linear model. In contrast to the linear model, the nonlinear model avoids the redundant calculation of the sensitivity to adlayer thickness for various biomolecular adlayer. Based on the simulated and measured result, and compared to the linear model, the nonlinear–MLE method improved the bulk RI resolution by two orders of magnitude (6 × 10−5 RIU vs 1.3 × 10−3 RIU). The method also achieves 75% improvement in the limit of detection based on the adlayer thickness (0.06 nm vs 0.24 nm uncertainty in the estimated adlayer thickness). In this paper, we considered the nanocrescent structures, but the proposed method can be applied to a wide range of different structures and various sensing scenarios.

Methods

The finite element method (FEM) was used to calculate the sensor response to bulk RI and adlayer thickness changes and validate the MLE method based on the nonlinear model. Periodic boundary conditions were enforced along the sides of a hexagonal simulation domain created with commercial COMSOL Multiphysics simulation package, as shown in Fig. 7(a). The refractive index for gold was obtained from Johnson and Christy23.The nanocrescents and the adlayer were discretized using a tetrahedral mesh, and triangular mesh was used with the rest of the domain. The transmission efficiency was obtained from the scattering parameter S21, as input and output ports were assigned to the bottom and top surfaces of the simulation domain, shown in Fig. 7(a).

Figure 7
figure 7

(a) Simulation layout used in COMSOL Multiphysics to model periodic nanocrescent structure of a hexagonal lattice. (b) Experimental set-up to measure the transmission spectra for the fabricated structure. The inset shows a SEM image for the fabricated nanocrescents (50 nm in thickness and 90 nm in height with 20° wedge angle).

The nanostructures were fabricated on a cyclic olefin polymer (COP) substrate by the nanoimprint lithography method and reactive ion etching as explained in a previous work17. A scanning electron microscope (SEM) image for the fabricated structures is shown in the inset of Fig. 7(b).

The fluidic channel used in the experiments was fabricated using replica moulding method as explained in16,24. Cary 5000 spectrometer was used in the set-up, illustrated in Fig. 7(b), to measure the extinction curves for the nanocrescents while injecting the ethanol and streptavidin solutions.