Introduction

Since 1960s, a wire-grid polarizer (WGP) has been considered as an optical element to discriminate polarization components:1 if an electric field of a light wave oscillates in parallel to the metallic wire-grids (TE polarization), it is largely absorbed, while it is transmitted if the electric field is perpendicular to the direction of wire-grids (TM polarization). Superb polarimetric performance combined with inherent structural planarity has made a WGP attractive in applications such as spectropolarimetry2,3 and projection displays4. Effects of a WGP were investigated in conjunction with a rotating platform5 and on the flexible6,7,8,9 and rough surface10. Thermal properties and plasmonic enhancement of a WGP were also studied11,12,13. On the other hand, a WGP was implemented using DNA-nanoparticle composites14. For the planarity of the structure, a WGP has been integrated to various electronic and photonic devices including a CMOS and CCD imaging sensor15,16,17,18,19, contact lens20, fiber-optics21, light-emitting diodes22, liquid crystal23,24,25,26, optical isolators27,28, a photodiode29, photoelectrochemical solar cells30, and semiconductor laser31.

Use of a WGP has been particularly beneficial for imaging polarimetry because complete polarization contents of a scene may be measured simply by changing the orientation of wire-grids on a pixelated platform in a flexible way32,33,34,35,36,37,38. Many design issues of WGPs have been explored in imaging polarimetry, for example, non-uniformity in the polarimetric performance depending on the incident scene vector39 and effect of finite pixel size40. WGP was integrated into a diffractive optical element41. One of the issues that have not been fully addressed and therefore we intend to address in this work is to understand the dispersive effects on the imaging performance which may appear in addition to typical spherical and chromatic aberration of imaging optics when WGPs are integrated for use in imaging polarimetry. Understanding the dispersive effects is critical to the design of WGPs because the polarimetry like any other imaging applications is performed in a broadband spectrum. More importantly, light dispersion can have an extremely severe and harmful effect on diffractive optical devices including metamaterials as recently reported for far-field applications42,43, which may limit practical application in imaging. In this sense, this study has a broad implication far greater than mere suggestion of a novel WGP design for imaging polarimetry.

Model and method

Numerical model for WGP

Wire-grids were assumed to be made of gold on a BK7 substrate in air ambience, as shown in the schematic of Fig. 1. Additional optics that are typically used in imaging polarimetry have been omitted for simplification. Incident scene is illuminated in the visible waveband (λ = 400 ~ 700 nm). Refractive index of BK7 substrate (ns) and gold (nm) was obtained as ns = 1.5308 and nm = 1.4684 + j1.9530 for λ = 400 nm44. At λ = 700 nm, ns = 1.5131 and nm = 0.1310 + j4.0624. The fill factor ff is defined as the ratio of wire-grid width to the period Λ. Wire-grids are assumed to be rectangular in profile (width ffΛ and thickness h) and infinite along the y axis. Wire-grid periods were varied from Λ = 100 nm ~ 1 μm in a step of 100 nm for a fill factor ff = 0.10, 0.25, 0.50, 0.75, and 0.90. Thickness of wire-grids is fixed at 200 nm.

Figure 1
figure 1

Schematic illustration of simplified imaging polarimetry based on a WGP. Incidence scene is imaged onto a pixelated detector through an imaging lens and a WGP mounted on a substrate. The scene undergoes dispersive effects of the WGP. The illustration only shows on-axis scene incidence for convenience. Inset figure represents off and on-axis scene incidence in cross-section. With respect to the principal axis, an on-axis scene is incident with an angle up to \({\theta }_{on}^{max}\), while the incident angle for an off-axis scene is between \({\theta }_{off}^{min}\) and \({\theta }_{off}^{max}\). Image of mandrill created by the US Department of State and available under the CC PDM 1.0 license.

Model for imaging polarimetry

In the imaging polarimetry model, a WGP is placed at the focal length (f) of an imaging convex lens (f-number f/#). For the convenience of analysis, we have assumed the imaging lens to be cylindrical, as shown in Fig. 1, with an incident angle for off and on-axis scene ranging from 0° to 60° as well as corresponding values of numerical aperture (NA) and f-number listed in Table 1. Spherical and chromatic aberration of the imaging lens has been disregarded in order to focus on the dispersive effects of a WGP. The aberration would further exacerbate the dispersive effects due to a WGP. The size of a WGP and an aperture, which is determined by the imaging lens, is denoted as DWGP and DL. For generality, we have assumed that a WGP is not directly integrated into an imaging detector. For this consideration, the thickness of a WGP substrate is set to be ds = 500 μm. The WGP-to-detector distance is s: s = 0 if the WGP operates in contact with the detector. A WGP and a detector are assumed to be aligned at the center in the lateral plane. The diameter of an Airy pattern (DAiry) produced by the aperture is given as:

$${D}_{Airy}\approx 2.44\lambda \frac{f}{D}=2.44\lambda (f/\#)=1.22\frac{\lambda }{{\rm{NA}}}=1.22\,\frac{\lambda }{n\,\sin \,\theta }.$$
(1)

f is the imaging focal length and D is the aperture diameter. DAiry is listed in Table 1 with respect to f-#. As is implied in Eq. (1) and Table 1, use of a high NA imaging lens reduces the point-spread function, thus enhances imaging resolution, with a large angle of scene incidence. This, however, may be accompanied by adverse effects of higher-order diffraction. On the contrary, a lens with a small NA can avoid higher-order excitation, despite degradation of resolution.

Table 1 Airy diameter for λ = 400 and 700 nm (unit: μm), numerical aperture (NA), and maximum on-axis angle for scene incidence (\({\theta }_{on}^{max}\)), as the f-number of an imaging lens is varied.

Generation of an object and image acquisition were performed on MatlabTM, i.e., spectral component images were generated in 512 × 512 pixels in a step of 10 nm from λ = 400 to 700 nm (total 31 images) by mapping RGB combinations in the CIE 1931 xy chromaticity space. Particle swarm optimization was used to find a specific single RGB combination. Each of the spectral component images was processed with ER(λ) between TE and TM polarization and combined to produce an image acquired by imaging polarimetry. Dispersive artifacts were then applied to the images.

Numerical methods and performance metrics

We used rigorous coupled-wave analysis (RCWA) with 60 spatial harmonic orders to calculate optical characteristics of a WGP. For the assessment of polarimetric performance, transmittance (T) of TM and TE polarized light was measured in the far-field. Extinction ratio (ER) was used as a ratio of reflectance of TM polarization to that of TE polarization, i.e., ER = TTM/TTE. Out of the several factors contributing to non-uniform polarimetric performance of a WGP, we have focused on the angular off-axis and the dispersive effects, which we have quantified with normalized transmittance and extinction ratio. If we consider light rays making minimum and maximum incident angle, \(\tan \,{\theta }_{on}^{min}\)= 0° and \(\tan \,{\theta }_{on}^{max}\)= DL/2 f = 1/2 f/# for on-axis scene incidence. For off-axis incidence, \(\tan \,{\theta }_{off}^{min}\)= (1 ‒ DWGP/DL)/2 f/# and \(\tan \,{\theta }_{off}^{max}\)= (1 + DWGP/DL)/2 f/#. Insights on the performance non-uniformity due to off-axis imaging of the scene can be obtained by comparing T and ER at \({\theta }_{on}^{min}\) and \({\theta }_{off}^{max}\), respectively, as the best and the worst-case scenario. We have therefore defined off-axis non-uniformity in transmittance (NToff) and extinction ratio (NERoff) as

$$N{T}_{off}=\frac{{T}_{TM}({\theta }_{off}^{max},\,\lambda )-{T}_{TM}({\theta }_{on}^{min},\,\lambda )}{{T}_{TM}({\theta }_{off}^{max},\,\lambda )+{T}_{TM}({\theta }_{on}^{min},\,\lambda )}$$
(2)
$$NE{R}_{off}=\frac{ER({\theta }_{off}^{max},\,\lambda )-ER({\theta }_{on}^{min},\,\lambda )}{ER({\theta }_{off}^{max},\,\lambda )+ER({\theta }_{on}^{min},\,\lambda )}.$$
(3)

Similarly, dispersive effects may be evaluated in the visible waveband by wavelength-dependent non-uniformity in transmittance (NTλ) and extinction ratio (NERλ) defined as

$$N{T}_{\lambda }=\frac{{T}_{TM}({\theta }_{off}^{max},\,\lambda =400\,nm)-{T}_{TM}({\theta }_{off}^{max},\,\lambda )}{{T}_{TM}({\theta }_{off}^{max},\,\lambda =400\,nm)+{T}_{TM}({\theta }_{off}^{max},\,\lambda )}$$
(4)
$$NE{R}_{\lambda }=\frac{ER({\theta }_{off}^{max},\,\lambda =400\,nm)-ER({\theta }_{off}^{max},\,\lambda )}{ER({\theta }_{off}^{max},\,\lambda =400\,nm)+ER({\theta }_{off}^{max},\,\lambda )}.$$
(5)

The non-uniformity ratios defined in Eqs. (25) becomes ±1 for the largest non-uniformity and approaches 0 for uniform performance. By the definition, NTλ = NERλ = 0 at λ = 400 nm.

Results and discussion

General characteristics based on the grating equation

By the conservation of momentum, grating equation specifies the direction of light propagation of diffractive optical devices in ambient and substrate modes, i.e.,

$$\sin \,{\theta }_{out}^{m}=\,\sin \,{\theta }_{in}+m\frac{\lambda }{\Lambda }$$
(6)
$$\sin \,{\theta }_{out}^{m}=\,\sin \,{\theta }_{in}+m\frac{\lambda }{{n}_{s}(\lambda )\Lambda }.$$
(7)

Here, m is an integer representing the mode number. θin and θoutm denote the angle of light incidence and an outgoing wave corresponding to the m-th mode. θin is not a function of λ under the assumption of no aberration of the imaging lens. The results suggest excitation of higher-order modes at normal incidence for λ ≥ 264 nm. For inclined light incidence, the wire-grid period Λ at which higher-order diffraction kicks in becomes even shorter.

Figure 2 presents TTM, TTE, and ER, which in effect shows dispersion relations between momentum in the x-axis and energy in the y-axis. The momentum is normalized by the free-space wave number k0 = 2π/λ. A few notes are worth a mention. First, higher-order modes appear as light momentum is increased with a larger angle of incidence. Higher-order modes also become more evident with a longer wire-grid period. Such a trend is particularly strong for TM polarization in TTM and ER. With TE polarized light, while higher-order modes also exist, overall transmittance is too much suppressed for the higher-order modes to emerge visibly.

Figure 2
figure 2

TTM, TTE, and ER for wire-grid period Λ = 200, 400, 600, 800, and 1000 nm. kx/k0 = sin(θin) in the x-axis (k0: free-space wave number). Wavelength λ in the y-axis was varied in 400 ~ 700 nm. It was assumed that ff = 0.50 and height of 200 nm. Higher-order diffraction components are shown for Λ > 200 nm.

Effect of off-axis scene components in WGP-based imaging polarimetry is presented in Fig. 3(a,b). Because polarimetric performance of a WGP depends on light incidence, the presence of off-axis scenes creates significant non-uniformity in the performance, as described earlier39. Figure 3(a,b) address the dynamics due to scene incidence with an angular spread given by the finite NA. The results suggest that the deviation in the imaging plane can be extremely high: in terms of NToff and NERoff, the non-uniformity is shown to reach up to |NToff | max = 0.93211 at Λ = 400 nm and |NERoff | max = 0.93624 at Λ = 600 nm. In other words, a scene component at a specific angle of incidence may dominate or be suppressed in an image acquired after a WGP.

Figure 3
figure 3

Non-uniformity ratios regarding the effects of off-axis scene components: (a) NToff and (b) NERoff. Dispersive effects presented by (c) NTλ and (d) NERλ. Data were collected at every 10 nm of wavelengths, i.e., Δλ = 10 nm, for WGPs with ff = 0.50 and height of 200 nm. For other wavelength points, the data were interpolated by cubic splines. Off-axis and wavelength-dependent non-uniformity ratios are calculated for various wire-grid grating periods (Λ) of a WGP between Λ = 100 and 1000 nm (see the legend).

Dispersive effects on non-uniformity and image misregistration

We now address effects of wavelength-dependent dispersion. Dispersive effects may be manifested in two ways that are potentially inter-related. First, dispersion may also cause non-uniformity in the polarimetric performance that is additional to the effect of off-axis scene components, as shown in Fig. 3(c,d), i.e., Fig. 3(c,d) address spectral performance variation in reference to λ = 400 nm to measure how far the performance quantities diverge as the wavelength becomes longer in the visible waveband for the range of wire-grid period Λ = 100 ~ 1000 nm. Although dispersive non-uniformity appears to be less significant than that caused by angle-dependent off-axis light incidence, it is still quite high, i.e., |NTλ | max = 0.84935 at Λ = 400 nm and |NERλ | max = 0.90139 at Λ = 300 nm. This suggests that specific color components in a scene may be more emphasized in imaging polarimetry, depending on the structure of a WGP. Figure 3 also shows that the non-uniformity associated with dispersion of WGP is slightly less severe than that due to off-axis scene components, e.g., |NTλ | max/|NToff | max = 0.91 if compared based on maxima. Interestingly, dispersive non-uniformity observed in Fig. 3(c,d) is quite prominent at a short wire-grid period, i.e., NTλ and NERλ can be significantly large at Λ = 100 nm. In other words, imaging polarimetry using a WGP with extremely fine wires, while it may minimize off-axis non-uniformity, may not be able to avoid dispersive non-uniformity in the performance.

Secondly, dispersive effects may produce image misregistration in the imaging plane. Schematic illustration is provided in Fig. 4(a). For simplicity, we consider zeroth and higher-order effect separately. The zeroth-order effect of dispersion can be understood by the Snell’s law, i.e.,

$${n}_{s}(\lambda )\sin \,{({\theta }_{out}^{m=0})}_{12}=\,\sin \,{\theta }_{in}.$$
(8)
Figure 4
figure 4

(a) Schematic illustration of higher-order excitation in imaging polarimetry. The illustration shows dispersive lateral spread: \(\Delta {{\boldsymbol{t}}}^{{\boldsymbol{m}}=0}\) and \(\Delta {{\boldsymbol{t}}}^{{\boldsymbol{m}}=1}\) for the zeroth and +1st-order mode. \(\Delta {{\boldsymbol{t}}}_{01}\) for the overall dispersive spread. For convenience, lateral spreads are only shown for on-axis light incidence. (b) Dispersive lateral spread \(\Delta {{\boldsymbol{t}}}^{{\boldsymbol{m}}=0}\) without higher-order modes. Calculated dispersive spread: (c) −1st and (d) +1st-order. Wavelength was sampled at every 10 nm in the range of 400–700 nm. The lateral spread was obtained for a WGP with ff = 0.50, period Λ = 1000 nm, and height of 200 nm.

Equation (8) describes light refraction into the glass substrate, where subscript ()12 denotes propagation from air ambience into glass substrate. Upon the second refraction at the bottom surface, the propagation angle of the outgoing wave becomes identical to that of an incident light, i.e., \({\theta }_{out}^{m=0}={\theta }_{in}\). Because of dispersion, a lateral spread arises in proportion to the thickness of a WGP substrate, which serves as a measure of image misregistration among various spectral components. Assuming that an imaging detector is in contact with a WGP, the refracted light becomes spectrally spread with a spread

$$\Delta {t}^{m=0}=t|\tan \,{\theta }_{out}^{m=0}(\lambda )-\,\tan \,{\theta }_{out}^{m=0}(\lambda =400\,nm)|.$$
(9)

Figure 4(b) presents \(\Delta {t}^{m=0}\) with respect to wavelength λ and incident angle θin. Assuming no higher-order diffraction, and with a typical size of a pixel of an imaging detector on the order of 10 ~ 100 μm, it is suggested that \(\Delta {t}^{m=0}\) should not cause a significant problem. However, potential imaging misregistration between images corresponding to various spectral components may arise at the edges of a pixel. This issue would particularly be critical, considering recent emergence of imaging detectors with extremely small pixels, e.g., a CCD image sensor with submicron pixels was reported45,46. With a pixel of 1-μm size, the results in Fig. 4(b) suggest misregistration on the order of six pixels.

Once higher-order modes are excited, they serve as noise for the most part. For example, it is well-known that higher-order modes suffer from relatively low polarimetric extinction. In this work, for the higher-order modes, we have defined the lateral spread in the worst case as

$$\Delta {t}_{0m}=t|\tan \,{\theta }_{out}^{m}(\lambda )-\,\tan \,{\theta }_{out}^{m=0}(\lambda =400\,nm)|.$$
(10)

which includes higher-order diffraction in addition to the effect of chromatic dispersion. For this reason, we emphasize that the overall dispersive misregistration can be greatly amplified for the higher-order modes and the deviation be potentially significant depending on the orders, as clearly shown in Fig. 4(c,d). Therefore, the result implies requirement of higher-order diffraction components to be either fully filtered or suppressed using WGPs with a sufficiently short Λ.

Dispersive effects in Stokes polarimetry

Before we present diattenuation-based dispersive effects in imaging polarimetry, it is also of interest to explore the effect on more general Stokes polarimetry. For convenience, we assume single-point acquisition in the far-field with unpolarized light incidence. The results are shown in Fig. 5 where changes of polarization states are described on the Poincaré sphere when the angle of incidence (θin) as well as the wavelength (λ) is varied from θin = 0 ~ 60° in the visible waveband.

Figure 5
figure 5

Variation of polarization states after transmission through a WGP is shown in a red line on the Poincaré sphere for (a) θin = 0°, (b) 15°, (c) 30°, (d) 45°, and (e) 60°. Incident light wavelength ranges from λ = 400 to 700 nm with a 50-nm step in the visible waveband.

It is quite clear that the polarization states may vary drastically due to light dispersion. Interestingly, the spectral variation of polarization tends to increase with a high angle of incidence due both to the difference in phase change that TE or TM polarization undergoes with respect to the incidence angle and also to the changes in the relative amount between the components polarized in parallel and orthogonal to the wire-grids47,48,49. If we quantify the variation of polarization states with a sum of arc lengths (Lp) measured between points on the Poincaré sphere (initially from U = 1), dLp/dθin = 0.5795 and dLp/dλ = 1.185. More importantly, we emphasize that the polarization state varies with light wavelength and may thus be affected significantly by the spectral content of an incident scene. The scene-based variation of polarization states may be additive to the dispersive effects that were presented previously. For more explicit understanding of dispersive effects in imaging polarimetry, however, we only consider diattenuation properties of a WGP in the following section.

Imaging polarimetry with dispersive effects

Under the conditions without dispersive artifacts, polarimetric performance is represented by ER. Now, performance non-uniformity and image misregistration due to light dispersion degrades overall polarimetric content in a pixel. Figure 6 presents images with and without dispersive artifacts, i.e., an image was initially converted into an object by mapping RGB combinations in the CIE 1931 xy chromaticity space. Here, we have used a baboon image, which is one of the test images often employed in the image processing community. Because the lateral spread due to wavelength dispersion is exacerbated in the presence of higher-order diffraction, the worst-case performance occurs at a large angle of incidence with a long wire-grid period (Λ) greater than light wavelength. For the simulation of dispersive effects, therefore, the wire-grid period of WGP and incident angle were assumed to be Λ = 1000 nm and θin = 60° as the worst case. The baboon image as an object was acquired through a WGP as 50:50 TM and TE component. A TM image was obtained as transmitted through a WGP, while TE image intensity was proportionately reduced with ER(λ). The TM and TE component images were then combined to form a final ideal image without dispersion in Fig. 6(a). This is compared with an image in the presence of dispersive effects corresponding to the zeroth-order and also the case including higher-order diffraction modes, respectively shown in Fig. 6(b,c). For the visual convenience, higher-order images were normalized by the peak intensity of the (−1)st TM image. In this illustration of an example, the detector sensitivity was assumed to be independent of wavelengths. With wavelength dependence, the contrast between ideal and dispersive images can be even starker.

Figure 6
figure 6

Dispersive effects simulated on images acquired in imaging polarimetry. (a) Ideal image without dispersion. Images that suffer from (b) zeroth-order and (c) 1st-order dispersion. Insets demonstrate loss of clarity as a result of dispersive effects in (b) vs. no dispersion in (a). Image of mandrill created by the US Department of State and available under the CC PDM 1.0 license.

The comparison of the magnified images laden with dispersive artifacts in the inset of Fig. 6(b) to that of an ideal image presented in Fig. 6(a) emphasizes that the dispersion induced by a WGP degrades the quality of images acquired in imaging polarimetry and the management of dispersion can therefore be crucial to the performance. Although we have assumed a simple WGP with wire-grids aligned in a single direction, a pixelated WGP with multiple orientations would incur dispersive effects much more complicated than are presented in Fig. 6. It is thus suggested that the results should be reflected in the design principles of imaging polarimetry and can be critical in applications of a WGP that operates in a broadband spectrum at a large angle of incidence using an imaging lens with a high NA, and when the pixel of an imaging detector is small comparable with the diffraction limit50,51,52. For example, higher-order diffraction modes need to be strictly suppressed, e.g., using a WGP with sufficiently fine wire-grids or employing filters in the light paths. To remove or reduce zeroth-order dispersive effects, a WGP is best integrated directly with a detector or in the closest proximity. A WGP mounted in an optical plane that is relayed in a 4 f imaging system to an image plane would cause more severe artifacts because of the distance between a WGP and a detector. This may be compensated using dispersive devices, e.g., gratings, in the optical path.

Concluding remarks

In summary, we have investigated effects of dispersion in imaging polarimetry based on WGPs. Mainly, there are mainly two effects: dispersion aggravates performance non-uniformity, which is less severe than one that is caused by off-axis imaging. In addition, dispersive lateral spread produces image misregistration in the detector. The spread is expected to be smaller than or on the order of 10 μm in typical imaging conditions. This may be significantly worse when higher-order diffraction components are excited and can add systemic noise to the acquired images. The results therefore emphasize the need to consider dispersive artifacts for the design of WGP-based imaging polarimetry.