Introduction

In nanoscale conductors that have channel lengths comparable to the mean free path of carriers, transport approaches the ballistic limit. A ballistic model should also describe the transport properties of well-designed MOSFETs with channel lengths approaching or below the mean scattering length λ. On the other hand, there is a marked difference between the current-voltage (ID − VD) characteristics from a purely ballistic model, such as the one proposed by Natori1, and experimental data. Although complex models can fit experimental data more precisely2, the simplicity of a fewer-parameter model based on a ballistic transport formalism is desirable.

In Natori’s model for ballistic transistors, MOSFET current-voltage (ID − VD) relations were derived based on Landauer’s formula for current through a ballistic conductor. In this work, we extend this transport model by including two physically meaningful parameters to predict device data accurately. They are transmission probability T and drain-channel coupling parameter Δ. Δ captures the effects of the longitudinal electric field in the channel and represents the fraction of the drain Fermi level that appears on the source end. The inclusion of Δ allows us to match the experimental ON-resistance (RON) in the low bias regime, which is the central result of our work. We provide physical justifications for these parameters, which hitherto had not been incorporated in a ballistic model to our knowledge.

Additionally, the threshold voltage (VT) is either obtained from device characterization techniques or allowed to vary to account for VT variation with device length. Here, we use either a two-parameter fit, by fixing VT, or a three-parameter fit, where all three parameters are allowed to vary, to characterize experimental data. By allowing VT to vary, we can account for the effects of the halo implants on the channel length, for example. We relax this requirement when comparing devices over a narrower range of length where the VT variation is small.

To support our model, we fabricated n-channel MOSFETs with gate lengths L ranging from 50 nm to 3 µm as we discuss in Methods.

We provide a detailed length dependence of the parameters we examined, including T and Δ. Using these parameters, we predict the minimum ON-resistance (RON) that can be achieved theoretically by setting T = 1. This value is substantially higher than that predicted from a purely ballistic model from others. We also model our data using the simplest long-channel model. By comparing the two models, we calculate the demarcation length at which one model becomes more appropriate over the other.

Most importantly, we find that T can be assumed to be independent of drain bias. A bias independent T is counter to the conclusions from the virtual source (VS) model, which has been successful in modeling nanoscale MOSFETs2,3,4,5. Here, we provide justifications for using a fixed T and discuss the assumptions used in each model that determine their respective values for T.

Ballistic Transport

Landauer’s formula

In nanoscale transistors, where the channel length becomes comparable to or less than the mean free path of carriers, we enter the quasi-ballistic or pure ballistic regime of transport. Here, parameters like the mobility and mean free path start to lose the physical significance that they held in the diffusive transport regime. In a ballistic transistor, scattering events are ideally zero. Landauer emphasized the role of contacts in determining the current through such a conductor. The maximum conductance, in this case, is limited by the contacts and is given by Gc = 2q2/h per conduction mode, where q is elementary charge and h is Planck’s constant. Thus, the resistance is finite even though there is no scattering in the conductor. The minimum resistance arises because the current is carried by many transverse modes in the contacts but by only a few modes or subbands inside the conductor. This requires a redistribution of the current among the current carrying modes at the interface that leads to a finite contact resistance6. It can also be thought of as an energy penalty that an electron has to pay in order to be funneled into the narrow channel from contacts that are comparatively wide. Here, we distinguish between this contact resistance, an intrinsic parameter that sets the ultimate resistance of a ballistic device, and series resistance, an extrinsic effect that arises from the physical size and nature of the contact to the source/drain regions of a MOSFET.

In this report, we describe a new ballistic transistor model with the fewest fitting parameters reported today. The important insight we provide is that the net occupancy of transmission modes is reduced from the original Natori’s model due to a finite longitudinal electric field in the channel, which is captured in the parameter Δ. In the Appendix, we describe the origin of Δ in greater detail.

Natori’s Model for Ballistic Transistors

In Landauer’s formalism (Eq. 1) for ballistic transport7, the current through any conductor is proportional to the probability T(E) that an electron can transmit through it and the quasi-Fermi levels that determine the occupancy of right- and left-moving carriers. In a conductor with multiple subbands, the current also depends on the number of modes M(E), which depends on energy E. Hence, the difference in the Fermi functions f1 − f2 determines the net transmission modes.

$${I}_{D}=\frac{2q}{h}\int dE\,T(E)M(E)({f}_{1}-{f}_{2})$$
(1)

Using Landauer’s approach, K. Natori1 derived the current-voltage (ID − VD) characteristics of a ballistic MOSFET. Natori’s model was developed for purely ballistic transistors that have channel lengths L < λ, where λ is the mean scattering length, and for perfect gate coupling. Even by setting T < 1, however, the model does not represent experimental data. Figure 1 shows the Natori’s model with T < 1 to match the maximum current in our L = 60 nm device, which we show again in section IV with our modified model. In Fig. 1, it is clear that the ON-resistance (RON) in the low bias regime is unrealistically too small compared to the experimental data. The goal of the present effort is to include physically relevant parameters to the original Natori’s model to match experimental ID − VD characteristics.

Figure 1
figure 1

Data from L = 60 nm, W = 1 µm nMOSFET where the gate voltage is stepped from 0 to 1.2 V in 0.3 V increment. Natori’s model (squares) with T < 1 is used to estimate device data (solid). Natori’s model predicts an unrealistically low RON at low drain bias. The same device data is modeled in section IV using the quasi-ballistic model described in the text.

The original model, however, serves as a benchmark for the highest current that could be achieved for a given technology node. For example, the degree of ballisticity that one can extract from it is an extremely useful parameter for assessing the scaling potential of transistors. Here, by extending the original model, we can better match to experimental data and provide new insights into the nature of ballistic transport in highly scaled MOSFETs.

Eq. 1 clearly shows the competition between T(E) and M(E)(f1 − f2). To determine them, one must first realize that these parameters are local, as discussed in more detail in the Appendix. Therefore, although Eq. (1) can be determined at any point along the channel, we apply it at a point that allows us to gain the most physical insight. This point is at the top of the band near the source, as we discuss below. At the top of the source, we must carefully define the quasi-Fermi levels which may be different from those of the source/drain contacts, as we discuss in the Appendix in more detail. When these parameters are properly accounted for, we show that a simple two-parameter quasi-ballistic model for nanoscale MOSFETs can be developed that matches experimental data. Our results show that T(E) need not depend on the bias, unlike the results from previous work5,8,9,10,11,12. We show that this discrepancy fundamentally arises from the use of non-local parameters to determine the net occupancy of M(E).

A Two-Parameter Ballistic Model

In this work, we present a simple yet physical modification to the ballistic transistor model to match experimental data. This is accomplished by extending Natori’s ballistic model to include the effects of channel transmission (T) and drain-channel coupling parameter Δ. These modifications are supported by the physics of the device that we present. In this section, we discuss the origin of these two parameters in detail. We show that T and Δ are closely linked. Depending on the use of either local or nonlocal quasi-Fermi levels, a bias independent or a bias dependent T will emerge.

Transmission (T)

In a quasi-ballistic MOSFET, the channel length is comparable to the mean free path λ of carriers in the channel. Thus, carrier scattering is significantly reduced but is not entirely absent. Most importantly, mobility no longer becomes a valid parameter. Instead, transmission (T) captures the effect of these scattering processes on the drain current. The most natural place to assess the value for T is at the peak of the band diagram8, as we show in Fig. 2. This is because the carrier density there is controlled by the gate and the lateral field is small. This region is close to the source and in near equilibrium with the source quasi-Fermi level. Thus the source quasi-Fermi level determines the occupancy of the right-moving carriers. Since the net current is due to the difference in the right- and left-moving carriers, one must also determine the occupancy of the left-moving carriers, which we discuss below.

Figure 2
figure 2

(a) Band profile along the channel of a quasi-ballistic transistor depicting the effective quasi-Fermi level Efd′ for the left-moving electrons at the source end. The factor Δ measures the fraction of Efd that appears at the top of the source. (b) The full dispersion relation at the top of the band showing that the reflected carriers have equal states as the transmitting carriers to which to reflect, which implies that 1-T is bias independent.

In our model, T is largely independent of drain voltage and energy, although this assumption is not always found in the literature5,9,10. This discrepancy in T arises because it enters in the Landauer’s model as a product with M(E). Our model captures the imperfect coupling of the gate to the channel. In such cases, a longitudinal field develops due to the drain bias, and the drain quasi-Fermi level necessarily has to vary along the channel, as discussed in the Appendix. In other words, (f1 − f2) must be defined locally, not from those of the source/drain contacts as is done typically. For example, in the model of ref.5, the drain quasi-Fermi level is assumed to be constant along the channel and was used to model the current at the top of the source. This is the same assumption used in Natori’s original work. When the drain quasi-Fermi level is used throughout the channel, however, one underestimates the occupancy of the left-moving modes at the source-end. Since the total current is the difference in the right- and left-moving carriers, the overall current is larger than if one allowed the drain quasi-Fermi level to vary along the channel. As stated earlier, a varying quasi-Fermi level is necessary due to the longitudinal field and requires one to define the quasi-Fermi levels locally. When the drain quasi-Fermi level is used throughout the channel, however, a bias dependent T is needed to match experimental data to compensate for a reduced number of left-moving modes5. Typically, T has to start out small and increase with the drain bias VD. In essence, since the product T(E)M(E) enters into the Landauer model, we have shifted the bias dependence to the occupancy of M(E) while previous models have incorporated the bias dependence in the parameter T. Clearly, the two approaches result in a completely different physics of nanoscale MOSFETs, even though both may reproduce experimental data.

Here, we show that device data can be modeled accurately using a model that does not require T to vary with VD. In the simplest form, T is given by Eq. (2).

$$T=\frac{\lambda }{\lambda +L}$$
(2)
$${I}_{D}/W=T{I}_{0}({F}_{1/2}(u)-{F}_{1/2}(u-{v}_{d}))$$
(3)

Eq. (3) is the ID − VD characteristics given by Natori (Eq. 22 in ref.1), modified by T and vd. (We note that vd needs to be normalized to the thermal voltage before using it in Eq. (3)). The first term in (3) captures the current in the channel due to right-moving carriers and the second term captures the current due to left-moving carriers. W is the width of the device and I0 is determined by material parameters. F1/2 is the half order Fermi integral that can be evaluated numerically. The difference in the two F1/2 functions accounts for the nonequilibrium carrier density in the channel under bias. As stated earlier, the most natural place to assess the transport is at the top of the conduction band, as shown in Fig. 21,4,13. Here vd, instead of the actual drain bias VD, is used to determine the occupancy of the left-moving carriers at the top of the source. We define vd empirically as a fraction Δ of the drain bias VD.

T is a fitting parameter that we extract for different length devices. Due to scattering, the ON current of a MOSFET is less than the ballistic ON-current by the fraction T < 1 that depends on the relative values of the channel length L and λ. We assume that the mean free path λ is a constant for a given transistor node. In addition, the ON-current is also reduced in the linear region because we have a larger number of left-moving carriers due to the use of vd instead of the drain voltage VD.

When Δ = 1, the Fermi level of the drain contact determines the difference f1 − f2 in Eq. (1), which is valid only in the case of perfect gate coupling. This is the origin of the unrealistically small RON predicted in the original Natori’s model, as we show in Fig. 1 and discussed elsewhere14. We show below that by including Δ < 1, we can achieve more realistic RON values.

Effective Drain-Quasi Fermi Level (Δ)

In our model, we use Δ as a fitting parameter to account for the longitudinal electric field in the channel. This field modifies the quasi-Fermi level of the left-moving carriers at the top of the source. Since the electrical field is small there, the right moving carriers can be assumed to be in near equilibrium with the source contact. This allows us to use the source quasi-Fermi level Efs to determine the occupancy by the right-moving carriers.

The population of the left-moving carriers at the peak, however, cannot be determined by Efd of the drain contact. To elucidate the dynamics at the peak, we superimpose the dispersion relation (E vs k) and the conduction band profile under bias in Fig. 2. The difference in the quasi-Fermi levels Efs and Efd due to a bias on the drain VD creates an imbalance in the populations of right- and left-moving electrons throughout the channel. In a purely ballistic channel with no longitudinal electric field in the channel, Efd would be used to demarcate the quasi-Fermi level of the left-moving electrons throughout the channel, defined as Efd = EfS − qVD. In most MOSFETs where the gate control is not perfect, a potential energy profile takes on the form depicted in Fig. 2(a). To account for the longitudinal field, we define Efd′ that results in a larger occupancy of the left-moving modes at the top of the source. Here, Efd′ must be above Efd, as depicted in Fig. 2, and determines the population of the left-moving carriers at the top of the band. Even if Efs is not strictly in equilibrium with the source contact, the band bending implies that Efd′ > Efd at the top of the band6.

Just as Efd needs to be defined locally, so must T(E). To elucidate, we show the full effective mass dispersion relationship at the peak of the band in Fig. 2(b). Since the lateral field at the top of the source is small, the right- and left-moving carriers are expected to be well defined by local quasi-Fermi levels Efs and Efd′, respectively. As indicated by the arrows, the number of states that are available for the reflected carriers (1 − T(E)) are equal to the transmitting states for all bias conditions. The preceding argument holds irrespective of whether Efd′ or Efd is used. Therefore, we can see that as long as the gate keeps the carrier density and the barrier height fixed at the peak, T(E) is expected to be largely independent of VD. In fitting to device data, we show that a single value for T(E) can accurately predict the ID − VD curves for a given channel length device.

We now estimate Efd′ to model experimental data. From Fig. 2, we have elected to use \({v}_{d}={\Delta }\cdot {V}_{D}\) where Δ is a constant determined by fitting to experimental data. Thus, we have Efd′ = −q vd. We expect Δ to vary with transistor geometry; the more control the gate exerts on the channel, the closer Δ will approach 1. The use of a constant Δ is further justified by fitting to device data. We show that Δ is a fundamental parameter of our MOSFETs, rather than one that is affected by extrinsic parasitic effects.

In addition to the quasi-ballistic devices, we measured long-channel MOSFETs. Because of the wide variation in the length we examined, a fair amount of threshold voltage variation with channel length is observed across the devices. Thus, initially, VT is allowed to vary as a fitting parameter. Since this variation is negligible for devices with similar lengths, VT is later considered to be fixed for those devices. The variation in VT is due to processing steps incorporated to reduce short-channel effects. For example, with decreasing channel length, VT roll-off can occur. To counter this, source/drain extensions and halo implants are added. These steps are responsible for the length dependence of VT with channel length. For devices with L < 100 nm, we can fix VT without affecting the fits.

Results

To support our model, we fabricated n-channel planar MOSFETs at SUNY-Poly’s 300 mm fabrication facility. The devices have channel lengths that range from L = 50 nm to 3 µm, with width W = 1 µm for all devices. The MOSFETs have an equivalent SiO2 gate thickness of 2 nm. We determined the fitting parameters using a least-squares minimization.

R-squared Goodness of Fit

In Fig. 3(a,b), we show the results from L = 1 µm and 60 nm MOSFETs. For both devices, we fit the data using a simple long-channel model given in Eq. (4) and the modified ballistic model discussed in section III. We, however, do not expect the long-channel parameters we extract for the L = 60 nm device to be physical. Similarly, we do not expect the parameters from the quasi-ballistic model to be valid for the L = 1 µm device. This exercise, however, will help to assess the length scale at which one model is more accurate over the other.

$${I}_{D}/W=\frac{{C}_{ox}{\mu }_{e}}{L}[({V}_{G}-{V}_{T}){V}_{D}-{V}_{D}^{2}/2]$$
(4)
Figure 3
figure 3

ID − VD characteristics of n-channel MOSFETs for (a) L = 1 µm and (b) L = 60 nm. The gate voltage is stepped from 0 to 1.2 V in 0.3 V increment. For each device, we use both a long-channel model and the quasi-ballistic model to fit the data. To treat both models on an equal footing, we allow VT to vary. (a) From the long-channel model, we find µe = 151 cm2/Vs and VT = 0.23 V; from the quasi-ballistic model, we have T = 0.05 and VT = 0.44 V. (b) From the long-channel model, we have µe = 46 cm2/Vs VT = 0.19 V; from the quasi-ballistic model, we have T = 0.22 and VT = 0.38 V.

For the long-channel Eq. (4), VT and electron mobility µe are fitting parameters. Beyond the linear region, we assume a constant current given by the saturation current determined from Eq. 4 in the usual way. We treat the two models on an equal footing by initially allowed VT to vary for both models, although we expect significant discrepancies in the results from the two models and expect some of the parameters to be not physical.

We confirmed a cross-over in the R-squared goodness-of-fit in Fig. 4, which is plotted for the two models as a function of channel length. A cross-over in R-squared values suggests that there is a length scale below which the ballistic description is the more appropriate one.

Figure 4
figure 4

R-squared goodness of fit vs. length for long channel model (red) and quasi-ballistic model (black) treated equally by allowing VT to vary. A crossover is seen near L = 0.5 µm.

For the device with L = 1 µm in Fig. 3(a), both models result in a good fit, provided that VT is allowed to vary. However, the long-channel model is the correct model for this channel length and results in a better fit as measured by the R-squared goodness-of-fit shown in Fig. 4. The long-channel mobility we extract is µe = 151 cm2/Vs. The quasi-ballistic model for this length device gives a reasonably good fit, albeit with a wrong VT. The threshold voltage is over 0.2 V above that of the long channel model, which is the correct one as determined by direct measurement (see the figure captions for the extracted parameters).

For the L = 60 nm device in Fig. 3(b), we observe that the quasi-ballistic model provides a far better fit compared to the long-channel model. The long-channel model requires unreasonably low values for both µe and VT, which are not physical15,16.

In order to determine the channel length below which the quasi-ballistic model would be more appropriate, both the long channel and the quasi-ballistic fits were performed for all channel lengths and the goodness of fit parameter, R-squared, was plotted as a function of length for both fits. This is shown in Fig. 4. We see from this figure that a crossover in R-squared occurs around L = 500 nm, although this is not the demarcation length for quasi-ballistic transport since we allowed VT to vary. Indeed, we expect each model to overestimate the range in which the underlying physics is valid because we allowed VT to vary. However, there is a clear confirmation that the modified quasi-ballistic model is a better one for highly scaled devices below a certain channel length. The devices near the crossover length at 500 nm can be viewed as neither being quasi-ballistic nor purely diffusive.

We thus conclude that the addition of only two parameters to the purely ballistic model from Natori can dramatically improve the ID − VD characteristics for nanoscale transistors, which we demonstrate for the first time.

Experimental Determination of T and Δ

To better support the quasi-ballistic model, we examine the length dependence of transmission T. In Fig. 5, we fix VT and examine devices over a narrower range of length. From the plot of transmission probability vs. length of all device lengths ranging from 60 nm to 0.5 µm, it was observed that only the devices with L < 100 nm followed the trend given by Eq. (2). To extract the transmission probability, the threshold voltage was fixed to 0.4 V, which was determined experimentally for these devices, and a two-parameter fit was performed. Fixing VT did not affect the quality of the fit for devices with L < 100 nm.

Figure 5
figure 5

Transmission probability T vs. Length determined by fitting the ID − VD curves to the modified Natori’s model. The solid curve is a fit to Eq. (2), yielding λ = 19 nm.

From the fit, we extract λ = 19 nm. This implies that the quasi-ballistic model would be more applicable below 100 nm. Therefore, according to the trend in transmission probability with length, a more precise estimate of the demarcation length below which the devices enter the quasi-ballistic regime of transport is around 100 nm. This demarcation length is further supported by other analysis we provide in Figs 6 and 7.

Figure 6
figure 6

Extracted Δ vs length (L) from the fit for the devices examined in Fig. 5.

Figure 7
figure 7

Experimental channel resistance (RON) measured at VD = 0.04 V and VG = 1.2 V, and purely ballistic RON by setting T = 1 at the same bias. The solid curve is the expected transition to the purely ballistic RON as L is reduced below λ = 19 nm.

Our estimate for λ is significantly higher than those estimated previously17. This is consistent with the fact that in our model the inversion charge that contributes to left-moving carriers persist to higher VD. The result is that fewer carriers contribute to the net current at a given VG compared to earlier models. Thus, they must be moving with longer mean scattering length.

We plot the extracted Δ in Fig. 6 for the devices shown in Fig. 5. The striking result is that Δ is largely independent of channel length. Therefore, the main parameter that changes with length is T. The results of Fig. 6 strongly suggests that one cannot attribute Δ to an external series resistance, one possible extrinsic parameter that could affect Δ. This is because a series resistance would create a length-dependent voltage drop and cause Δ to decrease with reduced channel length (i.e., with more current), not one that is constant with length as seen in Fig. 6. Although we cannot completely rule out the effects from source and drain series resistance, their effects appear to be minimal.

Δ is expected to be independent of length if it is an intrinsic parameter of the device. This is because Δ also represents the carrier concentration near the source, which would be primarily determined by the gate, as we discuss in the Appendix. There, we show that in a quasi-ballistic conductor Δ and T are the same. In a nanoscale MOSFET, the two parameters are decoupled since the gate also influences Δ.

Ballistic Contact Resistance

We further demonstrate the significance of T with length by extrapolating the purely ballistic resistance value by setting T = 1. In Fig. 7, the black data points represent the channel resistance, RON, of the measured devices. RON is the output resistance obtained experimentally from the ID − VD data at a low value of VD (0.04 V) and VG = 1.2 V for L < 100 nm devices. Extrapolating this linear graph to L = 0 should give us the contact resistance in the limit of no scattering. This would be proportional to the quantum of resistance (1/Gc) or the contact resistance that arises due to the difference in the number of modes available between the contacts and the channel, modified by the longitudinal field. The contact resistance will be the lowest achievable resistance no matter how advanced the technology for this gate voltage. This extrapolated value is about 170 Ω-µm, or about 85 Ω-µm per contact.

The theoretical minimum value of RON can also be calculated for each of the four devices by setting the transmission probability to unity (T = 1). These are represented by the red data points in Fig. 7. They are largely independent of length and average to a value of 154 Ω-µm (or 77 Ω-µm per contact), very close to the value obtained experimentally. The similarity in the two values provides important implications on the limit of RON that can be achieved for our planar devices.

We note that the theoretical value of contact resistance reported in ref.18 is about half of the value we report. The implications of the larger value we observe are as follows: The inclusion of the drain-channel coupling parameter Δ has the effect of reducing net carriers that transmit, resulting in a larger theoretical resistance as we show experimentally. Δ is a function of transistor geometry. We expect Δ to be smaller in a planar geometry compared to a FinFET geometry. This is because the gate control of the channel is superior in a FinFET19,20,21,22, resulting in a flatter band profile than the one shown in Fig. 2. Therefore, we would expect the theoretical RON to be smaller for a FinFET resulting from a larger Δ. For the planar geometry, the implication from Fig. 7 is that the transition to the purely ballistic RON will take place at L ~ λ, as depicted by the solid curve near L = 19 nm.

Finally, we note that including the drain-induced barrier lowering (DIBL) in our model does not qualitatively change the outcome of the results, although the fit to that data improves somewhat. DIBL primarily affects the current in the saturation regime and does little to affect RON, as was noted previously18. Including the DIBL parameter adds a small slope to the current in the saturation regime, which is not present in the current model. Our model with DIBL will be the subject of future work.

Conclusion

We report on a two-parameter quasi-ballistic quantum transport model for nanoscale transistors that extends Natori’s original ballistic model. The two parameters are T and Δ, the transmission probability and drain-channel coupling, respectively. The central feature of our work is the inclusion of Δ, which reduces the net carriers that transmit in the channel and accounts for the spatially varying channel potential. We show that our model can dramatically improve the fit to device data, which we validate by fabricating n-channel MOSFETs of varying lengths. The length scaling of both T and Δ suggests that these are intrinsic parameters of the device, not ones determined by extrinsic effects.

A length dependent analysis extracted from the quasi-ballistic model reveals significant implications regarding the fundamental limit to conductance in a ballistic transistor and the mean free path for carriers in quasi-ballistic MOSFETs. The length scale for the applicability of the model is estimated to be 100 nm and less. We extract a mean scattering length of 19 nm at room temperature for our planar devices.

Using the length dependent study, we show that the theoretical RON is significantly larger than the value one would predict for a device with perfect gate control. The larger RON arises whenever there is a transverse electric field along the channel, which reduces net modes that transmit compared to a device with no longitudinal electric field in the channel.

Methods

The devices were fabricated in SUNY Poly’s 300 mm wafer fabrication facility using industry standard bulk 65 nm process flow with self-aligned extension, halo and source-drain implants setting back from the channel with dual spacers. We fabricated CMOS devices using heavily doped dual polysilicon gates in shallow trench isolation (STI). We employed self-aligned silicide formation on gate and source-drain electrodes and used Cu contacts to minimize the parasitic series resistance. The measured effective oxide thickness EOT was 2 nm.