Introduction

Spin-transfer torque magnetoresistive random access memory (STT-MRAM)1,2,3,4,5,6 has been developed for a wide range of applications, such as buffer memory7,8, IoT9,10/AI11,12, and automobile13,14,15/space16,17, by replacing the existing SRAM, DRAM18,19, and Flash memories. The development of magnetic tunnel junction (MTJ), a storage component of STT-MRAM, requires tailoring its performance to such application requirements. MTJ is engineered mainly in two directions: for SRAM-like and Flash-like applications, where fast speed and long data retention are critical, respectively. In addition, like other electronics components, MTJ development has always come along with a challenge in scalability20. The MTJ consists of a thin tunnel barrier sandwiched by two ferromagnetic layers (free and reference layers) and engineering of the stack design enables MTJ performance to meet requirements in a variety of applications13,21,22,23. Because of superiority in writing efficiency24 and scalability20 as well as the compatibility with the CMOS integration process, currently developed MTJs are based on CoFeB/MgO material system with a perpendicular magnetic anisotropy, which originates from interfacial anisotropy at the CoFeB/MgO interface25. To improve MTJ performance, the interfacial anisotropy is enhanced by capping a CoFeB free layer with another MgO layer26 and/or inserting an MgO layer into a CoFeB free layer27. These so-called interfacial-anisotropy MTJs have demonstrated scaling down to 1X nm26,27,28,29 and have been commercialized with a diameter of several tens of nanometers by semiconductor foundries30,31,32. However, scaling beyond 10 nm (single nanometer) is challenging with the interfacial-anisotropy MTJ due to insufficient retention performance28. To address this issue, the MTJ that relies on not only interfacial anisotropy but also perpendicular shape anisotropy by making the CoFeB free layer thick at ultra-small scale has been proposed and its basic operation was demonstrated down to 3.8 nm in diameter33. However, the so-called shape-anisotropy MTJ33,34, while showing high retention performance at ultrasmall scale, is difficult to achieve fast STT switching due to increasing switching current at a shorter timescale as described later in detail.

In this Article, we report single-nanometer MTJs using a multilayered ferromagnetic structure as a free layer, whose performance can be tailored for both speed- and retention-critical applications. The multilayered ferromagnetic structure uses the conventional CoFeB/MgO material system without resorting to new materials but consists of multiple CoFeB/MgO layers. This enables independent control of shape and interfacial anisotropies, which determine retention and/or switching performance. We demonstrate sub-ten-ns STT switching and over-10-year retention individually in single-nanometer MTJs with different stack designs.

Results

Design of CoFeB/MgO multilayers as a free layer

The CoFeB/MgO multilayers consist of multiple CoFeB/MgO layers, i.e., [CoFeB(tCoFeB)/MgO]N, where each CoFeB thickness tCoFeB and the repetition N of CoFeB/MgO layers can be varied (Fig. 1a). This enables independent control of interfacial and shape anisotropies, determining factors for retention and switching speed performance as explained later. Multiple CoFeB layers in the free layer are strongly coupled through a magnetostatic interaction that becomes evident by shrinking the MTJ sizes and behaves as a single ferromagnet35 while exchange coupling that was reported to exist in previous studies27,36 may have some effect in our system as well. The total thickness of the multiple CoFeB layers accounts for shape anisotropy; increasing tCoFeB enhances the total anisotropy. Note that each CoFeB thickness is limited to 15 nm, beyond which may cause an incoherent reversal37,38 while coherent magnetization reversal is confirmed with 15-nm CoFeB layer39. On the other hand, interfacial anisotropy is effectively enhanced by increasing N40, and the contribution of interfacial anisotropy becomes more significant than that of shape anisotropy by reducing tCoFeB. In the following, we show a design principle using the CoFeB/MgO multilayers to tailor its performance specialized for retention or speed.

Fig. 1: Concept of single-digit nanometer MTJs for various applications.
figure 1

a Stack structure of studied MTJ with schematic illustration of design principle. bd TEM images of studied MTJs for (b) DBL(15) (ref. 33), (c) QUAD(5), and (d) OCTA(2). The white line in Fig. 1b, 1c, and 1d is a 10 nm scale bar. e Summary of coercive field μ0HC for studied MTJs. f R-H curve for OCTA(2) with D = 2.0 nm. TEM image of DBL(15) in (b) is reproduced from ref. 33. Copyright 2018 Author(s), licensed under a CC BY license. A part of data in (e) is reproduced with permission from ref. 35. Copyright 2020 IEEE; ref. 40. Copyright 2021 IEEE.

Retention property is measured as thermal stability factor Δ, which, assuming coherent reversal in a ferromagnet, is expressed as

$$\varDelta =\frac{E}{{k}_{{\rm{B}}}T}=\frac{{K}_{{\rm{eff}}}t}{{k}_{{\rm{B}}}T}\frac{\pi {D}^{2}}{4},$$
(1)
$${K}_{{\rm{eff}}}=\frac{{M}_{{\rm{S}}}{H}_{K}^{{\rm{eff}}}}{2}={K}_{{\rm{b}}}+\frac{{K}_{{\rm{i}}}}{t}-{\rm{\delta }}N\frac{{M}_{{\rm{S}}}^{2}}{2{\mu }_{0}},$$
(2)

where E is an energy barrier between two magnetization states; kB the Boltzmann constant; T the absolute temperature; Keff the effective magnetic anisotropy energy density; D the free-layer diameter; HKeff the effective magnetic anisotropy field; Kb the bulk (magnetocrystalline) anisotropy energy density; Ki the interfacial anisotropy energy density; t the ferromagnet thickness; δN the difference in the dimensionless demagnetization coefficients, or the shape anisotropy coefficient, between the perpendicular and in-plane orientations; MS the spontaneous magnetization; and μ0 the permeability in free space. As in Eq. (1), Δ is proportional to KefftD2. Since the required Δ is determined by applications and D scales down with technology nodes, one can only adjust Kefft. As Kb in the CoFeB/MgO material system is negligibly small25, increasing Δ needs the enhancement of interfacial and/or shape anisotropies, corresponding to the second and third terms in Eq. (2), respectively.

The MTJ for retention-critical applications does not require high-speed operation and switching within several tens of ns is acceptable. On the other hand, it requires a high retention property at a wide range of temperatures; for example, automobile applications demand 10-year retention at −40 to 150°C41, and embedded memory applications, where the program code is loaded in the chip before soldering, require data retention during 260°C reflow process corresponding to 10-year retention at 225°C42,43. This means that the retention property has to be insensitive to temperature. According to previous works, shape-anisotropy MTJ shows smaller changes in E and coercivity with T compared to interfacial-anisotropy MTJ because Keff of the former (latter) is mainly governed by MS (Ki), which is relatively T-insensitive (sensitive) for the CoFeB/MgO systems44,45,46,47. Therefore, the free layer needs to be thick so that temperature-insensitive shape anisotropy can be dominant.

For the speed-critical applications, switching voltage VC at a short pulse duration τp typically less than 10 ns is required, while the retention requirement is relaxed to several seconds to months. At short τp, or in a precessional regime1,4,48,49,50, where switching is determined by the dispersion of the initial magnetization angle due to the thermal fluctuation and the transferred total angular momentum with the negligible role of thermal fluctuation51, VC, at which switching probability PSW is 0.5, increases in inversely proportional to τp as1,48,49,50,52

$${V}_{{\rm{C}}}={V}_{{\rm{C}}0}\left\{1+\left(\frac{{\tau }_{{\rm{D}}}}{{\tau }_{{\rm{p}}}}\right)\frac{\mathrm{ln}\left[4\varDelta /\mathrm{ln}2\right]\,}{2}\right\},$$
(3)
$${V}_{{\rm{C}}0}=\frac{2\alpha e\varDelta {k}_{{\rm{B}}}T}{{\rm{\hslash }}{g}_{{\rm{STT}}}}\frac{4}{\pi {D}^{2}}\,{RA},$$
(4)
$${\tau }_{{\rm{D}}}=\frac{1+{\alpha }^{2}}{\alpha \gamma {\mu }_{0}{H}_{K}^{{\rm{eff}}}},$$
(5)

where VC0 is intrinsic critical voltage, α the damping constant, e the elementary charge, the Dirac constant, gSTT the STT efficiency, γ the gyromagnetic ratio, and τD the characteristic relaxation time. τD determines the switching time scale in this regime4 and should be lowered to suppress the VC increases with reducing τp. This can be achieved by increasing HKeff [see Eq. (5)]4,53, and in this regard, MTJs with shape anisotropy, whose Δ mostly relies on t, is unfavorable, as they have smaller Keff. Therefore, lowering t and increasing Keff, i.e., increasing the interfacial (or bulk) anisotropy, are the design principles for fast switching. In addition to switching speed, we have confirmed that t should be thinner than 20 nm for STT switching in our shape-anisotropy MTJs (see Supplementary Information S1 for STT switching in shape-anisotropy MTJs). Note that increasing α also shortens τD, but there is a trade-off as VC0 is proportional to α (see Supplementary Information S2 for calculation of τD with different α values).

Fabrication of single-nanometer MTJs with CoFeB/MgO multilayers

Following the design guide presented in the previous section, we fabricate the MTJs with various tCoFeB and N using the multilayered ferromagnetic structure as a free layer. The stack structures fabricated in this work are summarized in Table 1. Here, MTJs with N = 1, 2, 3, and 4 are referred to as DBL, QUAD, HEXA, and OCTA, respectively35,40. We also fabricate the MTJ with a 15-nm CoFeB free layer capped by a Ta layer (SGL(15)) for comparison. See Methods for more information about MTJ fabrication. Note that the contribution of shape (interfacial) anisotropy becomes less (more) dominant at t < D. Fig. 1b–d shows transmission electron microscopy (TEM) images for DBL(15), QUAD(5), and OCTA(2), respectively. Bright layers and dark layers in between bright layers correspond to MgO and CoFeB layers, respectively. Small MTJs down to 2.0 nm in diameter are fabricated for various structures listed in Table 1. Fig. 1e shows coercivity μ0HC with respect to D for studied structures and Fig. 1f shows MTJ resistance R as a function of external magnetic field H for OCTA(2) with D = 2.0 nm. For further measurements to investigate the thermal stability factor Δ and STT switching property, we choose devices whose HC is close to the average so that they represent typical device properties for each structure. Regarding tunnel magnetoresistance (TMR) ratio, we confirmed that the TMR ratio decreases with increasing N, possibly due to parasitic resistance from the inserted MgO layer in the multilayer40. As demonstrated in previous works27,29, optimizing the MgO layer could address this issue.

Table 1 Information of studied samples.

Single-nanometer MTJs with high data retention capability

We study retention property in the fabricated MTJs based on the stack design principle described in the previous section. First, we characterize Δ at RT by measuring switching probability under a magnetic field. The obtained Δ at RT is summarized in Fig. 2a. Δ is enhanced by increasing N from SGL(15) to DBL(15) to QUAD(7.5) even with the same total CoFeB thickness and also by increasing tCoFeB from QUAD(2) to QUAD(5) and QUAD(7.5) even with the same N, indicating the positive effects of the interfacial and shape anisotropies. Δ of QUAD(5) and QUAD(7.5) is comparable with that of DBL(15) and DBL(20), respectively. This means that inserting an MgO layer enables to reduce the required thickness by about 5 nm, which is beneficial for fast STT switching as described earlier35,37,38. For the MTJs with total CoFeB thickness below 10 nm or with interfacial anisotropy enhanced, such as QUAD(2), HEXA(2), and OCTA(2), Δ over 50 is obtained even below 5 nm40.

Fig. 2: Evaluated retention properties for studied MTJs.
figure 2

a Obtained Δ with respect to D in MTJs for studied MTJs. b Temperature dependence of Δ for QUAD(5) and QUAD(7.5) with D ~ 7 nm. Curves are calculation based on power law scaling relationship between E and MS with obtained n values (see Methods for more details). The inset shows an enlarged figure. c Summary of exponent n for DBL(15), QUAD(5), QUAD(7.5), and interfacial-anisotropy MTJ for comparison. The error bar shows the standard error of fitting Eq. (7) to the experimental results. A part of data in (a) and (b) is reproduced with permission from ref. 35. Copyright 2020 IEEE; ref. 40. Copyright 2021 IEEE. A part of data in (b) and (c) is reproduced with permission from ref. 44. Copyright 2021 AIP Publishing LLC.

We then measure the temperature dependence of Δ in QUAD(5) and QUAD(7.5) with D ~ 7 nm (Fig. 2b) to examine high-temperature capability. For comparison, we also plot Δ for DBL(15) with D = 8.4 nm from the previous study44. Δ over 70 at 150 oC is obtained in QUAD(7.5)35. From the temperature dependence of Δ, we can obtain scaling exponent n, which characterizes temperature dependence of Δ or E44,54 (see Methods for more information). Note that the smaller the n values are, the weaker the temperature dependence of Δ or E becomes, which is preferable for high-temperature applications. Fig. 2c summarizes n values for various structures. For comparison, we also plot n for DBL(15) and interfacial-anisotropy MTJ with a free layer of CoFeB(2)/MgO (DBL(2)) from previous studies44,54. Increasing total tCoFeB or decreasing N makes the n values smaller, i.e., temperature insensitive, and close to 2 as a result of the decrease (increase) in the contribution of interfacial (shape) anisotropy. We then calculate the temperature dependence of Δ (curves in Fig. 2b) using modified Eq. (8) (see Methods for more details). At RT, the Δ of QUAD(5) is higher than that of DBL(15), but because DBL(15) has a smaller n, the Δ of DBL(15) is higher than that of QUAD(5) when T is above 150°C. While n of 2 is expected from the origin of shape anisotropy, n values related to interfacial anisotropy tend to be larger than three55 due to device-fabrication-induced variation in magnetic properties at the vicinity of the device edge56. It is noteworthy that the impact of device fabrication is observed in the interfacial-anisotropy MTJs but not in the shape-anisotropy MTJ, suggesting that the bulk of a CoFeB layer has less impact on fabrication than a CoFeB/MgO interface39,44. The obtained results prove that tailoring MTJs to emphasize the shape anisotropy is effective for retention-critical applications.

Single-nanometer MTJs with high-speed capability

Next, we investigate STT switching with voltage pulse for various MTJ structures in the single-nanometer regime. Fig. 3a, b shows R as a function of pulsed voltage V for different pulse duration in representative MTJs with shape or interfacial anisotropies enhanced: SGL(15) with D = 8.5 nm and HEXA(2) with D = 4.5 nm, whose Δ are comparable (34 and 37, respectively). Switching is achieved at 1 μs and 100 ns in both structures. At shorter pulse duration, however, while HEXA(2) shows switching for both anti-parallel (AP) to parallel (P) states and vice versa at 10 ns, SGL (15) does not show switching from AP to P states. This can be explained by the increase in switching voltage in the precessional regime described earlier.

Fig. 3: STT switching measurement for SGL(15) with D = 8.5 nm and HEXA(2) with D = 4.5 nm.
figure 3

MTJ resistance R as a function of V: (a) for SGL(15) with D = 8.5 nm and (b) for HEXA(2) with D = 4.5 nm. Insets in (a) and (b) show R-H curves for the MTJ devices. c VC with respect to pulse duration for different structures. Curves correspond to fitting curves using Eq. (3). d, e VC for P to AP and AP to P switching (VC_P and VC_AP, respectively) normalized by that at 1 μs, respectively. Curves are normalized fitting curves obtained from (c). A part of data in (c), (d), and (e) is reproduced with permission from ref. 40. Copyright 2021 IEEE.

We then compare the trends of STT switching with respect to pulse duration between various structures. Fig. 3c shows VC as a function of pulse duration for the MTJs presented in Fig. 3a, b (results for other MTJs are shown in Supplementary Information S3). Bidirectional switching with τp < 10 ns is obtained for MTJs with D < 5 nm. To focus on the VC increase at shorter pulse durations, VC is normalized by the one at 1 μs for P to AP (AP to P) switching and plotted as a function of τp in Fig. 3d, e. While VC does not change significantly in the range of 1 μs to 100 ns for both structures, the VC increase at shorter pulse durations is more moderate in HEXA(2), where the interfacial anisotropy is dominant. We evaluate HKeff for studied structures from switching probability measurement under a magnetic field and summarize in Supplementary Information S4. We confirm that HKeff becomes larger by enhancing interfacial anisotropy as in HEXA(2) and OCTA(2). We then extract τD by fitting Eq. (3) to the experimental results in Fig. 3c, where Δ in Fig. 2a is used as a fixed parameter and τD is used as a fitting parameter. Note that the results obtained in our sample are well fitted by Eq. (3) with pulse durations up to 1 μs, even though Eq. (3) is valid in the precessional regime with a pulse duration up to about 10 ns according to previous studies1,48,57. The extracted τD is summarized with respect to μ0HKeff (from Fig. S4 in Supplementary Information S4) for various structures in Fig. 4. The τD decreases with increasing HKeff in consistent with Eq. (5), and fitting (Solid curve) yields α = 0.0139 ± 0.0013, which reasonably agrees with previous studies25,58. Note that the α value should be similar among studied structures. According to a previous study58, α of the MgO/CoFeB/MgO system stays constant regardless of CoFeB thickness. We also evaluate N dependence of α using VNA-FMR in the multilayer and confirm that the increase in N does not affect α so much as well (see Supplementary Information S5). Most of the MTJs with shape anisotropy enhanced (red symbols) show larger τD and this fact explains the increase in VC with decreasing τp below around 100 ns shown in Fig. 3c–e (see also Supplementary Information S3). On the other hand, short τD is obtained for those with interfacial anisotropy enhanced (blue symbols), explaining the high-speed switching shown in Fig. 3a–c (see also Supplementary Information S6). The obtained results prove that tailoring MTJs to emphasize the interfacial anisotropy is effective for fast STT switching.

Fig. 4: Summarized τD as a function of μ0HKeff for each structure.
figure 4

Curve corresponds to Fitting using Eq. (5). The error bar shows the longest and shortest differences between both τD (for P to AP switching and AP-to-P switching), including standard errors, obtained by fitting using Eq. (3). Samples with shape (interfacial) anisotropy enhanced are indicated as red (blue) symbols.

Discussion

This work uses a CoFeB/MgO material system with the multilayered ferromagnetic structure to tailor device performance to applications requiring high data retention or high speed in a single-nanometer regime. Here we discuss how to improve device performance further in the proposed structures.

For Flash-like, or retention-critical, applications, one needs to maintain high retention property even at high temperatures. We demonstrate Δ over 70 at 150oC by enhancing shape anisotropy in multilayered ferromagnetic structure, which is enough for a single device. However, to satisfy the read error rate of 10−9 for 109-bit system44, Δ needs to be over 80 at 85oC, 150oC, and 225oC for consumer, automobile, and solder-reflow process, respectively. Further increasing tCoFeB and/or MS by increasing Fe concentration in CoFeB layer54 may achieve the required Δ.

For SRAM-like, or speed-critical, applications, one should suppress a rapid increase in write voltage at shorter τp for reliable high-speed switching. In our results, even though we achieve sub-ten-ns switching, we see a sizable increase in the switching voltage at the pulse duration shorter than 10 ns. Further VC reduction in the precessional regime at the ultra-small scale can be expected by combining multilayered ferromagnetic structure and ultra-thin CoFeB layers59,60. Here we calculate STT switching voltage beyond 1 ns using Eqs. (3)–(5) considering a free layer of various materials/structures, shown in Fig. 5 (see Methods for the parameters used). For comparison, we also show calculated VC for QUAD(2) and HEXA(2) (red curves). In all the free layers, we assume D = 7.5 nm and α = 0.005. All the proposed free layers are expected to show smaller VC than that of QUAD(2) and HEXA(2) owing to further enhancement of interfacial anisotropy. We especially reveal that the ultra-thin CoFeB layer, i.e., [CoFeB(0.66)/MgO]3 (dashed blue curve) has the potential to achieve VC of less than 1 V even at 1 ns. Another way for reliable high-speed switching is to introduce other materials with large Kb and low MS such as L10-MnAl61. In our calculation, L10-MnAl (green curve) also has the potential to achieve VC of less than 1 V even at 1 ns. Note that one needs to control α to avoid VC0 increase carefully. Apart from material/stack development of the free layer, introducing a double-spin MTJ structure62,63 could also be another possibility to achieve reliable switching below 10 ns. Furthermore, this concept can be applied to devices operated with a spin-orbit torque (SOT)64,65,66. Since our approach is to increase the precessional frequency, the proposed stack could also be applied to three-terminal MTJs controlled by SOT65,66 or a combination of STT and SOT67,68,69,70,71.

Fig. 5: Design of the free layer toward further high-speed operation.
figure 5

a Calculated VC with respect to pulse duration for different structures using Eqs. (3) to (5). The parameters for HEXA(0.66), HEXA(0.9), and MnAl are based on the previous studies58,60 (see Methods). b 1/τp vs. VC, replotted from (a) to see a difference in structures easily. The used parameters are shown in Methods.

In this study, we show CoFeB/MgO-based MTJ technology in the single-nanometer regime by introducing a multilayered ferromagnetic structure, which enables controlling interfacial and shape anisotropy independently at ultrafine scale. We show that emphasizing shape anisotropy allows sufficient thermal stability at high temperatures, whereas emphasizing interfacial anisotropy enables fast STT switching at 10 ns or below at ultrafine scale, which satisfies the requirement for Flash-like or SRAM-like applications. We also propose MTJ stacks that allow even faster switching with a lower voltage than studied MTJ stacks. This work provides design guideline to fabricate ultra-small MTJs used in a wide variety of applications.

Methods

Sample preparation

The MTJ stacks were deposited on 3-inch thermally oxidized Si wafers by dc/rf magnetron sputtering system. The stacks are, from the substrate side, the followings: Ta(5)/Ru(10)/Ta(15)/Pt(5)/[Co(0.4)/Pt(0.4)]6/Co(0.4)/Ru(0.4)/Co(0.4)/[Pt(0.4)/Co(0.4)]2/Ta(0.2)/CoFeB(1)/MgO(1.0)/(Co0.25Fe0.75)75B25(15.5)/Ta(5)/Ru(5) for SGL(15); Ta(5)/Ru(10)/Ta(tTa)/Pt(5)/[Co(0.4)/Pt(0.4)]6/Co(0.4)/Ru(0.4)/Co(0.4)/[Pt(0.4)/Co(0.4)]2/Ta(0.2)/CoFeB(1)/MgO(tMgO)/[(Co0.25Fe0.75)75B25(tCoFeB)/MgO(0.93)]N/Ru(5) with (tTa, tMgO, tCoFeB, N) = (15, 1.0 or 0.96, 15, 1), (15, 0.94, 5, 2), and (20, 0.94, 7.5, 2) for DBL(15), QUAD(5), and QUAD(7.5), respectively; and Ta(5)/Ru(10)/Ta(20)/Pt(5)/[Co(0.3)/Pt(0.4)]6/Co(0.6)/Ru(0.4)/Co(0.6)/[Pt(0.4)/Co(0.3)]2/W(0.2)/CoFeB(1)/MgO(tMgO)/[(Co0.25Fe0.75)75B25(2)/MgO(0.93)]N/Ru(5) with (tMgO, N) = (0.98, 2), (0.94, 3), and (0.93, 4) for QUAD(2), HEXA(2), and OCTA(2), respectively. The numbers in the parentheses in the stacks are nominal thicknesses in nm. To suppress the increase in RA with increasing N and achieve as low RA as possible for STT switching, the MgO barrier thickness between the free and the reference layers was varied for each structure. Note that the impact of the increases in N on RA should be small; according to a previous study on DBL structure72, the contribution of the top MgO layer to the total RA is less than 1 Ωμm2. The stacks were processed into circular MTJs with various diameters by electron beam lithography, reactive ion etching, and multistep Ar ion milling33. After Ar ion milling, the MTJ was covered ex-situ with a SiN passivation layer using chemical vapor deposition. The processed wafers were post-annealed for 1 h (3 h) at 300oC for SGL(15), DBL(15), QUAD(5), and QUAD(7.5) (QUAD(2), HEXA(2), and OCTA(2)). MTJ diameters were electrically determined from the resistance (R)-area (A) product (RA) and the resistance of the parallel state of MTJs. RA of the studied MTJs is summarized in Table 1. We used DBL(15) with RA = 5.4 (3.1) Ωμm2 for the evaluation of Δ (STT switching). We used OCTA(2) with RA = 1.1 and 1.7 (1.0) Ωμm2 for STT switching (TEM image shown in Fig. 1d and R-H curve shown in Fig. 1f).

Thermal stability measurement

We evaluated Δ and HKeff from switching probability measurement using a pulse magnetic field51. We applied the pulse magnetic field Hpulse 50 times. The measurements were carried out at temperatures from RT up to around 190oC35,44. We fitted the following equation to the experimental results with Δ, HKeff, and a shift field Hshift as fitting parameters,

$${P}_{{\rm{SW}}}^{{\rm{P}}({\rm{AP}})}=1-\exp \left\{-\frac{{\tau }_{{\rm{p}}H}}{{\tau }_{0}}\exp \left[-\varDelta {\left(1\mp \frac{{H}_{{\rm{pulse}}}-{H}_{{\rm{shift}}}}{{H}_{K}^{{\rm{eff}}}}\right)}^{2}\right]\right\}$$
(6)

where τpH is the pulse duration (= 1 s) of Hpulse, and τ0 is the inverse of the attempt frequency (assumed to be 1 ns in this study).

Extraction of scaling exponent n

We extracted n in the same way as in ref. 44. From the temperature dependence of E (ΔkBT) and MS, we obtained n as a slope of the following power law relationship:

$$\frac{E\left(T\right)}{E\left({T}^{* }\right)}={\left\{\frac{{M}_{{\rm{S}}}\left(T\right)}{{M}_{{\rm{S}}}\left({T}^{* }\right)}\right\}}^{n}$$
(7)

where T* is a normalizing temperature (RT in this study). We calculated the temperature dependence of MS using Bloch law, MS(T) = MS0(1 – aT3/2), where MS0 is MS at 0 K and a is a material dependent parameter, providing MS0 of 1.59 T and a of 2.47 × 10−5 K−3/2.

Calculation of Δ as a function of T

We calculated the temperature dependence of Δ (curves in Fig. 2c) using modified Eq. (7) as follows:

$$\varDelta \left(T\right)=\varDelta \left({T}^{* }\right)\,\cdot \,\left(\frac{{T}^{* }}{T}\right)\,\cdot \,{\left\{\frac{{M}_{{\rm{S}}}\left(T\right)}{{M}_{{\rm{S}}}\left({T}^{* }\right)}\right\}}^{n}$$
(8)

where Δ(T*) is Δ at normalizing temperature (RT in this study). We took Δ from Fig. 2b into account Eq. (8). For the calculations, we used n values of 2.27, 3.44, and 2.58 for DBL(15), QUAD(5), and QUAD(7.5), respectively.

STT switching

We used a source meter to apply DC voltage and measure MTJ resistance for DC measurement. For high-speed measurement, we applied pulse voltage via rf port of bias-T and then measured MTJ resistance by source meter with reading voltage = 30 mV via dc port of bias-T. We measured switching probability with a pulse magnetic field.

The parameters used for the calculation of V C

We calculated VC shown in Fig. 5 under an assumption of either [CoFeB/MgO]N multilayered ferromagnetic structure or L10-MnAl. The parameters used for calculation were as follows:

  1. (1)

    [CoFeB(2)/MgO]2 or 3 (QUAD(2) or HEXA(2))

    MS = 1.38 T, α = 0.005, P = 0.4, Kb = 0 J/m3. Ki = 3.48 and 4.66 mJ/m2 40 for QUAD(2) and HEXA(2), respectively.

  2. (2)

    [CoFeB(0.66 or 0.9)/MgO]3 (HEXA(0.66) or HEXA(0.9))

    MS = 1.70 T, α = 0.005, P = 0.4, Kb = 0 J/m3, and Ki = 4.44 mJ/m2 59.

  3. (3)

    L10-MnAl

    MS = 0.60 T, α = 0.005, P = 0.4, Kb = 8.5 × 105 J/m3, and Ki = 0 mJ/m2 61.