Limited accuracy of conduction band effective mass equations for semiconductor quantum dots

Effective mass equations are the simplest models of carrier states in a semiconductor structures that reduce the complexity of a solid-state system to Schrödinger- or Pauli-like equations resempling those well known from quantum mechanics textbooks. Here we present a systematic derivation of a conduction-band effective mass equation for a self-assembled semiconductor quantum dot in a magnetic field from the 8-band k · p theory. The derivation allows us to classify various forms of the effective mass equations in terms of a hierarchy of approximations. We assess the accuracy of the approximations in calculating selected spectral and spin-related characteristics. We indicate the importance of preserving the off-diagonal terms of the valence band Hamiltonian and argue that an effective mass theory cannot reach satisfactory accuracy without self-consistently including non-parabolicity corrections and renormalization of k · p parameters. Quantitative comparison with the 8-band k · p results supports the phenomenological Roth-Lax-Zwerdling formula for the g-factor in a nanostructure.

replaced by derivatives (momentum operators in the coordinate representation). However, at the same time, the band edges and other parameters become position-dependent via their dependence on local composition and strain. This leads to a Schrödinger-like equation with a position-dependent effective mass and Landé factor and the ordering between the momenta and these two parameters starts to play a role. This ordering is arbitrary if the effective mass model is introduced as a heuristic extension of the bulk formulas. Arguments based on solvable models of abrupt interfaces 25,26 lead to the conclusion that the only physically correct ordering in a non-homogenous medium is the most common "k(1/m*)k". This conclusion has been formally verified by rigorous derivation from the exact multi-band envelope function equations 2,10 .
In this work we would like to address a more practical question of the achievable quantitative accuracy of the cb effective mass theories for quantum dot (QD) systems. Thus, the goal of this paper is to rigorously derive a family of effective mass Hamiltonians for a cb electron in a strained, inhomogeneous system by applying a systematic series of approximations to the Löwdin partitioning of the 8-band k · p Hamiltonian in the envelope function approximation and to validate their predictions against the results of the 8-band model with respect to the energy spectrum and Landé factors. By effective mass Hamiltonians we understand cb Hamiltonians for a carrier in an external magnetic field, obtained by a unitary transformation (partitioning) that eliminates the coupling between the conduction and valence bands of the original Hamiltonian, that contain quadratic (in momentum k) kinetic energy terms and may contain cubic spin-orbit couplings. We will show that reproducing the results from the 8-band theory by an effective mass equation is possible with limited precision and the form of the equation required to achieve the best available accuracy is much more complicated than the usual Schrödinger-or Pauli-like equation for the envelope function. On the other hand, the Landé g factor of the nanostructure can be reasonably estimated by the simple semi-phenomenological Roth-Lax-Zwerdling formula.

The starting point: 8-band k · p model
The 8-band k · p Hamiltonian in the envelope function approximation is defined in the block notation as 3 where the blocks refer in the standard way to the cb (6c), the j = 3/2 valence band (vb, 8v) and the j = 1/2 (spin-orbit split-off) vb (7v) and are explicitly given by 3 where P′ and Q are couplings to higher conduction bands; σ i are Pauli matrices; J i are matrices of the j = 3/2 representation of angular momentum; T i are matrix representations of a vector operator between j = 1/2 and j = 3/2 states, i.e., , with the matrix elements of the spherical components T q (1) given in terms of the Clebsch-Gordan coefficients 〈j 1 j 2 ; m 1 m 2 |jm〉 by the Wigner-Eckart theorem, q m 2/3 3/2, ; 1, 1/2, , for m = ±1/2, m′ = −3/2, …, 3/2; and T ij = T i J j + T j J i . The system is placed in an axial magnetic field. In numerical calculations we use gauge-invariant discretization scheme 13 for the covariant derivative.
The material parameters used in our k · p calculations are given in Table 1. In order to avoid ′ < A 0 c , which would break the ellipticity condition, we rescale E P to obtain ′ = A 1 c , which gives 27 E P = (m 0 /m* − 1)E 0 (E 0 + Δ 0 )/ (E 0 + 2Δ 0 /3). Due to inconsistency of the reported values 3,28 , we calculate q using the perturbative formula 29 and are 14 band k · p parameters 3 . We account for the strain within a continuous elasticity approach 30 . Piezoelectric field in the system is calculated up to the second order in the polarization 31 with the parameters taken from ref. 32 .

Derivation of the effective mass Hamiltonian
The essence of the method 24,33 is to perturbatively decouple the group of states of interest from all the other states by using a canonical transformation T = e S , with an anti-hermitian operator S, in order to obtain a transformed Hamiltonian = ∼ † H THT , in which the inter-band terms (treated as a perturbation) are approximately eliminated. We will use a modified version of the van Vleck quasi-degenerate perturbation theory 34 as presented in ref. 35 . The Hamiltonian is split into its block-diagonal and block-off-diagonal parts (coupling states within a single group of states and between the two groups of states, respectively), H = H (d) + H (od) . The operator S is required to have null matrix elements within the groups (that is, it has to be block-off-diagonal). The group of states of interest here are the cb states. Since we start from the 8-band k · p theory, the other group of states are vb states. We define the superoperator Ŝ representing the adjoint action of S on the algebra of operators: . We therefore take into account only the linear term in Eq. (5) and write The neglected corrections are O(k 3 ) and would lead to quartic terms in ∼ H (d) , which is beyond the effective mass approximation, even with spin-orbit terms. Quantitatively, the linear truncation according to Eq. (6) amounts to neglecting corrections of relative magnitude ΔE/E 0 , where ΔE is the interband energy separation due to confinement. Taking ΔE as the excitation energy in the direction of strongest confinement (~200 meV) one gets a rough estimate of 20% for the error due to truncation.
The remaining, block-diagonal part of the transformed Hamiltonian is now written as where H 0 is assumed to be diagonal in a certain basis |α, i〉 (where α denotes a group of states and i are individual states within these groups), and is selected in such a way that the remaining parts are in some sense small. ′ ∼ H c and ′ ∼ H v denote the cb and vb blocks of ∼ H, respectively, with the corresponding parts of H 0 subtracted. In the problem at hand, we choose for H 0 a diagonal Hamiltonian which is constant and proportional to unity within each of the 6c, 8v and 7v bands and approximately represents the band edges in a strained QD. We will denote the respective energy values by α E , α = 6c, 8v, 7v. One can understand them as the average band edges corrected by hydrostatic strain (while the splitting between heavy and light holes within the 8v band is not included). The operator S is written as S = S cv + h.c., where S cv denotes one of the two off-diagonal blocks (a 2 × 6 matrix in the standard k · p matrix notation). The effective band gaps are denoted by where the entries correspond to the 6 valence bands of the 8-band k · p model. Then, from Eq. (6), one finds Note that the arbitrariness of choosing the diagonal Hamiltonian H 0 is removed here, as the subtracted energies (the operator Δ ) are added back to the remaining part of the Hamiltonian. Equation (7) has a structure of a Sylvester equation but the operators appearing here are not finite-dimensional matrices. One can treat this equation as a matrix one, in the sense of the block notation over the subbands, but then the problem of non-commutativity of the matrix elements appears (due to non-commutativity of k with position-dependent quantities), precluding the application of standard algebraic methods for solving this equation. In order to overcome this difficulty, we expand the operators S cv and H cv in powers of k, where the coefficients C (1) and C (2) are defined by comparison with the explicit form of Eqs (2d) and (2e). Then, upon rearrangement of terms one gets from Eq. (7) in the subsequent (formal) orders in k The non-commutativity problem persists since ′ ∼ H c and ′ ∼ H v contain k-dependent terms, while X (n) are position-dependent. Returning to Eq. (6) one can see that the terms of ∼ H (d) linear and quadratic in k generate corrections to S on the order of k 2 and k 3 , respectively. According to Eq. (4), these corrections generate terms O(k 3 ) and O(k 4 ) in the effective mass Hamiltonian for the cb. The latter are beyond the usual effective mass approximation, while the former correspond to spin-orbit terms but (by a simple perturbation argument) appear with a coefficient ∼ ∼ . P C E / 06 k 2 g 2 nm 3 · meV (using InAs parameters), which is two orders of magnitude smaller than the Dresselhaus coefficient for InAs, α D = 27 nm 3 · meV. It appears, therefore, that the kinetic part of ~′ H v can be discarded in the derivation of a usual effective mass equation in the parabolic band approximation, that is, one with O(k 2 ) kinetic terms and the relevant O(k 3 ) spin-orbit corrections. However, as we will see below, including non-parabolicity effects at this point improves accuracy of the modeling of a self-assembled QD. Therefore, it seems reasonable to keep the kinetic part of ′ ∼ H c and ′ ∼ H v . In order to obtain a solvable system of equations, we therefore propose to self-consistently replace the k-dependent terms by their averages in the eigenstate of interest.
In our derivation, Eqs (8a)-(8c) depend on the blocks of the transformed Hamiltonian ∼ H rather than on the initial Hamiltonian H hence, together with Eq. (4), they form a system that cannot be solved in a closed form. Clearly, in the leading order one could replace ′ ∼ H c and ′ ∼ H v by the original blocks ~′ H c and ~′ H v (the correction is O(k 2 ), yielding corrections that are formally O(k 4 ) in the resulting cb Hamiltonian). However, as we will see, including at least some corrections to these blocks improves the accuracy of the equation. It is known that the major corrections to the cb and vb Hamiltonians resulting from the decoupling procedure are the renormalization of the cb electron mass and of the Luttinger parameters, respectively. Therefore, we propose to take into account these strong effects only and to use, in place of ′ ∼ H c and ′ ∼ H v , the cb and vb blocks of the original 8-band Hamiltonian but with the position-dependent renormalized parameters, is the local band gap, including the hydrostatic strain-induced shift. Note that these band-decoupling corrections are O(k 2 ), so the distinction between , is only important if the k-dependent terms are included self-consistently, as proposed above.
In the approximations proposed here, Eqs (8a)-(8c) are a system of usual Sylvester equations that can be solved iteratively. A solution can be obtained in a closed, analytical form using the general method of ref. 36 . However, the form of the solution simplifies considerably if one discards the contribution of the spin-dependent part of ′ ∼ H c to the operator S (these terms remain included to the leading order in H 6c6c that is part of H (d) , see Eqs (2a) and (4)). These terms are very small compared to any other energy scales in the problem, hence their contribution to S is negligible. Within this approximation, one has  χ ′ = ′ ∼ × H c c 2 2 , where χ ′ c is a scalar function of position and  × 2 2 is a 2 × 2 unit matrix. Then, the solution to the above system of equations can be obtained trivially.
, one finds 35 ~= ( od) or, in the leading order, . Then, the correction to the cb Hamiltonian up to the order k 3 can be decomposed into two parts. The first one, which we will denote by ∼ H (2) , is formally quadratic in k and is proportional to C C j l (1) (1) in our notation. This contribution yields corrections to the electron effective mass and Landé factor. The second , is of third order in k and contains terms proportional to C C jl n (2) (1) . It includes the Dresselhaus spin-orbit term. Thus, the resulting transformed Hamiltonian can be written as = . From Eq. (4) in the linear approximation one finds [ , ] h c

Interpretation, Verification and Discussion
In this Section, we first present various approximations to the effective mass Hamiltonian and relate the equations resulting from some of these approximations to the common form of the effective mass Hamiltonian, written in terms of the effective mass tensor and the g-factor given by the Roth formula. Next we define a series of approximation for which we present quantitative comparison of the predictions from the effective mass theory with the results from the 8-band k · p Hamiltonian.

Interpretation of the effective mass equation.
We restrict the discussion to the quadratic term [Eq. (10)] that determines the fundamental properties of the energy spectrum.
In order to simplify notation, we define the 2 × 6 matrices  where the two components of the direct sum correspond to the j = 3/2 (hh and lh) and j = 1/2 (spin-orbit split-off) subbands of the vb. Then, by direct inspection of Eqs (2d) and (2e) one finds jn n . The second-order correction to the effective mass Hamiltonian is then where we use the fact that j c is a number and commutes with the matrices ∼ i  and . It might seem that the above Hamiltonian confirms the correctness of the particular (BenDaniel-Duke 37 ) ordering of the operators in the kinetic term. This is not the case: a simple manipulation of the terms in Eq. (12) under assumption P = const allows one to rewrite the second-order correction in the equivalent form~~~T where a different (Gora-Williams-Bastard [38][39][40] ) ordering appears in the kinetic term. Interestingly, the correcting terms in Eqs (12) and (13) involve only spatial derivatives of cb and vb parameters, respectively. It is hence clear that the two orderings are, in a sense, dual and each of them corresponds to neglecting terms in the Hamiltonian, related to spatial modulation of either the conduction or valence band. As we will see explicitly below, these additional terms are essential to correctly reproduce the Rashba spin-orbit interaction. Further analytical insight into the somewhat unusual form of our effective mass Hamiltonian ∼ H is hindered by the need to invert the "band gap operator" , which produces rather intransparent and intractable formulas (and, in practice, is performed numerically). One might expect that neglecting the off-diagonal elements of , from which this difficulty stems, is a good approximation, since these elements are rather small compared to the band gap. The operator  in this approximation will be denoted by ∼ . Since it is diagonal its inverse powers are found trivially. Let us denote its diagonal elements by (E hh , E lh , E lh , E hh , E so , E so ). They can be interpreted as the offset of the local edges of the three valence subbands with respect to the cb edge at a given point in space. We split the T D T ∼ − † j n l matrix into a symmetric and asymmetric part,

T D T TD T T D T
Using the explicit forms of the matrices  j one finds up to the linear order in η . Thus, the symmetric part is spin-independent and contributes to the kinetic part of the effective mass Hamiltonian in the well-known way via the electron effective mass tensor; indeed, the first term in Eq. (12) combined with the kinetic term in H 6c6c [Eq. The second term yields Thus, the symmetric component of the first term in Eq. (12) yields the usual kinetic energy of the effective mass theory (Eq. (15)) in the commonly used ordering "k(m*) −1 k" (BenDaniel-Duke 37 ordering). This is, however, corrected by the second term (Eq. (16)) and one could as well start from Eq. (13) and arrive at an equivalent Hamiltonian with Gora-Williams-Bastard [38][39][40] ordering and with an H v -dependent correction term instead of Eq. (16). The asymmetric part of H (2) can be interpreted most easily starting form the alternative form of the Hamiltonian given in Eq. (13). The asymmetric contribution to the first term in Eq. (13) Here, ĝ differs from g (Roth) by including strain effects as well as anisotropy resulting from the hh-lh splitting in the nanostructure. The Roth-Lax-Zwerdling formula has the advantage that the fundamental band gap E g , instead of being calculated at some level of approximation to the k · p theory, can be taken from experiment [42][43][44] . While the standard Roth-Lax-Zwerdling formula is linked to the 8-band k · p Hamiltonian via a diagonal and parabolic approximations, the full effective mass Hamiltonian accounts also for non-parabolicity corrections and the full structure of the vb.
The asymmetric part of the second term in Eq. (13) is a generalized Rashba term. To see this, assume that the spatial variation of ′ H v results from an external electric field , hence j v v 6 6 . Then, taking only the strain-independent terms in the second term in Eq. (13) and using Eq. (17)  which is the standard Rashba spin-orbit term 3 . The full Eq. (13) yields the Rashba term generalized to arbitrary inhomogeneity of the valence band and includes corrections due to inhomogeneous strain. The same effects are accounted for (although not so explicitly) by Eq. (12).
Quantitative assessment. In the following, we compare the accuracy (with respect to the 8-band k · p results) of various approximations to the effective mass equation derived above. Our systematic approach allows us to discuss the possible approximate equations in the uniform framework, according to the level of approximation made to the  operator.
(1) As the very first step, obviously inaccurate but included for completeness, we apply the bulk approximation without strain, in which the operator  is taken diagonal and without the kinetic (k 2 ) terms, hh lh lh hh so so with local bad gaps calculated from unstrained bulk values, E hh = E lh = E 0 , E so = E 0 + Δ 0 , where the band edges are interpolated according to the local composition (see Table 1). (2) The first reasonable approximation is to correct the above procedure by accounting for band-edge shifts due to local strain, which is done by including the diagonal strain-dependent matrix elements in H v and H c . Thus, the  operator is still given by Eq.
where the local value of strain is used and parameters are interpolated according to the local composition. In this way, the major correction to the band gap (which is a crucial factor determining the effective mass and g-factor) is included in the model. This will be referred to as bulk approximation with strain. (3) The value used for the band gap in the previous approach is still a crude approximation since the actual energy spacing between the states in the conduction and valence bands in a QD are affected by spatial confinement. A common approach, taken in many cases [42][43][44] to estimate the electron g-factor when interpreting experimental data, is to use the measured value of the fundamental transition energy E g (exp) as the effective band gap value for a given system. In our numerical study this corresponds to replacing the fundamental band edge offset in the operator  (together with its strain correction) by the splitting between the top vb and bottom cb states obtained from the full 8-band calculation, while using the bulk values of the spin-orbit splitting Δ 0 and the Kane parameter P interpolated according to composition. The  operator is still given by the diagonal form of Eq. (21) but now 0 , with a constant E g (exp) and a local composition-dependent Δ 0 . We will refer to this as semi-phenomenological approximation. For the g-factor, a further simplification along this line is achieved by introducing constant values of Δ 0 obtained by averaging the position-dependent values weighted by the squared wave function, which yields an explicit effective Roth-Lax-Zwerdling formula for the g-factor of a nanostructure, that is, Eq. (20) with the effective values of the parameters. (4) In the off-diagonal approximation, we include the full structure of the vb Hamiltonian H v , hence the full form of , but still neglect the k-dependent terms. In block notation, As a test of these approximations, we study the low-energy part of the spectrum of a self-assembled QD. We consider two models of a lens-shaped, self-assembled InAs/GaAs QD. In both models the QD has 24 nm diameter and 4.2 nm height but they differ in the composition profile: While the first model assumes a uniform composition with 100% InAs inside the QD, the second one has a more realistic trumpet-shape profile of InAs/GaAs composition with the InAs content defined by 45 where we took C b = 0.4, C t = 0.8, r p = 0.9 nm and z p = 1.4 nm. In both cases the QD is placed on a 0.6 nm wetting layer (WL), which in the case of the first dot contains 100% and in the latter one 40% InAs. The two figures of merit that we will investigate here are the energy splitting ΔE sp between the ground ("s-shell") electron state and the lowest excited ("p-shell") state at zero magnetic field (Fig. 1) and the ground state g-factor, extracted from the leading order (linear) term of the Zeeman splitting of the ground-state doublet at low magnetic fields (Fig. 2). In both cases we show in the plots the values obtained using only the first, usual term in Eq. (12) (points) and those from the full equation, including also the non-standard second term (crosses).
As shown by the results in Fig. 1, the effective mass methods typically fail to reproduce the s-p shell splitting which, in fact, belongs to the most fundamental quantitative characteristics of a QD system. What matters here is the renormalization of the k · p parameters according to Eq. (9), hence the self-consistent equation (method (6) from our series of approximations) is able to produce a result that is closest to the correct value. For the uniform QD, Fig. 1(a), the disagreement is still at the level of 30%, while for the more realistic model with the trumpet-shape composition profile Fig. 1(b), the effective mass result is much more exact. The 30% discrepancy is consistent with the estimate of the error induced by the linear truncation in Eq. (6). The much higher precicision in the case of the smooth trumpet-shape composition may be due to the fact that terms of higher order in k in an envelope-function theory for an inhomoheneous system are not just "non-parabolic" but also induce higher order derivatives of system parameters, to which the uniform QD model with abrupt compositional boundaries is obviously much more sensitive. One can understand why approximation (6) is particularly suited for relatively correct modeling of this particular spectral feature by noting that the s-p shell splitting is related to the in-plane excitation of the QD, for which the value of the effective mass plays a crucial role. The renormalization described by Eq. (9)  correction to this parameter. On the contrary, including the non-parabolicity correction, either by self-consistently adding the average k 2 terms in the  operator, as in the approach (5), or by using a phenomenological value of E g , as in the approach (3), mostly reflects the impact of the strong confinement in the growth direction, which shifts the lowest shells rigidly, hence does not affect the s-p splitting considerably. It is found also that including the full matrix structure of H v in  (approximation (4)) does not bring any clear advantage by itself. In addition, it turns out that correcting the effective mass equation by the second term in Eq. (12) has an opposite effect on the two models: it decreases the accuracy for the homogeneous QD, while it improves it (to a larger degree) for the trumpet-shape model. We were not able to find a plausible explanation of this fact. Interestingly, the correction stemming from this term is roughly the same for a given QD structure in each approximation.
The accuracy of various approximations to the effective mass equation is different when the g-factor is considered (Fig. 2). Here the approximations (4-6), where the full matrix structure of the vb is included, yield much more accurate results than the diagonal approximations (1) and (2). As one can see in Fig. 2, the self-consistent model (6), with self-consistently renormalized parameters, again provides the most accurate results in the systematic series of approximations. The discrepancy is about 60% for the uniform QD model (Fig. 2a) while the result for the model with a trumpet-shape composition profile (Fig. 2b) is overestimated by a factor of 2. In the case of the g-factor, including the part of non-parabolicity that is captured by our self-consistent approach improves the accuracy to some extent but is not critical. Also the correction provided by the second term in Eq. (12) is of much less importance than it was for the spectrum, with a larger relative contribution in the trumpet-shape model. In both models, it turns out to lower the accuracy. The large error in the values of the g-factor apparently exceeds the estimate of the effect of truncating Eq. (6). In fact, however, the numerical values should be referred to the bulk value of −15, which is nearly entirely compensated by confinement and strain effects. Therefore, the mismatch is actually of only a few percent.  It seems interesting that accurate systematic modeling of the Zeeman splitting in an effective mass equation requires the full form of the operator . This means that the standard Eq. (19) is not very useful if the g-factor is calculated by Löwdin perturbation theory including only the diagonal contributions to the vb structure (in particular, the hh-lh splitting induced by the axial-strain-related b v terms): for our two structures the approach (2) overestimates the Zeeman splitting by an order of magnitude. We were able to trace this back to the important role played by the off-diagonal terms proportional to the b v deformation potential in the correct alignment of valence bands: including axial strain contributions only in the diagonal terms produces wrong vb alignment (in terms of band splitting and the sign of the band offsets between the QD and the barrier). As a consequence, completely neglecting these subtle details of the vb, as done in the semi-phenomenological approximation (3), leads to a result which is only slightly worse than the most exact one when compared to the full 8-band model. Here the lh-hh splitting and band offsets are completely washed out by setting a single constant value for the fundamental band gap and only the spin-orbit (Γ 7 ) band splitting is position dependent. The result becomes even more accurate if the latter is also replaced by a single number obtained from spatial averaging of the standard bulk values (black triangles in Fig. 2). Thus, our numerical results support the commonly used and very convenient Roth-Lax-Zwerdling formula for estimating the electron g-factor.

Conclusions
In this paper we have proposed a systematic derivation of a cb effective mass equation from the 8-band k · p Hamiltonian. We have shown that this derivation develops into a systematic series of approximations that differ in the way the vb is represented in the final equation. Possible approximations range from using a set of fixed band gap parameters to a full matrix structure with self-consistent non-parabolicity corrections and parameter renormalization. We have assessed the accuracy of the approximations in calculating selected spectral and spin-related characteristics for a self-assembled QD system within two models of the composition profile.
We have shown that a quantitatively correct description of the lowest sector of the electron spectrum, which involves intraband dynamics and therefore relies primarily on the accurate modeling of the effective mass, requires a self-consistent renormalization of the Hamiltonian parameters that goes beyond the second order Löwdin perturbation. The accuracy is also improved by accounting for cb non-parabolicity by self-consistently including terms of higher order in the electron momentum. When studying the Zeeman splitting of the ground electron level we found that the most accurate value of the g-factor is obtained within the systematic scheme only after including the full structure of the vb Hamiltonian in the equation. Here, again, including non-parabolicity corrections improves the accuracy of the result. Surprisingly, the values obtained from the resulting rather complicated equation can be reproduced by a simple diagonal model with a fixed value of the fundamental band gap, which supports the use of the effective Roth-Lax-Zwerdling formula for a nanostructure. The accuracy of the latter is remarkable, given its simplicity. The effective mass equations reproduce 8-band k · p value of the electron g-factor for the QD ground state within a factor of 2, which may seem disappointing. One should take into account, however, that the value of the g-factor for bulk InAs is about −15, which is nearly entirely compensated by confinement and strain effects. The absolute mismatch, which is on the order of 0.1, is a tiny fraction of the original value (that is, the compensation is quantitatively reproduced) but yields a large relative error when compared to the very small final value.
In general, we have shown that the effective mass equation offers a limited accuracy when modeling the lowest-energy sector of the electron spectrum in a self-assembled QD, unless it is extended to a rather complicated, inconvenient and computationally expensive form. In particular, an equation that is rigorously derived from the k · p theory strictly up to order k 2 (which may correspond to the most usual notion of the effective mass theory) quantitatively fails in all respects. The more accurate self-consistent equation proposed in this paper is not particularly transparent and does not even allow a separation into kinetic and Pauli terms. On the other hand, we have shown that the electron g-factor can be estimated by a very simple version of the effective mass theory, thus justifying the phenomenological Roth-Lax-Zwerdling formula for a nanostructure. Our derivation highlights the correspondence between the BenDaniel-Duke and Gora-Williams-Bastard orderings of the kinetic term, both emerging as different approximations within a general scheme, and yields a generalized Rashba coupling including inhomogeneity and strain effects. This confirms once more that the effective mass equation, even though not perfectly accurate, can be very useful from the conceptual point of view and often provides valuable physical insight.
Data availability. The datasets generated during and/or analysed during the current study are available from the corresponding author on reasonable request.