Derivation of Dirac equation from the stochastic optimal control principles of quantum mechanics

Yordanov, Vasil

doi:10.1038/s41598-024-56582-5

Download PDF

Article
Open access
Published: 18 March 2024

Derivation of Dirac equation from the stochastic optimal control principles of quantum mechanics

Vasil Yordanov¹

Scientific Reports volume 14, Article number: 6507 (2024) Cite this article

492 Accesses
1 Altmetric
Metrics details

Subjects

Abstract

In this paper, we present a stochastic approach to relativistic quantum mechanics. We formulate the three fundamental principles of this theory and derive the Dirac equations based on them. This approach enables us to bring more insight into the nature of Dirac’s spinors. Furthermore, we provide a physical interpretation of the stochastic optimal control theory of quantum mechanics.

Quantum Mechanics can be understood through stochastic optimization on spacetimes

Article Open access 27 December 2019

Information theoretical limits for quantum optimal control solutions: error scaling of noisy control channels

Article Open access 10 December 2022

Gravitational friction from d’Alembert’s principle

Article Open access 26 June 2023

Introduction

The connection between stochastic optimal control theory and quantum mechanics is profound, with a long history^1,2. Its roots can be traced back to the pioneering works of Fürth^3,4, Fényes⁵ and stochastic mechanics of Nelson⁶, and it is even linked to Feynman’s path integral formulation of quantum mechanics^7,8. Later in the 1980s, Yasue⁹ formulated the variational principle in stochastic mechanics, while Guerra and Morato¹⁰ applied stochastic control theory. Despite its rich history, the theory continues to evolve and expand^11,12,13,14.

Stochastic optimal control theory of quantum mechanics attempts to give a realistic, objective description of physical events in the microphysical world aiming to bridge the gap between classical and quantum interpretations. Notably, it represents physical observables through stochastic processes instead of self-adjoint operators¹⁵.

Unlike traditional quantum mechanics, which postulates the Schrödinger equation, this theory derives it from three underlying principles:

1.
Particles move as Brownian particles in four-dimensional spacetime, influenced by an external random spacetime force⁶.
2.
The stochastic movement of the particle turns its classical action integral into a stochastic variable.
3.
Nature tries to minimize the expected value for the action, in which the particle’s velocity is consider to be a control parameter of the optimization.

The principles formulated above are not new in this paper. However, attributing them to a single author would not be correct, as they have developed over time with contributions from various scientists^{9,10,12,16,17}.

Building on these fundamental principles, the theory allows us not only to derive the Schrödinger equation but also to explain the Born rule and the operator substitution rule¹².

There are a variety of approaches ¹⁸ to the derivation of the Dirac equation, but none of them is based on stochastic optimal control theory. Although there have been other investigations that aim to construct a stochastic optimization theory for relativistic quantum mechanics^{12,19,20,21,22}, none of them have succeeded in deriving the Dirac equation or in explaining the nature of the Dirac spinor. In this paper, we extend the stochastic optimal control theory of quantum mechanics to also derive the Dirac equation²³ and to provide new insights into the nature of Dirac’s four-component spinor $\psi $.

$$\begin{aligned} \gamma ^\mu (i \hbar \partial _\mu - e A_\mu ) \psi = m c \psi , \end{aligned}$$

(1)

where $\gamma ^\mu $ are 4x4 Dirac matrixes with anticommutation rules: $\{ \gamma ^\mu , \gamma ^\nu \} = 2 g^{\mu \nu } {\mathbb {I}}_4$, here the ${\mathbb {I}}_4$ is the identity four-by-four matrix, and $g^{\mu \nu }$ is the metric tensor of spacetime.

We will take advantage of the standard notation for contravariant and covariant tensors, as well as the Einstein summation convention.

Stochastic optimal control theory of quantum mechanics

Let us express the previously mentioned fundamental principles of Stochastic Optimal Control Theory of quantum mechanics in mathematical terms.

The stochastic equation of motion of the Brownian particle²⁴ is:

$$\begin{aligned}dx_\mu =u_\mu ds+ \sigma _\mu dW_\mu , \quad \mu =0..3, \end{aligned}$$

(2)

where $x_\mu $ denote the spacetime coordinates ${{\textbf {x}}}$ of the particle, and $u_\mu $ represents the components of the four-velocity ${\textbf{u}}$. In the context of applying optimal control theory, the velocity $u_\mu $ is referred to as the Markov control policy. The $W_\mu $ terms are standard independent Wiener processes, and $\sigma _\mu $ represent the diffusion coefficients.

Note that in Eq. (2), we consider diffusions not only in three spatial dimensions but also allow the time variable to undergo a diffusion process. This assumption puts the time dimension on equal footing with the spatial dimensions and is in line with the principles of special relativity, which combines space and time into a single entity of spacetime. The concept of temporal diffusion was introduced in^12,20.

The action is postulated to be the minimum of the expected value of the stochastic action:

$$\begin{aligned} S({\textbf{x}}_i, {\textbf{u}}(\tau _i \rightarrow \tau _f))= \min _{\textbf{u}(\tau _i \rightarrow \tau _f)} \left\langle \int _{\tau _i}^{\tau _f} ds\, {\mathcal {L}}({\textbf{x}}(s), {\textbf{u}}(s),s) \right\rangle _{{\textbf{x}}_i}, \end{aligned}$$

(3)

where ${\mathcal {L}}({\textbf{x}}(s), {\textbf{u}}(s), s)$ is the Lagrangian of the test particle, which is a function of the control policy ${\textbf{u}}(s)$ and the four-coordinates ${\textbf{x}}(s)$ at proper time s. Note that s need not be the proper time; it can be any monotonic function representing the progress of the world point along the particle’s world line (see²⁵, Chapter 7.10). The subscript ${\textbf{x}}_i$ on the expectation value indicates that the expectation is taken over all stochastic trajectories that start at ${\textbf{x}}_i$.

The task of optimal control theory, as described by Bellman²⁶, is to find the control ${\textbf{u}}(s)$ for the time interval $\tau _i< s < \tau _f$, often denoted as ${\textbf{u}}(\tau _i \rightarrow \tau _f)$, that minimizes the expected value of the action $S({\textbf{x}}_i, {\textbf{u}}(\tau _i \rightarrow \tau _f))$.

We introduce the optimal action for any intermediate proper time $\tau $ such that $\tau _i< \tau < \tau _f$:

$$\begin{aligned} J(\tau , {\textbf{x}}_\tau )=\min _{{\textbf{u}}(\tau \rightarrow \tau _f )} \left\langle \int _{\tau }^{\tau _f} ds\, {\mathcal {L}}(s, {\textbf{x}}_s, {\textbf{u}}_s) \right\rangle _{ {\textbf{x}}_\tau }, \end{aligned}$$

(4)

in stochastic optimal control theory, this function is commonly referred to as the “cost-to-go” function, and the use of the symbol “J” as notation for it is a common convention.

By definition, the action $S({\textbf{x}}_i, {\textbf{u}}(\tau _i \rightarrow \tau _f))$ is equal to the cost-to-go function $J(\tau _i, {\textbf{x}}_{\tau _i})$ at the initial proper time and spacetime coordinate:

$$\begin{aligned} S({\textbf{x}}_i, {\textbf{u}}(\tau _i \rightarrow \tau _f))=J(\tau _i, {\textbf{x}}_{\tau _i}). \end{aligned}$$

(5)

In following papers^27,28,29,30 one can find the derivation of the Hamilton–Jacobi–Bellman (HJB) equation, which is a partial differential equation for $J(\tau , {\textbf{x}})$ with boundary conditions $J(\tau _f, {\textbf{x}}) = 0$. The final result for this equation is:

$$\begin{aligned} -\partial _\tau J(\tau , {\textbf{x}})=\min _u \left( {\mathcal {L}}(\tau , {\textbf{x}}, {\textbf{u}}) + u^\mu \partial _\mu J(\tau , {\textbf{x}}) + \frac{1}{2} \sigma ^{\mu } \sigma ^\mu \partial _{\mu \mu } J(\tau , {\textbf{x}}_\tau ) \right) . \end{aligned}$$

(6)

The optimal control at the current ${\textbf{x}}$, $\tau $ is given by:

$$\begin{aligned} u({\textbf{x}}, \tau ) = \arg \min _{\textbf{u}} \left( {\mathcal {L}}(\tau , {\textbf{x}}, {\textbf{u}}) + u^\mu \partial _\mu J(\tau , {\textbf{x}}) \right) . \end{aligned}$$

(7)

To describe a quantum system, one must start with a given Lagrangian for the system and then solve the Eq. (6).

Covariant relativistic Lagrangians

In Eq. (6), we employ the following Lorentz-invariant Lagrangian for a single particle in an electromagnetic field, as detailed in²⁵.

$$\begin{aligned} \Lambda = {\tilde{\sigma }} m c \sqrt{{\tilde{\sigma }} u_\mu u^\mu } + q A_\mu u^\mu , \end{aligned}$$

(8)

where q represents the charge of the particle and $A_\mu $ denotes the 4-vector potential. The symbol ${\tilde{\sigma }}$ indicates the sign convention for the metric tensor: it takes the value of $+1$ for the metric with diagonal elements $(1,-1,-1,-1)$ and $-1$ for the metric $(-1,1,1,1)$, as elaborated in³¹.

The components of the four-velocity of the particle are related to the speed of light by the equation:

$$\begin{aligned} u_\mu u^\mu = {\tilde{\sigma }} c^2. \end{aligned}$$

(9)

This relation, referred to as the “weak equation” by Dirac, allows us to treat $u^\mu $ as unconstrained quantities until all differentiation operations have been carried out, at which point we impose the condition of Eq. (9) (see²⁵ Chapter 7.10). This will be the approach we employ as we seek to minimize the expected value of the stochastic action.

Derivation of Dirac equation

Let us find the minimum of the expected value of the stochastic action using the Lagrangian (8). Let us substitute this Lagrangian in (6):

$$\begin{aligned} -\partial _\tau J(\tau , {\textbf{x}})=\min _u \left( {\tilde{\sigma }} mc \sqrt{{\tilde{\sigma }} u_\mu u^\mu } + u^\mu \left( \partial _\mu J(\tau , {\textbf{x}}) + q A_\mu \right) + \frac{1}{2} \sigma ^\mu \sigma ^\mu \partial _{\mu \mu } J(\tau , {\textbf{x}}) \right) . \end{aligned}$$

(10)

If we take derivative with respect to $u^\mu $ we can find the optimal control policy $u^\mu $ at which the above expression has minimum:

$$\begin{aligned} {\tilde{\sigma }} c \frac{2 {\tilde{\sigma }} u^{\min}_\mu }{2 \sqrt{{\tilde{\sigma }} u^{\min}_\mu u^\mu _{\min}}} + \frac{1}{m} \left( \partial _\mu J(\tau , {\textbf{x}}) + q A_\mu (\tau , {\textbf{x}}) \right) = 0. \end{aligned}$$

(11)

If we use the weak condition (9) then we derive:

$$\begin{aligned} u^{\min}_\mu + \frac{1}{m} \left( \partial _\mu J(\tau , \textbf{x}) + q A_\mu (\tau , {\textbf{x}}) \right) = 0. \end{aligned}$$

(12)

For short in the rest of this paper we will denote with $u_\mu $ the optimal control policy and will skip the superscript of $u^{\min}_\mu $.

$$\begin{aligned} u_\mu = - \frac{1}{m} \left( \partial _\mu J(\tau , {\textbf{x}}) + q A_\mu (\tau , {\textbf{x}}) \right) \end{aligned}$$

(13)

We want to linearise Eq. (10) with respect to $u_\mu $. For this purpose we will use similar approach that Dirac²³ has used to linearise the relativistic energy. The idea of the linearization of the Lagrangian in HJB equation is the main idea in this paper and we will see that this idea will lead us to the derivation of Dirac equation. We write:

$$\begin{aligned} {\tilde{\sigma }} m c \sqrt{{\tilde{\sigma }} u_\mu u^\mu } = m c \gamma ^\mu u_\mu , \end{aligned}$$

(14)

here, the $\gamma ^\mu $ are the Dirac matrices²³. Note that since $\gamma ^\mu $ is a four-by-four matrix, we should assume that the left-hand side of this equation is multiplied by a four-by-four matrix whose square is the identity matrix.

Since the four-velocity is optimal, we can then remove the minimization function from (10) and use (14) to obtain:

$$\begin{aligned} -\partial _\tau J(\tau , {\textbf{x}})=mc \gamma ^\mu u_\mu + u^\mu \left( \partial _\mu J(\tau , {\textbf{x}}) + q A_\mu (\tau , {\textbf{x}}) \right) + \frac{1}{2} \sigma ^\mu \sigma ^\mu g_{\mu \mu } \partial ^\mu \partial _\mu J(\tau , {\textbf{x}}), \end{aligned}$$

(15)

here, we have raised the index of the derivative in the last term and assumed summation over $\mu $ in this term.

In equation (15), $\gamma ^\mu $ is a four-by-four matrix. To align the dimensions of each term, we assume multiplication by a four-by-four matrix whose square is the identity matrix. To find the appropriate matrix, we will consider the example of a particle at rest with four-velocity (c, 0, 0, 0). In this case, the equation simplifies to: $-\partial _\tau J(\tau , \textbf{x})=mc^2 \gamma ^0 - m c^2$. It is clear now that we need to multiply the scalar terms by $\gamma ^0$. If we use the identities $\gamma ^0 \gamma ^0={\mathbb {I}}_4$, ${\mathbb {I}}_4 {\mathbb {I}}_4={\mathbb {I}}_4$, and multiply both sides of (15) by $\gamma ^0$, then we obtain:

$$\begin{aligned}- \partial _\tau J(\tau , {\textbf{x}}) {\mathbb {I}}_4= mc \gamma ^\mu u_\mu \gamma ^0 + u^\mu {\mathbb {I}}_4 \left( \partial _\mu J(\tau , {\textbf{x}}) + q A_\mu (\tau , {\textbf{x}}) \right) {\mathbb {I}}_4 + \frac{1}{2} \sigma ^\mu \sigma ^\mu g_{\mu \mu } \partial ^\mu \partial _\mu J(\tau , {\textbf{x}}) {\mathbb {I}}_4. \end{aligned}$$

(16)

Now we will introduce the following notation:

$$\begin{aligned} \epsilon _r = {\left\{ \begin{array}{ll} 1 &{} \quad \text {if } r=1,2 \\ -1 &{}\quad \text {if } r=3,4 \end{array}\right. }. \end{aligned}$$

(17)

Let us also denote $\gamma ^0_{(r)}$ and ${\mathbb {I}}^{(r)}_4$, where $r=1,2,3,4$, as the $r^{\text{th}}$ column of the $\gamma ^0$ matrix and the identity matrix ${\mathbb {I}}_4$, respectively. It is obvious that $\gamma ^0_{(r)} = \epsilon _r {\mathbb {I}}^{(r)}_4$ for $r=1,2,3,4$. We will consider Eq. (16) as four independent equations, one corresponding to each column of the matrix.

$$\begin{aligned}- \partial _\tau J(\tau , {\textbf{x}}) {\mathbb {I}}^{(r)}_4 &= \epsilon _r mc \gamma ^\mu u_\mu {\mathbb {I}}^{(r)}_4 + u^\mu {\mathbb {I}}^{(r)}_4 \left( \partial _\mu J(\tau , {\textbf{x}}) + q A_\mu (\tau , {\textbf{x}}) \right) {\mathbb {I}}^{(r)}_4 + \\ & \quad + \frac{1}{2} \sigma ^\mu \sigma ^\mu g_{\mu \mu } \partial ^\mu \partial _\mu J(\tau , {\textbf{x}}) {\mathbb {I}}^{(r)}_4, \quad r=1,2,3,4 \end{aligned}$$

(18)

We should note that if the first term in Eq. (14) was not linearized, then we would have four identical equations for $r=1,2,3,4$. Another observation is that we can rewrite $\partial _\mu J(\tau , {\textbf{x}}) {\mathbb {I}}^{(r)}_4=\partial _\tau (J(\tau , {\textbf{x}}) {\mathbb {I}}^{(r)}_4)=\partial _\tau \varvec{J}^{(r)}(\tau , {\textbf{x}})$ and $mc \gamma ^\mu u_\mu {\mathbb {I}}^{(r)}_4=mc \gamma ^\mu (u_\mu {\mathbb {I}}^{(r)}_4)=mc \gamma ^\mu \varvec{u}^\mu _{(r)}$. This allows us to treat both $\varvec{J}$ and the $\varvec{u}_\mu $ as four-component column vectors. Then Eq. (18) can be written as:

$$\begin{aligned}- \partial _\tau \varvec{J}^{(r)}(\tau , {\textbf{x}}) &= \epsilon _r mc \gamma ^\mu \varvec{u}^{(r)}_\mu + \varvec{u}^\mu \left( \partial _\mu \varvec{J}^{(r)}(\tau , {\textbf{x}}) + q A_\mu (\tau , {\textbf{x}}) \right)+ \\ & \quad + \frac{1}{2} \sigma ^\mu \sigma ^\mu g_{\mu \mu } \partial ^\mu \partial _\mu \varvec{J}^{(r)}(\tau , {\textbf{x}}), \quad r=1,2,3,4. \end{aligned}$$

(19)

As Nelson proposed in⁶, the variance of the Brownian motion of the particle is equal to the ratio of Planck constant to the mass of the particle. That is why we express the square of the diffusion coefficient in (2) to be proportional to the $\frac{\hbar }{m}$.

$$\begin{aligned} g_{\mu \mu } \sigma ^\mu \sigma ^\mu = - \frac{2 i \epsilon _r \hbar }{m}, \quad \mu =0..3 \end{aligned}$$

(20)

In the original idea proposed by Nelson, the imaginary unit, the factor 2, and the temporal diffusion coefficient were missing from the equation above. Later, we will demonstrate that the introduction of the imaginary unit and the factor 2 in the equation above is necessary for the linearization of Eq. (15). This serves as another key contribution of the current paper, explaining the existence of the imaginary unit in quantum mechanics equations. However, it is important to note that the imaginary diffusion coefficient will result in a complex stochastic Eq. (2), as we will show in Sect. “Diffusion coefficients of the stochastic motion”.

If we substitute Eqs. (20) and (13) into (19) we obtain:

$$\begin{aligned}- \partial _\tau \varvec{J}^{(r)}(\tau , {\textbf{x}}) &= - \epsilon _r mc \gamma ^\mu \frac{1}{m} \left( \partial _\mu \varvec{J}^{(r)}(\tau , \textbf{x}) + q A_\mu \right) - \frac{i \epsilon _r \hbar }{m} \partial ^\mu \partial _\mu \varvec{J}^{(r)}(\tau , {\textbf{x}})- \\&\quad - \frac{1}{m} \left( \partial ^\mu \varvec{J}^{(r)}(\tau , {\textbf{x}}) + q A^\mu \right) \left( \partial _\mu \varvec{J}^{(r)}(\tau , {\textbf{x}}) + q A_\mu \right) , \quad r=1,2,3,4. \end{aligned}$$

(21)

Expanding the last term of the above equation we obtain:

$$\begin{aligned}- \partial _\tau \varvec{J}^{(r)}(\tau , {\textbf{x}}) &= - \epsilon _r mc \gamma ^\mu \frac{1}{m} \left( \partial _\mu \varvec{J}^{(r)}(\tau , \textbf{x}) + q A_\mu \right) - \frac{i \epsilon _r \hbar }{m} \partial ^\mu \partial _\mu \varvec{J^{(r)}}(\tau , {\textbf{x}}) - \\&\quad - \frac{1}{m} \partial ^\mu \varvec{J}^{(r)}(\tau , {\textbf{x}}) \partial _\mu \varvec{J}^{(r)}(\tau , {\textbf{x}}) - 2 \frac{1}{m} \partial ^\mu \varvec{J}^{(r)}(\tau , {\textbf{x}}) q A_\mu - \frac{1}{m} q^2 A^\mu A_\mu , \, r=1,2,3,4. \end{aligned}$$

(22)

Let us introduce a new function ${\tilde{J}}(\tau , {\textbf{x}})$:

$$\begin{aligned} \varvec{J}^{(r)}(\tau , {\textbf{x}}) = -i \epsilon _r \hbar {\tilde{J}}(\tau , {\textbf{x}}) {\mathbb {I}}^{(r)}_4, \quad r=1,2,3,4. \end{aligned}$$

(23)

If we use this new function in Eq. (13), it will make the equation asymmetric with respect to $\epsilon _r$. That is why let us denote $q=\epsilon _r e$ and rewrite Eq. (13) as:

$$\begin{aligned} \varvec{u}^{(r)}_\mu = - \frac{1}{m} \left( -i \epsilon _r \hbar \partial _\mu {\tilde{J}}(\tau , {\textbf{x}}) + \epsilon _r e A_\mu (\tau , {\textbf{x}}) \right) {\mathbb {I}}^{(r)}_4, r=1,2,3,4. \end{aligned}$$

(24)

This result is quite interesting because it tells us that the last two equations in the initial set of Eq. (19) with $r=3,4$ are for particles with the opposite sign of the charge compared to the first two equations with $r=1,2$.

Substituting Eq. (23) into Eq. (22), and then multipling both sides by m, we obtain:

$$\begin{aligned}i \hbar m \partial _\tau {\tilde{J}}(\tau , {\textbf{x}}) &= m c \gamma ^\mu \left( i \hbar \partial _\mu {\tilde{J}}(\tau , {\textbf{x}}) - e A_\mu \right) + \\ & \quad + \hbar ^2\left( \partial ^\mu {\tilde{J}}(\tau , {\textbf{x}}) \partial _\mu {\tilde{J}}(\tau , {\textbf{x}}) + \partial ^\mu \partial _\mu {\tilde{J}}(\tau , {\textbf{x}}) \right) + 2 i \hbar \partial ^\mu {\tilde{J}}(\tau , {\textbf{x}}) e A_\mu - e^2 A^\mu A_\mu . \end{aligned}$$

(25)

Using (Hopf-Cole) logarithmic transformation, defined by ${\tilde{J}}(\tau , \textbf{x})=log(\phi (\tau ,{\textbf{x}}))$, we can linearize the HJB equation through the following calculation:

$$\partial ^\mu {\tilde{J}}(\tau , {\textbf{x}}) \partial _\mu {\tilde{J}}(\tau , {\textbf{x}}) + \partial ^\mu \partial _\mu {\tilde{J}}(\tau , {\textbf{x}}) = \frac{\partial ^\mu \phi (\tau , {\textbf{x}})}{\phi (\tau , {\textbf{x}})} \frac{\partial _\mu \phi (\tau , {\textbf{x}})}{\phi (\tau , {\textbf{x}})} + \partial ^\mu \left( \frac{\partial _\mu \phi (\tau , \textbf{x})}{\phi (\tau , {\textbf{x}})} \right) = \frac{\partial ^\mu \phi (\tau , {\textbf{x}})}{\phi (\tau , {\textbf{x}})} \frac{\partial _\mu \phi (\tau , {\textbf{x}})}{\phi (\tau , {\textbf{x}})} + \frac{\partial ^\mu \partial _\mu \phi (\tau , {\textbf{x}})}{\phi (\tau , {\textbf{x}})} - \frac{\partial ^\mu \phi (\tau , {\textbf{x}}) \partial _\mu \phi (\tau , {\textbf{x}})}{\phi (\tau , {\textbf{x}})^2} = \frac{\partial ^\mu \partial _\mu \phi (\tau , \textbf{x})}{\phi (\tau , {\textbf{x}})}.$$

(26)

Now, it becomes clear that the presence of the imaginary unit in the diffusion coefficient (20) and the imaginary unit in (23) are a consequence of the need to linearize the HJB equation.

The explicit equation for $\phi (\tau ,{\textbf{x}})$ is:

$$\begin{aligned} \varvec{\phi }^{(r)}(\tau ,{\textbf{x}}) = \phi (\tau ,{\textbf{x}}) {\mathbb {I}}^{(r)}_4=e^{-\frac{\epsilon _r}{i \hbar } J(\tau , \textbf{x})} {\mathbb {I}}^{(r)}_4, \quad r=1,2,3,4. \end{aligned}$$

(27)

later we will discuss what is the physical meaning of this function.

The above substitution transforms the HJB into a linear equation:

$$\begin{aligned}i \hbar m \frac{\partial _\tau \phi (\tau , {\textbf{x}})}{\phi (\tau , {\textbf{x}})} = m c \left( i \hbar \gamma ^\mu \frac{\partial _\mu \phi (\tau , {\textbf{x}})}{\phi (\tau , {\textbf{x}})} - e \gamma ^\mu A_\mu \right) + \hbar ^2 \frac{\partial ^\mu \partial _\mu \phi (\tau , \textbf{x})}{\phi (\tau , {\textbf{x}})} + 2 e A^\mu i \hbar \frac{\partial _\mu \phi (\tau , {\textbf{x}})}{\phi (\tau , {\textbf{x}})} - e^2 A^\mu A_\mu . \end{aligned}$$

(28)

Multiplying both sides of Eq. (28) by $\phi (\tau , {\textbf{x}})$, finally we obtain:

$$\begin{aligned}i \hbar m \partial _\tau \phi (\tau , {\textbf{x}})& = m c \left( i \hbar \gamma ^\mu \partial _\mu \phi (\tau , {\textbf{x}}) - e \gamma ^\mu A_\mu \phi (\tau , {\textbf{x}}) \right) + \hbar ^2 \partial ^\mu \partial _\mu \phi (\tau , {\textbf{x}})+ \\ & \quad + 2 e A^\mu i \hbar \partial _\mu \phi (\tau , {\textbf{x}}) - e^2 A^\mu A_\mu \phi (\tau , {\textbf{x}}). \end{aligned}$$

(29)

We are going to prove that Eq. (29) is equivalent to:

$$\begin{aligned} i \hbar m \partial _\tau \phi (\tau , {\textbf{x}}) = \left( i \hbar \gamma ^\nu \partial _\nu - e \gamma ^\nu A_\nu - mc \right) \left( - i \hbar \gamma ^\mu \partial _\mu \phi (\tau , {\textbf{x}}) + e \gamma ^\mu A_\mu \phi (\tau , {\textbf{x}}) \right) . \end{aligned}$$

(30)

After expanding the brackets we obtain:

$$\begin{aligned}i \hbar m \partial _\tau \phi (\tau , {\textbf{x}})&= mc \left( i \hbar \gamma ^\mu \partial _\mu \phi (\tau , {\textbf{x}}) - e \gamma ^\mu A_\mu \phi (\tau , {\textbf{x}}) \right) + \hbar ^2 \gamma ^\nu \partial _\nu \gamma ^\mu \partial _\mu \phi (\tau , {\textbf{x}}) + \\&\quad + i \hbar \gamma ^\nu \partial _\nu \left( e \gamma ^\mu A_\mu \phi (\tau , {\textbf{x}}) \right) + e \gamma ^\nu A_\nu i \hbar \gamma ^\mu \partial _\mu \phi (\tau , {\textbf{x}}) - e \gamma ^\nu A_\nu e \gamma ^\mu A_\mu \phi (\tau , {\textbf{x}}). \end{aligned}$$

(31)

To simplify the above equation we will use the following relations:

$$\begin{aligned}\gamma ^\mu \partial _\mu \gamma ^\nu \partial _\nu = \partial ^\mu \partial _\mu \qquad \gamma ^\mu \partial _\mu \gamma ^\nu A_\nu = \partial _\mu A^\mu \qquad \gamma ^\mu A_\mu \gamma ^\nu A_\nu = A^\mu A_\mu \end{aligned}$$

(32)

To prove that $\gamma ^\mu B_\mu \gamma ^\nu B_\nu = B^\mu B_\mu $ lets sum two times $\gamma ^\mu B_\mu \gamma ^\nu B_\nu $ and take one half of this sum. Also let us use the fact that we can change the name of summation indexes, and the property $\gamma ^\mu \gamma ^\nu + \gamma ^\nu \gamma ^\mu = g^{\mu \nu }$: $\gamma ^\mu B_\mu \gamma ^\nu B_\nu = \frac{1}{2}( \gamma ^\mu B_\mu \gamma ^\nu B_\nu + \gamma ^\nu B_\nu \gamma ^\mu \partial _\mu ) = \frac{1}{2}( \gamma ^\mu \gamma ^\nu + \gamma ^\nu \gamma ^\mu ) B_\mu B_\nu = \frac{1}{2} 2 g^{\mu \nu } B_\mu B_\mu = B^\mu B_\mu $.

$$\begin{aligned}i \hbar m \partial _\tau \phi (\tau , {\textbf{x}})&= mc \left( i \hbar \gamma ^\mu \partial _\mu \phi (\tau , {\textbf{x}}) - e \gamma ^\mu A_\mu \phi (\tau , {\textbf{x}}) \right) + \hbar ^2 \partial ^\mu \partial _\mu \phi (\tau , {\textbf{x}}) +\\&\quad + i \hbar e ( \partial _\mu A^\mu ) \phi (\tau , {\textbf{x}}) + i \hbar e A^\mu \partial _\mu \phi (\tau , {\textbf{x}}) + e A^\mu i \hbar \partial _\mu \phi (\tau , {\textbf{x}}) - e^2 A^\mu A_\mu \phi (\tau , {\textbf{x}}). \end{aligned}$$

(33)

If we use the Lorenz gauge condition³² $\partial _\mu A^\mu =0$, the Eq. (33) will turn into Eq. (29), which proves that both Eqs. (29) and (31) are equivalent.

Let us introduce the four-component column vector function:

$$\begin{aligned} \varvec{\psi }(\tau , {\textbf{x}}) = - \frac{i \hbar }{mc} \gamma ^\mu \partial _\mu \varvec{\phi }(\tau , {\textbf{x}}) + \frac{e}{m c} \gamma ^\mu A_\mu \varvec{\phi }(\tau , {\textbf{x}}), \end{aligned}$$

(34)

the factor $\frac{1}{m c}$ is introduced to ensure that $\varvec{\psi }(\tau , {\textbf{x}})$ has dimensionless units.

If we divide the Eq. (30) by mc, then we obtain the following equation for any r:

$$\begin{aligned} \frac{i \hbar }{c} \partial _\tau \varvec{\phi }(\tau , {\textbf{x}}) = \left( i \hbar \gamma ^\nu \partial _\nu - e \gamma ^\nu A_\nu - mc \right) \varvec{\psi }(\tau , {\textbf{x}}). \end{aligned}$$

(35)

It should be noted that the Dirac equation is stationary equation when one set the partial derivative with respect to proper time to zero.

$$\begin{aligned}\left( i \hbar \gamma ^\nu \partial _\nu - e \gamma ^\nu A_\nu - mc \right) \varvec{\psi }({\textbf{x}}) = 0 \end{aligned}$$

(36)

Where $\varvec{\psi }({\textbf{x}})$ is the stationary solution that does not depend on the proper time. As it can be seen from (36) that $\varvec{\psi }({\textbf{x}})$ is the Dirac spinor, which from (34) becomes:

$$\begin{aligned} \varvec{\psi }({\textbf{x}}) = - \frac{1}{m c} i \hbar \gamma ^\mu \partial _\mu \varvec{\phi }({\textbf{x}}) + \frac{1}{m c} e \gamma ^\mu A_\mu \varvec{\phi }({\textbf{x}}). \end{aligned}$$

(37)

Note that derivation of Dirac equation presented in this paper does not derive the equation with the negative mass $-m$. Even some recent theoretical attempt to recover the idea of negative mass³³, until now there is no experimental evidence of particle with negative mass.

Dirac spinor and the wave function

In the previous section, we expressed the Dirac spinor in Eq. (37) as a function of $\phi ({\textbf{x}})$, which is a stationary solution of the HJB equation, and its derivative $\partial _\mu \phi ({\textbf{x}})$.

As any other solution $\phi (\tau , {\textbf{x}})$ of HJB equation, the stationary solution $\phi ({\textbf{x}})$ should also satisfy the Eq. (27):

$$\begin{aligned} \varvec{\phi }^{(r)}({\textbf{x}})=\phi ({\textbf{x}}) {\mathbb {I}}^{(r)}_4=e^{-\frac{\epsilon _r}{i \hbar } J({\textbf{x}})} {\mathbb {I}}^{(r)}_4, r=1,2,3,4. \end{aligned}$$

(38)

The function $\phi ({\textbf{x}})$ is identical as the wave function in the path integral formulation of quantum mechanics proposed by Feynman³⁴. That is why $|\phi ({\textbf{x}})|^2$ is the probability density of the system to be in spacetime position ${\textbf{x}}$. This conclusion aligns with the work of Papiez¹⁶. Note that here we do not give probabilistic physical interpretation of function $\phi (\tau , {\textbf{x}})$, because this function is object of the stochastic optimal control theory, and it is not an object of path integral formulation of quantum mechanics. The connection of both theories is via the stationary solution of HJB.

Proper time in Dirac equation

In his paper, Fock employed a representation of the Dirac spinor (see Chapter 37-2, Section II, in³⁵) that appears strikingly similar to the one we have derived in Eq. (37), as per our application in stochastic optimal control.

Fock was looking for a solution for the Dirac equation represented in the form $\psi =(i \hbar \gamma ^\nu \partial _\nu - e \gamma ^\nu A_\nu - m c) \Psi $, where $\Psi $ is required to satisfy a second-order equation $\hbar ^2 \Lambda \Psi = 0$. Here $\Lambda $ is an operator defined as $\Lambda = \frac{1}{\hbar ^2}(i \hbar \gamma ^\nu \partial _\nu - e \gamma ^\nu A_\nu - m c) (i \hbar \gamma ^\mu \partial _\mu - e \gamma ^\mu A_\mu - m c)$. A solution of the equation for $\Psi $ can be looked for in the form of a definite integral, $\Psi =\int _C {F d \tau }$, over the auxiliary variable $\tau $ taken between certain fixed limits (or along a specific contour in the complex $\tau $-plane). It is obvious that this equation would be satisfied if one imposes on F the equation, called by Fock the proper time Dirac equation, $\frac{\hbar ^2}{2 m} \Lambda F = i \hbar \frac{\partial F}{\partial \tau }$ with the choice of contour C such that $\int _C {\frac{\partial F}{\partial \tau }} d \tau = F |_{\partial C} = 0$.

In this paper, we introduce a representation of the Dirac spinor as $\psi =-\frac{1}{mc}(i \hbar \gamma ^\nu \partial _\nu - e \gamma ^\nu A_\nu ) \phi ({\textbf{x}})$, detailed further in Eq. (37). The corresponding second-order equation that $\phi ({\textbf{x}})$ must satisfy is presented as $\hbar ^2 \Lambda \phi ({\textbf{x}}) = 0$. Here, the operator $\Lambda $ is defined by $\Lambda =\frac{1}{\hbar ^2}\left( i \hbar \gamma ^\nu \partial _\nu - e \gamma ^\nu A_\nu - mc \right) \left( - i \hbar \gamma ^\mu \partial _\mu + e \gamma ^\mu A_\mu \right) $, as elaborated in Eq. (30). Similar to Fock’s approach, we consider representing $\phi (x)$ in the form $\int _C {\phi (\tau , {\textbf{x}}) d \tau }$, with the condition that $\int _C {\frac{\partial \phi (\tau , {\textbf{x}})}{\partial \tau }} d \tau = \phi (\tau , \textbf{x}) |_{\partial C} = 0$. However, within the framework of stochastic optimal control, the parameter $\tau $ is an invariant parameter used to trace the progress of the system point in configuration space. Consequently, such a contour C for integration is not feasible. The feasible approach is to regard $\phi ({\textbf{x}})$ as a stationary solution of the time-dependent second-order differential equation $\frac{\hbar ^2}{2 m} \Lambda \phi (\tau , {\textbf{x}}) = i \hbar \partial _\tau \phi (\tau , {\textbf{x}})$, as outlined in Eq. (29).

Even though we cannot directly apply Fock’s technique to the stochastic optimal control case, we can still identify Eq. (33) or the equivalent Eq. (29) as the proper time Dirac equation. This is because, by definition, the variable $\tau $ in these equations represents the proper time.

Physical interpretation of the stationary solution of HJB equation

To accurately interpret the stationary solution of the Hamilton–Jacobi–Bellman (HJB) equation and the associated proper time Dirac equation as derived in Eq. (29), a detailed analysis of this equation is essential. One important observation is that we need to solve the HJB equation reverse in time. This means that, given the boundary condition at the future time moment $\tau _f$ and spacetime coordinate $x_{\tau _f}$, we aim to find the solution of this equation for an earlier time moment $\tau $ and coordinate $x_\tau $. This time-reversal approach is typical for the dynamic programming²⁶, as incorporated in the derivation of the HJB equation.

It is required that $x_\tau $ be a four-vector of real numbers. However, there is no requirement for the four-coordinates corresponding to times greater than $\tau $ to be real numbers, see the discussion in¹⁶; they can be a four-vector of complex numbers. This implies that no matter what the complex four-coordinates and four-velocities of the particle are at the time moments greater than $\tau $ when it moves backward in time, it should arrive at the initial time $\tau $ with the real four-coordinate $x_\tau $.

As far as $\tau _f$ being any arbitrary final moment in time, we can choose $\tau _f$ to be sufficiently large so that when the particles propagate backward in time, their motion becomes stationary and ceases to depend on time.

In all physical theories, the aim is to identify an equation that describes how a system evolves over time. Essentially, this means determining the state of the system at successive time moments based on its state at previous moments. The stochastic optimal control theory of quantum mechanics also allows for this traditional approach, as demonstrated in this paper through the derivation of the Dirac equation. However, the theory also offers an alternative approach. To determine the wave function for every spacetime coordinate $x_\tau $, we must choose some future proper time $\tau _f$. We then backtrack all possible complex stochastic trajectories of the particle to the present time $\tau $ and real four-coordinates $x_\tau $, aiming to find the optimal trajectories that minimize the expected value of the action. It appears that the current state of the system is determined by all possible optimal future trajectories of the particle.

Plane wave solutions of the Dirac equation

Let us consider a particle with mass m that moves with velocity $u_\mu $ in free space, where $A_\mu =0$.

If we multiply both sides of Eq. (13) by m, we find that the particle momentum is given by $p_\mu = - \partial _\mu J(x)$. A solution to this equation is $J(x) = - p \cdot x$. Substituting J(x) into Eq. (38), we obtain the equation for a plane wave, which describes the particle: $\varvec{\phi }^{(r)}(x) = e^{-\frac{i \epsilon _r}{\hbar } p \cdot x} {\mathbb {I}}^{(r)}_4, r=1,2,3,4$. We can calculate the derivative $\partial _\mu \varvec{\phi }^{(r)}(x) = \frac{\epsilon _r}{i \hbar } p_\mu e^{- \frac{i \epsilon _r}{\hbar } (p \cdot x)} {\mathbb {I}}^{(r)}_4, r=1,2,3,4$ and then substitute this result into Eq. (37).

$$\begin{aligned} &\varvec{\psi }^{(r)}(x) = - \frac{\epsilon _r}{m c} \gamma ^\mu p_\mu e^{- \frac{i\epsilon _r}{\hbar }( p \cdot x)} {\mathbb {I}}^{(r)}_4, \quad r=1,2,3,4 \end{aligned}$$

(39)

Now we will prove that $\varvec{\psi }^{(r)}(x), r=1,2,3,4$, satisfies the Dirac Eq. (36).

$$\begin{aligned}- (i \hbar \gamma ^\mu \partial _\mu - m c) \frac{\epsilon _r}{m c} \gamma ^\mu p_\mu e^{- \frac{i \epsilon _r}{\hbar }( p \cdot x)} {\mathbb {I}}^{(r)}_4 = (\gamma ^\mu p_\mu - \epsilon _r m c) e^{- \frac{i \epsilon _r}{\hbar }( p \cdot x)} \epsilon _r {\mathbb {I}}^{(r)}_4 = 0 \end{aligned}$$

(40)

The last equation equals zero due to Eqs. (9) and (14). This demonstrates that $\varvec{\psi }^{(r)}(x)$ from Eq. (39) are indeed solutions of the Dirac Eq. (36).

Diffusion coefficients of the stochastic motion

In Sect. “Derivation of Dirac equation”, we derived the Dirac equation by linearizing the stochastic HJB equations, necessitating the use of complex diffusion coefficients (refer to Eq. (20)). The works of Papiez^16,20 and Lindgren et al.¹² also employ optimal control theory in conjunction with complex diffusion coefficients. In contrast, earlier variational approaches to deriving stochastic mechanics^17,36,37 do not rely on complex numbers but utilize two control parameters: the forward ($b_{+}=v_\rho + u_\rho $) and backward ($b_{-}=v_\rho - u_\rho $) velocities in the variational procedures.

In their work on Complex Mechanics³⁸, Yang et al. prove that the current velocity $v_\rho $ and the osmotic velocity $u_\rho $, as defined in stochastic mechanics of Nelson, are equivalent to the real part and negative imaginary part, respectively, of the complex velocity evaluated on the real axis. Thus, one can consider the complex diffusion coefficient and related complex stochastic equations of motion as a theoretical construct to simplify the derivation of quantum mechanics equations from stochastic optimal control theory, or it can be viewed as an intrinsic aspect of reality, subject to interpretation.

It should be noted that, in contrast to non-relativistic Complex Mechanics as discussed in Yang et al.³⁸ and other works utilizing a complex diffusion coefficient $\nu =-i \hbar / m$^3,7,12,16, in the relativistic case the diffusion coefficient is two times larger, $\nu =-2 i \epsilon _r \hbar / m$ (see Eq. (20)). This increase is a consequence of the linearization performed on the kinetic term of the Lagrangian (see Eq. (14)). Note also that the sign of complex diffusion coefficient is opposite for the equations $r=1,2$ and $r=3,4$ that describes particles with opposite changes as per Eq. (24).

If we take the square root of both sides of Eq. (20), we obtain the following equation for the diffusion coefficient:

$$\begin{aligned} \sigma ^\mu =\pm \sqrt{\frac{2 g_{\mu \mu } \epsilon _r i \hbar }{m}},\quad \mu =0..3. \end{aligned}$$

(41)

If we use the identity $\sqrt{\pm i}=\frac{1}{\sqrt{2}}(1\pm i)$, we finally obtain:

$$\begin{aligned} \sigma ^\mu =\pm \ (1 + \epsilon _r g_{\mu \mu } i) \sqrt{\frac{\hbar }{m}},\quad \mu =0..3. \end{aligned}$$

(42)

If we rewrite the equation of stochastic motion (2) using Eq. (42) we obtain:

$$\begin{aligned} d x_\mu =u_\mu ds + (1 + \epsilon _r g_{\mu \mu } i){\sqrt{\frac{\hbar }{m}}} dW_\mu , \quad \mu =0..3. \end{aligned}$$

(43)

The presence of complex diffusion coefficients and complex four-velocity in the stochastic equation of motion (2) necessitates that the particle’s four-coordinates also be complex-valued. Taking the real and imaginary parts of the four-coordinates allows us to formulate the equations of stochastic motion (2) for both real and imaginary spacetime coordinates.

$$\begin{aligned} {\text {Re}}(d x_\mu )= & {} {\text {Re}}(u_\mu ) ds + \sqrt{\frac{\hbar }{m}} dW_\mu , \quad \mu =0..3 \end{aligned}$$

(44)

$$\begin{aligned} {\text {Im}}(d x_\mu )= & {} {\text {Im}}(u_\mu ) ds + \epsilon _r g_{\mu \mu } \sqrt{\frac{\hbar }{m}} dW_\mu , \quad \mu =0..3 \end{aligned}$$

(45)

The coordinates of the test particles, both the real and imaginary parts, can be treated separately as coordinates in Minkowski spacetime in the same way as it is done in³⁸.

Conclusion

The present work derives the Dirac equation from the stochastic optimal control theory and provides representation of Dirac spinor using the solution of Hamilton–Jacobi–Bellman (HJB) equation.

We proposed a linearization of the Lagrangian in the HJB equation, which yields four independent equations. The last two of which are for particles with opposite charges.

It has been demonstrated that in the case of relativistic theory, the complex diffusion coefficient is twice as large as in the non-relativistic theory, with an opposite sign for the oppositely charged particle.

We also proposed a physical interpretation of the proper time Dirac equation in terms of stochastic optimal control theory that leads to the observation that the current state of the system is determined by all possible optimal future trajectories of the particle.

Even though the stochastic mechanics has some difficulties³⁹ in explaining the locality (see Chapter 23 in¹⁵) and entanglement (see Chapter 10 in⁴⁰) the present paper makes one step further of explaining other feature of the theory that was not revealed until now.

Data availability

All data generated or analysed during this study are included in this published article and its supplementary information files.

References

Beyer, M. & Paul, W. On the stochastic mechanics foundation of quantum mechanics. Universe 7(6), 166 (2021).
Article CAS ADS Google Scholar
Kuipers, F. Stochastic mechanics: The unification of quantum mechanics with Brownian motion. Stoch. Mech. (2023).
Fürth, R. Über einige beziehungen zwischen klassischer statistik und quantenmechanik. Zeitschrift für Physik 81(3), 143–162 (1933).
Article ADS Google Scholar
Peliti, L., Paolo Muratore-Ginanneschi, R. & fürth’s,. paper “on certain relations between classical statistics and quantum mechanics” [“über einige beziehungen zwischen klassischer statistik und quantenmechanik”, zeitschrift für physik, 81 143–162]. Eur. Phys. J. H48(1–19), 2020 (1933).
Fényes, I. Eine wahrscheinlichkeitstheoretische begründung und interpretation der quantenmechanik. Zeitschrift für Physik 132(1), 81–106 (1952).
Article MathSciNet ADS Google Scholar
Nelson, E. Derivation of the Schrödinger equation from Newtonian mechanics. Phys. Rev. 150, 1079–1085 (1966).
Article CAS ADS Google Scholar
Comisar, G. G. Brownian-motion model of nonrelativistic quantum mechanics. Phys. Rev. 138, B1332–B1337 (1965).
Article MathSciNet ADS Google Scholar
Wang, M. S. Stochastic mechanics and Feynman path integrals. Phys. Rev. A 37, 1036–1039 (1988).
Article MathSciNet CAS ADS Google Scholar
Yasue, K. Quantum mechanics and stochastic control theory. J. Math. Phys. 22(5), 1010–1020 (1981).
Article MathSciNet ADS Google Scholar
Guerra, F. & Morato, L. M. Quantization of dynamical systems and stochastic control theory. Phys. Rev. D 27, 1774–1786 (1983).
Article MathSciNet ADS Google Scholar
Bacciagaluppi, G. A conceptual introduction to Nelson’s mechanics. In Endophysics, Time, Quantum and the Subjective (eds Buccheri, R. et al.) 367–388 (World Scientific, Singapore, 2005).
Chapter Google Scholar
Lindgren, J. & Liukkonen, J. Quantum mechanics can be understood through stochastic optimization on spacetimes. Sci. Rep. 9(1), 19984 (2019).
Article CAS PubMed PubMed Central ADS Google Scholar
Yang, C.-D. & Cheng, L.-L. Optimal guidance law in quantum mechanics. Ann. Phys. 338, 167–185 (2013).
Article MathSciNet CAS ADS Google Scholar
Albeverio, S. et al. Some Connections Between Stochastic Mechanics, Optimal Control, and Nonlinear Schrödinger Equations 505–534 (Springer, Cham, 2023).
Google Scholar
Nelson, E. Quantum Fluctuations. Princeton Series in Physics (Princeton University Press, Berlin, 1985).
Book Google Scholar
Papiez, L. Stochastic optimal control and quantum mechanics. J. Math. Phys. 23(6), 1017–1019 (1982).
Article MathSciNet ADS Google Scholar
Pavon, M. Hamilton’s principle in stochastic mechanics. J. Math. Phys. 36(12), 6774–6800 (1995).
Article MathSciNet ADS Google Scholar
Simulik, V. & Zayats, T. M. The variety of approaches to the problem of the derivation of Dirac equation. Sci. Herald Uzhhorod Univ. Ser. Phys. 45, 92–103 (2019).
Article Google Scholar
Blaquière, A. From the main equation to the Klein–Gordon equation. J. Optim. Theory Appl. 27(1), 71–87 (1979).
Article MathSciNet Google Scholar
Papiez, L. Stochastic optimal control quantization of a free relativistic particle (1981).
Gaveau, B., Jacobson, T., Kac, M. & Schulman, L. S. Relativistic extension of the analogy between quantum mechanics and Brownian motion. Phys. Rev. Lett. 53, 419–422 (1984).
Article MathSciNet CAS ADS Google Scholar
Kuipers, F. Stochastic quantization of relativistic theories. J. Math. Phys. 62(12), 122301 (2021).
Article MathSciNet ADS Google Scholar
Dirac, P. A. M. & Fowler, R. H. The quantum theory of the electron. Proc. R. Soc. Lond. Ser. A Contain. Pap. Math. Phys. Charact. 117(778), 610–624 (1928).
ADS Google Scholar
Øksendal, B. Stochastic Differential Equations: An Introduction with Applications Vol. 82 (Springer, New York, 2000).
Google Scholar
Goldstein, H., & Poole, C. Classical mechanics. Pearson education Asia, Delhi, 3rd edn. Includes subject and author index, selected bibliography (2002).
Bellman, R. The theory of dynamic programming. Oper. Res. 2(3), 275–285 (1954).
Google Scholar
Kappen, H. J. Path integrals and symmetry breaking for optimal control theory. J. Stat. Mech. Theory Exp. 2005(11), P11011–P11011 (2005).
Article MathSciNet Google Scholar
Kappen, H. Optimal control theory and the linear Bellman equation. In Neural Networks (2011).
Fleming, W.H., & Soner, H.M. Controlled Markov Processes and Viscosity Solutions. Stochastic Modelling and Applied Probability. Springer, New York, NY, 2 edn. Number of Pages: XVII, 429 (2006).
Yordanov, V. Derivation of the stochastic Hamilton–Jacobi–Bellman equation. https://arxiv.org/abs/2312.04581 (2023).
Brizard, A.J. On the proper choice of a Lorentz-covariant relativistic Lagrangian. arXiv:0912.0655 (2009).
Lorenz, L. Xxxviii on the identity of the vibrations of light with electrical currents. Lond. Edinb. Dublin Philos. Mag. J. Sci. 34(230), 287–301 (1867).
Article Google Scholar
Debergh, N., Petit, J.-P. & D’Agostini, G. On evidence for negative energies and masses in the Dirac equation through a unitary time-reversal operator. J. Phys. Commun. 2(11), 115012D (2018).
Article Google Scholar
Feynman, R. P. Space-time approach to non-relativistic quantum mechanics. Rev. Mod. Phys. 20, 367–387 (1948).
Article MathSciNet ADS Google Scholar
Faddeev, L. D., Khalfin, L. A. & Komarov, I. V. V. A. Fock—Selected Works: Quantum Mechanics and Quantum Field Theory 1st edn, Vol. 584 (CRC Press, Berlin, 2019).
Google Scholar
Zambrini, J. C. Variational processes and stochastic versions of mechanics. J. Math. Phys. 27(9), 2307–2330 (1986).
Article MathSciNet ADS Google Scholar
Yang, J. M. Variational principle for stochastic mechanics based on information measures. J. Math. Phys. 62(10), 102104 (2021).
Article MathSciNet ADS Google Scholar
Yang, C.-D. & Han, S.-Y. Extending quantum probability from real axis to complex plane. Entropy 23, 210 (2021).
Article MathSciNet PubMed PubMed Central ADS Google Scholar
Nelson, E. Review of stochastic mechanics. J. Phys. Conf. Ser. 361(1), 012011 (2012).
Article Google Scholar
Faris, W. G. (Eds.) Diffusion, Quantum Theory, and Radically Elementary Mathematics. (MN-47). Princeton University Press (2006).

Download references

Acknowledgements

I wish to express my gratitude to Lachezar Simeonov for the valuable discussions and insights that greatly contributed to the research presented in this paper. I would also like to thank Professor Asen Pashov and Lachezar Georgiev for their very useful remarks and comments, which are gratefully acknowledged.

Author information

Authors and Affiliations

Faculty of Physics, Sofia University, 5 James Bourchier Blvd., 1164, Sofia, Bulgaria
Vasil Yordanov

Authors

Vasil Yordanov
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

V.Y. is the only author of the manuscript. V.Y. wrote the manuscript text. V.Y. reviewed the manuscript.

Corresponding author

Correspondence to Vasil Yordanov.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Yordanov, V. Derivation of Dirac equation from the stochastic optimal control principles of quantum mechanics. Sci Rep 14, 6507 (2024). https://doi.org/10.1038/s41598-024-56582-5

Download citation

Received: 17 September 2023
Accepted: 08 March 2024
Published: 18 March 2024
DOI: https://doi.org/10.1038/s41598-024-56582-5

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Derivation of Dirac equation from the stochastic optimal control principles of quantum mechanics

Subjects

Abstract

Similar content being viewed by others

Quantum Mechanics can be understood through stochastic optimization on spacetimes

Information theoretical limits for quantum optimal control solutions: error scaling of noisy control channels

Gravitational friction from d’Alembert’s principle

Introduction

Stochastic optimal control theory of quantum mechanics

Covariant relativistic Lagrangians

Derivation of Dirac equation

Dirac spinor and the wave function

Proper time in Dirac equation

Physical interpretation of the stationary solution of HJB equation

Plane wave solutions of the Dirac equation

Diffusion coefficients of the stochastic motion

Conclusion

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Information.

Rights and permissions

About this article

Cite this article

Comments

Search

Quick links

Subjects

Abstract

Similar content being viewed by others

Quantum Mechanics can be understood through stochastic optimization on spacetimes

Information theoretical limits for quantum optimal control solutions: error scaling of noisy control channels

Gravitational friction from d’Alembert’s principle

Introduction

Stochastic optimal control theory of quantum mechanics

Covariant relativistic Lagrangians

Derivation of Dirac equation

Dirac spinor and the wave function

Proper time in Dirac equation

Physical interpretation of the stationary solution of HJB equation

Plane wave solutions of the Dirac equation

Diffusion coefficients of the stochastic motion

Conclusion

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Information.

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links