Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

# Statistical Tests for Force Inference in Heterogeneous Environments

## Abstract

We devise a method to detect and estimate forces in a heterogeneous environment based on experimentally recorded stochastic trajectories. In particular, we focus on systems modeled by the heterogeneous overdamped Langevin equation. Here, the observed drift includes a "spurious” force term when the diffusivity varies in space. We show how Bayesian inference can be leveraged to reliably infer forces by taking into account such spurious forces of unknown amplitude as well as experimental sources of error. The method is based on marginalizing the force posterior over all possible spurious force contributions. The approach is combined with a Bayes factor statistical test for the presence of forces. The performance of our method is investigated analytically, numerically and tested on experimental data sets. The main results are obtained in a closed form allowing for direct exploration of their properties and fast computation. The method is incorporated into TRamWAy, an open-source software platform for automated analysis of biomolecule trajectories.

## Introduction

Random walks are encountered throughout biology and other domains of science, and so is the associated inverse problem of inferring their properties from experimental data. Random walkers can be considered probes of their environment, and their recorded trajectories thus contain information on the properties of both the walker and its environment. In the context of biophysics, the random walkers are typically colloidal particles or biomolecules, but, in a general context, they may, for example, represent the motion along an abstract coordinate of a chemical reaction or the fluctuating price of a stock asset. Transport of biomolecules within cells1, conformational dynamics of proteins and RNA molecules2, diffusion of proteins on the DNA3, dynamics of nanosized objects in the cytosol4, dynamics of receptors in neurons5,6,7, complex random walks in mixed biological environments8,9,10,11, bacteria performing chemotaxis12,13, immune-cell dynamics14,15,16, and directionally persistent cell movement17 are all examples of cases where biologically relevant information can be extracted from recorded stochastic trajectories.

Empirical systems featuring biomolecule random walks are typically characterized by high heterogeneity, so the inverse problem often translates into inferring the properties of the heterogeneous environment from the trajectories of tracer molecules18,19,20,21,22,23,24,25,26,27,28,29,30,31. A paradigmatic model for random walks in such systems is the heterogeneous overdamped Langevin equation (OLE):

$$d{{\bf{X}}}_{t}=\frac{{\bf{f}}\left({{\bf{X}}}_{t}\right)}{\gamma \left({{\bf{X}}}_{t}\right)}dt+\sqrt{2D\left({{\bf{X}}}_{t}\right)}\circ d{{\bf{W}}}_{t},$$
(1)

which describes the continuous-time dynamics underlying a discrete-time recorded random walk22. Here Xt is the tracer’s position at time t, f(X) is the force acting on it in the point X, D(X) is its diffusivity, γ(X) is the viscous friction coefficient, and Wt is a Gaussian zero-mean continuous-time white noise process with uncorrelated increments and unit variance32. Owing to its simplicity, the OLE (1) is a popular model for biological random walks, providing an effective mesoscopic description of the dynamics22. As is often the case for models of biological systems, the OLE is empirically postulated rather than derived from the first principles, since this derivation is complex and requires taking into account many factors, such as the heterogeneity of the environment composition, presence of boundaries33 and hydrodynamic properties34, as well as possible noise correlations35,36. We refer the interested reader to (i) references37,38 for an in-depth discussion of the derivation of the OLE and for a microscopic model of crowded environments, to (ii) references39,40,41 for some approaches to the derivation of the equation of motion in media featuring diffusivity or temperature gradients, and to (iii) references35,36,42 for a discussion and experimental measurements of the extent to which Brownian noise is truly uncorrelated.

When the diffusivity D(x) varies in space, Eq. (1) is only well-defined after a convention for calculating the (stochastic) integral of the noise term has been defined43. Two well-known examples are the Itô and Stratonovich conventions. Each convention leads to a different extra "spurious” drift term, which is proportional to the diffusivity gradient41,43,44. This feature of the OLE is known as the Itô-Stratonovich dilemma41,45,46,47,48,49. For an experimental illustration of the presence of two physically different components see50,51.

Any stochastic convention can be used in the OLE to statistically describe a given experimental random walk. However, they are not equivalent to an external observer attempting to interpret the parameters of the random walk. Indeed, the two components of the drift in the OLE — the spurious force and the non-diffusive force — have a different physical nature. The spurious force is proportional to the diffusivity gradient, hence its value changes when the diffusivity, viscosity, or the temperature of the system change in space (time) or across systems. On the contrary, the non-diffusive component of the drift does not depend on the diffusivity and represents specific or non-specific interactions. The spurious force does not contribute to the equilibrium (Boltzmann) distribution.

Spurious forces are due to the interactions of the tracked particle with the surrounding thermal bath, while the non-diffusive forces represent its interactions with other objects or fields45. The separation of interactions between these two groups naturally depends (i) on the scale on which the system is analyzed and (ii) on which parts of the environment are included into the thermal bath. In Sect. 3.7 below, we show how in the same simulated system, the drift due to non-diffusive forces on the microscopic scale is perceived as a spurious force on the mesoscopic scale, when the contribution of individual interacting partners can no longer be identified.

For practical applications, it is thus important to develop a method allowing to distinguish between diffusive and non-diffusive forces or at least to develop a test allowing to confirm the presence of the non-diffusive forces on a given scale. The need for such approaches is further emphasized by the inaccessibility of the equilibrium distributions and of the exact boundary conditions at the nanometer scale in numerous biological setups.

Since the seminal work of Bachelier52, the inverse problem of drift and diffusivity inference from random walks has been attracting attention53,54, especially in financial applications55,56. In this article, we address a more specific problem of distinguishing between diffusive and non-diffusive forces, since the value of the latter is generally unknown. Although the spurious force is proportional to the diffusivity gradient λ D(X), it includes an unknown proportionality factor λ. It is known that for physical systems in equilibrium, described by Boltzmann distribution, λ = 1, but its value is not known in general for out-of-equilibrium systems41,42,44,47,48,49,50,57,58. Each value of λ represents specific symmetries of transition probabilities in these systems46,58,59.

Our goal is hence two-fold: (i) to develop a statistical test for the presence of non-diffusive forces, and (ii) to infer the posterior distribution for the intensity of the non-diffusive forces while taking into account all possible contributions of the "spurious” forces as well as experimental localization errors and motion blur. The method we introduce here is statistically robust to changes in the spurious force contribution in the OLE due, for example, to changes of the diffusivity or viscosity. We validate our approach on numerical trajectories and demonstrate its efficiency on experimental data.

## The Itô-Stratonovich Dilemma for the Inverse Problem

In this section we give a brief review of the Itô-Stratonovich dilemma45. Numerous discussions of the dilemma have been focused on choosing the appropriate integral convention for the forward problem of integrating the OLE in a particular system. In contrast, we here focus on how the dilemma affects the inverse problem of inferring the underlying physical parameters of a model from recorded data. To underline the generality of the problem, we rewrite the OLE (1) in the form of a general stochastic differential equation (SDE):

$$d{{\bf{X}}}_{t}={\bf{a}}({{\bf{X}}}_{t})dt+b({{\bf{X}}}_{t})\circ d{{\bf{W}}}_{t},$$
(2)

where a and b are differentiable functions of Xt. We will refer to a and b as the drift and diffusivity respectively.

The integral of Eq. (2) is defined as the limit of Riemann sums

$${{\bf{X}}}_{T}={{\bf{X}}}_{0}+\mathop{lim}\limits_{N\to \infty }\left[\mathop{\sum }\limits_{i=0}^{N-1}a({{\bf{X}}}_{{\xi }_{i}})({t}_{i+1}-{t}_{i})+\mathop{\sum }\limits_{i=0}^{N-1}b({{\bf{X}}}_{{\xi }_{i}})({{\bf{W}}}_{{t}_{i+1}}-{{\bf{W}}}_{{t}_{i}})\right],$$
(3)

where each point ξi is chosen in the interval [titi+1]. The standard conventions — Itô, Stratonovich-Fisk and Hänggi-Klimontovich — correspond to ξi = ti, ξi = (ti + ti+1)/2 and ξi = ti+1 respectively41,46. More generally, ξi can be set to any point ξi = ti + λ(ti+1 − ti) within the [titi+1] interval. This allows one to rewrite Eq. (2) with any convention λ in the Itô form60,61:

$$d{{\bf{X}}}_{t}=\alpha ({{\bf{X}}}_{t},\lambda )dt+b({{\bf{X}}}_{t})d{{\bf{W}}}_{t},$$
(4)

where the total drift α is the sum of a and the spurious drift λb(X) b(X):

$$\alpha ({\bf{X}},\lambda )\equiv {\bf{a}}({\bf{X}})+\lambda b({\bf{X}})\nabla b({\bf{X}}).$$
(5)

From the perspective of the forward problem, Eq. (5) shows that the often arbitrary choice of the value of λ influences the value of the drift α when a and b are fixed, — this is the essence of the Itô-Stratonovich dilemma45. In the context of the inverse problem, one is given fixed values of α and b estimated from the recorded trajectories, so different choices of λ result in different estimates of the non-diffusive drift a. If the chosen λ does not agree with its true value in the empirical system, the resulting estimate of a becomes biased.

We emphasize that we do not address here the forward problem, i.e. the question of finding the correct λ for a given system41,45,46,47,48,49. The correct λ values are often inaccessible in real biological systems. Instead, we aim to solve the inverse problem of whether non-diffusive forces are observed in the system and to infer their values if the appropriate value of λ cannot be determined. It is an inverse problem with an uncertainty in the underlying physical model. This ambiguity in λ may stem, for example, from the lack of a priori knowledge about the out-of-equilibrium fluxes in the system, noise correlations or the particle density distribution. In all cases, the method developed below allows one to obtain estimates of the non-diffusive forces and to circumvent the Itô-Stratonovich dilemma by marginalizing over all possible λ values. The estimates are robust to changes in the spurious force contribution in the OLE.

Above, we have formulated the main question of this paper from a physical point of view as that of inferring non-diffusive forces, when the correct λ is unknown. It is interesting to note that the same question can also be asked from a purely statistical point of view: Given the OLE, does there exist a value of 0 ≤ λ ≤ 1 that would allow to describe the given system with zero non-diffusive forces (a = 0)? This would allow to describe the same system with fewer parameters (D and λ instead of Dλ, a), thus minimizing the description length among all the descriptions proposed by the OLE family62,63.

From this point of view, the Bayes factor developed below is a Bayesian analog of the difference in the description lengths between the models with a ≠ 0 and a = 0 for the given data. It evaluates how much more efficient the non-diffusive-force description is, as compared to the spurious-force-only description of the same data. If the spurious-force description is preferred, as a byproduct, one can calculate the value of λ that provides the most efficient description of the data.

## The Bayesian Approach

Our goal is to discriminate between the following two nested hypotheses:

• H0: The only forces present are spurious forces due to heterogeneous diffusivity (the null hypothesis).

• H1: There are other, non-diffusive, forces acting on the random walker in addition to the spurious forces.

We use the Bayes factor to decide between these hypotheses64.

### The Bayes factor

According to Bayes’ rule65, the posterior probability Pr(Hi | T) of a hypothesis Hi given data T is

$${\rm{\Pr }}({H}_{i}| T)=\frac{p(T| {H}_{i})\pi ({H}_{i})}{p(T)}.$$

Here, T is a trajectory, $$T\equiv {\{{{\rm{r}}}_{i}\}}_{i=1}^{n}$$, or a set of trajectories; p(T | Hi) is the marginal likelihood for the data T to be observed under the hypothesis Hi; π(Hi) is the prior probability of Hi; and p(T) is the probability to observe T under either hypothesis. For the two competing hypotheses H1 and H0, the ratio of their posterior probabilities reads

$$\frac{{\rm{\Pr }}({H}_{1}| T)}{{\rm{\Pr }}({H}_{0}| T)}=\frac{p(T| {H}_{1})}{p(T| {H}_{0})}\frac{\pi ({H}_{1})}{\pi ({H}_{0})}.$$

The first fraction on the right-hand side is called the Bayes factor for H1 over H064:

$$K\equiv \frac{p(T| {H}_{1})}{p(T| {H}_{0})}.$$

Each marginal likelihood p(T | Hi) is calculated by marginalizing the corresponding conditional likelihood p(T | θiHi) over all model parameters:

$$p(T| {H}_{i})=\int d{\theta }_{i}\ p(T| {\theta }_{i},{H}_{i})\pi ({\theta }_{i}| {H}_{i}).$$

For H0, the likelihood p(T | θ0H0) thus depends on 3 parameters: θ0 = {b2, g, λ}, where b2 is the diffusivity and g ≡ b is the diffusivity gradient. For H1, the likelihood p(T | θ1H1) additionally includes the drift a, so θ1 = {b2, g, λ, a}. Note that we treat g as independent from b2, which allows us to obtain the results in the analytical form. This assumption is further discussed in Appendix A1.

### Likelihood

The likelihood p(T | θiHi) is obtained as the fundamental solution of the Fokker-Planck equation corresponding to the OLE (2). However, it cannot in general be obtained analytically. Instead, one can approximate it locally by assuming that α and b are constant within small spatial domains20,21. In this case, the likelihood of observing a set of displacements {Δr} inside a given domain is21:

$$p(\{\Delta {\rm{r}}\}| \alpha ,{b}^{2})={(2\pi {b}^{2}\Delta t)}^{-nd/2}\exp \left(-\frac{n{\left(\overline{\Delta {\bf{r}}}-\alpha \Delta t\right)}^{2}+nV}{2{b}^{2}\Delta t}\right).$$
(6)

Here the mean displacement $$\overline{\Delta {\rm{r}}}\equiv {\sum }_{i=1}^{n}\Delta {{\rm{r}}}_{i}/n$$ and the biased sample variance $$V\equiv {\sum }_{i=1}^{n}| \Delta {{\rm{r}}}_{i}-\overline{\Delta {\rm{r}}}{| }^{2}$$/n are the sufficient statistics of the model65, and d is the number of dimensions. The equations below are valid for d = 1 and d = 2, but the framework can also be extended to d = 3.

Note that calculations would be similar if one relaxed the approximation of the locally constant values of α and b. Computations would be performed numerically but the analytical explorations such as those of Appendix A3 would not be possible. Meanwhile, the assumption of bin independence is paramount to the presented method.

### Priors

The likelihood (6) belongs to the exponential family65. Therefore, a natural choice for the prior is a conjugate prior for the parameters a and b2. Among other advantages, conjugate priors provide a closed form of the posterior distribution. We furthermore assume a factorized form for the prior distributions for λ and the diffusivity gradient g:

$$\pi (a,\lambda ,{\rm{g}},{b}^{2}| {H}_{1})\approx \pi (a,{b}^{2}| \lambda ,{H}_{1})\ \pi (\lambda )\ \pi ({\rm{g}}).$$

We have no a priori information available about the true value of λ other than that 0 ≤ λ ≤ 1, so we use the flat prior π(λ) ≡ 1. The diffusivity gradient prior is approximated by a delta function $$\pi ({\rm{g}})\equiv \delta ({\rm{g}}-\widehat{{\rm{g}}})$$ centered around its maximum a posteriori (MAP) value $$\widehat{{\rm{g}}}$$. Details of $$\widehat{{\rm{g}}}$$ estimation are given in Appendix A1.

Under H1, the full conjugate prior is then (cf. (6)):

$$\pi ({\bf{a}},\lambda ,{\rm{g}},{b}^{2}| {H}_{1})\equiv {A}_{d}\ {\left({b}^{2}\Delta t\right)}^{-d{n}_{\pi }/2}\exp \left(-\frac{{n}_{\pi }{((a+\lambda {\rm{g}})\Delta t-{\mu }_{\pi })}^{2}+{n}_{\pi }{V}_{\pi }}{2{b}^{2}\Delta t}\right)\delta ({\rm{g}}-\widehat{{\rm{g}}}),$$
(7)

where $${A}_{d}\equiv {({n}_{\pi }/(2\pi ))}^{d/2}{({n}_{\pi }{V}_{\pi }/2)}^{m(d)}\Delta {t}^{d+1}/\Gamma (m(d))$$; m(d) ≡ d(nπ − 1)/2 − 1; μπ, Vπ and n are the parameters of the prior (called hyper-parameters).

The models H0 and H1 are nested models. In such case, it is common practice to obtain the H0 prior by integrating the H1 prior over a64:

$$\pi (\lambda ,{\rm{g}},{b}^{2}| {H}_{0})={A}_{d}{\left(\frac{2\pi }{{n}_{\pi }\Delta {t}^{2}}\right)}^{d/2}{\left({b}^{2}\Delta t\right)}^{\frac{d(1-{n}_{\pi })}{2}}\exp \left(-\frac{{n}_{\pi }{V}_{\pi }}{2{b}^{2}\Delta t}\right)\delta ({\rm{g}}-\widehat{{\rm{g}}}).$$
(8)

We further set the hyper-parameters to maximally favor the null model. More specifically, nπ acts as an effective number of prior observations. The least constraining prior is obtained by setting nπ = 4 for 1D data and nπ = 3 for 2D, which are the minimal number of observations, for which the prior is proper (normalized). Furthermore, we center the prior on zero force by setting μπ = λgΔt. The remaining hyper-parameter Vπ defines the prior distribution for the diffusivity. Sensitivity of the results to u ≡ Vπ/V is explored in Appendix A2.

### Model evidence and the Bayes factor

The evidence for the H1 and H0 models is the central ingredient in the Bayes factor. Given the likelihood p({Δr} | α(a, λ), b2) (Eq. (6)) and prior π(a, λ, g, b2 | H1) (Eq. (7)), the evidence for H1 is calculated by marginalizing p({Δr} | α(a, λ), b2)π(a, λ, g, b2 | H1) over all the parameters θ1 = {ab2gλ}. This gives

$$p(\{\Delta {\rm{r}}\}| {H}_{1})=\frac{{A}_{d}{C}_{d}}{{(n+{n}_{\pi })}^{d/2}}{\int }_{0}^{1}d\lambda {(nV+{n}_{\pi }{V}_{\pi }+\frac{n{n}_{\pi }}{n+{n}_{\pi }}{(\overline{\Delta {\bf{r}}}-\lambda {\rm{g}}\Delta t)}^{2})}^{-\kappa (d)},$$
(9)

where $${C}_{d}={2}^{\kappa (d)}\Gamma (\kappa (d)){(2\pi )}^{\frac{d(1-n)}{2}}\Delta {t}^{-d-1}$$ and κ(d) ≡ d(n + nπ − 1)/2 − 1.

For H0, the likelihood p({Δr} | α(λ), b2) is given by Eq. (6) with α(λ) = λb b, and the prior π(λ, g, b2 | H0) by Eq. (8). Marginalization of p({Δr} | α(λ), b2)π(λ, g, b2 | H0) over θ0 = {b2, g, λ} gives

$$p(\{\Delta {\rm{r}}\}| {H}_{0})=\frac{{A}_{d}{C}_{d}}{{n}^{d/2}}{\int }_{0}^{1}d\lambda {(nV+{n}_{\pi }{V}_{\pi }+n{(\overline{\Delta {\bf{r}}}-\lambda {\rm{g}}\Delta t)}^{2})}^{-\kappa (d)}.$$
(10)

Expressions (9) and (10) let us finally calculate the marginalized Bayes factor KM, which takes into account all possible values for the unknown parameter λ. For comparison, we also provide the Bayes factor K(λ) for fixed-λ inference procedures (Itô, Stratonovich or Hänggi), which is calculated in the same manner:

$${K}^{{\rm{M}}}={\eta }^{d}\frac{{\int }_{0}^{1}d\lambda {\left[v+{\eta }^{2}{({\zeta }_{{\rm{t}}}-\lambda {\zeta }_{{\rm{sp}}})}^{2}\right]}^{-K(d)}}{{\int }_{0}^{1}d\lambda {\left[v+{({\zeta }_{{\rm{t}}}-\lambda {\zeta }_{{\rm{sp}}})}^{2}\right]}^{-K(d)}},\quad K(\lambda )={\eta }^{d}{\left[\frac{v+{\eta }^{2}{({\zeta }_{{\rm{t}}}-\lambda {\zeta }_{{\rm{sp}}})}^{2}}{v+{({\zeta }_{{\rm{t}}}-\lambda {\zeta }_{{\rm{sp}}})}^{2}}\right]}^{-K(d)}.$$
(11)

All the integrals appearing in Eqs. (911) are 1D integrals that are numerically evaluated using the trapezoid rule.

The natural parameter combinations appearing in Eq. (11) are: (i) $${\zeta }_{{\rm{t}}}\equiv \overline{\Delta {\rm{r}}}/\sqrt{V}$$, the signal-to-noise ratio for the total force in a single displacement; (ii) $${\zeta }_{{\rm{sp}}}\equiv \widehat{{\rm{g}}}{\Delta }_{t}/\sqrt{V}$$, the signal-to-noise ratio for the spurious force in a single displacement; (iii) $$\eta \equiv \sqrt{{n}_{\pi }/(n+{n}_{\pi })}$$, the relative strength of the prior compared to the observed data; (iv) v ≡ 1 + nπVπ/(nV), a weighted ratio of jump variances in the prior and in the data.

Figure 1A plots the marginalized Bayes factor KM (11) as a function of ζsp, and of the component of the total force ζ parallel to ζsp. The lowest values of KM are achieved in the region 0 ≤ ζ/ζsp ≤ 1. The value of KM changes relatively little within this region but grows rapidly at its boundary. The absolute minimum of KM is achieved for ζsp = 0 and ζ = 0 with $$\min \ {K}^{{\rm{M}}}=\min \ K(\lambda )={\eta }^{d}{[(v+{\eta }^{2}{{\zeta }_{t\perp }}^{2})/(v+{{\zeta }_{t\perp }}^{2})]}^{-\kappa }$$. A mathematical analysis of Eq. (11) is provided in Appendix A3, where it is shown that non-diffusive forces cannot in principle be detected in certain intervals of ζsp, ζt regardless of the number of collected data points. Appendix A4 extends the Bayes factors (11) to the experimentally relevant case with localization errors and motion blur.

### Force posterior

When H1 is met, we can infer the value of the non-diffusive force by marginalizing the force posterior over all possible values of λ:

$$p({\zeta }_{a}| T)\propto {\int }_{0}^{1}d\lambda {\left[v+\frac{{\eta }^{2}}{1-{\eta }^{2}}{\zeta }_{a}^{2}+{({\zeta }_{{\rm{t}}}-{\zeta }_{a}-\lambda {\zeta }_{{\rm{sp}}})}^{2}\right]}^{-K(d)-d/2},$$
(12)

where a signal-to-noise ratio for the force ζa ≡ aΔt/$$\sqrt{V}$$ was introduced. Figure 1B plots an example force posteriors obtained with the marginalized method and with fixed-λ inference schemes. The wider marginalized method posterior takes into account all possible λ values. Appendix A5 demonstrates that in contrast to the fixed-λ posteriors, the marginalized posterior is in general non-symmetric.

### Numerical results

The performance of the marginalized method was investigated on simulated trajectories. Random trajectories were simulated in a 2D box with periodic boundary conditions, a uniform total force, and a triangular diffusivity profile along the x axis (Fig. 2A,B). Other simulation parameters are given in Appendix A6. For each trial, the simulated trajectories were then analyzed using the TRamWAy software platform66 and following a procedure similar to the one used in reference20,28 and consisting of (i) individual spatial tessellation in each trial; (ii) assignment of recorded displacements to spatial domains; (iii) inference of ζsp and ζt in each domain; (iv) calculation of the Bayes factor in each domain.

The marginalized Bayes factor $${\widehat{K}}^{{\rm{M}}}$$, inferred in each domain, was then plotted against its expected value KM to test the accuracy of the method (Fig. 2C,D). The figure shows good correspondence between the inferred Bayes factor and the expected Bayes factor. 95% confidence intervals (CIs) show the extent of the deviation of the results from the true values due to the stochastic nature of the simulated trajectories.

### Microscopic model of heterogeneous diffusivity

The next simulation was performed with two goals: (i) to illustrate how spurious forces may originate from crowding at the molecular scale, and (ii) to illustrate a case, wherein our developed statistical test successfully indicates the absence of non-diffusive forces. For this purpose, we simulated free diffusion of particles with no microscopic drift within a square region with periodic boundaries and with impenetrable immobile beads evenly spaced on a square lattice (Fig. 3A), similar to schemes suggested in67,68,69. The microscopic diffusivity of the particle was the same throughout the system. A spatial variation in the radii of the immobile beads created a spatial variation in the effective diffusivity on a much larger “mesoscopic” scale, where each analysis bin included ~100 small beads (Fig. 3B). As a result, recordings at the mesoscopic scale exhibit a diffusivity gradient (Fig. 3C), which contributes to the drift observed on the same scale (Fig. 3D). Note that at long time scales, the system is in physical equilibrium, although particles experience a stationary non-zero drift. Simulation details and parameters are provided in Appendix A7.

The diffusivity gradient contribution to the drift is the spurious force, its exact value depends on λ. Assuming the value of λ is unknown, one can use the Bayes factor test developed above to estimate the a posteriori likelihood of that the observed drift is due to non-spurious forces (Fig. 3E). In our simulation, the inferred Bayes factors were small ($${{\rm{\log }}}_{10}K < -1$$) in most parts of the region, supporting the claim that only spurious forces were present (Fig. 3F). Statistical noise in several bins resulted in weaker evidence, which did not let us draw statistically significant conclusions in those zones. These results confirm the capacity of the method to detect spurious forces. Its capacity to detect non-spurious forces will be illustrated in the next section.

This simulation captures one possible microscopic mechanism behind the observation of a diffusivity gradient on the mesoscopic scale in biological systems. However, note that the homogeneous composition required for a uniform microscopic diffusivity is probably achievable in the biological systems only on the molecular scale (10−9 m and smaller). On this scale though, it is not clear whether the diffusivity itself is well-defined, since by definition it is the result of the action of millions of individual molecules and Fick’s law describes an intrinsically mesoscopic phenomenon.

Other microscopic mechanisms for the diffusivity gradient include (i) confinement, wherein it was shown that the diffusivity in a homogeneous system changes with the distance to a wall33,41,50,70, (ii) corralled motion71, (iii) hydrodynamic coupling to other objects in the medium34,72, (iv) temperature gradients44,73,74, or (v) intermittent trapping75.

## Applications

The developed method was tested on two experimental systems. The first one was a well-controlled setup of a bead in the optical tweezers. The second one was a complex biological process of HIV virion assembly in a T cell76, where the OLE is potentially only an approximation to the true biomolecule dynamics (ignoring inertial effects, colored noise or memory of the previous states).

### Optical tweezers

Optical tweezers combine physical trapping of the bead with local laser heating of the medium, leading to a heterogeneous diffusivity field. Therefore, the heating effect and the ensuing spurious forces may interfere with the inferred trapping potential. Figure 4A–C compares the results of Bayes factor calculations for the same system subjected to three different laser powers. The tessellation procedure was designed to assign the same number of jumps to each domain. In all 3 cases, the particle is confined and the Bayes factor favors the presence of forces ($${{\rm{\log }}}_{10}{K}^{{\rm{M}}} > 1$$) in a large number of domains, which form a connected region. With the decrease of the laser power, the confinement at the center of the trap becomes more shallow, so that the statistical test only detects confining forces on the trap border.

### Assembly of HIV-1 Virus-Like particles

The HIV virus-like-particle (VLP) assembly experiments that provided the data are described in reference77. The VLPs derive from the human immunodeficiency virus type 1 (HIV-1), but are immature and deprived of envelope proteins. One of their main components is the group-specific antigen (Gag) protein. It is a viral structural protein produced by the virus that anchors and oligomerizes at the plasma membrane of the host T cells, eventually assembling into a VLP76. In the experiments, the HIV-1 Gag precursor was genetically modified to contain a photoactivable fluorescent tag mEOS2 protein. It allowed to record VLP assembly in human CD4 T cells using single-particle tracking photoactivated localization microscopy (sptPALM)77,78. Several VLPs can assemble in parallel in the same observation region.

The TRamWAy software platform was used to tessellate the observation region and infer maps of diffusivities (Fig. 4G) and drift20,66. The Bayes factor map was then computed. The localization uncertainty was σL = 30 nm, requiring corresponding corrections to the Bayes factor (Appendix A4). Inference results and Bayes factors for a 2 μm × 2 μm zone of one T-cell membrane are shown in Fig. 4D–F.

Plots of the trajectories, the density of the recorded points and the diffusivity (Fig. 4D,E,G) indicate that there are two regions of interest (ROI) in the data set. However, the plots of the diffusivity gradient and the drift (Fig. 4H,I) suggest that the two parameters are of the same scale, hence it is not a priori clear, whether the localization of the particles is due to non-diffusive or spurious forces. Only the calculation of the Bayes factors for these regions allowed us to confirm that it is not solely due to heterogeneities in the diffusivity but rather to non-spurious forces ($${{\rm{\log }}}_{10}(K)\gg 1$$, Fig. 4F).

Some other individual domains in Fig. 4F bear evidence of a force with rather high Bayes factors ($${{\rm{\log }}}_{10}(K)\ge 1$$). In such a complex system, the high K values in these individual domains may stem from local membrane activity, failed capsid assembly77 or be false detection. In the rest of the region, the Bayes factor is $$| {{\rm{\log }}}_{10}(K)| < 1$$ meaning neither of the models is favored at the chosen level of statistical significance.

As demonstrated in the simulation of Sect. 3.7, the results of any inference procedure depend on how the spatial scale on which the analysis is performed, corresponds to the internal scale of the observed system. An illustration of this fact for the VLP data can be seen in Fig. 5. Here, our statistical test was applied to the same VLP data set on three different spatial scales. When the bins are much larger than the typical structures present in the biological system (Fig. 5A, 0.5 μm), the interactions are averaged out and our statistical test confirms the absence of interactions or is inconclusive. On the scale of the structures (Fig. 5B, 0.25 μm), one may identify the potential regions of interest, but is unable to resolve their internal structure. When the bins are smaller than the regions of interest (Fig. 5C, 0.05 μm), given enough data, the internal structure of the regions can be resolved. At even smaller scales (not shown), when few points are available per bin, one starts losing the connectivity of the regions of interest, and the statistical tests becomes inconclusive or (by design) favor the model with only spurious forces (H0). We suggest that one should aim for a scale smaller than the scale of the analyzed structure, but maintain enough points per bin to reach statistically significant conclusions.

## Discussion

In this paper, we introduced a method to address the inverse problem for the spatially heterogeneous OLE that is robust in regards to changes in the spurious-force contribution. We leveraged Bayesian inference and Bayesian model comparison to account for the uncertainty in the values of the spurious force caused by a heterogeneous diffusivity field. The method provides a test for the presence of non-diffusive forces and returns the values of the non-diffusive forces and diffusivity.

The marginalized posterior takes into account the error in the inferred forces due both to stochastic errors and to possible spurious forces when the true value of λ is unknown. The expression for the Bayes factor was derived in a closed form, allowing for identification of natural parameters associated with the dynamics, namely, the signal-to-noise ratios for the total force (drift) and spurious forces, ζt and ζsp, and the relative strength of the localization uncertainty $$4{\sigma }_{L}^{2}$$/(nV). Interestingly, we showed that under some configurations, the discrimination between active and spurious forces is impossible without introducing additional assumptions.

As for any statistical method, a prerequisite for our method is that one observe the trajectories on the "right” spatial and temporal scales, which depend on the individual system. In particular, the spatial tessellation employed here should be constructed on the appropriate spatial scale, i.e. finer than the spatial heterogeneities of interest and coarse enough to provide sufficient measurements in each mesh domain (as discussed above). Another condition required by our method is that the number of points per bin be >4 in 1D and >3 in 2D, which are the equivalent numbers of points contained in the prior. Otherwise, due to the choice of μπ, the model with only spurious forces is likely to always be favored. Our experience with the method indicates that for the biological data we tested, n ≥ 20 typically provides a reasonable compromise between the spatial and statistical resolution.

The VLP example demonstrated successful utilization of the method for the detection of biological activity. The test was applied in an unsupervised way, which makes it useful for automatic analysis of single-molecule dynamics. In general, however, the results may depend on the spatial meshing. For the VLP data set, we had the advantage of a priori knowing the characteristic spatial scale of the underlying biological processes77. In a general case, one may need to sample multiple spatial scales in an attempt to optimize the detection. An optimal mesh scale in this case can be seen as a trade-off between increasing statistical significance (by getting more data per domain) and increasing resolution (by reducing the domain size).

Potential ways to circumvent this fundamental trade-off of spatial versus statistical resolution could be to regularize the inference of the diffusivity and drift fields20,79 or to cluster the regions with similar Bayes factor values based on a certain rule. However, the former approach induces correlations between the results inferred in different domains making analytical calculations intractable and hindering interpretation of the results. The main difficulty with the latter approach consists in defining the appropriate clustering criterion and in accounting for how the uncertainty in the individual Bayes factors propagates to the Bayes factors of the clusters.

One should also keep in mind that the validity of the main result (11) relies on the assumption that the diffusivity b is smooth enough, so that the gradient b exists on the spatial scale on which the system is experimentally probed. Additionally, we stress that α and b are mesoscopic quantities and their values may change depending on the analyzed scale80. In practice, the choice of the spatial and temporal resolutions for the analysis is limited by the particular experimental setup and the properties of the biological system.

The Bayesian approach that we proposed here is general and not limited to the OLE equation. The ambiguity of the stochastic integration is encountered in numerous other scientific fields involving stochastic equations with multiplicative heterogeneous noise. The effect is usually ignored and an arbitrary standard convention is used. The marginalized method allows us to avoid arbitrarily choosing an integral convention in the absence of system-specific information, therefore providing more robust results.

The marginalized method code is available as a module of the open source project TRamWAy81 and the microscopic crowding simulation code is available at82. Two Jupyter notebooks are provided as illustration of the module interface66.

## References

1. 1.

Wachsmuth, M., Waldeck, W. & Langowski, J. Anomalous Diffusion of Fluorescent Probes inside Living Cell Investigated by Spatially-Resolved Fluorescence Correlation Spectroscopy. J. Mol. Biol. 298, 677–689, ISSN: 00222836 (2000).

2. 2.

Best, R. B. & Hummer, G. Diffusion Models of Protein Folding. Phys. Chem. Chem. Phys. 13, 16902, ISSN: 1463-9076 (2011).

3. 3.

Vestergaard, C. L., Blainey, P. C. & Flyvbjerg, H. Single-Particle Trajectories Reveal Two-State Diffusion-Kinetics of hOGG1 Proteins on DNA. Nucleic Acids Res. 46, 2446–2458, ISSN: 1362-4962 (2018).

4. 4.

Etoc, F. et al. Non-Specific Interactions Govern Cytosolic Diffusion of Nanosized Objects in Mammalian Cells. Nat. Mater. 17, 740–746, ISSN: 1476-1122 (2018).

5. 5.

Dahan, M. et al. Diffusion Dynamics of Glycine Receptors Revealed by Single-Quantum Dot Tracking. Science 302, 442–5, ISSN: 1095-9203 (2003).

6. 6.

Choquet, D. Linking Nanoscale Dynamics of AMPA Receptor Organization to Plasticity of Excitatory Synapses and Learning. J. Neurosci. Off. J. Soc. Neurosci. 38, 9318–9329, ISSN: 1529-2401 (2018).

7. 7.

Schneider, R. et al. Mobility of Calcium Channels in the Presynaptic Membrane. Neuron 86, 672–9, ISSN: 1097-4199 (2015).

8. 8.

Thapa, S., Lomholt, M. A., Krog, J., Cherstvy, A. G. & Metzler, R. Bayesian Analysis of Single-Particle Tracking Data Using the Nested-Sampling Algorithm: Maximum-Likelihood Model Selection Applied to Stochastic-Diffusivity Data. Phys. Chem. Chem. Phys. PCCP 20, 29018–29037, ISSN: 1463-9084 (2018).

9. 9.

Grebenkov, D. S., Metzler, R. & Oshanin, G. Towards a Full Quantitative Description of Single-Molecule Reaction Kinetics in Biological Cells. Phys. Chem. Chem. Phys. PCCP 20, 16393–16401, ISSN: 1463-9084 (2018).

10. 10.

Javanainen, M., Martinez-Seara, H., Metzler, R. & Vattulainen, I. Diffusion of Integral Membrane Proteins in Protein-Rich Membranes. J. Phys. Chem. Lett. 8, 4308–4313, ISSN: 1948–7185 (2017).

11. 11.

Norregaard, K., Metzler, R., Ritter, C. M., Berg-Sørensen, K. & Oddershede, L. B. Manipulation and Motion of Organelles and Single Molecules in Living Cells. Chem. Rev. 117, 4342–4375, ISSN: 1520-6890 (2017).

12. 12.

Masson, J.-B., Voisinne, G., Wong-Ng, J., Celani, A. & Vergassola, M. Noninvasive Inference of the Molecular Chemotactic Response Using Bacterial Trajectories. Proc. Natl. Acad. Sci. 109, 1802–1807, ISSN: 0027-8424 (2012).

13. 13.

Wong-Ng, J., Melbinger, A., Celani, A. & Vergassola, M. The Role of Adaptation in Bacterial Speed Races. PLoS Comput. Biol. 12 (ed. Rao, C. V.) e1004974, ISSN: 1553-7358 (2016).

14. 14.

Sarris, M. et al. Inflammatory Chemokines Direct and Restrict Leukocyte Migration within Live Tissues as Glycan-Bound Gradients. Curr. Biol. CB 22, 2375–82, ISSN: 1879-0445 (2012).

15. 15.

Sarris, M. & Sixt, M. Navigating in Tissue Mazes: Chemoattractant Interpretation in Complex Environments. Curr. Opin. Cell Biol. 36, 93–102, ISSN: 1879-0410 (2015).

16. 16.

Fricke, G. M., Letendre, K. A., Moses, M. E. & Cannon, J. L. Persistence and Adaptation in Immunity: T Cells Balance the Extent and Thoroughness of Search. PLoS Comput. Biol. 12, e1004818, ISSN: 1553-7358 (2016).

17. 17.

Maiuri, P. et al Actin Flows Mediate a Universal Coupling between Cell Speed and Cell Persistence. Cell 161, 374–86, ISSN: 1097-4172 (2015).

18. 18.

Cocco, S. & Monasson, R. Reconstructing a Random Potential from Its Random Walks. EPL Europhys. Lett. 81, 20002, ISSN: 0295-5075 (2008).

19. 19.

Best, R. B. & Hummer, G. Coordinate-Dependent Diffusion in Protein Folding. Proc. Natl. Acad. Sci. 107, 1088–1093, ISSN: 0027-8424 (2010).

20. 20.

El Beheiry, M., Dahan, M. & Masson, J.-B. InferenceMAP: Mapping of Single-Molecule Dynamics with Bayesian Inference. Nat. Methods 12, 594–595, ISSN: 1548-7091 (2015).

21. 21.

Masson, J.-B. et al. Inferring Maps of Forces inside Cell Membrane Microdomains. Phys. Rev. Lett. 102, 1–4, ISSN: 00319007 (2009).

22. 22.

Hozé, N. & Holcman, D. Statistical Methods for Large Ensembles of Super-Resolution Stochastic Single Particle Trajectories in Cell Biology. Annu. Rev. Stat. Appl. 4, 189–223, ISSN: 2326-8298 (2017).

23. 23.

Chang, J. C., Fok, P.-W. & Chou, T. Bayesian Uncertainty Quantification for Bond Energies and Mobilities Using Path Integral Analysis. Biophys. J. 109, 966–974, ISSN: 00063495 (2015).

24. 24.

Neuman, K. C. & Nagy, A. Single-Molecule Force Spectroscopy: Optical Tweezers, Magnetic Tweezers and Atomic Force Microscopy. Nat. Methods 5, 491–505, ISSN: 1548-7091 (2008).

25. 25.

Lang, M. J., Fordyce, P. M., Engh, A. M., Neuman, K. C. & Block, S. M. Simultaneous, Coincident Optical Trapping and Single-Molecule Fluorescence. Nat. Methods 1, 133–139, ISSN: 1548-7091 (2004).

26. 26.

Li, T. P. & Blanpied, T. A. Control of Transmembrane Protein Diffusion within the Postsynaptic Density Assessed by Simultaneous Single-Molecule Tracking and Localization Microscopy. Front. Synaptic Neurosci8, 19, ISSN: 1663-3563 (2016).

27. 27.

Sungkaworn, T. et al. Single-Molecule Imaging Reveals Receptor-G Protein Interactions at Cell Surface Hot Spots. Nature 550, 543–547, ISSN: 0028-0836 (2017).

28. 28.

Remorino, A. et al. Gradients of Rac1 Nanoclusters Support Spatial Patterns of Rac1 Signaling. Cell Rep. 21, 1922–1935, ISSN: 22111247 (2017).

29. 29.

Granik, N. et al. Single-Particle Diffusion Characterization by Deep Learning. Biophysical Journal 117, 185–192, ISSN: 00063495 (2019).

30. 30.

Cherstvy, A. G., Thapa, S., Wagner, C. E. & Metzler, R. Non-Gaussian, Non-Ergodic, and Non-Fickian Diffusion of Tracers in Mucin Hydrogels. Soft Matter 15, 2526–2551, ISSN: 1744-6848 (2019).

31. 31.

Muñoz-Gil, G., Garcia-March, M. A., Manzo, C., Martín-Guerrero, J. D. & Lewenstein, M. Single Trajectory Characterization via Machine Learning. New J. Phys. 22, 013010, ISSN: 1367-2630 (2020).

32. 32.

Knight, F. Essentials of Brownian Motion and Diffusion Series Title: Mathematical Surveys and Monographs. ISBN: 978-0-8218-1518-2 (American Mathematical Society, Providence, RI, 1981).

33. 33.

Lançon, P., Batrouni, G., Lobry, L. & Ostrowsky, N. Brownian Walker in a Confined Geometry Leading to a Space-Dependent Diffusion Coefficient. Phys. Stat. Mech. Its Appl. 304, 65–76, ISSN: 03784371 (2002).

34. 34.

Crocker, J. C. Measurement of the Hydrodynamic Corrections to the Brownian Motion of Two Colloidal Spheres. The Journal of Chemical Physics 106, 2837–2840, ISSN: 0021-9606, 1089-7690 (1997).

35. 35.

Franosch, T. et al. Resonances Arising from Hydrodynamic Memory in Brownian Motion - The Colour of Thermal Noise. Nature 478, 8–11, ISSN: 0028-0836 (2011).

36. 36.

Berg-Sørensen, K. & Flyvbjerg, H. The Colour of Thermal Noise in Classical Brownian Motion: A Feasibility Study of Direct Experimental Observation. New J. Phys. 7, 38–38, ISSN: 1367-2630 (2005).

37. 37.

Holcman, D. & Schuss, Z. 100 Years after Smoluchowski: Stochastic Processes in Cell Biology. J. Phys. A: Math. Theor. 50, 093002, ISSN: 1751-8113, 1751-8121 (2017).

38. 38.

Sancho, J. M. Brownian Colloidal Particles: Ito, Stratonovich, or a Different Stochastic Interpretation. Phys. Rev. E. 84, 062102 (2011).

39. 39.

Van Kampen, N. G. Diffusion in Inhomogeneous Media. Journal of Physics and Chemistry of Solids 49, 673–677, ISSN: 0022-3697 (1988).

40. 40.

Jayannavar, A. M. & Mahato, M. C. Macroscopic Equation of Motion in Inhomogeneous Media: A Microscopic Treatment. Pramana - J. Phys. 45, 369–376, ISSN: 0973-7111 (1995).

41. 41.

Lau, A. W. C. & Lubensky, T. C. State-Dependent Diffusion: Thermodynamic Consistency and Its Path Integral Formulation. Phys. Rev. E - Stat. Nonlinear Soft Matter Phys. 76, ISSN: 15393755, pmid: 17677426 (2007).

42. 42.

Kupferman, R., Pavliotis, G. A. & Stuart, A. M. Itô versus Stratonovich White-Noise Limits for Systems with Inertia and Colored Multiplicative Noise. Phys. Rev. E. 70, 036120, ISSN: 1539-3755 (2004).

43. 43.

Van Kampen, N. G. Itô versus Stratonovich. J. Stat. Phys. 24, 175–187, ISSN: 15729613 (1981).

44. 44.

Yang, M. & Ripoll, M. Drift Velocity in Non-Isothermal Inhomogeneous Systems. The Journal of Chemical Physics 136, 204508, ISSN: 0021-9606, 1089-7690 (2012).

45. 45.

Van Kampen, N. G. Stochastic Processes in Physics and Chemistry 3rd (North-Holland Personal Library, Amsterdam, 1992).

46. 46.

Sokolov, I. Ito, Stratonovich, Hänggi and All the Rest: The Thermodynamics of Interpretation. Chem. Phys. 375, 359–363, ISSN: 03010104 (2010).

47. 47.

Farago, O. & Grønbech-Jensen, N. Langevin Dynamics in Inhomogeneous Media: Re-Examining the Itô-Stratonovich Dilemma. Phys. Rev. E 89, 013301, ISSN: 1539-3755 (2014).

48. 48.

Farago, O. & Grønbech-Jensen, N. On the Connection between Dissipative Particle Dynamics and the Itô-Stratonovich Dilemma. J. Chem. Phys. 144, 084102, ISSN: 0021-9606 (2016).

49. 49.

Regev, S., Grønbech-Jensen, N. & Farago, O. Isothermal Langevin Dynamics in Systems with Power-Law Spatially Dependent Friction. Phys. Rev. E 94, 012116, ISSN: 2470-0045 (2016).

50. 50.

Volpe, G., Helden, L., Brettschneider, T., Wehr, J. & Bechinger, C. Influence of Noise on Force Measurements. Phys. Rev. Lett. 104, 170602 (2010).

51. 51.

Brettschneider, T., Volpe, G., Helden, L., Wehr, J. & Bechinger, C. Force Measurement in the Presence of Brownian Noise: Equilibrium-Distribution Method versus Drift Method. Phys. Rev. E. 83, 041113 (2011).

52. 52.

Bachelier, L. Théorie de la spéculation. Ann. Sci. École Norm. Sup. 17, 21–86, ISSN: 0012-9593, 1873-2151 (1900).

53. 53.

Friedrich, R., Peinke, J., Sahimi, M. & Reza Rahimi Tabar, M. Approaching Complexity by Stochastic Methods: From Biological Systems to Turbulence. Physics Reports 506, 87–162, ISSN: 03701573 (2011).

54. 54.

Kleinhans, D. Estimation of Drift and Diffusion Functions from Time Series Data: A Maximum Likelihood Framework. Phys. Rev. E 85, 026705, ISSN: 1539-3755, 1550-2376 (2012).

55. 55.

The Random Character of Stock Market Prices (ed. Cootner, P. H.) 3rd. OCLC: 1067931365 (MIT Pr., Cambridge, Massachusetts, 1970).

56. 56.

Parkinson, M. The Extreme Value Method for Estimating the Variance of the Rate of Return. J. Bus. 53, 61–65 (1980).

57. 57.

Hänggi, P. Stochastic Processes. 1. Asymptotic Behavior and Symmetries. Helvetica Phys. Acta 51 (1978).

58. 58.

Klimontovich, Y. L. Ito, Stratonovich and Kinetic Forms of Stochastic Equations. Physica A: Statistical Mechanics and its Applications 163, 515–532, ISSN: 0378-4371 (1990).

59. 59.

Klimontovich, Y. Nonlinear Brownian motion. Uspekhi Fizicheskikh Nauk 164, 811, ISSN: 0042-1294, 1996-6652 (1994).

60. 60.

Volpe, G. & Wehr, J. Effective Drifts in Dynamical Systems with Multiplicative Noise: A Review of Recent Progress. Rep. Prog. Phys. 79, 53901, ISSN: 00344885 (2016).

61. 61.

Kloeden, P. E. & Platen, E. Numerical Solution of Stochastic Differential Equations Corr. 3rd print. Applications of Mathematics 23, 636 pp. ISBN: 978-3-540-54062-5 (Springer, Berlin; New York, 1999).

62. 62.

Balasubramanian, V. In Advances in Minimum Description Length: Theory and Applications (eds. Grünwald, P. D., Myung, I. J. & Pitt, M. A.) 81–98, ISBN: 978-0-262-07262-5 (The MIT Press, Cambridge, MA, 2005).

63. 63.

Rissanen, J. Stochastic Complexity and Modeling. Ann. Stat. 14, 1080–1100, ISSN: 0090-5364 (1986).

64. 64.

Kass, R. E., Raftery, A. E., Association, S. & Jun, N. Bayes Factors. J. Am. Stat. Assoc. 90, 773-795, ISSN: 0162–1459 (1995).

65. 65.

Gelman, A., Carlin, J. B., Stern, H. S. & Rubin, D. B. Bayesian Data Analysis 2nd (Chapman & Hall/CRC, Boca Raton, FL, 2004).

66. 66.

TRamWAy. TRamWAy Project https://github.com/DecBayComp/TRamWAy (2018).

67. 67.

Machta, J. & Zwanzig, R. Diffusion in a Periodic Lorentz Gas. Phys. Rev. Lett. 50, 1959–1962 (1983).

68. 68.

Holcman, D., Hoze, N. & Schuss, Z. Narrow Escape through a Funnel and Effective Diffusion on a Crowded Membrane. Phys. Rev. E 84, 021906 (2011).

69. 69.

Chakraborty, I. & Roichman, Y. Two Coupled Mechanisms Produce Fickian, yet Non-Gaussian Diffusion in Heterogeneous Media. http://arxiv.org/abs/1909.11364 (2019).

70. 70.

Brenner, H. The Slow Motion of a Sphere through a Viscous Fluid towards a Plane Surface. Chemical Engineering Science 16, 242–251, ISSN: 0009-2509 (1961).

71. 71.

Niehaus, A. M. S., Vlachos, D. G., Edwards, J. S., Plechac, P. & Tribe, R. Microscopic Simulation of Membrane Molecule Diffusion on Corralled Membrane Surfaces. Biophysical Journal 94, 1551–1564, ISSN: 00063495 (2008).

72. 72.

Batchelor, G. K. Brownian Diffusion of Particles with Hydrodynamic Interaction. J. Fluid Mech. 74, 1–29, ISSN: 1469-7645, 0022-1120 (1976).

73. 73.

Duhr, S. & Braun, D. Why Molecules Move along a Temperature Gradient. Proceedings of the National Academy of Sciences 103, 19678–19682, ISSN: 0027-8424, 1091-6490 (2006).

74. 74.

Bringuier, E. & Bourdon, A. Colloid Thermophoresis as a Non-Proportional Response. J. Non-Equilib. Thermodyn. 32, 221–229 (2007).

75. 75.

Metzler, R., Jeon, J.-H., Cherstvy, A. G. & Barkai, E. Anomalous Diffusion Models and Their Properties: Non-Stationarity, Non- Ergodicity, and Ageing at the Centenary of Single Particle Tracking. Phys. Chem. Chem. Phys. 16, 24128–24164, ISSN: 1463-9076, 1463-9084 (2014).

76. 76.

Freed, E. O. HIV-1 Assembly, Release and Maturation. Nat. Rev. Microbiol. 13, 484–496, ISSN: 17401534 (2015).

77. 77.

Floderer, C. et al. Single Molecule Localisation Microscopy Reveals How HIV-1 Gag Proteins Sense Membrane Virus Assembly Sites in Living Host CD4 T Cells. Sci. Rep. 8, 16283, ISSN: 2045-2322 (2018).

78. 78.

Manley, S. et al. High-Density Mapping of Single-Molecule Trajectories with Photoactivated Localization Microscopy. Nat. Methods 5, 155–157, ISSN: 1548-7091 (2008).

79. 79.

Laurent, F. et al. Mapping Spatio-Temporal Dynamics of Single Biomolecules in Living Cells. (submitted) (2019).

80. 80.

Zwanzig, R. Diffusion in a Rough Potential. Proc. Natl. Acad. Sci. 85, 2029-2030, ISSN: 0027-8424 (2006).

81. 81.

Serov, A. S. Bayes Factor Calculations Module for the TRamWAy Project https://github.com/DecBayComp/TRamWAy/tree/master/tramway/inference/bayes_factors (2018).

82. 82.

Serov, A. S. Microscopic Crowding Simulation and Analysis Code https://github.com/Alexander-Serov/simLattice (2019).

## Acknowledgements

We thank Aleksandra Walczak, Vincent Hakim, Bassam Hajj, Mathieu Coppey and Maxime Dahan (†) for helpful discussions. This study was funded by the Institut Pasteur, L’ Agence Nationale de la Recherche (TRamWAy, ANR-17-CE23-0016), the INCEPTION project (PIA/ANR-16-CONV-0005, OG), and the programme d’investissement d’avenir supported by L’ Agence Nationale de la Recherche ANR-19-P3IA-0001. The funding sources had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

## Author information

Authors

### Contributions

A.S., F.L., C.L.V. and J.-B.M. designed the research, performed the simulations, analysed the results and wrote the paper. C.Fl., C. F. and D. M. performed the experiments on VLPs, K.P. and N.W. performed the experiments with optical tweezers.

### Corresponding authors

Correspondence to Alexander S. Serov or Christian L. Vestergaard or Jean-Baptiste Masson.

## Ethics declarations

### Competing interests

The authors declare no competing interests.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

## Rights and permissions

Reprints and Permissions

Serov, A.S., Laurent, F., Floderer, C. et al. Statistical Tests for Force Inference in Heterogeneous Environments. Sci Rep 10, 3783 (2020). https://doi.org/10.1038/s41598-020-60220-1

• Accepted:

• Published: