A digital twin approach for experimental acoustic hologram optimization

Fushimi, Tatsuki; Tagami, Daichi; Yamamoto, Kenta; Ochiai, Yoichi

doi:10.1038/s44172-024-00160-0

Download PDF

Article
Open access
Published: 11 January 2024

A digital twin approach for experimental acoustic hologram optimization

Tatsuki Fushimi ORCID: orcid.org/0000-0003-3944-0014^1,2,
Daichi Tagami³,
Kenta Yamamoto³ &
…
Yoichi Ochiai^1,2,4

Communications Engineering volume 3, Article number: 12 (2024) Cite this article

1728 Accesses
1 Citations
20 Altmetric
Metrics details

Subjects

Abstract

The need for the accurate generation of acoustic holograms has increased with the prevalence of the use of acoustophoresis methods such as ultrasonic haptic sensation, acoustic levitation, and displays. However, experimental results have shown that the actual acoustic field may differ from the simulated field owing to uncertainties in the transducer position, power and phase, or from nonlinearity and inhomogeneity in the field. Traditional methods for experimentally optimizing acoustic holograms require prior calibration and do not scale with the number of variables. Here, we propose a digital twin approach that combines feedback from experimental measurements (such as a microphone and an optical camera) in the physical setup with numerically obtained derivatives of the loss function, using automatic differentiation, to optimize the loss function. This approach is number of transducers times faster and more efficient than the classical finite difference approach, making it beneficial for various applications such as acoustophoretic volumetric displays, ultrasonic haptic sensations, and focused ultrasound therapy.

Acoustic hologram optimisation using automatic differentiation

Article Open access 16 June 2021

Generating Bessel beams with broad depth-of-field by using phase-only acoustic holograms

Article Open access 27 December 2019

Improving ultrasound images with elevational angular compounding based on acoustic refraction

Article Open access 23 October 2020

Introduction

The acoustic hologram is a two-dimensional encoding of a three-dimensional acoustic field and encodes the complex wavefront via amplitude and phase specification at each point in the field. Recent advances in mid-air ultrasonics, such as ultrasonic haptic sensation^1,2, acoustic levitation^3,4,5, and acoustic streamings^6,7, along with display technologies^8,9,10,11, have heightened the demand for precise acoustic holograms capable of generating multiple foci from a single device. A number of acoustic hologram optimization techniques have been proposed, including Gerchberg-Saxton^12,13,14, Eigensolver and Tikhonov-regularization², machine learning methods^15,16, direct solvers¹⁷, and greedy-type solvers¹⁸. In 2021, we demonstrated an automatic differentiation approach to acoustic hologram optimization^19,20 and exhibited good accuracy with the application of automatic differentiation and the Adam optimizer in simulation.

While these numerical approaches use simulated values to optimize acoustic fields, a number of experimental results suggest that the acoustic field in reality is offset from the numerically simulated field^3,8,21,22,23. These offsets could emerge from simple uncertainties in the transducer position, power, and phase, or could emerge from non-linearity, inhomogeneity, or the existence of other scatterers in the field. Recent advances in computational modeling have begun to enable the inclusion of complex nonlinear fields produced by acoustic holograms²⁴, or complex fields with scatterers in the field^9,25. However, it is still computationally expensive and cumbersome to include nonlinearity, and experimental deviations are susceptible to minor changes in the environment. This renders the attainment of experimentally accurate gradients; a challenging endeavor.

Some attempts have been made to experimentally optimize acoustic holograms^8,23,26. For example, the offset of equilibrium points could be addressed by calibrating the focal points with the equilibrium position of the levitated particles²⁶, or by the combination of a gradient descent algorithm and experimentally obtained finite differences⁸. While these optimizers are effective in achieving their targets, they require prior calibration, or experimental finite differences that do not scale well with the number of variables. These experimental deviations are known to cause performance degradation in the practical applications of acoustic holograms^8,22,24, and there is an increasing need for better and more efficient approaches to optimize acoustic holograms in experiments. This will ultimately help to improve the haptic quality in ultrasonic tactile displays, improve the graphic generation capabilities of acoustophoretic volumetric displays, and improve the positioning accuracy in the potential application of acoustic levitation. Where the current system needs to accept the experimental deviation or collect a significant amount of data to collect the deviation; this method has the potential for application in practice.

Herein, we propose a digital twin approach for optimizing the acoustic holograms, as shown in Fig. 1, with the aim of minimizing the difference between the target and experimental states with a minimal number of measurements per step. A digital twin serves as a comprehensive virtual model designed to accurately emulate a corresponding physical entity. Recently, this concept has been extended to the field of acoustics as a tool to enhance structural health monitoring in engineering systems^27,28,29. Experimental measurements in situ (physical setup) can be fed back into the loss function of the optimizer, and the digital model of the experimental setup is then used to obtain the gradient of the loss function with respect to each variable using automatic differentiation. Because the gradient of the loss function is approximated numerically from the digital model, there is no need for the experimental finite difference algorithm. Thus, the optimizer will complete its optimization at least the “number of variables” times faster than the classical finite difference approach. Digital twin optimization is highly beneficial in phased array transducers (PAT) applications because the number of transducers is in the magnitude of 10²–10³ (i.e. up to 10³ times faster). Considering the fact that the optimization is performed iteratively, this causes a significant enhancement of the performance and efficiency. Such methods that are used to connect experimental to digital models have been proven to be effective in machine learning³⁰ and optics³¹; the application of this method in the acoustic hologram could be beneficial in the practical application of acoustics. In addition, we present insights into digital twin optimization such as “experimental optimization of iteratively calculated variables” and “design principles of loss functions in the experimental optimization with digital twin”, which are highly relevant in machine learning and the optics field as well.

**Fig. 1: In-situ optimization with digital twin.**

The core of the optimization algorithm is the Diff-PAT; an acoustic hologram optimization method based on automatic differentiation demonstrated by Fushimi et al.¹⁹. The initial guess of the acoustic hologram is updated iteratively using the Adam optimizer based on the differentiated loss value with respect to each phase of the transducers. Given the loss function, L_t(θ_t), the Adam optimizer iteratively updates the initial guess of the variables (θ_t) by

$${\theta }_{t}={\theta }_{t-1}-\alpha \cdot \frac{{\hat{m}}_{t}}{\sqrt{{\hat{v}}_{t}}+\epsilon }$$

(1)

where θ is the optimization variable, and subscript t is the step number. ${\hat{v}}_{t}=\frac{{v}_{t}}{1-{\beta }_{2}^{t}}$, ${\hat{m}}_{t}=\frac{{m}_{t}}{1-{\beta }_{1}^{t}}$, ${v}_{t}={\beta }_{2}\cdot {v}_{t-1}+(1-{\beta }_{2})\cdot {g}_{t}^{2}$, m_t = β₁ ⋅ m_t−1+(1−β₁) ⋅ g_t, and ${g}_{t}=\frac{\delta {L}_{t}({\theta }_{t-1})}{\delta \theta }$. Here, α is the step size/learning rate, β₁ = 0.9, β₂ = 0.999, and ϵ = 1 × 10⁻⁷ are exponential decay rates for the moment estimates. We adjusted the learning rates depending on each application, and the used value was specified within each case.

As shown above, the Adam optimizer only uses the derivative of the loss function (L_t(θ_t−1)) to update the parameters. Naturally, the question arises on “how can it know which way to descend to in the experiment when it only uses a gradient that is numerically obtained?” In a nutshell, we design the loss function such that “the experimentally obtained states carry over to the derivative of the loss function”, and when it does, “the gradient near the target state is steep”. Thus, when the loss function is properly designed; the Adam optimizer determines the minima at the target state in the experiments.

For example, a loss function could be specified as ${L}_{t}({\theta }_{t-1})={(T-{f_{{\rm {exp}}}}({\theta }_{t-1}))}^{2}$, where T is the target value (i.e. target acoustic pressure, phase, or equilibrium position), and subscript “exp” means experimentally obtained. We define the gradient to be given by $\frac{\delta {L}_{t}}{\delta {\theta }_{t}}=-2(T-{f_{{\rm {exp}}}}({\theta }_{t-1}))\frac{\delta {f}_{{\rm {num}}}({\theta }_{t-1})}{\delta \theta }$ with subscript “num” denoting a numerically obtained value. Thus, the experimental optimum point naturally becomes the destination of the Adam optimizer. The optimization target can be anything that can be physically measured and predicted in in-situ optimization. In this study, we demonstrate the optimization of the (i) acoustic pressure and (ii) equilibrium position to demonstrate the relevance of digital twin optimization in PAT and acoustic holograms in general. Finally, we discuss the design of the loss function in the discussion section.

First, we describe digital twin optimization for the acoustic pressure field. We use PAT as described in the “Methods” section, and it takes phase-only acoustic holograms (operates in phase-only A mode hologram). As in Fushimi et al.²⁰, we begin by defining a suitable loss function for the phase-only (A:i), amplitude-only (A:ii), and phase and amplitude (A:iii) optimization:

$$L({\phi }_{t})=\left[{({A}_{{\rm {c}}}\cos {\phi }_{{\rm {c}}}-{A}_{{\rm {p}}}^{{\rm {exp}}}\cos ({\phi }_{{\rm {p}}}^{{\rm {exp}}}))}^{2}+{({A}_{{\rm {c}}}\sin {\phi }_{{\rm {c}}}-{A}_{{\rm {p}}}^{{\rm {exp}}}\sin ({\phi }_{{\rm {p}}}^{{\rm {exp}}}))}^{2}\right],$$

(2)

where A_c and ϕ_c are the target pressure amplitude and phase, and ${A}_{{\rm {p}}}^{{\rm {exp}}}(x,{x}_{t},{\phi }_{t})=| {p}_{{\rm {in}}}(x,{x}_{t},{\phi }_{t})| +G({p}_{{\rm {exp}}}-| {p}_{{\rm {in}}}(x,{x}_{t},{\phi }_{t})| )$ and ${\phi }_{{\rm {p}}}^{{\rm {exp}}}=\arg ({p}_{{\rm {in}}}(x,{x}_{t},{\phi }_{t}))+G({\phi }_{{\rm {exp}}}-\arg ({p}_{{\rm {in}}}(x,{x}_{t},{\phi }_{t})))$ are the substituted pressure amplitude and phase, respectively. The substituted pressure amplitude allows the automatic differentiation package (in this case TensorFlow) to track the gradient of the function, whereas the inside of function G() is untracked by the package (achieved by e.g. tf.stop_gradient() in TensorFlow). We note that this process does not act as mathematical operators, and only serves a functional purpose to introduce foreign variables to the computational graph of TensorFlow. As in Fushimi et al.²⁰, A_c = 1 was set for A:i and ϕ_c = 0 was set for A:ii.

The experimentally obtained values (such as p_exp and ϕ_exp) are obtained using a calibrated pressure microphone (B&K Type 4138-A-015, pressure sensitivity ${p}_{{\rm {sens}}}^{{\rm {mic}}}=1.0\,{{{{{{{\rm{mV}}}}}}}}\,{{{{{{{{\rm{Pa}}}}}}}}}^{-1}$), as detailed in the “Methods” section. Twenty target phases and amplitudes were set with a constant focal point, x = (0, 0, 0.04) m. The phase linearly increased from 0 to 2π, and the amplitude increased linearly from 10% to 90% of p_max. p_max was set as the pressure amplitude with a single focus point in the numerical simulation. The maximum iteration number was set to 100. The experimental measurements and optimizations were repeated three times (S = 3) to obtain the mean and standard deviation of the pressure and amplitude. The statistical analysis was performed on Matlab R2022a. The mean phase was obtained by ${\phi }_{{\rm {mean}}}=\arctan \left(\frac{\mathop{\sum }\nolimits_{s}^{S}\sin ({\phi }_{{\rm {meas}}}^{s})}{\mathop{\sum }\nolimits_{s}^{S}\cos ({\phi }_{{\rm {meas}}}^{s})}\right)$ where ϕ_meas is the measured phase.

We can also apply the in-situ digital optimization for the equilibrium position of the levitated particle. While the acoustic pressure field can be calculated easily using Huygens’ approach, the determination of the equilibrium position requires the balancing of the acoustic radiation force and gravity. This calculation requires a root-finding algorithm that iteratively updates its guess. Here, we use the single-axis acoustic levitator (2 SonicSurface arrays separated by a distance of 0.215 m). The target equilibrium shape was set as a circle with radius r = 3 mm, where nth focal point is calculated by ${x}_{f}^{t,n}=(0,r\sin (\pi +\frac{2\pi n}{29}),0.0119+r+r\cos (\pi +\frac{2\pi n}{29}))$, with n ∈ {0, 1, …, 28}. While it is possible to optimize the equilibrium position from the acoustic hologram as in pressure field optimization, the levitation conditions are not necessarily guaranteed for all possible phase combinations. Thus, the optimization variable was constrained to known stable solutions using a single focal point and twin trap⁴:

$${\phi }_{t}={\phi }_{{\rm {focal}}}+{\phi }_{{\rm {twin}}},$$

(3)

where ${\phi }_{{\rm {focal}}}=-\frac{2\pi {f}_{0}}{{c}_{0}}[{\rm {d}}({{{{{{{{\boldsymbol{x}}}}}}}}}_{{{{{{{{\boldsymbol{f}}}}}}}}}^{{{{{{{{\boldsymbol{t}}}}}}}},{{{{{{{\boldsymbol{n}}}}}}}}},{{{{{{{{\boldsymbol{x}}}}}}}}}_{{{{{{{{\boldsymbol{t}}}}}}}}})-{\rm {d}}({x}_{o},{{{{{{{{\boldsymbol{x}}}}}}}}}_{{{{{{{{\boldsymbol{f}}}}}}}}}^{{{{{{{{\boldsymbol{t}}}}}}}},{{{{{{{\boldsymbol{n}}}}}}}}})]$, and ϕ_twin are 0 and π for the bottom and upper arrays, respectively. x₀ is the origin (0,0,0) of the array which is the centre of the 16 by 16 PAT, and on the surface level of the transducers. The single focus is calculated such that the acoustic signal from each transducer meets at the focal point simultaneously. As in a previous study⁸, the target shape was specified using the focal points, and the equilibrium positions were recorded (waited 2.5 s after sending the commands to PAT for the steady state) using optical methods as described in the “Methods” section.

For a spherical particle in the Rayleigh regime (r << λ), the acoustic radiation force was calculated using Gor’kov^32,33:

$$\left[\begin{array}{c}{F}_{x}\\ {F}_{y}\\ {F}_{z}\end{array}\right]=-\frac{4\pi }{3}{a}^{3}\bigtriangledown \left[\frac{1}{2}{{{{{{{\rm{Re}}}}}}}}[\,{f}_{1}]{\kappa }_{0}\langle {p}_{{\rm {in}}}{(x,{\phi }_{t})}^{2}\rangle -\frac{3}{4}{{{{{{{\rm{Re}}}}}}}}[\,{f}_{2}]{\rho }_{0}\langle {v}_{{\rm {in}}}{(x,{\phi }_{t})}^{2}\rangle \right]$$

(4)

where ${f}_{1}=1-\tilde{\kappa }$ and ${f}_{2}=\frac{2(\widetilde{\rho }-1)}{2\widetilde{\rho }}$. $\tilde{\kappa }=\frac{{\kappa }_{p}}{{\kappa }_{0}}$ and $\widetilde{\rho }=\frac{{\rho }_{{\rm {p}}}}{{\rho }_{0}}$. Subscript 0 and p represent the surrounding media and particle property, and $\kappa =\frac{1}{\rho {c}^{2}}$ where ρ and a are the density and radius of the sphere, respectively. ${v}_{{\rm {in}}}(x,{\phi }_{t})=\Big| \bigtriangledown \left(\frac{{p}_{{\rm {in}}}}{{\rho }_{0}\omega i}\right)\Big|$ is the acoustic velocity field. Moreover, the particle properties were arbitrarily set as a = 0.7 mm, ρ_p = 40 kg m⁻³, and c_p = 900 m s⁻¹.

To determine the numerical equilibrium position, a root-finding algorithm (Newton gradient descent) was used²⁶:

$${{{{{{{{{\bf{x}}}}}}}}}_{{{{{{{{\bf{e}}}}}}}}}}^{n+1}={{{{{{{{{\bf{x}}}}}}}}}_{{{{{{{{\bf{e}}}}}}}}}}^{n}-{J}^{-1}\left[{F}_{x},{F}_{y},{F}_{z}^{g}\right]$$

(5)

where $J=\left[\begin{array}{c}\frac{\delta {F}_{x}}{\delta x}\frac{\delta {F}_{x}}{\delta y}\frac{\delta {F}_{x}}{\delta z}\\ \frac{\delta {F}_{y}}{\delta x}\frac{\delta {F}_{y}}{\delta y}\frac{\delta {F}_{y}}{\delta z}\\ \frac{\delta {F}_{z}^{{\rm {g}}}}{\delta x}\frac{\delta {F}_{z}^{{\rm {g}}}}{\delta y}\frac{\delta {F}_{z}^{{\rm {g}}}}{\delta z}\end{array}\right]$, ${F}_{z}^{{\rm {g}}}={F}_{z}-mg$, x_e is the Jacobian matrix, total z force, and equilibrium position, respectively. The root-finding algorithm was executed until the delta between the current and the previous step was below 0.1 mm.

At this point, the numerical model can be integrated into the digital twin; however, this model is computationally very expensive. Inspecting the calculated equilibrium position reveals that the mapping from the focal to equilibrium point is simple²⁶, and as simple as two sets of polynomial functions. To fit the polynomial functions to the equilibrium position, the equilibrium positions in the region of interest (ROI); −λ ≤ y ≤ λ and −λ + r_c ≤ z ≤ λ + r_c were calculated with a step size of $\frac{\lambda }{5}$. r_c = 0.119 is the vertical offset of the numerical simulation to set the ROI. Then, the polynomial function was fitted to the data set using the Matlab curve fitting toolbox (version 3.7); ${x}_{e}^{y}={a}_{0}+{a}_{1}{x}_{f}^{y}+{a}_{2}{x}_{f}^{z}$ where a₀,a₁ and a₂ are 7.367 × 10⁻¹², 0.9981, and −6.962 × 10⁻¹⁰, respectively. ${x}_{e}^{z}={b}_{0}+{b}_{1}{x}_{f}^{y}+{b}_{2}{x}_{f}^{z}+{b}_{3}{({x}_{e}^{y})}^{2}+{b}_{4}{x}_{e}^{y}{x}_{e}^{z}$ where b₀, b₁, b₂, b₃, b₄ are −1.524 × 10⁻⁴, −2.934 × 10⁻⁸, −1.000, −1.143 × 10⁻², and −2.343 × 10⁻⁷. The r-squared goodness of fit was 0.999 for both cases.

This significantly simplifies the numerical model, guarantees stability within the ROI, and reduces the number of optimization variables. Similarly to the pressure field optimization, the loss function was set as $L=\sqrt{{({y}_{{\rm {c}}}-{y}_{{\rm {p}}}^{{\rm {exp}}})}^{2}+{({z}_{{\rm {c}}}-{z}_{{\rm {p}}}^{{\rm {exp}}})}^{2}}$ where y_c and z_c are the target positions in the y and z axes. ${y}_{{\rm {p}}}^{{\rm {exp}}}={x}_{e}^{y}({x}_{f}^{y},{x}_{f}^{z})+G({y}^{{\rm {exp}}}-{x}_{e}^{y}({x}_{f}^{y},{x}_{f}^{z}))$ and ${z}_{p}^{{\rm {exp}}}={x}_{e}^{z}({x}_{f}^{y},{x}_{f}^{z})+G({z}^{{\rm {exp}}}-{x}_{e}^{z}({x}_{f}^{y},{x}_{f}^{z}))$ are the substituted experimental equilibrium points. The maximum iteration number was set to 25.

Results and discussion

Pressure field optimization

The results for the A:i, A:ii, and A:iii optimizations are as shown in Fig. 2a, b, and c, d, respectively. As shown in Fig. 2a–d, the phased array perfectly achieves target-optimized states in optimal conditions, i.e. numerical simulation (red crosses). The performance of the optimizer in the numerical simulation is evaluated by the mean square of error (MSE) for both phases and amplitude. The MSE phases are 3.16 × 10⁻¹³ and 3.17 × 10⁻¹³ for A:i and A:iii, respectively. The amplitude error is also low with 1.79 × 10⁻⁶ and 2.50 × 10⁻⁶ for A:ii and A:iii, respectively.

**Fig. 2: Comparison of pressure field with various test conditions.**

The numerically optimum solution works well for target phase optimization in experiments, and the experimental value closely achieves the target as shown in Fig. 2a and c. The experimental phase error, measured in MSE are 5.43 × 10⁻³, and 4.81 × 10⁻³ for A:i and A:iii, respectively. The employment of the experimental optimization reduces the error to 3.23 × 10⁻⁴ and 3.08 × 10⁻³, respectively, for A:i and A:iii. However, because the numerical optima performs well in the first place, the improvement is minor and the error reduction rate (measured by the ratio between experimental and optimized MSE) is only approximately 16.8 and 1.56 for A:i and A:ii, respectively.

However, the numerically obtained optimum solution does not apply well for amplitudes in the experimental condition as shown in Fig. 2b and d. The pressure amplitude error measured in MSE is 1.41 × 10⁵ and 1.46 × 10⁵ for A:ii and A:iii, respectively, in the experiment. By the employment of experimental optimization, the pressure error reduces to 89.2 and 423 for A:ii and A:iii, respectively. The A:iii optimizer consistently performs worse than the counterparts such as A:i or A:ii, and this is attributed to the fact that the loss function is more complex than optimizing for either parameter. The error reduction rate is 1580 and 345 times for A:ii and A:iii, respectively. The results are summarized in Table 1.

Table 1 Summary of phase and amplitude performance measured in mean squared error (MSE).

Full size table

One of the potential causes of the experimental deviation is the nonlinearity of the field. Nonlinearity of the field is not only an issue in mid-air ultrasound but also from measurements to the practical application of audible acoustics in mid-air^34,35. The generation of higher harmonics has been discussed as a potential issue by Andrade et al.²¹ and it has also been reported to cause issues in underwater acoustics²⁴. Figure 3 shows the measured nonlinearity from the non-optimized field, and Fig. 3a shows that the second harmonics generation (F2) grows as the target amplitude increases. However, the total harmonic distortion ($\frac{\sqrt{({F}_{2}^{2})}}{{F}_{1}}$) decreases with the increased target amplitude. Thus, while the nonlinear effects are present, it does not fully explain the experimental deviation.

**Fig. 3: Investigation of nonlinear effects in pressure field optimization.**

Despite the nonlinearity and unknown cause of the experimental deviation, the digital twin optimizer still determines the acoustic holograms for the desired outcome. The in-situ optimization of the pressure amplitude is directly applicable in HCI applications (ultrasonic haptic sensation, displays, acoustic streaming), medical applications where the nonlinear and complex media is present in the propagation media, or additive manufacturing where scattering conditions are constantly changing. In-situ optimization can easily be scaled to implement multi-point optimization, and the benefit of in-situ optimization is enhanced with the number of optimization variables and targets. In such cases, the experimental measurements may still become the bottleneck in the optimization process, and efficient measurement methods based on optics (for e.g. schileren³⁶ or the laser Doppler vibrometer³⁷) may be better suited for faster optimization.

Equilibrium point optimization

The results are as shown in Fig. 4, and the unoptimized focal point does not approximate the equilibrium position (root mean square error (RMS) error of 0.442 and 0.154 mm); the optimization process is required. Digital twin optimization was performed with the Adam optimizer (learning rate = 5 × 10⁻⁴), with the initial solution set as the target focal point (q = 0). For subsequent optimization (q ≤ 1), the initial guess was set to the optimized focal points from the last target. The optimizer was iterated for 25 steps, and the results are as shown in Fig. 4a. After the experimental optimization, the RMS error dropped to 0.105 and 0.057 mm for the y and z axes, respectively, significantly improving the positioning accuracy of the acoustic levitator, as shown in Fig. 4b.

**Fig. 4: Results of equilibrium position optimization.**

Where previous methods²⁶ required a calibration map (which typically requires many hours to measure), this in-situ optimizer achieves equivalent performance (RMS of 0.11 and 0.030 mm for horizontal and vertical axes, respectively, for methods with prior calibration²⁶) without the calibration map. Thus, this could be used to improve the image quality in acoustophoretic volumetric displays or improve specimen positioning accuracy for diagnostics/analytic purposes^38,39,40.

Designing the loss function

In-situ optimization has been demonstrated previously in machine learning and optical systems; however, the design of the loss function itself has not been discussed in depth. The design of the loss function is the most critical in achieving a successful experimental optimization with the digital twin. This is because we do not identify any derivatives via experiments. The loss function needs to be designed such that the experimental values are passed on to the optimizer; otherwise, the optimizer will only find the numerical optima. This could be a potential pitfall for the future of experimental optimization with the digital twin, and we will present an example where such a design may be critical.

We set a toy maximization problem where f_ideal(x) = −x², and the experimental deviated function, f_exp(x) = −(x−10)² (see the section “Data availability” for the codes). A simple loss function to obtain the maxima of the function is; L₁(x) = −f_ideal(x), L₂ = −f_exp(x). Trivially, the solution is x = 0, and x = 10 for the ideal and experimental case, respectively. We then convert the loss function to include the experimental data and preserve the automatic differentiated value; L₃ = −(f_ideal(x) + G(f_exp(x) −f_ideal(x))). We solved the L₃ with the Adam optimizer (learning rate = 0.05, iteration number of 3000), and obtained the mean optimal points by repeating the optimization 100 times with random initial values between -2.5 and 22.5 (i.e. center at 10).

When such optimization is performed, the function converges only to the numerical optima (x = 0). This is because the differentiated loss function does not carry any information regarding the experimental states, and cannot descend to the experimental maxima. Thus, for a function to be maximized, the loss function needs to have a steep gradient near the maxima, and still have a term f(x) when differentiated.

One such function is $\frac{\delta {L}_{4}}{\delta x}=\frac{{f}^{{\prime} }(x)}{f(x)}$, or ${L}_{4}(x)=\log f(x) = \log \left(\right.{f}_{{{{{{\rm{ideal}}}}}}}(x)+G({f}_{{{{{\rm{{exp}}}}}}}(x)-{f}_{{{{{{\rm{ideal}}}}}}}(x))$. This is not a perfect maximization function, because (1) the value does not reach the optima when it starts from the left-hand side of optima, (i.e. x ≤ 10 the returned solution has a mean of x = 0.300, with a s.t.d. of 1.22, number of instances 54/100) and (2) the solution could be NaN out depending on the initial value; however, the solution improves to a mean of 8.32 (std: 0.491, number of instances 41/100), given a good initial guess (x ≥ 10). Further study is required to identify more suitable loss functions, but this knowledge should be helpful in the future applications of in-situ optimization.

Measurements methods

The acoustic pressure field can be measured via a wide range of methods, from the classical use of a pressure microphone, optical methods (LDV³⁷, Schlieren³⁶, PIV⁴¹), to thermal methods^12,42. If calibration is properly performed, experimentally determined values from these methods can be directly substituted into the digital twin workflow. Thus, non-contact and remote measuring methods could be employed in the future to minimize the effect of scattering from the microphones. However, if not calibrated, it is more challenging to implement it back into the digital twin workflow, and its functions may be limited to ‘minimize’ or ‘maximize’ the pressure field with respect to the normalized values.

Conclusion

In conclusion, we presented an in-situ optimization method of the acoustic hologram with a digital twin. This optimizer obtained the experimental measurements and optimized the hologram using experimental measurements and numerical gradients. We demonstrated two approaches for the measurements (microphone, and camera), and two approaches for the modeling (i.e. direct numerical model, and polynomial approximation). Both methods were successful in improving the performance of the hologram, and up to 1580 times improvements were recorded in terms of pressure amplitude optimization. Furthermore, the experimental optimization was performed by measuring once per step, which significantly reduced the number of measurements needed in the experimental setup. This optimization method, along with the design philosophy for the loss function will be directly helpful in improving the performance of the practical application of PAT such as acoustophoretic volumetric displays, ultrasonic tactile displays, and mid-air acoustic levitation.

Methods

Pressure calculation

The complex pressure at a specific point (x) generated by PAT is calculated by

$${p}_{{\rm {in}}}(x,{x}_{t},{\phi }_{t})=\mathop{\sum }\limits_{t=1}^{T}\frac{{P}_{0}}{d({{{{{{{\boldsymbol{x}}}}}}}},{{{{{{{{\boldsymbol{x}}}}}}}}}_{{{{{{{{\boldsymbol{t}}}}}}}}})}D(\eta ){{\rm {e}}}^{j(kd({{{{{{{\boldsymbol{x}}}}}}}},{{{{{{{{\boldsymbol{x}}}}}}}}}_{{{{{{{{\boldsymbol{t}}}}}}}}})+{\phi }_{t})},$$

(6)

where P₀ is the transducer power at 1 m, d(x, x_t) is the Euclidean distance between the transducer position (x_t) and the specified position (x). $D(\eta )=\frac{2{J}_{1}(kr\sin \eta )}{kr\sin \eta }$ is the directivity function for a piston source. $k=\frac{2\pi {f}_{0}}{{c}_{0}}$ is the wavenumber, with f = 40 kHz and c₀ = 341 m s⁻¹.

We employed a phased array made of 256 transducers of a 1 cm diameter, operating at 40 kHz (Manorshi, MSO-P1040H07T, P₀ = P_vV_a, where P_v = 0.31 Pa V⁻¹ at 1 m, and V_a = 5 V is the actuated voltage) and we referred to SonicSurface for details regarding the signal generation for each transducer)⁴³. The transducers are arranged in a 16 × 16 square flat grid. A field-programmable gate array (FPGA) (EP4CE6E22C8N—ALTERA IV Core Board, Waveshare) generates the control signals multiplexed into 8 channels per output pin; shift registers (74HC595, TI) demultiplex the pin signal into 8 channels, and the channels get amplified by drivers (MIC4127 from MT) up to a 20 peak-to-peak voltage. The signals to be generated are sent by a computer to the FPGA using UART at 230,400 bps, enabling the update of the emission phases at 190 times per second. The phase resolution was 32 divisions per period. The transducer power coefficient was measured by taking the average of 10 transducers, and the microphone was oriented such that the microphone pointed towards the PAT.

Experimental pressure field measurements

The calibrated microphone was connected to the conditioning amplifier (B&K Type 2690), and the output voltage was recorded using the USB oscilloscope (TiePie Handyscope HS5). The captured data were converted from voltage to pressure amplitude based on the calibration data, and the FFT was obtained to determine the amplitude and phase at the fundamental frequency (40 kHz). The reference for the phase was set as the clock signal from the FPGA board. The microphone was attached to the XYZ stage (Controller: OptoSigma SHOT-304GS, Stages: OptoSigma OSMS20-85, OSMS26-100, OSMS26-100) to accurately control the position of the microphone, as shown in the Supplementary Fig. S1. The stage commands were sent via Serial communication (baud rate = 9600) using the pyOptoSigma package (https://github.com/ken1row/PyOptoSigma). The optimization scheme was implemented in Python (ver 3.10.7) and codes to fully recreate the setup were made available as shown in the “Data availability” section (TensorFlow version 2.10.0). The Adam optimizer was used, the learning rate was 0.05, and the optimizer was iterated 100 times. Experimental measurements and optimizations were repeated three times to obtain the average performance, its standard deviation and mean squared error.

Experimental measurement of equilibrium position

The experiment was conducted on top of an optical table (Thorlabs B90120A, SDP90120), and the equilibrium position was captured by a USB-C high-speed camera (Photron INFINICAM UC-1) with a Nikon F-to-C Mount Adaptor (Kenko Tokina) and a single-focus lens (Tamron SP AF180mm F/3.5Di), as shown in Supplementary Fig. S2. We used a parallel light source (LED Tempo, IPS-FPP150150-IF15) for background illumination, which provides uniform light intensity across its entire area. The camera then captured the particle’s silhouette to accurately determine its position. A CMM-stylus (RENISHAW A-5000-7557) was attached to the aforementioned XYZ stages to obtain both the pixel-to-mm conversion rate (1.408 × 10⁻⁵ mm pix⁻¹) and datum point. The camera was operated through Python SDK (pypuclib (https://github.com/infinicam/pypuclib)), and the equilibrium position and camera calibration was identified using the hough circle transform on OpenCV (version 4.6.0).

Utilization of generative AIs in manuscript

The authors employed OpenAI’s ChatGPT (GPT-4) for the generation of certain sections of the manuscript and abstracts. Subsequently, we meticulously reviewed and verified the output to ensure its accuracy and relevance to the subject matter.

Data availability

The data that support the findings of this study (experimental data) are openly available in Zenodo (https://doi.org/10.5281/zenodo.10065462) and in supplementary information.

Code availability

All codes used in this study (experimental codes, numerical optimizer, experimental data) are openly available in Github (https://github.com/DigitalNatureGroup/insitu_optimization) and in Zenodo (https://doi.org/10.5281/zenodo.10065462). Please refer to “README.md” file in both repositories for the details regarding the package prerequisite.

References

Hoshi, T., Takahashi, M., Iwamoto, T. & Shinoda, H. Noncontact tactile display based on radiation pressure of airborne ultrasound. IEEE Trans. Haptics 3, 155–165 (2010).
Article Google Scholar
Long, B., Seah, S. A., Carter, T. & Subramanian, S. Rendering volumetric haptic shapes in mid-air using ultrasound. ACM Trans. Graph. 33, 1–10 (2014).
Article Google Scholar
Seah, S. A., Drinkwater, B. W., Carter, T., Malkin, R. & Subramanian, S. Dexterous ultrasonic levitation of millimeter-sized objects in air. IEEE Trans. Ultrason. Ferroelectr. Freq. Control 61, 1233–1236 (2014).
Article Google Scholar
Marzo, A. et al. Holographic acoustic elements for manipulation of levitated objects. Nat. Commun. 6, 8661 (2015).
Article Google Scholar
Ochiai, Y., Hoshi, T. & Rekimoto, J. Three-dimensional mid-air acoustic manipulation by ultrasonic phased arrays. PLoS ONE 9, 97590 (2014).
Article Google Scholar
Hasegawa, K., Qiu, L., Noda, A., Inoue, S. & Shinoda, H. Electronically steerable ultrasound-driven long narrow air stream. Appl. Phys. Lett. 111, 064104 (2017).
Article Google Scholar
Norasikin, M. A., Plasencia, D. M., Memoli, G. & Subramanian, S. Sonicspray: a technique to reconfigure permeable mid-air displays. In ISS 2019—Proc. 2019 ACM International Conference on Interactive Surfaces and Spaces 113–122 (Association for Computing Machinery, 2019).
Fushimi, T., Marzo, A., Drinkwater, B. W. & Hill, T. L. Acoustophoretic volumetric displays using a fast-moving levitated particle. Appl. Phys. Lett. 115, 64101 (2019).
Article Google Scholar
Hirayama, R., Christopoulos, G., Plasencia, D. M. & Subramanian, S. High-speed acoustic holography with arbitrary scattering objects. Sci. Adv. 8, 7614 (2022).
Article Google Scholar
Ochiai, Y., Hoshi, T. & Rekimoto, J. Pixie dust: graphics generated by levitated and animated objects in computational acoustic-potential field. ACM Trans. Graph. 33, 85 (2014).
Article Google Scholar
Morales, R., Marzo, A., Subramanian, S. & Martínez, D. Leviprops: animating levitated optimized fabric structures using holographic acoustic tweezers. In Proc. 32nd Annual ACM Symposium on User Interface Software and Technology 651–661 (Association for Computing Machinery, 2019).
Melde, K., Mark, A. G., Qiu, T. & Fischer, P. Holograms for acoustics. Nature 537, 518–522 (2016).
Article Google Scholar
Marzo, A. & Drinkwater, B. W. Holographic acoustic tweezers. Proc. Natl Acad. Sci. USA 116, 84–89 (2018).
Article Google Scholar
Plasencia, D. M., Hirayama, R., Montano-Murillo, R. & Subramanian, S. Gs-pat: high-speed multi-point sound-fields for phased arrays of transducers. ACM Trans. Graph. 39, 1–12 (2020).
Article Google Scholar
Lin, Q. et al. A deep learning approach for the fast generation of acoustic holograms. J. Acoust. Soc. Am. 149, 2312–2322 (2021).
Article Google Scholar
Lee, M. H., Lew, H. M., Youn, S., Kim, T. & Hwang, J. Y. Deep learning-based framework for fast and accurate acoustic hologram generation. IEEE Trans. Ultrason. Ferroelectr. Freq. Control 69, 3353–3366 (2022).
Article Google Scholar
Li, J., Lv, Z., Hou, Z. & Pei, Y. Comparison of balanced direct search and iterative angular spectrum approaches for designing acoustic holography structure comparison of balanced direct search and iterative angular spectrum approaches for designing acoustic holography structure. Appl. Acoust. 175, 107848 (2020).
Article Google Scholar
Suzuki, S., Fujiwara, M., Makino, Y. & Shinoda, H. Radiation pressure field reconstruction for ultrasound midair haptics by greedy algorithm with brute-force search. IEEE Trans. Haptics 14, 914–921 (2021).
Article Google Scholar
Fushimi, T., Yamamoto, K. & Ochiai, Y. Acoustic hologram optimisation using automatic differentiation. Sci. Rep. 11, 12678 (2021).
Article Google Scholar
Fushimi, T., Yamamoto, K. & Ochiai, Y. Target acoustic field and transducer state optimization using diff-pat. AIP Adv. 11, 125007 (2021).
Article Google Scholar
Andrade, M. A. B., Ramos, T. S., Okina, F. T. A. & Adamowski, J. C. Nonlinear characterization of a single-axis acoustic levitator. Rev. Sci. Instrum. 85, 045125 (2014).
Article Google Scholar
Prisbrey, M. & Raeymaekers, B. Ultrasound noncontact particle manipulation of three-dimensional dynamic user-specified patterns of particles in air. Phys. Rev. Appl. 10, 034066 (2018).
Article Google Scholar
Paneva, V., Fleig, A., Plasencia, D. M., Faulwasser, T. & Müller, J. Optitrap: optimal trap trajectories for acoustic levitation displays. ACM Trans. Graph. 41, 1–14 (2022).
Article Google Scholar
Sallam, A. & Shahab, S. On nonlinear effects in holographic-modulated ultrasound. Appl. Phys. Lett. 121, 204101 (2022).
Article Google Scholar
Polychronopoulos, S. & Memoli, G. Acoustic levitation with optimized reflective metamaterials. Sci. Rep. 10, 4254 (2020).
Article Google Scholar
Fushimi, T., Marzo, A., Hill, T. L. & Drinkwater, B. W. Trajectory optimization of levitated particles in mid-air ultrasonic standing wave levitators. In 2018 IEEE International Ultrasonics Symposium (IUS) 1–9 (2018).
Ji, T. & Mohamad Nor, N. Deep learning-empowered digital twin using acoustic signal for welding quality inspection. Sensors 23, 2643 (2023).
Article Google Scholar
Sternini, S., Bottero, A., Kuperman, W. Acoustic digital twin for passive structural health monitoring. JASA Express Lett. 2, https://doi.org/10.1121/10.0009610 (2022).
Lin, C., Zhang, J., Xu, L. & Peng, H. Application of acoustic digital twin model for fault monitoring of heavy duty gearbox. In 2023 14th International Conference on Mechanical and Intelligent Manufacturing Technologies (ICMIMT). 279–287 (IEEE, 2023)
Wright, L. G. et al. Deep physical neural networks trained with backpropagation. Nature 601, 549–555 (2022).
Article Google Scholar
Peng, Y., Choi, S., Padmanaban, N. & Wetzstein, G. Neural holography with camera-in-the-loop training. ACM Trans. Graph. (SIGGRAPH Asia) 39, 1–14 (2020).
Article Google Scholar
Gor’kov, L. P. On the forces acting on a small particle in an acoustical field in an ideal fluid. Sov. Phys. Dokl. 6, 773 (1962).
Google Scholar
Bruus, H. Acoustofluidics 7: the acoustic radiation force on small particles. Lab Chip 12, 1014–1021 (2012).
Article Google Scholar
Hamilton, M. F. & Blackstock, D. T. Nonlinear Acoustics (Academic Press, 1998).
Gee, K. L. et al. The role of nonlinear effects in the propagation of noise from high-power jet aircraft. J. Acoust. Soc. Am. 123, 4082–4093 (2008).
Article Google Scholar
Contreras, V. & Marzo, A. Adjusting single-axis acoustic levitators in real time using rainbow schlieren deflectometry. Rev. Sci. Instrum. 92, 015107 (2021).
Article Google Scholar
Malkin, R., Todd, T. & Robert, D. A simple method for quantitative imaging of 2d acoustic fields using refracto-vibrometry. J. Sound Vibration 333, 4473–4482 (2014).
Article Google Scholar
Puskar, L. et al. Raman acoustic levitation spectroscopy of red blood cells and plasmodium falciparum trophozoites. Lab Chip 7, 1125 (2007).
Article Google Scholar
Yurduseven, O., Cooper, K. & Chattopadhyay, G. Point-spread-function (psf) characterization of a 340-GHz imaging radar using acoustic levitation. IEEE Trans. Terahertz Sci. Technol. 9, 20–26 (2019).
Article Google Scholar
Chen, X., Ding, Q., Bi, C., Ruan, J. & Yang, S. Lossless enrichment of trace analytes in levitating droplets for multiphase and multiplex detection. Nat. Commun. 13, 7807 (2022).
Article Google Scholar
Sasaki, Y., Kobayashi, K., Hasegawa, K., Kaneko, A. & Abe, Y. Transition of flow field of acoustically levitated droplets with evaporation. Phys. Fluids 31, https://doi.org/10.1063/1.5124499 (2019).
Onishi, R. et al. Two-dimensional measurement of airborne ultrasound field using thermal images. Phys. Rev. Appl. 18, 044047 (2022).
Article Google Scholar
Morales, R., Ezcurdia, I., Irisarri, J., Andrade, M. A. B. & Marzo, A. Generating airborne ultrasonic amplitude patterns using an open hardware phased array. Appl. Sci. (Switzerland) 11, https://doi.org/10.3390/app11072981 (2021).

Download references

Acknowledgements

This work was supported by JSPS KAKENHI Grant Number JP21K14103. We would like to thank Dr. Azier Marzo and Mr. Iñigo Ezcurdia for their assistance on the experimental setup. The authors would like to acknowledge OpenAI and ChatGPT for the mostly automated generation of the abstract. We would like to thank Editage (www.editage.com) for English language editing.

Author information

Authors and Affiliations

Institute of Library, Information and Media Science, University of Tsukuba, Kasuga Campus Kasuga 1-2, Tsukuba, 305-8550, Ibaraki, Japan
Tatsuki Fushimi & Yoichi Ochiai
R&D Center for Digital Nature, University of Tsukuba, Kasuga Campus Kasuga 1-2, Tsukuba, 305-8550, Ibaraki, Japan
Tatsuki Fushimi & Yoichi Ochiai
Graduate School of Comprehensive Human Sciences, University of Tsukuba, Kasuga Campus Kasuga 1-2, Tsukuba, 305-8550, Ibaraki, Japan
Daichi Tagami & Kenta Yamamoto
Pixie Dust Technologies, Inc., Misakicho 2-20-5, Chiyoda, 101-0061, Tokyo, Japan
Yoichi Ochiai

Authors

Tatsuki Fushimi
View author publications
You can also search for this author in PubMed Google Scholar
Daichi Tagami
View author publications
You can also search for this author in PubMed Google Scholar
Kenta Yamamoto
View author publications
You can also search for this author in PubMed Google Scholar
Yoichi Ochiai
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

T.F. and Y.O. contributed to the conception of the system. T.F. performed the acquisition, analysis, and interpretation of the data. T.F., D.T., and K.Y. contributed to the creation of software. All authors contributed to the manuscript.

Corresponding author

Correspondence to Tatsuki Fushimi.

Ethics declarations

Competing interests

Y.O. has multiple unpaid advisory positions for governmental/non-governmental bodies in Japan. We further declare a competing interest with potential editor, Dr. Liangfei Tian (T.F. and Dr. Tian were in the same institution until 2019). T.F., K.Y., and D.T. declare no competing interest.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Material

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Fushimi, T., Tagami, D., Yamamoto, K. et al. A digital twin approach for experimental acoustic hologram optimization. Commun Eng 3, 12 (2024). https://doi.org/10.1038/s44172-024-00160-0

Download citation

Received: 25 January 2023
Accepted: 02 January 2024
Published: 11 January 2024
DOI: https://doi.org/10.1038/s44172-024-00160-0