## Abstract

Acoustic holograms are the keystone of modern acoustics. They encode three-dimensional acoustic fields in two dimensions, and their quality determines the performance of acoustic systems. Optimisation methods that control only the phase of an acoustic wave are considered inferior to methods that control both the amplitude and phase of the wave. In this paper, we present Diff-PAT, an acoustic hologram optimisation platform with automatic differentiation. We show that in the most fundamental case of optimizing the output amplitude to match the target amplitude; our method with only phase modulation achieves better performance than conventional algorithm with both amplitude and phase modulation. The performance of Diff-PAT was evaluated by randomly generating 1000 sets of up to 32 control points for single-sided arrays and single-axis arrays. This optimisation platform for acoustic hologram can be used in a wide range of applications of PATs without introducing any changes to existing systems that control the PATs. In addition, we applied Diff-PAT to a phase plate and achieved an increase of > 8 dB in the peak noise-to-signal ratio of the acoustic hologram.

## Introduction

Acoustic hologram is becoming an imperative part of a wide range of acoustics applications such as in the fields of medicine^{1,2,3}, biology^{4,5,6,7,8}, and engineering^{9,10,11}. The reconstruction accuracy of the acoustic field from the hologram plays a significant role in determining the performance of the system. Thus, developing an acoustic hologram optimiser that can generate holograms with accurate field reconstruction is of significant interest of the field. Recent advancement of airborne phased array transducers (PATs) have enabled new applications such as airborne ultrasound tactile display (AUTD)^{12,13} and acoustic levitation^{14,15,16,17}. Acoustic levitation in particular is used for digital fabrication^{18,19}, display applications^{20,21,22,23,24,25} and sample holding in medicine^{26,27}, physics^{28,29} and chemistry^{30}. In all of these applications, acoustic holograms must generate a pressure control/focal point at a specified position. The control point is modulated at low frequencies to create haptic sensations that are sensible by human hands in an AUTD^{12,13}. Marzo et al. demonstrated that the control points can be converted into levitation points by adding a twin-trap hologram in acoustic levitation^{14,15}.

Generating an acoustic hologram for a single control point in space using PATs is trivial; however, it has been a significant challenge to generate a hologram that can create more than one control point in space. Multiple control points are becoming a necessary part of PAT applications, and low-quality acoustic hologram can lead to poor performances during practical use in applications. To address this challenge, Long et al. proposed Eigensolver and Tikhonov based regularisation in 2014^{13}. Marzo & Drinkwater proposed the iterative back propagation (IBP) method in 2018^{14}, and Plasencia et al. proposed GS-PAT in 2020^{31}. GS-PAT and IBP are both modified versions of the Gerchberg-Saxton algorithm^{32}. Most recently, Sakiyama et al.^{33} demonstrated acoustic hologram optimisation with the Levenberg–Marquardt algorithm (LMA); they examined the accuracy of ultrasonic stimulation in real and simulated environment. It should be noted that, while there are a number of hologram optimisers available within the research community, the usage of acoustic holograms to generate multiple focal points is a recent trend in the PAT community, and this field is yet to be developed. The listed acoustic hologram optimisers above are the most cited or considered state of the art in the field. LMA and IBP optimise only the phase of the transducer array, and GS-PAT and Eigensolver can optimise both the amplitude and phase of the acoustic hologram. Placensia et al.^{31} demonstrated that hologram optimisation methods with amplitude control (i.e. GS-PAT and Eigensolver) achieve higher-quality holograms than phase-only methods such as IBP, and that the Eigensolver method achieves the best optimised result because it seeks the global solution.

In this paper, we propose Diff-PAT, a phase-only, gradient-descent algorithm with automatic differentiation as a new platform for optimizing acoustic holograms. We demonstrate that in the most fundamental case of matching the output pressure to the target value; Diff-PAT achieve higher accuracy than the conventional Eigensolver and GS-PAT with both amplitude and phase control. Automatic differentiation is different from other differentiation operations (such as numerical differentiation and symbolic differentiation). This method calculates a derivative of a function at high accuracy by applying chain rules to each elementary operation of a given function. It is commonly used in machine learning. Our work was inspired by Peng et al.^{34}, who demonstrated this automatic differentiation based method for optical holograms. In comparison to acoustic holograms, optical holograms have been studied for a longer period, and a number of hologram optimisation methods have been developed. However, Peng et al. demonstrated that the method using automatic differentiation outperforms conventional methods such as the Gerchberg-Saxton method. The challenge in optimising an optical or acoustic holograms is based on the fact that the loss function converts a complex number to a real number (non-holomorphic objective^{35}). Whilst this function is challenging to differentiate analytically (the derivative of the function is not just a constant^{35}), latest automatic differentiation packages^{36,37,38} can differentiate these functions with high precision^{39} and ease^{34}. Diff-PAT does have multiple modes of implementation, and the gradient of the loss function does not necessarily need to be solved using automatic differentiation. Analytical and numerical differentiation can be just as effective for identifying the necessary information; however, it is cumbersome to analytically calculate the loss function (Wirtinger calculus is an advanced concept in methematics^{34,40}), and the implementation with numerical differentiation is not as efficient as automatic differentiation (details in section on computational time). Considering the development cycle of the ultrasonic application, the implementation and execution time should be kept minimal to allow the user to test out different array configurations and loss functions to fit their purpose. Thus, we argue that automatic differentiation is the best mode of implementation for Diff-PAT considering its potential usage in early to late stages of development. Achieving a high-quality acoustic hologram without amplitude control in PATs represents a significant contribution to the literature, as it allows PAT controllers to retain a simple design. Furthermore, we demonstrated the versatility of Diff-PAT by employing it for the optimisation of a phase plate^{41,42,43,44}. With regard to phase plates, the iterative angular spectrum approach (IASA) developed by Melde et al.^{41} is the standard method for optimizing acoustic holograms for arbitrary acoustic fields. We achieved enhanced performance by simply applying Diff-PAT to the optimisation process, without altering the fundamental design proposed by Melde et al^{41}.

The system overview of Diff-PAT is as shown in Fig. 1. We used JAX (ver. 0.2.5) on Python 3.6.9 to perform automatic differentiation and optimisation^{38}. To identify a suitable acoustic hologram (\(\phi_{n}\)) for a given control point \(x_{c}\), target amplitude \(A_{c}\), and transducer position \(x_{t}\), we adopted the Adam optimiser^{45}, which enables efficient stochastic optimisation with only first-order gradients. Before the optimisation of hologram \(\phi_{n}\), we defined the following optimisation problem and objective function :

where \({\mathbf{\mathcal{L}}}\left( {\phi_{n} , x_{c} , A_{c} , x_{t} } \right) = \mathop \sum \limits_{c = 0}^{C} (A_{c} - \left| {p_{t} \left( {\phi_{n} ,x_{c} ,x_{t} } \right)} \right|)^{2}\). \(C\) is the total number of the control points, and \(p_{t} \left( {\phi_{n} ,x_{c} ,x_{t} } \right)\) is the total acoustic pressure at \(x_{c}\) (see “Methods” section for details on \(p_{t}\)). In order to evaluate the effectiveness of Diff-PAT in PATs applications, acoustic hologram was generated for three setups of arrays (see Fig. 2a for layout); (1) a single-sided square array of 14 \(\times\) 14 transducers (M = 196), (2) two square arrays of 16 \(\times\) 16 transducers (M = 512), which face each other with a separation distance of 0.2355 m (i.e. a single-axis configuration) and (3) a single-sided square array of 32 \(\times\) 32 transducers (M = 1024). The single-sided array is a common arrangement for AUTDs, and the single-axis array is a popular configuration for acoustic levitation^{14,46,47,48}. The single-sided square array with M = 1024 was included in the evaluation in order to allow for the increasing demand of 1000 + transducer PATs^{18,49,50,51}. All results in this manuscript are based on numerical simulation; however, the utilised numerical models are well established in PATs, and the optimised results obtained from numerical simulation have been shown to translate well into physical experiments^{13,14,15,21,31}. Some discrepancy between numerical and experimental results has been reported; however its effect is limited to specific application requirements such as accurate positioning^{24,47,52} and is otherwise is rarely considered. The initial phase estimate was randomly generated for all transducers. The gradient is calculated by reverse-mode automatic differentiation according to the computational graph of the objective function \(\mathcal{L}\), and the phase is optimised by updating itself with the Adam optimiser on JAX. The Adam optimiser updates the current phase estimate according to the following^{45}:

where \(\widehat{{\mathrm{m}}_{\mathrm{t}}}=\frac{{\mathrm{m}}_{\mathrm{t}}}{1-{\upbeta }_{1}^{\mathrm{t}}}\) and, \(\widehat{{\mathrm{v}}_{\mathrm{t}}}=\frac{{\mathrm{v}}_{\mathrm{t}}}{1-{\upbeta }_{2}^{\mathrm{t}}}\) are bias-corrected first and second raw moment estimates. \({\mathrm{m}}_{\mathrm{t}}={\upbeta }_{1}\cdot {\mathrm{m}}_{\mathrm{t}-1}+(1-{\upbeta }_{1})\cdot \frac{\partial \mathcal{L}}{\partial\upphi }\) , and \({\mathrm{v}}_{\mathrm{t}}={\upbeta }_{2}\cdot {\mathrm{v}}_{\mathrm{t}-1}+(1-{\upbeta }_{2})\cdot {\left(\frac{\partial \mathcal{L}}{\partial\upphi }\right)}^{2}\), where t is the iteration number, and \({\mathrm{m}}_{\mathrm{t}}\) and \({\mathrm{v}}_{\mathrm{t}}\) are both initially zero vectors. The required hyperparameters for the Adam optimiser, \(\alpha \), \({\upbeta }_{1}\), \({\upbeta }_{2}\), and \(\varepsilon \) were set as; 0.1, 0.9, 0.999, and \(1\times {10}^{-8}\), respectively.

## Results and discussions

### Convergence rate of Diff-PAT

In the following section, the performance of Diff-PAT is evaluated. The best optimiser for PATs can be defined as the most versatile and accurate optimiser. In order to assess this, the performance of algorithms are evaluated using three array configurations and five levels of control point numbers. Control points are randomly generated using the process described in the “Methods” section, and this framework covers basic expectations of PATs in a wide range of applications and configurations available to date.

We show that the optimisation function converges quickly to the target value (evaluated by the ratio \({{\varvec{R}}}_{{\varvec{p}}}=\frac{\left|{{\varvec{p}}}_{{\varvec{t}}}({{\varvec{x}}}_{{\varvec{c}}})\right|}{{A}_{c}}\) between the target and current acoustic pressure amplitude), as shown in Fig. 2b (see Data Availability for results at each iteration). Increasing the number of control points (N) has a negative effect on the convergence rate of the solution, and the length of the error bar (which shows the standard deviation of \({{\varvec{R}}}_{{\varvec{p}}}\)) increases. However, the increase in the number of transducers has a negligible effect on the convergence rate (Fig. 2b). We confirm that our algorithm converges sufficiently between 100 and 150 iterations in Fig. 2b, and the number of iterations was set to 150 in all evaluations of Diff-PAT in this study. The convergence rate of the algorithm can be further improved by hyperparameter tuning; however, this is beyond the scope of the current study. Here, the sum of squared error was chosen as the loss function; however, it could be evaluated using a number of different loss functions (such as Huber loss and sum of absolute error). The choice of loss function has an impact on the final results, and the sum of squared error gave the best results in the preliminary analysis. Further improvement in the loss function may be possible; however, such investigation is beyond the scope of the current study.

### Accuracy of Diff-PAT and comparison to conventional solvers

The performance of Diff-PAT was compared with that of conventional algorithms in PATs which were made available by the authors of GS-PAT in C ++ ^{31}. Here, we compare Diff-PAT with the Eigensolver, corrected Eigensolver, and GS-PAT. The results are shown in the box-and-whisker plot in Fig. 3. Each algorithm was tasked to optimise the acoustic hologram for the same dataset, which included 1000 sets of randomised control points and amplitude (GS-PAT's iteration number was set to 100, as in their study^{31}). Raw data for Fig. 3 can be accessed as stated in the Data Availability section. For readers wishing to inspect the optimised acoustic pressure field in detail, the exemplary field from each optimiser is available in the supplementary material. The visual inspection of the acoustic pressure field shows that the target is achieved as specified in Diff-PAT. Furthermore, we performed a simple case study to demonstrate that the identified solutions by Diff-PAT are optimised solutions and that Diff-PAT strictly follows limits imposed by physical laws; details of which can be found in the supplementary material. The optimised phases from Diff-PAT were exported to a performance analyser developed by the author of GS-PAT, and the performance was analysed on it to independently verify our solution.

Interestingly, when the control number is small (N = 2), Eigensolver performs poorly (median values for M = 196, 512 and 1024 are 0.737, 0.728 and 0.721, respectively, in Fig. 3). This large deviation is considered to be caused by the tendency of the regularisation policy to “homogenise transducer's output rather than reconstruction accuracy”^{31}. This performance can be improved by using the corrected Eigensolver method, and the median value improves to unity for M = 196, 512 and 1024 when N = 2.

In addition, the performance of Eigensolver and the corrected Eigensolver improves when the control point number increases from N = 2 to N = 32 in all types of arrays. Specifically, the interquartile (IQ) range value for the corrected Eigensolver method when N = 2 and M = 196 is 0.058; however increasing N to 32 improves the IQ range value to 0.015. We attribute this to the variability of the target amplitude in a given control point geometry. As the number of control points increases, the average difference between the minimum and maximum amplitude decreases. This is partially because of how the target amplitude is assigned when control points are randomly generated. The sum of the amplitudes assigned to the control points is always equal to a constant (see the “Methods” section), and the difference between the minimum and maximum amplitude for a given control point geometry decreases from 523, 283, and 70 Pa for N = 2, 8, and 32, respectively (M = 196 with corrected Eigensolver). This statement is supported by further analysis of the dataset of the corrected Eigensolver when N = 2 and M = 196. When the control point geometries lie between the IQ range, the average amplitude difference between the control points is 443 Pa. However, beyond the IQ range, the average amplitude difference between the control points increases to 603 Pa.

As shown in Fig. 3, the performance of GS-PAT decreases as the number of control points N increases. When N = 2 and M = 196, the IQ range is 0.012; however, when N is increased to 32, the IQ range increases to 0.238. By contrast, increasing the number of transducers to M = 512 improves the performance of GS-PAT, and the IQ range drops to 0.119 when N = 32 and M = 512. These results are consistent with the observations made by Placensia et al. who claimed that GS-PAT is comparable to Eigensolver up to N = 8 when M = 512. However, as the transducer number increases to M = 1024, the performance of GS-PAT drops again (IQ range increases to 0.089 and 0.146 for N = 2 and 32 when M = 1024).

Conventional solvers (i.e. Eigensolver, corrected Eigensolver, and GS-PAT) show a trade-off between the numbers of transducers and control points. In contrast, Diff-PAT outperforms all solvers in all cases evaluated in Fig. 3. Diff-PAT is robust against amplitude variability within the control point geometries and can consistently achieve the target amplitude despite the reduction in transducer numbers. In addition, while the Eigensolver type method and GS-PAT use both amplitude and phase control, Diff-PAT outperforms both methods with only phase control. It should be noted that Diff-PAT does have an increased background noise in comparison to the other algorithms (see acoustic pressure distributions from each solver in supplementary material). However, the increased background noise as obtained in this case study does not invalidate the effectiveness of Diff-PAT, as this is the direct results of the clearly set objectives in the loss function (i.e. only acoustic pressure at the target point is considered). Whether the increased background noise become an issue is largely dependent on each application. For example, in ultrasonic tactile displays increased background noise could lead to blunt haptic sensation, whereas in acoustic levitation, high background noise away from the control point may not arise as significant problem. It is difficult to ascertain the effect of the increased background noise, sole from the numerical simulation. Thus, the purpose of the paper is to set to introduce the effectiveness of Diff-PAT as the platform for optimizing acoustic systems, and we hope researchers from each field will develop suitable loss function which meet their purpose in their system.

### Comparing computational time of each solver

The computational time of the solver is insignificant for non-active PATs; however, some applications of PATs require consideration of the computational efficiency. Figure 4 compares the computational time of each solver, and all of the solvers were executed on the same high-end desktop computer (4.2 GHz Core i7-7700 K, 64 GB RAM). Eigensolver and corrected Eigensolver (Fig. 4a and b) both show similar trends in computational time, and an increase in control points (N) does not have a significant effect on their computational time. However, doubling the number of transducers results in a magnitude increase in the computational time for both Eigensolver and corrected Eigensolver (e.g. when N = 2, the computational time for Eigensolver is 6.2, 110, and 830 ms for M = 196, 512, and 1024, respectively). GS-PAT achieves the lowest computational time of all solvers in all of the conditions (lowest computational time is 0.12 ms with N = 2 and M = 196, and the highest computational time is 21 ms with N = 32 and M = 1024). The authors of GS-PAT have reported up to 17,000 geometries per second^{31}, making it well suited for application requiring rapid calculations. However, as demonstrated in Figs. 3 and 4, GS-PAT is inaccurate in comparison to other solvers, and the regions in which GS-PAT can achieve both accuracy and speed is limited.

Diff-PAT does not have a computational efficiency that is as high as that of GS-PAT; however, its computational time is comparable to that of Eigensolver type optimiser with M = 512. When M = 512 and \(\mathrm{N}\le 4\), Diff-PAT is faster than the Eigensolver type method and when M = 1024, Diff-PAT is faster than the Eigensolver method in all cases of N. In addition, Diff-PAT scales well with an increasing number of transducers as shown in Fig. 4d. Given that the demand for high transducer number PATs is increasing^{18,49,50,51,53}: Diff-PAT is already more suited to optimise large transducer number PATs than an Eigensolver type method, both in terms of accuracy and computational efficiency. Furthermore, there are abundant applications of PATs where the accurate reconstruction of the acoustic field is prioritised over computational speed^{18,20,21,22,54}. Diff-PAT can also be implemented using numerical differentiation as stated earlier. The accuracy of Diff-PAT with numerical differentiation is comparable to that of automatic differentiation; however, its computational efficiency is lower than that of automatic differentiation, as shown in the supplementary material. This is due to the fact that numerical differentiation is an O(n) process, whereas automatic differentiation scales better with the increasing number of variables in the loss function^{39}.

### Application of Diff-PAT for Phase Plate

In this section, we further explore the versatility of Diff-PAT by employing Diff-PAT for the optimisation of a phase plate^{41,42,43,44} which is used underwater. It has a significantly higher number of elements than PATs^{43} and is well-suited for testing the capabilities of Diff-PAT. The IASA developed by Melde et al. is one of the most popular optimisation methods for phase plates. Similar to the GS-PAT and IBP methods, IASA also applies the Gerchberg–Saxton algorithm^{32}. To create a phase plate version of Diff-PAT, the acoustic pressure field was assumed to be a plane wave^{41}, and the angular spectrum approach^{55} was used to calculate the propagating acoustic field in the loss function (see “Methods” section for details).

Figure 5 compares the holograms optimised using IASA and Diff-PAT (see Supplementary Information for the acoustic holograms of IASA and Diff-PAT). The reconstruction accuracy of Diff-PAT is clearly higher than that of IASA. A simple visual inspection verifies that the acoustic hologram optimised using IASA, shown in Fig. 5a–c, has many artefacts and does not resolve the test image well (Fig. 5c). In contrast, the acoustic hologram optimised using Diff-PAT has a significantly improved image with minimal artefacts, as shown in Fig. 5a–b, and the USAF 1951 resolution test chart is clearly resolved in Fig. 5c. Quantitatively, the peak signal-to-noise ratio (PSNR) values for Diff-PAT are 16.4, 20.7 and 16.4 dB (Fig. 5a–c, respectively), and they are at least 8 dB higher the respective PSNRs of IASA. In practical applications of phase plates, these optimised acoustic holograms are encoded and fabricated into a plate^{41} using a 3D printer. As shown in the supplementary material, Diff-PAT’s acoustic hologram has a spatially higher frequency component in comparison with IASA. This could potentially cause issues during manufacture of the phase plates. The effect of such manufacturing tolerances was evaluated, as presented in the supplementary material, and it was found that the PSNR drops by 3 dB when the standard deviation of normally distributed manufacturing tolerance drops to σ≈20%. If manufacturing tolerances become an issue, some weighted regularization term or a low pass filter can be added in the loss function to reduce the high frequency component of the acoustic hologram. The detailed view of acoustic pressure distribution demonstrates that Diff-PAT achieves this high accuracy in the region of interest (ROI) by diffracting the acoustic wave outside the ROI as shown in the supplementary material. The improved accuracy of Diff-PAT with significantly reduced artefacts in the ROI can enhance performance of a wide range of acoustic systems used in medicine^{1,2,3}, biology^{4,5,6,7,8}, and engineering^{9,10,11} applications.

In conclusion, we presented Diff-PAT, which optimises only the phase of an acoustic hologram using automatic differentiation. We demonstrated the effectiveness of Diff-PAT as a new platform for optimising acoustic holograms in PATs and phase plates. In addition, contrary to common belief, the phase-only optimiser can achieve better accuracy than an optimiser with both amplitude and phase control in the simplest test case for acoustic holograms. For PATs with high transducer numbers (M = 1024), Diff-PAT is more suitable than Eigensolver type methods, both in terms of accuracy and computational efficiency. In future, this improved acoustic hologram optimiser will improve the performance of many applications by providing higher quality acoustic holograms. In addition, this study, along with that of Peng et al.^{31}, demonstrated the effectiveness of implementing automatic differentiation in the context of hologram optimization. We hope that these results will further facilitate the use of Diff-PAT platform in a wide range of fields and applications in acoustics.

## Methods

### Total acoustic pressure calculation

The total pressure was evaluated by \({p}_{t}=\sum_{m=0}^{M}\frac{{P}_{ref}}{d\left({x}_{c}, { x}_{t}\right)}D\left(\uptheta \right){e}^{j(kd\left({x}_{c},{ x}_{t}\right)+{\upphi }_{n,m})}\) where \(M\) is the total number of transducers, \({P}_{ref}\) is the reference pressure amplitude; \(d\left({x}_{c}, {x}_{t}\right)\) is the Euclidean distance between \({x}_{c}\) and \({x}_{t}\), and \(D\left(\theta \right)= \frac{2{J}_{1}(krsin\left(\uptheta \right))}{krsin\left(\uptheta \right)}\) is the far-field directivity function for the piston source based on the angle, \(\uptheta \). Moreover, \(\uptheta \) is evaluated as the angle between the transducer normal and \({x}_{c}\); \(r = 5\) mm is the transducer radius, and \({J}_{1}\) is the Bessel function of the first kind of order 1. \(k=\frac{2\uppi f}{{c}_{0}}\) is the wavenumber given the acoustic frequency, \(f = 40 \mathrm{kHz}\) and the speed of sound is \({c}_{0}\). \({P}_{ref}\) and \({c}_{0}\) are assumed to be 1.98 Pa at 1 m (12 V Pk-Pk^{25}) and \(346 {\mathrm{ms}}^{-1}\), respectively. In the implementation of Diff-PAT, the acoustic pressure calculation is split into pre-calculated (A) and re-calculated (B) components: \({\mathrm{p}}_{\mathrm{t}}=\sum_{\mathrm{m}=0}^{\mathrm{M}}\mathrm{AB}\), where \(\mathrm{A}=\frac{{\mathrm{P}}_{\mathrm{ref}}}{\mathrm{d}\left({\mathrm{x}}_{\mathrm{c}}, {\mathrm{ x}}_{\mathrm{t}}\right)}\mathrm{D}\left(\uptheta \right){\mathrm{e}}^{\mathrm{j}(\mathrm{kd}\left({\mathrm{x}}_{\mathrm{c}},{\mathrm{ x}}_{\mathrm{t}}\right))}\), and \(\mathrm{B}={\mathrm{e}}^{\mathrm{j}({\upphi }_{\mathrm{n},\mathrm{m}})}\). Given the transducer properties do not change, part A does not change at all, and it allows significant increase in computational efficiency. In addition, neither TensorFlow 2.3.0 nor JAX 0.2.5 currently supports differentiation of Bessel functions of the first kind of order 1, and these terms cannot be included within the computational graph of automatic differentiation.

### Random generation of control points and amplitude

Control points, \({x}_{c}\) are randomly generated such that \(x\), \(y\), and \(z\) coordinates are within the region of interest (ROI), i.e. \([-0.05, 0.05]\) m from the centre of the array. For single-sided arrays, the z-axis centre is set at \(z=0.1\) m. The physically achievable pressure amplitude was predetermined by making a singular focal point (\(\phi =-\frac{2\uppi f}{{c}_{0}}\left[d\left({x}_{r},{x}_{t}\right)-d\left(0,{x}_{r}\right)\right]\)) at the vertex of ROI, \({x}_{r}\) (8 points in total). The average acoustic pressures around the vertices for single-sided (M = 196), single-axis (M = 512), and large single-sided (M = 1024) PATs are 1512, 3812, and 4121 Pa, respectively. Amplitude \({A}_{c}\) for each control point is randomly assigned such that the minimum pressure amplitude is 10 Pa, and the amplitude at each control point sums up to the average acoustic pressure for the single focus of each transducer array. This target amplitude assignment method is technically inaccurate as it assumes the acoustic amplitude to be conserved (i.e. acoustic power/energy should be conserved). However, the assignment of target via acoustic energy is impractical as it requires assignment of both target pressure and velocity field. Thus, this target assignment method is employed in this manuscript, and it is considered appropriate since the optimizer can follow the targets specified.

### Application of Diff-PAT to the phase plate

For the optimisation of the phase plate, an underwater transducer with a resonance frequency of 2 MHz, diameter of 35 mm, and input amplitude of 1 Pa was assumed at the surface. The input pressure field was assumed to be a plane wave, and the initial estimate of the phase was set to zero for all elements. The speed of sound in water was assumed to be 1480 \({\mathrm{ms}}^{-1}\), and the pixel size was assumed to be 150 \(\mathrm{\mu m}\). For simplicity, the acoustic transmission loss through the plate was considered negligible, and the angular spectrum was solved using methods described by Zeng & McGough^{55} (following the implementation of k-Wave^{56}). Both IASA and Diff-PAT were programmed in Python 3.6.9, and the phase plate version of Diff-PAT used Tensorflow^{36} (ver. 2.3.0) to differentiate the loss function automatically. We also used the Adam optimiser in TensorFlow with the same hyperparameter setting as in the PAT version. IASA was calculated by following the steps described by Melde et al.^{41}, using the angular spectrum approach proposed by Zeng & McGough^{55}. The propagation distance from the transducer to the image plane was 20 mm, and the iteration number for the optimiser (both IASA and Diff-PAT) was set to 200. The loss function was \(\mathop \sum \limits_{x}^{N} \mathop \sum \limits_{y}^{N} \left| {A_{c} \left( {x,y} \right) - \left| {p\left( {x,y} \right)} \right|} \right|\), where \(A_{c} \left( {x,y} \right)\) represents the target image with a screen resolution (N) of 256 \(\times\) 256 pixels. The optimised acoustic hologram was exported from Python and the resultant acoustic pressure fields shown in Fig. 5 were calculated using k-Wave (ver. 1.3 *angularSpectrumCW*)^{56} on MATLAB. Finally, the PSNR was calculated using the MATLAB Image Processing Toolbox.

## Data availability

The data that support the findings of this study are available within this article. The data for transducer arrangements, control point geometries and amplitude, output phase by Diff-PAT, and the results from all solvers are available in Zenodo at https://doi.org/10.5281/zenodo.4906351.

## Code availability

The program code for optimisation and analysis is also made available at the link provided in Data Availability.

## References

Jiménez-Gambín, S., Jiménez, N., Benlloch, J. M. & Camarena, F. Holograms to Focus Arbitrary Ultrasonic Fields through the Skull.

*Phys. Rev. Appl.***12**, 1 (2019).Sapozhnikov, O. A., Tsysar, S. A., Khokhlova, V. A. & Kreider, W. Acoustic holography as a metrological tool for characterizing medical ultrasound sources and fields.

*J. Acoust. Soc. Am.***138**, 1515–1532 (2015).Baresch, D. & Garbin, V. Acoustic trapping of microbubbles in complex environments and controlled payload release.

*Proc. Natl. Acad. Sci.***117**, 15490–15496 (2020).Ma, Z.

*et al.*Acoustic holographic cell patterning in a biocompatible hydrogel.*Adv. Mater.***32**, 1–6 (2020).Kreider, W.

*et al.*Characterization of a multi-element clinical HIFU system using acoustic holography and nonlinear modeling.*IEEE Trans. Ultrason. Ferroelectr. Freq. Control***60**, 1683–1698 (2013).Baudoin, M.

*et al.*Folding a focalized acoustical vortex on a flat holographic transducer: Miniaturized selective acoustical tweezers.*Sci. Adv.***5**, 1–7 (2019).Baudoin, M.

*et al.*Spatially selective manipulation of cells with single-beam acoustical tweezers.*Nat. Commun.***11**, 1–10 (2020).Baresch, D., Thomas, J.-L. & Marchiano, R. Observation of a single-beam gradient force acoustical trap for elastic particles: acoustical tweezers.

*Phys. Rev. Lett.***116**, 024301 (2016).Kruizinga, P.

*et al.*Compressive 3D ultrasound imaging using a single sensor.*Sci. Adv.***3**, (2017).Melde, K.

*et al.*Acoustic Fabrication via the Assembly and Fusion of Particles.*Adv. Mater.***30**, 1–5 (2018).Gong, Z. & Baudoin, M. Particle assembly with synchronized acoustic tweezers.

*Phys. Rev. Appl.***12**, 1 (2019).Hoshi, T., Takahashi, M., Iwamoto, T. & Shinoda, H. Noncontact tactile display based on radiation pressure of airborne ultrasound.

*IEEE Trans. Haptics***3**, 155–165 (2010).Long, B., Seah, S. A., Carter, T. & Subramanian, S. Rendering volumetric haptic shapes in mid-air using ultrasound.

*ACM Trans. Graph.***33**, 1–10 (2014).Marzo, A. & Drinkwater, B. W. Holographic acoustic tweezers.

*Proc. Natl. Acad. Sci.***116**, 84–89 (2018).Marzo, A.

*et al.*Holographic acoustic elements for manipulation of levitated objects.*Nat. Commun.***6**, 8661 (2015).Ochiai, Y., Hoshi, T. & Rekimoto, J. Three-dimensional mid-air acoustic manipulation by ultrasonic phased arrays.

*PLoS ONE***9**, e97590 (2014).Marzo, A., Caleap, M. & Drinkwater, B. W. Acoustic virtual vortices with tunable orbital angular momentum for trapping of mie particles.

*Phys. Rev. Lett.***120**, 44301 (2018).Prisbrey, M., Guevara Vasquez, F. & Raeymaekers, B. 3D ultrasound directed self-assembly of high aspect ratio particles: On the relationship between the number of transducers and their spatial arrangement.

*Appl. Phys. Lett.***117**, (2020).Youssefi, O. & Diller, E. Contactless robotic micromanipulation in air using a magneto-acoustic system.

*IEEE Robot. Autom. Lett.***4**, 1580–1586 (2019).Ochiai, Y., Hoshi, T. & Rekimoto, J. Pixie dust: graphics generated by levitated and animated objects in computational acoustic-potential field.

*ACM Trans. Graph.***33**, Article 85 (2014).Morales, R., Marzo, A., Subramanian, S. & Martínez, D. LeviProps: Animating levitated optimized fabric structures using holographic acoustic tweezers.

*Proc. 32nd Annu. ACM Symp. User Interface Softw. Technol.*651–661, https://doi.org/10.1145/3332165.3347882 (2019).Sahoo, D. R.

*et al.*JOLED: A Mid-air Display based on Electrostatic Rotation of Levitated Janus Objects. in*Proceedings of the 29th Annual Symposium on User Interface Software and Technology - UIST ’16*437–448 (ACM Press, 2016). https://doi.org/10.1145/2984511.2984549.Uno, Y.

*et al.*Luciola: A Millimeter-Scale Light-Emitting Particle Moving in Mid-Air Based On Acoustic Levitation and Wireless Powering.*Proc. ACM Interactive, Mobile, Wearable Ubiquitous Technol.***1**, 1–17 (2018).Fushimi, T., Marzo, A., Drinkwater, B. W. & Hill, T. L. Acoustophoretic volumetric displays using a fast-moving levitated particle.

*Appl. Phys. Lett.***115**, 64101 (2019).Hirayama, R., Martinez Plasencia, D., Masuda, N. & Subramanian, S. A volumetric display for visual, tactile and audio presentation using acoustic trapping.

*Nature***575**, 320–323 (2019).Puskar, L.

*et al.*Raman acoustic levitation spectroscopy of red blood cells and Plasmodium falciparum trophozoites.*Lab Chip***7**, 1125 (2007).Hosseinzadeh, V. A., Brugnara, C. & Holt, R. G. Shape oscillations of single blood drops: applications to human blood and sickle cell disease.

*Sci. Rep.***8**, 1–8 (2018).Yurduseven, O., Cooper, K. & Chattopadhyay, G. Point-spread-function (PSF) characterization of a 340-GHz imaging radar using acoustic levitation.

*IEEE Trans. Terahertz Sci. Technol.***9**, 20–26 (2019).Hasegawa, K. & Kono, K. Oscillation characteristics of levitated sample in resonant acoustic field.

*AIP Adv.***9**, 035313 (2019).Andrade, M. A. B., Camargo, T. S. A. & Marzo, A. Automatic contactless injection, transportation, merging, and ejection of droplets with a multifocal point acoustic levitator.

*Rev. Sci. Instrum.***89**, 125105 (2018).Plasencia, D. M., Hirayama, R., Montano-Murillo, R. & Subramanian, S. GS-PAT: High-speed Multi-point Sound-fields for Phased Arrays of Transducers.

*ACM Trans. Graph.***39**, (2020).Gerchberg, R. W. A practical algorithm for the determination of phase from image and diffraction plane pictures.

*Optik (Stuttg).***35**, 237–246 (1972).Sakiyama, E.

*et al.*Midair Tactile Reproduction of Real Objects. in*Haptics: Science, Technology, Applications*(eds. Nisky, I., Hartcher-O’Brien, J., Wiertlewski, M. & Smeets, J.) 425–433 (Springer International Publishing, 2020).Peng, Y., Choi, S., Padmanaban, N. & Wetzstein, G. Neural holography with camera-in-the-loop training.

*ACM Trans. Graph. (SIGGRAPH Asia)***39**, 1–14 (2020).Chakravarthula, P., Peng, Y., Kollin, J., Fuchs, H. & Heide, F. Wirtinger holography for near-eye displays.

*ACM Trans. Graph.***38**, (2019).Abadi, M.

*et al.*TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems. (2016).Paszke, A.

*et al.*Pytorch: An imperative style, high-performance deep learning library. in*Advances in neural information processing systems*8026–8037 (2019).Bradbury, J.

*et al.*JAX: composable transformations of Python+NumPy programs. http://github.com/google/jax (2018).Güne, A., Baydin, G., Pearlmutter, B. A. & Siskind, J. M. Automatic differentiation in machine learning: a survey.

*J. Mach. Learn. Res.***18**, 1–43 (2018).Candès, E. J., Li, X. & Soltanolkotabi, M. Phase retrieval via wirtinger flow: theory and algorithms.

*IEEE Trans. Inf. Theory***61**, 1985–2007 (2015).Melde, K., Mark, A. G., Qiu, T. & Fischer, P. Holograms for acoustics.

*Nature***537**, 518–522 (2016).Melde, K.

*et al.*Acoustic fabrication via the assembly and fusion of particles.*Adv. Mater.***1704507**, 1704507 (2017).Cox, L., Melde, K., Croxford, A., Fischer, P. & Drinkwater, B. W. Acoustic hologram enhanced phased arrays for ultrasonic particle manipulation.

*Phys. Rev. Appl.***12**, 64055 (2019).Franklin, A., Marzo, A., Malkin, R. & Drinkwater, B. W. Three-dimensional ultrasonic trapping of micro-particles in water with a simple and compact two-element transducer.

*Appl. Phys. Lett.***111**, (2017).Kingma, D. P. & Ba, J. L. Adam: A method for stochastic optimization.

*3rd Int. Conf. Learn. Represent. ICLR 2015 - Conf. Track Proc.*1–15 (2015).Fushimi, T., Hill, T. L., Marzo, A. & Drinkwater, B. W. Nonlinear trapping stiffness of mid-air single-axis acoustic levitators.

*Appl. Phys. Lett.***113**, 034102 (2018).Prisbrey, M. & Raeymaekers, B. Ultrasound noncontact particle manipulation of three-dimensional dynamic user-specified patterns of particles in air.

*Phys. Rev. Appl.***10**, 034066 (2018).Morris, R. H., Dye, E. R., Docker, P. & Newton, M. I. Beyond the Langevin horn: Transducer arrays for the acoustic levitation of liquid drops.

*Phys. Fluids***31**, 101301 (2019).Inoue, S.

*et al.*Acoustical boundary hologram for macroscopic rigid-body levitation.*J. Acoust. Soc. Am.***145**, 328–337 (2019).Hasegawa, K., Yuki, H. & Shinoda, H. Curved acceleration path of ultrasound-driven air flow.

*J. Appl. Phys.***125**, 1–7 (2019).Ochiai, Y., Hoshi, T. & Rekimoto, J. Three-dimensional mid-air acoustic manipulation by ultrasonic phased arrays.

*PLoS ONE***9**, 97590 (2014).Fushimi, T., Marzo, A., Hill, T. L. & Drinkwater, B. W. Trajectory Optimization of Levitated Particles in Mid-Air Ultrasonic Standing Wave Levitators.

*2018 IEEE Int. Ultrason. Symp.*1–9 (2018) https://doi.org/10.1109/ULTSYM.2018.8580093.Hasegawa, K., Qiu, L., Noda, A., Inoue, S. & Shinoda, H. Electronically steerable ultrasound-driven long narrow air stream.

*Appl. Phys. Lett.***111**, (2017).Prisbrey, M. & Raeymaekers, B. Aligning high-aspect-ratio particles in user-specified orientations with ultrasound-directed self-assembly.

*Phys. Rev. Appl.***12**, 1 (2019).Zeng, X. & McGough, R. J. Evaluation of the angular spectrum approach for simulations of near-field pressures.

*J. Acoust. Soc. Am.***123**, 68–76 (2008).Treeby, B. E. & Cox, B. T. k-Wave: MATLAB toolbox for the simulation and reconstruction of photoacoustic wave fields.

*J. Biomed. Opt.***15**, 21314 (2010).Nievinski, F. G. subtightplot. https://jp.mathworks.com/matlabcentral/fileexchange/39664-subtightplot (2013).

## Acknowledgements

This work was funded by Pixie Dust Technologies, Inc. We would like to acknowledge the authors of GS-PAT for making the code of the conventional optimisation algorithm available for comparison on C++. We would also like to acknowledge the developers of k-Wave. We would like to thank Tatsuya Minagawa for assisting with the visualization of the PATs in Fig. 1. Fig. 3 was generated using subtighplot^{57}. Fig. 5b was designed by Freepik from Flaticon; we have license to modify and use the image for commercial or non-commercial purposes. We would like to thank Editage (www.editage.com) for English language editing.

## Author information

### Authors and Affiliations

### Contributions

T.F. and K.Y. contributed equally to the work. T.F., K.Y., and Y.O. designed the study; T.F. and K.Y. performed the research; T.F. and K.Y. analysed the data; and T.F., K.Y., and Y.O. wrote the paper. All authors reviewed the manuscript.

### Corresponding author

## Ethics declarations

### Competing interests

Pixie Dust Technologies, Inc. filed Japanese patent JP2020-167367A on 1st October 2020, covering the Diff-PAT algorithm described in this paper, listing the authors T.F., K.Y., and Y.O. as inventors. Y.O. has multiple unpaid advisory position for governmental/non-governmental bodies in Japan. This work was funded by ″Strategic Research Platform towards Digital Nature Powered by Pixie Dust Technologies, Inc″.

## Additional information

### Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

## Supplementary Information

## Rights and permissions

**Open Access** This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

## About this article

### Cite this article

Fushimi, T., Yamamoto, K. & Ochiai, Y. Acoustic hologram optimisation using automatic differentiation.
*Sci Rep* **11**, 12678 (2021). https://doi.org/10.1038/s41598-021-91880-2

Received:

Accepted:

Published:

DOI: https://doi.org/10.1038/s41598-021-91880-2

## Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.