Quantum topology identification with deep neural networks and quantum walks

Ming, Yurui; Lin, Chin-Teng; Bartlett, Stephen D.; Zhang, Wei-Wei

doi:10.1038/s41524-019-0224-x

Download PDF

Article
Open access
Published: 27 August 2019

Quantum topology identification with deep neural networks and quantum walks

npj Computational Materials volume 5, Article number: 88 (2019) Cite this article

5014 Accesses
30 Citations
18 Altmetric
Metrics details

Subjects

Abstract

Topologically ordered materials may serve as a platform for new quantum technologies, such as fault-tolerant quantum computers. To fulfil this promise, efficient and general methods are needed to discover and classify new topological phases of matter. We demonstrate that deep neural networks augmented with external memory can use the density profiles formed in quantum walks to efficiently identify properties of a topological phase as well as phase transitions. On a trial topological ordered model, our method’s accuracy of topological phase identification reaches 97.4%, and is shown to be robust to noise on the data. Furthermore, we demonstrate that our trained DNN is able to identify topological phases of a perturbed model, and predict the corresponding shift of topological phase transitions without learning any information about the perturbations in advance. These results demonstrate that our approach is generally applicable and may be used to identify a variety of quantum topological materials.

Identifying quantum phase transitions using artificial neural networks on experimental data

Article 01 July 2019

Enhancing detection of topological order by local error correction

Article Open access 20 February 2024

Experimental unsupervised learning of non-Hermitian knotted phases with solid-state spins

Article Open access 24 September 2022

Introduction

The properties of topological quantum materials have been the subject of intense interest in recent years, due to their paradigm-changing implications for condensed matter physics^1,2,3,4 and potential applications to new technologies. The electric conductivity of topological materials such as topological insulators has potential applications for magnetoelectric devices with higher efficiency and lower energy consumption.^5,6,7 In addition, topological materials can support anyonic quasiparticle excitations, with exotic statistics under braiding transformations that may enable fault-tolerant quantum computing.^8,9 The topological ordering of quantum materials can be characterised with quantised, nonlocal topological invariants, such as the Chern number of the quantum Hall effect. These invariants determine all of the key topological properties of quantum systems, such as the number of topological edge states and the types of anyonic excitations in topological materials. The discovery and characterisation of novel topological quantum materials requires a general and efficient method to identify these topological invariants using experimentally accessible properties. For bulk systems of topological insulators, these can often be inferred from the existence of edge states,^2,10 or particle dynamics, such as the anomalous velocities obtained by wave packets under applied forces,^11,12 and quantum walks.^{13,14,15,16,17,18,19} However, despite the considerable theoretical progress in developing classification methods for topological phases, we still lack a universal automatic method for the discovery and characterisation of new materials.

Here, we propose and test a universal automated method for identifying topological phases of quantum materials, combining quantum walks to probe the phase and a deep neural network (DNN) to analyse the evolution. Using the particle density profiles formed during a particle’s evolution driven by the system’s Hamiltonian, we demonstrate that a novel DNN with external memory is able to identify the topological phases and phase transitions for a two-dimensional lattice model with spin–orbit coupling. Our method demonstrates high identification accuracy of 97.4%, and is shown to be robust to noise on the input data. Finally, although we train our model using data from a specific two-dimensional spin–orbit lattice Hamiltonian, we demonstrate that our method is able to classify the phases of a perturbed model with high accuracy, without any details about the perturbation. As such, our results demonstrate that quantum walks and DNN are a powerful and generic tool for the efficient discovery and analysis of novel topological quantum systems, and therefore the design of robust quantum technologies.

Results

Continuous-time quantum walks (CTQW) in topological quantum systems

The coherent dynamics of particles, with motion dependent on an internal degree of freedom, such as spin, are described as quantum walks. Along with providing a tool for building quantum algorithms, quantum walks also provide a platform to simulate and analyse complex physical systems.^20,21 There are two types of quantum walks: discrete-time and continuous-time quantum walks, where the main difference is the timing used to apply corresponding evolution operators. In the case of discrete-time quantum walks, the corresponding evolution operator of the system is applied only in discrete time steps, while in the continuous-time quantum walk case, the evolution operator is applied continuously. Discrete-time quantum walks have been successfully used to study topological properties of a quantum system. Specifically, the experimental observation of particle localisation at the boundary between materials possessing different topological ordering and its robustness to the defects have been used to prove the existence of topologically protected edge modes.^{13,14,15,16,22,23} Furthermore, the moments of the probability distribution for the walker’s position after many steps is an experimental signature of a topological quantum phase transition in one-dimensional quantum walks.¹⁵ In contrast to discrete-time quantum walks, which require pulsed control over the system, CTQW can arise directly in free Hamiltonian systems such as two-dimensional spin–orbit lattice models. These CTQW have been shown to reveal topological phase transitions,¹⁷ a fact supported by recent experiments.¹⁸ In such CTQW, the resulting density profile of an initially localised particle is expected to contain a wealth of information to identify the topological order of the underlying quantum system, provided one can extract this information efficiently.

In this work, we consider the topological phases of a parameterised Hamiltonian on a two-dimensional lattice (599 × 599 in our simulation). Following ref. ¹⁷, we use a CTQW for a initially localised spin-up particle under this Hamiltonian, where the behaviour of the distribution of the quantum state after long time evolution provides a signature of the topological phase. We will investigate the use of both the particle’s spatial, as well as its momentum density profiles, marginalising over the particles internal state. Specifically, we consider the two-dimensional spin–orbit lattice Hamiltonian,^17,18,24,25 as described in the section “Methods”. We use this model to test our method for topological phase identification because the topological invariant (Chern number) of this system is easily calculated, allowing us to check the accuracy of our method. This Hamiltonian supports five distinct topological phases, labelled by the Chern number ${\cal{C}} \in \left\{ {0, \pm 1, \pm 2} \right\}$, determined by the coupling parameters in this Hamiltonian, as shown in Fig. 1.

The density profile is strongly dependent on the system’s topology, and can be used as a diagnostic of topological phases, and the phase transitions between them, as discussed in refs ^17,18. From these previous studies, good signatures for topological phase identification are the central features of the position distribution and the ring pattern of the momentum distribution, which reveal that the Hamiltonian localises in a nontrivial topological phase with Chern number ${\cal{C}} = \pm 1$. However, these previous analyses are based on approximations, and we do not have a general method to analyse the density profiles for topological phases associated with other Chern numbers.

Learning topological phases using a DNN

Machine learning can determine the underlying characteristics of a physical system even without prior human knowledge.²⁶ Deep learning, a subset of machine learning, which represents the data as a nested hierarchy of concepts, provide great capability and adaptability in this regard.²⁷ Each concept is defined in relation to simpler concepts, and more abstract representations are computed in terms of less abstract ones. Deep learning has achieved breakthroughs across many applications,^27,28,29,30 indicating its potential benefit in the analysis of many different quantum problems.^{31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48} Inspired by the hierarchical bio-structures in visual systems,⁴⁹ DNNs can automatically extract the most suitable representations from input data and make accurate predictions. Generally speaking, during the end-to-end learning process, the representations of data will automatically emerge rather than being discovered or manually crafted.⁵⁰

We will apply DNN to the problem of topological identification by providing the network with the density profiles from a CTQW as input. As described above, the density profiles contain a wealth of information about the topological phase of the system, but identifying which features are important is challenging, especially for higher order phases. A DNN with external memory has the capacity to solve complex structural tasks that are challenging to stand-alone neural networks, and has shown the ability to answer synthetic questions designed to emulate reasoning and inference problems.^51,52,53,54 The architecture of our DNN is shown in Fig. 2, which consists of multiple computation blocks (CB) and fully connected layers (computation network), as well as an external memory coupled to the last convolutional layer (memory network). The computation network is of a supervised-learning paradigm and the memory network is of an unsupervised-learning paradigm. Supervised and unsupervised paradigms each have their own advantages in classification problems, as introduced in ref. ²⁷ and they are jointly trained during the process in our experiments.

Our experiment consists of three steps: data preparation, neural network training and validation, and testing. The data preparation stage is based on numerical simulations of CTQW with different Hamiltonian parameters, and is described in the section “Methods”. The data corresponding to different topological phases is randomised and split into three sets with the ratio 0.8:0.1:0.1 for training, validation and testing, respectively. Validation is integrated to the iterative training process to prevent overfitting and several different DNN architectures were manually compared as discussed in the supplementary files. Details of neural architecture evaluation and naive baseline are given in the Appendix. The prepared data is reused three times to evaluate the network. As the performance indicator for the corresponding prepared data, the accuracy in our results is the average over the three independent randomisation sets.

We analyse the outcome of our experiments using the principal component analysis (PCA) of memory, a t-distributed stochastic neighbour embedding (t-SNE) of the computation network output, and the statistical accuracy of the test. Both the PCA and the t-SNE are visualisation results, and the accuracy is a statistical evaluation. The t-SNE shows the topological classification of input data corresponding to different Chern numbers. The PCA demonstrates how the input data is clustered according to its correlation by self-organisation, which distinguishes the different topological phases of the input data. The accuracy represents the fraction of test data that is correctly identified (by comparing with the analytical solution).

The PCA and t-SNE based on the data—the density profiles in momentum and position space—are shown in Fig. 3, where the DNN identification forms separated clusters associated with the topological phases of our model Hamiltonian system. For the momentum space data, the identification clearly reveals five clusters corresponding to each of the distinct topological phases of the Hamiltonian. For the position space data covering the whole phase diagram, only four clusters are identified and the topological phases corresponding to Chern numbers ${\cal{C}} = \pm 2$ are not distinguished based on this data.

The statistical accuracy of our test, i.e., the ratio between the number of testing samples classified into correct topological phases and the total number of testing samples, is shown in Table 1. When based on momentum space density profiles, we obtain a very high accuracy for data covering both the whole phase diagram region, as well as for a restriction to the region around the phase transition (97.4% and 95.8% respectively). Position space density profiles lead to identification with relatively lower accuracy for the whole phase diagram, 76.1%, remaining high for the phase transitions regions 93.8%. The reduction in accuracy for the whole phase diagram is primarily because our DNN is unable to distinguish the phases ${\cal{C}} = \pm2$. By excluding the data for $|{\cal{C}}| = 2$, the accuracy obtained with data from the whole phase diagram reaches 94.9%. The low accuracies for distinguishing ${\cal{C}} = \pm2$ in this case may potentially be an affect of our parameterisation of phase space: the variation of our chosen hyper-parameter manifold in FBZ for Hamiltonians with ${\cal{C}} = 0, \pm 1, - 2$ are a continuous process, while the ${\cal{C}} = \pm2$ region is accessed through a discrete change; see the section “Methods” for details of the data generation. The relatively small region for $|{\cal{C}}| = 2$ in the phase diagram of this model also potentially restricts the learning ability of DNN.

Table 1 The statistical accuracy for the topological identification using our DNN

Full size table

Quantum walks on engineered topological quantum materials have been realised in different physical platforms including photonics systems^14,19,23,55 and cold atoms,¹⁸ amongst others. For our method to be useful on experimental data, it must be robust to noise. Here, we test the performance of our method with noisy input data for our trained DNN. We add Gaussian noise to our simulated data, at a level comparable with current experimental techniques in optical systems^23,55 and cold atoms systems;^10,18,56 details are discussed in the section “Methods”. In these tests, the accuracy statistics for topological phase identification shows limited degradation as indicated in Table 1. Using momentum density profiles, the accuracy decreases by only 0.3% on average, and this decrease could potentially be offset by increasing the size of the network.

General applicability of the method

As we now show, our DNN trained with the data from CTQWs governed by a known model is also able to identify the topology of a perturbed model without additional information or further training to learn the perturbation. Hereafter, we refer the DNN after training on the unperturbed model as our “trained DNN”. In our test, the perturbed model is obtained by adding an additional term to our training Hamiltonian;^57,58 see the section “Methods” for details. As the Chern number for the perturbed model can still be calculated analytically, we are able to test the accuracy of our trained DNN to identify the topology of the perturbed model.

We generate three sets of momentum density profiles using the perturbed Hamiltonian with three different perturbation strengths η = {3, 6, 9}. Our trained DNN is able to identify the topology of the perturbed Hamiltonian with an averaged accuracy 93.88%, with {97.96%, 93.88%, 89.80%} for η = {3, 6, 9}, respectively, where the accuracy decreases while increasing the perturbation strength.

Furthermore, to demonstrate that our trained DNN is able to detect changes to the topological system caused by the perturbation, we show that it can identify the location of the topological phase transitions and how these locations shift depending on the perturbation. Our trained DNN reveals that, while increasing the perturbation strength η, the phase boundary between ${\cal{C}} = 1$ and ${\cal{C}} = 0$ shifts in the direction of increasing magnitude of t₃, that is, the area of the ${\cal{C}} = 0$ phase region is increasing as a function of the perturbation strength. Specifically, our trained DNN predictions for the phase transition shifts are Δ_DNN = {1.005, 1.746, 2.686} for η = {3, 6, 9}, which are close to the theoretical analysis for the corresponding shifts Δ = {1, 2, 3}. We note that we classify phases using a grid of discrete points in parameter space, and that this discretisation accounts for a considerable uncertainty in our identified phase boundaries, comparable with the error in the estimates. Further details are give in the section “Methods”.

We have demonstrated a universal automatic method for the identification of distinct topological phases of quantum materials, and the related perturbed models. Our simulated experimental results show that the combination of the particle’s density profile from a CTQW and DNN augmented with external memory is a reliable and efficient method to identify topological phases and phase transitions in our trial system, even for the high order ${\cal{C}} = \pm 2$ and noisy data. We have also demonstrated the generality of this method, by using our trained DNN to classify the topological properties of a perturbed system without any knowledge of the perturbation.

Discussion

For the purpose of engineering novel topological systems using our method, we could use zero-shot learning methods, which aim to recognise objects whose instances may not have been seen during training.⁵⁹ By integrating the zero-shot learning into our DNN, the design and identification of novel topological phases will be possible.

Methods

Here we present the trial topological Hamiltonian system, and describe the generation of a particle’s density profile as used as the input data for our DNN. The perturbed model, which we use to assess the generality of our method, is detailed as well. We also provide the details of the architecture of our DNN.

The topological system in our simulated experiments

The two-dimensional spin–orbit lattice Hamiltonian we consider here is^17,18,24,25

$$\begin{array}{*{20}{l}} {\hat H} \hfill & = \hfill & {\mathop {\sum}\limits_{x,y} {\left[ {c_{x,y}^\dagger \frac{m}{2}\hat \sigma _3c_{x,y} + c_{x + 1,y}^\dagger (t_{1x}\hat \sigma _1 - {\mathrm{i}}\frac{3}{4}t_3\hat \sigma _3)c_{x,y}} \right.} } \hfill \\ {} \hfill & {} \hfill & {\left. { + c_{x,y + 1}^\dagger (t_{1y}\hat \sigma _2 - {\mathrm{i}}\frac{3}{4}t_3\hat \sigma _3)c_{x,y} + c_{x + 1,y + 1}^\dagger t_2\hat \sigma _3c_{x,y} + {\mathrm {h.c.}}} \right]} \hfill \\ {} \hfill & = \hfill & {\mathop {\sum}\limits_{k_x,k_y} {\vec h \cdot \vec \sigma } \left| {k_x,k_y} \right\rangle \left\langle {k_x,k_y} \right|,} \hfill \end{array}$$

(1)

using {m, t_1x, t_1y, t₂, t₃} as the coupling parameters, i ∈ {1, 2, 3}, $\sigma = \{ \hat \sigma _1,\hat \sigma _2,\hat \sigma _3\}$ as the Pauli operators and $\vec h = \left( {h_1,h_2,h_3} \right)$. The last line of Eq. (1) is obtained by using translation invariance and the Fourier Transformation $\left\{ {\left| {k_x} \right\rangle = \frac{1}{{\sqrt {2\pi } }}\mathop {\sum}\nolimits_x {{\mathrm {e}}^{ - ixk_x}} \left| x \right\rangle ,\left| {k_y} \right\rangle = \frac{1}{{\sqrt {2\pi } }}\mathop {\sum}\nolimits_y {{\mathrm {e}}^{ - iyk_y}} \left| y \right\rangle } \right\}$, the 2 × 2 block-diagonalized Hamiltonian in momentum space is

$$\begin{array}{*{20}{l}} {\vec h \cdot \vec \sigma } \hfill & = \hfill & {2t_{1x}{\mathrm{cos}}\,k_x\hat \sigma _1 + 2t_{1y}\,{\mathrm{cos}}\,k_y\hat \sigma _2} \hfill \\ {} \hfill & {} \hfill & { + \left\{ {m + 2t_2\,{\mathrm{cos}}\,\left( {k_x + k_y} \right) + \frac{3}{2}t_3\left( {{\mathrm{sin}}\,k_x + {\mathrm{sin}}\,k_y} \right)} \right\}\hat \sigma _3.} \hfill \end{array}$$

(2)

This Hamiltonian supports the topological phases with Chern numbers ${\cal{C}} \in \left\{ {0, \pm 1, \pm 2} \right\}$. We consider a parameter space given by varying the coupling parameters m and t₃ while fixing all other parameters. For example, while fixing t_1x = t_1y = 1, t₂ = 5 the Hamiltonian supports ${\cal{C}} \in \left\{ {0, \pm 1, - 2} \right\}$ and while fixing t_1x = 1, t_1y = −1, t₂ = 5 the Hamiltonian supports ${\cal{C}} \in \left\{ {0, \pm 1,2} \right\}$. The definition of Chern number is

$${\cal{C}} = \frac{1}{{4\pi }}\mathop {\int}\nolimits_{{\mathrm{BZ}}} {{\mathrm {d}}^2} k\,\hat h \cdot \left( {\partial _{k_x}\hat h \times \partial _{ky}\hat h} \right){\kern 1pt} ,$$

(3)

with $\hat h = \vec h/|\vec h|$.²⁴ The different topological phases labelled by Chern number ${\cal{C}}$, as a function of Hamiltonian parameters, are shown in Fig. 1.

The formation of particle’s density profile in both momentum and position spaces

In CTQW evolutions, a particle with spin up, initially localised in the centre of a two-dimensional lattice in position space, spreads out and gradually occupies a larger area of the lattice. Equivalently, the particle is initially uniformly distributed in momentum space and during the evolution the particle’s components at every momenta oscillates between spin up and spin down components. The particle’s wave functions and probability distributions in both position and momentum spaces form a certain pattern which is closely related with the Hamiltonian.

At evolution time t, the state of the particle initially spin up and localised at the centre of two-dimensional lattice is (setting ℏ = 1)

$$\begin{array}{*{20}{l}} {\left| {\psi (t)} \right\rangle } \hfill & = \hfill & {\mathop {\sum}\limits_{\boldsymbol{k}} {\left( {\alpha _{{\boldsymbol{k}} \uparrow }\left| \uparrow \right\rangle + \alpha _{{\boldsymbol{k}} \downarrow }\left| \downarrow \right\rangle } \right)} \left| {\boldsymbol{k}} \right\rangle } \hfill \\ {} \hfill & = \hfill & {\mathop {\sum}\limits_{\boldsymbol{k}} \left( {\begin{array}{*{20}{c}} {\frac{{h_3\left( { - {\mathrm{isin}}\left( {E_{\boldsymbol{k}}t} \right)} \right)}}{{E_{\boldsymbol{k}}}} - {\mathrm{cos}}\left( {E_{\boldsymbol{k}}t} \right)\frac{{\left( {h_1 + {\mathrm{i}}h_2} \right)\left( { - {\mathrm{isin}}\left( {E_{\boldsymbol{k}}t} \right)} \right)}}{{E_{\boldsymbol{k}}}}} \end{array}} \right)|{\boldsymbol{k}}\rangle {\kern 1pt} } \hfill \end{array}$$

(4)

where $E_{\boldsymbol{k}} = \sqrt {h_x^2 + h_y^2 + h_z^2} \ne 0$ is the eigenenergy of system’s Hamiltonian. When E_k = 0 we have α_k↑ = 1 and α_k↓ = 0, which is the case at Dirac point while the system is under topological phase transition. The particle’s state represented in position space is the Fourier transform of the corresponding spin components.

From the expression of Eq. (4) for particle’s state at time t, the amplitude and the relative phase of both spin up and spin down components are closely related with the energy E_k and sensitive to the band gap of the system which is min{2E_k} as discussed in refs. ^17,18 The topological phase of the system characterised with Chern number is revealed by the band structure of the system. Therefore, the particle’s density profile is a competitive candidate for the topological detection, even for higher order phases.

Here, we generate two sets of density profiles. One is the wave functions in momentum space and the other is the probability distributions in position space. For the training of the neural network, we decompose the complex values of both spin up and spin down components into two real values and map the amplitude and relative phase matrices into image representation. With this process, the input data set consists of the set of spatial or momentum distributions for the particle’s final states.

Dataset generation for our DNN identifying the topology of quantum matters

Our system supports topological phases with ${\cal{C}} = \{ 0, \pm 1, \pm 2\}$ as described above. The diagram showing the distribution of Chern number ${\cal{C}}$ with respect to m, t₃ and fixed t_1x = t_1y = 1, t₂ = 5 is shown in Fig. 1, where the shaded area represents the parameter area for the dataset labelled as “whole” and the dotted area represents the parameter area for the dataset labelled as “transition” in our tables. The dataset for ${\cal{C}} = 2$ is generated with the same m, t₃, t_1x, t₂ as ${\cal{C}} = - 2$, but with t_1y = −1. The sizes of our dataset generated for the whole phase diagram are {1449, 1478, 1486, 1488, 1449} and for the phase transition area of the diagram are {1575, 1506, 1474, 1408, 1575} corresponding to ${\cal{C}} = \{ - 2, - 1,0,1,2\}$, respectively. The conventional practice using DNN²⁸ indicates this data size is sufficient for training. The density profiles in our work are mimicking the theoretical density profiles after a long-time evolution on an infinite large lattice. Since our data are generated with numeric simulations, we choose an evolution time which enables the particle’s density profile occupying around 80% of the lattice area. This strategy ensures the evolutions avoid the boundary effects of a finite lattice and in the meanwhile they are good approximations to the long-time evolutions, which means they are time-independent and the minor evolution time changes will not affect our results.

The method to add the noise to our density profiles are different for the data collected in different measurement spaces, i.e. momentum or position. The experimental momentum data measurement can be implemented in cold atom systems as in refs, ^10,18 where the noise in the data are the shot-noise and Gaussian white noise. The standard deviation of Gaussian noise is set to be 0.02 in our simulated data, which is a reasonable estimation for current technology based on the error bar ranges in ref. ¹⁸. The experimental position data measurement can be implemented in cold atom system as in ref. ⁵⁶ and photonics systems as in refs ^23,55 by encoding the position of a walker in either time-bins or spatial modes. The noise in position data includes shot-noise and device noise resulting in the uncertainty in both relative phase and amplitude of the state, which is realised by the convolution between the perfect state and the point-spread function (PSF) of the system. In our noisy data, the PSF we used is a Gaussian with 0 as its mean and 2 as the standard deviation which is also within current experimental techniques level.^60,61

Identification of perturbed system

We consider a perturbation to the Hamiltonian of Eq. (2) given by the addition of a third nearest-neighbour (hopping) term in x direction, which has the expression $\hat h_{{\mathrm{N3}}}^x = \eta \,{\mathrm{cos}}\left( {2k_x} \right)\sigma _z$, with k_x ∈ [−π, π) as the momenta in x direction, and η as the perturbation strength. The block-diagonalized Hamiltonian having a third nearest-neighbour (hopping) term is $\vec h\prime \cdot \vec \sigma = \vec h \cdot \vec \sigma + \hat h_{{\mathrm{N3}}}^x$, where $\vec h \cdot \vec \sigma$ is as in Eq. (2).

All the data in this section is generated in momentum space. We consider a parameter space for $\vec h \cdot \vec \sigma$ by fixing t_1x = t_1y = 1, t₂ = 5 as before, and with $m \in \left[ { - 20, - 10} \right)$, t₃ ∈ [−20, 20] corresponding to the area ${\cal{C}} = \{ 0,1\}$ of the phase diagram of the unperturbed system as shown in Fig. 1. For the different perturbation strength η = {3, 6, 9} we generate 196 data, where seven values equally sampled from $m \in \left[ { - 20, - 10} \right)$ and 28 values equally sampled from t₃ ∈ [−20, 20], where the discretisation resolution for sampling the Hamiltonian parameters is 1.4815. With the three sets of perturbation data corresponding to η = {3, 6, 9} as the input, our trained DNN is able to identify their topological phases and phase transitions. The topology identification of the perturbed Hamiltonian is with an averaged accuracy 93.88%, with {97.96%, 93.88%, 89.80%} for η = {3, 6, 9}, respectively.

To show that our trained DNN is detecting the changes to the model caused by the perturbation, we use our trained DNN to track the movement of the topological phase transition as the perturbation is increased. With the outputs of our trained DNN, we can isolate the location of a topological phase transition as laying between points with different Chern numbers, (t_3L, t_3R). We take the middle point of the two locations (t_3L + t_3R)/2 as the estimate of the phase boundary, and calculate the corresponding shift Δt₃ from the location of the phase boundary t_3|η=0 in the unperturbed η = 0 model,

$${\mathrm{\Delta }}t_3 = \left( {t_{3{\mathrm{L}}} + t_{3{\mathrm{R}}}} \right)/2 - t_{3|\eta = 0}.$$

(5)

We define a phase boundary shift Δ_DNN for perturbation η to be the average of a collection of shifts sgn(t₃)Δt₃ with different m along the boundary. We note that this method to identify the phase boundary shift is very sensitive to the discretisation resolution of the parameter space on which the DNN is used. In our simulations, the phase shift affected by the parameters discretisation resolution is 0.7407, the half of the discretisation resolution.

Our trained DNN reveals that, while increasing the third nearest-neighbour coupling strength η, the phase boundary between ${\cal{C}} = 1$ and ${\cal{C}} = 0$ shifts outwards (in the direction of increasing magnitude |t₃|), i.e., the area of ${\cal{C}} = 0$ is increasing with the size of the perturbation. Specifically, our trained DNN predictions for the phase boundary shifts are Δ_DNN = {1.005, 1.746, 2.686} as shown in Fig. 4 for η = {3, 6, 9}, which are close to the theoretical analysis for the corresponding shifts Δ = {1, 2, 3} indicated from the gapless band structures.²⁴

The configurations of our DNN for topological phase identification of quantum systems

We use a DNN coupled with an external memory for identification of topological phases from the distributions from CTQWs. We take advantage of the most of up-to-date techniques for our computation network design. For the memory network, the simplification of memory operations is achieved by using a self-organising map (SOM), which is endowed with effective memory addressing and allocation mechanisms. A hybrid learning approach is devised to optimise the network for obtaining promising results.

The detailed architecture and the configuration of our network is illustrated in Fig. 2 and Table 2. There are six CB (two with size 8 × 8, two with size 16 × 16 and two with size 32 × 32), two fully connected layers and an external memory. During the training process, the learning rates (LR) for computation network and memory network are 0.0001 and 0.4, respectively. The batch size is set as 64 and the network is trained 1000 iterations. The learning rate decay factor in our computation network is 0.9 for every 100 iterations. The time constant for SOM is the number of iterations divided by the natural logarithm of initial radius (128 in our experiment). The labels for memory clusters are probed by tracking the corresponding coordinates of a few typical data from different topological phases. The details on the network architecture selection, naive baseline and the misclassified samples interpretation are shown in Appendix.

Table 2 DNN architecture configuration with LR as learning rate

Full size table

Our experiments run on a GPU cluster with three nodes. Each node is with two Intel CPUs of model E5-2680 and 128GB physical memory. For computing acceleration, each CPU manages a separate PCIe slot in which an NVIDIA Quadro P5000 GPU card with 16GB on-board memory installed.

Data availability

Data available on request from the authors.

Code availability

All the codes are available in our repository (https://github.com/mingyr/Quantum_TP_Identification_DNN).

References

Moore, J. E. The birth of topological insulators. Nature 464, 194–198 (2010).
Article CAS Google Scholar
Hasan, M. Z. & Kane, C. L. Colloquium: topological insulators. Rev. Mod. Phys. 82, 3045–3067 (2010).
Article CAS Google Scholar
Ryu, S., Schnyder, A. P., Furusaki, A. & Ludwig, A. W. Topological insulators and superconductors: tenfold way and dimensional hierarchy. New J. Phys. 12, 065010 (2010).
Article Google Scholar
Qi, X.-L. & Zhang, S.-C. Topological insulators and superconductors. Rev. Mod. Phys. 83, 1057–1110 (2011).
Article CAS Google Scholar
Li, C. H. et al. Electrical detection of charge-current-induced spin polarization due to spin-momentum locking in Bi2Se3. Nat. Nanotechnol. 9, 218–224 (2014).
Article CAS Google Scholar
Ando, Y. et al. Electrical detection of the spin polarization due to charge flow in the surface state of the topological insulator Bi1.5Sb0.5Te1.7Se1.3. Nano Lett. 14, 6226–6230 (2014).
Article CAS Google Scholar
DC, M. et al. Room-temperature high spin–orbit torque due to quantum confinement in sputtered Bi x Se (1–x) films. Nat. Mater. 17, 800–807 (2018).
Article CAS Google Scholar
Nayak, C. et al. Non-Abelian anyons and topological quantum computation. Rev. Mod. Phys. 80, 1083–1159 (2008).
Article CAS Google Scholar
Field, B. & Simula, T. Introduction to topological quantum computation with non-Abelian anyons. Quantum Sci. Technol. 3, 045004 (2018).
Article Google Scholar
Wu, Z. et al. Realization of two-dimensional spin–orbit coupling for Bose–Einstein condensates. Science 354, 83–88 (2016).
Article CAS Google Scholar
Price, H. M. & Cooper, N. R. Mapping the Berry curvature from semiclassical dynamics in optical lattices. Phys. Rev. A 85, 033620 (2012).
Article Google Scholar
Duca, L. et al. An Aharonov–Bohm interferometer for determining Bloch band topology. Science 347, 288–292 (2015).
Article CAS Google Scholar
Kitagawa, T., Rudner, M. S., Berg, E. & Demler, E. Exploring topological phases with quantum walks. Phys. Rev. A 82, 033429 (2010).
Article Google Scholar
Kitagawa, T. et al. Observation of topologically protected bound states in photonic quantum walks,. Nat. Commun. 3, 882 (2012).
Article Google Scholar
Cardano, F. et al. Statistical moments of quantum-walk dynamics reveal topological quantum transitions. Nat. Commun. 7, 11439 (2016).
Article CAS Google Scholar
Zhang, W.-W., Goyal, S. K., Simon, C. & Sanders, B. C. Decomposition of split-step quantum walks for simulating Majorana modes and edge states. Phys. Rev. A 95, 052351 (2017).
Article Google Scholar
Zhang, W.-W., Sanders, B. C., Apers, S., Goyal, S. K. & Feder, D. L. Detecting topological transitions in two dimensions by Hamiltonian evolution. Phys. Rev. Lett. 119, 197401 (2017).
Article Google Scholar
Sun, W. et al. Uncover topology by quantum quench dynamics. Phys. Rev. Lett. 121, 250403 (2018).
Article Google Scholar
Zhan, X. et al. Detecting topological invariants in nonunitary discrete-time quantum walks. Phys. Rev. Lett. 119, 130501 (2017).
Article Google Scholar
Venegas-Andraca, S. E. Quantum walks: a comprehensive review. Quantum Inf. Process. 11, 1015 (2012).
Article Google Scholar
Portugal, R. Quantum Walks and Search Algorithms (Springer: New York, 2013).
Flurin, E. et al. Observing topological invariants using quantum walks in superconducting circuits. Phys. Rev. X 7, 031023 (2017).
Google Scholar
Xiao, L. et al. Observation of topological edge states in parity–time–symmetric quantum walks. Nat. Phys. 13, 1117 (2017).
Article CAS Google Scholar
Sticlet, D., Piéchon, F., Fuchs, J.-N., Kalugin, P. & Simon, P. Geometrical engineering of a two-band Chern insulator in two dimensions with arbitrary topological index. Phys. Rev. B 85, 165456 (2012).
Article Google Scholar
Asbóth, J.K., Oroszlány, L. & Pályi, A. A short course on topological insulators. Lect. Notes Phys. 919, 85–98 (2016).
Schmidt, M. & Lipson, H. Distilling free-form natural laws from experimental data. Science 324, 81–85 (2009).
Article CAS Google Scholar
Goodfellow, I., Bengio, Y. & Courville, A. Deep Learning (MIT Press, USA, 2016).
Krizhevsky, A., Sutskever, I. & Hinton, G.E. Imagenet classification with deep convolutional neural networks. Adv. Neural Inf. Process. Syst. 1097, 1097–1105 (2012).
Esteva, A. et al. Dermatologist-level classification of skin cancer with deep neural networks. Nature 542, 115–118 (2017).
Article CAS Google Scholar
Shallue, C. J. & Vanderburg, A. Identifying exoplanets with deep learning: a five-planet resonant chain around kepler-80 and an eighth planet around kepler–90. Astron. J. 155, 94 (2018).
Article Google Scholar
Cai, X.-D. et al. Entanglement-based machine learning on a quantum computer. Phys. Rev. Lett. 114, 110504 (2015).
Article Google Scholar
Schuld, M., Sinayskiy, I. & Petruccione, F. An introduction to quantum machine learning. Contemp. Phys. 56, 172–185 (2015).
Article Google Scholar
Dunjko, V., Taylor, J. M. & Briegel, H. J. Quantum-enhanced machine learning. Phys. Rev. Lett. 117, 130501 (2016).
Article Google Scholar
Biamonte, J. et al. Quantum machine learning. Nature 549, 195–202 (2017).
Article CAS Google Scholar
Mott, A., Job, J., Vlimant, J.-R., Lidar, D. & Spiropulu, M. Solving a Higgs optimization problem with quantum annealing for machine learning. Nature 550, 375–379 (2017).
Article CAS Google Scholar
Broecker, P., Carrasquilla, J., Melko, R. G. & Trebst, S. Machine learning quantum phases of matter beyond the fermion sign problem. Sci. Rep. 7, 8823 (2017).
Article Google Scholar
Carrasquilla, J. & Melko, R. G. Machine learning phases of matter. Nat. Phys. 13, 431–434 (2017).
Article CAS Google Scholar
Zhang, Y. & Kim, E.-A. Quantum loop topography for machine learning. Phys. Rev. Lett. 118, 216401 (2017).
Article Google Scholar
Zhang, P., Shen, H. & Zhai, H. Machine learning topological invariants with neural networks. Phys. Rev. Lett. 120, 066401 (2018).
Article CAS Google Scholar
Choo, K., Carleo, G., Regnault, N. & Neupert, T. Symmetries and many-body excitations with neural-network quantum states. Phys. Rev. Lett. 121, 167204 (2018).
Article CAS Google Scholar
Lu, S. et al. Separability-entanglement classifier via machine learning. Phys. Rev. A 98, 012315 (2018).
Article CAS Google Scholar
Arrazola, JuanMiguel et al. Machine learning method for state preparation and gate synthesis on photonic quantum computers. Quantum Sci. Technol. 4, 024004 (2019).
Article Google Scholar
Caio, M., Caccin, M., Baireuther, P., Hyart, T. & Fruchart, M. Machine learning assisted measurement of local topological invariants. arXiv:1901.03346 (2019).
Mehta, P. et al. A high-bias, low-variance introduction to machine learning for physicists. arXiv:1803.08823 (2019).
Article Google Scholar
Rem, B. S. et al. Identifying quantum phase transit using artificial neural networks on experimental data. arXiv:1809.05519 (2018).
Sarma, S. D., Deng, S.-L. & Duan, L.-M. Machine learning meets quantum physics. Phys. Today 72, 48 (2019).
Article Google Scholar
Schuld, M. Machine learning in quantum spaces. Nature 567, 179–181 (2019).
Article CAS Google Scholar
Rodriguez-Nieva, J. F. & Scheurer, M. S. Identifying topological order through unsupervised machine learning. Nat. Phys. 15, 790–795 (2019).
Article Google Scholar
Hubel, D. & Wiesel, T. David Hubel and Torsten Wiesel. Neuron 75, 182–184 (2012).
LeCun, Y., Bengio, Y. & Hinton, G. Deep learning. Nature 521, 436–444 (2015).
Article CAS Google Scholar
Graves, A. et al. Hybrid computing using a neural network with dynamic external memory. Nature 538, 471–476 (2016).
Article Google Scholar
McLaughlin, N., Del Rincon, J. M. & Miller, P. Data-augmentation for reducing dataset bias in person re-identification, In Proc. 12th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS), Vol. 1 (IEEE, Karlsruhe, Germany, 2015).
Crispell, D., Biris, O., Crosswhite, N., Byrne, J. & Mundy, J. L. Dataset augmentation for pose and lighting invariant face recognition. arXiv:1704.04326 (2017).
DeVries, T. & Taylor, G. W. Dataset augmentation in feature space. arXiv:1702.05538 (2017).
Chen, C. et al. Observation of topologically protected edge states in a photonic two-dimensional quantum walk. Phys. Rev. Lett. 121, 100502 (2018).
Article CAS Google Scholar
Robens, C. et al. High numerical aperture (NA = 0.92) objective lens for imaging and addressing of cold atoms. Opt. Lett. 42, 1043–1046 (2017).
Article CAS Google Scholar
Sticlet, D. & Piéchon, F. Distant-neighbor hopping in graphene and Haldane models. Phys. Rev. B 87, 115402 (2013).
Article Google Scholar
Montambaux, G. An equivalence between monolayer and bilayer honeycomb lattices. Eur. Phys. J. B 85, 375 (2012).
Article Google Scholar
Xian, Y., Lampert, C., Schiele, B. & Akata, Z. Zero-shot learning-a comprehensive evaluation of the good, the bad and the ugly. IEEE Trans. Pattern Anal. Mach. Intell. 41, 2251–2265 (2018).
Stallinga, S. & Rieger, B. Accuracy of the Gaussian point spread function model in 2D localization microscopy. Opt. Express 18, 24461–24476 (2010).
Article CAS Google Scholar
Minář, J. et al. Phase-noise measurements in long-fiber interferometers for quantum-repeater applications. Phys. Rev. A 77, 052325 (2008).
Article Google Scholar

Download references

Acknowledgements

This work is supported by the Australian Research Council via the Centre of Excellence in Engineered Quantum Systems project number CE170100009 and Discovery Project numbers DP170103073, DP180100670 and DP180100656, and USyd-SJTU Partnership Collaboration Awards. The authors acknowledge discussions about noisy experimental data with Wei Sun, Chao Chen, Yu He, and Steven Flammia, Xianmin Jin and comments from Robin Harper and John Manion. The authors acknowledge the University of Sydney and University of Technology Sydney for providing HPC resources that have contributed to the research results reported in this paper.

Author information

Authors and Affiliations

Centre for Artificial Intelligence, School of Computer Science, University of Technology Sydney, Sydney, Australia
Yurui Ming & Chin-Teng Lin
Centre for Engineered Quantum Systems, School of Physics, The University of Sydney, Sydney, Australia
Stephen D. Bartlett & Wei-Wei Zhang

Authors

Yurui Ming
View author publications
You can also search for this author in PubMed Google Scholar
Chin-Teng Lin
View author publications
You can also search for this author in PubMed Google Scholar
Stephen D. Bartlett
View author publications
You can also search for this author in PubMed Google Scholar
Wei-Wei Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Y.M. and C.-T.L. designed and performed the DNN experiments. W.-W.Z. and S.D.B. proposed theoretical support. W.-W.Z. prepared the training data. All authors contributed to writing the paper.

Corresponding author

Correspondence to Wei-Wei Zhang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary: Quantum topology identi.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Ming, Y., Lin, CT., Bartlett, S.D. et al. Quantum topology identification with deep neural networks and quantum walks. npj Comput Mater 5, 88 (2019). https://doi.org/10.1038/s41524-019-0224-x

Download citation

Received: 19 November 2018
Accepted: 02 August 2019
Published: 27 August 2019
DOI: https://doi.org/10.1038/s41524-019-0224-x

This article is cited by

Entanglement detection with artificial neural networks
- Naema Asif
- Uman Khalid
- Hyundong Shin
Scientific Reports (2023)
Realising and compressing quantum circuits with quantum reservoir computing
- Sanjib Ghosh
- Tanjung Krisnanda
- Timothy C. H. Liew
Communications Physics (2021)
A data-driven approach to violin making
- Sebastian Gonzalez
- Davide Salvi
- Augusto Sarti
Scientific Reports (2021)
Characterization and control of open quantum systems beyond quantum noise spectroscopy
- Akram Youssry
- Gerardo A. Paz-Silva
- Christopher Ferrie
npj Quantum Information (2020)