Introduction

Quantum Internet is a communication network with quantum nodes and quantum links1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16 that allows to the parties to perform efficient quantum communications17,18,19. The aim of quantum Internet5,8 and quantum repeater networks1,17,18,19,20,21,22,23 is to distribute quantum entanglement between distant nodes through a chain of intermediate quantum repeater nodes24,25,26,27,28,29,30,31,32,33. In the quantum Internet, the quantum nodes share entangled connections that formulate entangled links1,2,3,4,5,6. The quantum nodes store the quantum states in their local quantum memory for path selection and path recovery purposes1,2,3,8,24,25,26,27,28,29,30,31,32,33. Since setting several attributes must be optimized in parallel in an arbitrary quantum network, the optimization problem formulates a multi-objective procedure. Formally, multi-objective optimization covers the minimization of quantum memory usage time (storage time), the maximization of entanglement throughput (number of transmitted entangled states per second of a particular fidelity) of entangled links1,2,3,4,5,6, and the reduction of the number of entangled links between a source and a target quantum node8,24,25,26,27,28,29,30,31,32,33. However, the problem does not end here since a quantum repeater network can be approached on the quantum transmission (quantum layer) level and on the auxiliary classical communication (classical layer) level that is required for the dynamic functioning of the quantum layer. Therefore, the problem is not just a multi-objective optimization problem in the quantum layer but also a multilayer optimization issue that covers the development of both the quantum and classical layers of a quantum repeater network.

Here, we define a multilayer optimization method for quantum repeater networks. This covers both the quantum layer and the classical layer of a quantum repeater network.

By utilizing the tools of quantum Shannon theory1,2,3,4,5,6,7, the optimization of the quantum layer includes minimizing the usage of quantum memories in the nodes to reduce the storage time of entangled states, the maximization of entanglement throughput of the entangled links, and also these conditions have to be satisfied for the shortest path between a given source node and target quantum node (i.e., a multi-objective optimization of the quantum layer).

The aim of classical-layer optimization is to curtail the cost of auxiliary classical communications, which is required for such optimization. The cost of the classical communication covers all communication costs required to achieve the optimal quantum network state including the classical communication steps for overall quantum storage time minimization, entanglement throughput maximization, and the selection of a shortest path.

The multilayer optimization employs advanced methods to solve the multi-objective optimization of the quantum layer. We define the structures of the quantum memory utilization graph and the entanglement throughput tree for the multi-objective optimization of the quantum layer of a quantum repeater network. The quantum memory utilization graph models the quantum memory usage for entanglement storage. The entanglement throughput tree shows the entanglement throughput of entangled links with respect to the number of transmittable entangled states at a particular fidelity. Using these advanced constructions, we also define a method for the optimal assignment of entangled states in the repeater nodes. The input of the quantum layer optimization procedure is the quantum memory utilization graph, while the output of the method is a set of entanglement throughput trees. The output identifies the optimal states of the quantum network with respect to the multi-objective optimization function.

Classical-layer optimization focuses on the minimization of the total cost of classical communications by utilizing swarm intelligence34,35,36,37,38. This also defines a multi-objective problem since the cost has to be reduced with respect to the classical communication cost required for the minimization of quantum memory usage, the classical cost of entanglement throughput maximization of entangled links, and for the selection of the shortest path in the quantum layer. Classical-layer optimization uses some fundamentals of bacteria foraging models26,30,31,32,33,34,35 and probabilistic multi-objective uncertainty characterization31,39,40,44,45,46.

The optimization framework requires no changes in the physical layer, so the framework is directly implementable by the current physical devices1,2,3,4,5,6,17,18,19 and quantum networking elements8,24,25,26,27,28,29,30,31,32,33. The method is useful in quantum networking environments with diverse physical attributes (different quantum memory characteristics, quantum error correction, physical quantum nodes attributes, and transmission capabilities of noisy quantum links).

The novel contributions of our paper are as follows:

  • We conceive a complex optimization framework for quantum networks. It integrates the development of the quantum and classical layers of quantum repeater networks.

  • Quantum-layer optimization utilizes the attributes of physical-layer quantum transmissions, quantum memory usage, and entanglement distribution via the framework of quantum Shannon theory.

  • Classical-layer optimization focuses on minimizing any auxiliary communications related to the quantum layer and optimization.

  • Multilayer optimization is applicable by current physical devices and quantum networking elements providing a solution for the optimization of arbitrary quantum networking scenarios with diverse physical attributes and environments.

This paper is organized as follows. In Section 2, the system model is proposed. In Section 3, the optimization procedure of the quantum layer of quantum repeater networks is defined. Section 4 studies the optimization of the classical layer. Section 5 provides a performance evaluation. Finally, Section 6 concludes the paper. Supplemental material is included in the Appendix.

System Model

Our model assumes that the quantum repeater network consists of a source and target node with intermediate repeater nodes and a quantum switcher node. A quantum switcher node S operates as follows. Node S is a quantum repeater node capable of switching between the entangled connections stored in its local quantum memory and a permit of applying entanglement swapping on the selected connections. While an i-th quantum repeater node establishes only the entangled connections with the neighbor quantum repeater nodes, a switcher node is equipped with an extended knowledge about the quantum network to select between the entangled links. A general repeater node is not allowed to perform any link selection, since it is assumed in the model that a quantum repeater node has only local knowledge about the network. A switcher node based on its network knowledge can also send entanglement swapping commands to the quantum repeater nodes to define new paths in the network.

Let N be a quantum network, \(N=(V,{\mathscr{S}})\), where V is a set of nodes, \({\mathscr{S}}\) is a set of entangled links. Without loss of generality, the level Ll of an entangled link E(x, y) is defined as follows. For an Ll-level entangled link, the hop distance between quantum nodes x and y is1,11

$$d{(x,y)}_{{{\rm{L}}}_{l}}={2}^{l-1},$$
(1)

with \(d{(x,y)}_{{{\rm{L}}}_{l}}-1\) intermediate nodes between the nodes x and y. The probability that an Ll-level entangled link E(x, y) exists between nodes x, y is \({{\rm{\Pr }}}_{{{\rm{L}}}_{l}}(E(x,y))\), which depends on the actual network.

The S quantum switcher is modeled as a quantum node with the following attributes and permissions:

  • knowledge about the physical attributes of distant quantum repeater nodes and the entangled connections of N (e.g., entanglement fidelity, quantum memory status, link noise, etc),

  • internal quantum memory for the storage of entangled states,

  • quantum functionality:

    • permission to set new entangled connections between its local quantum system and a selected quantum node of the quantum network,

    • permission to switch between the stored entangled states to construct new paths,

  • classical functionality:

    • permission to command distant quantum nodes of N via classical links (to construct new entangled connections in the network, to perform entanglement swapping between the selected nodes, other).

The network model is illustrated in Fig. 1. The example network in Fig. 1(a) consists of six quantum repeater nodes Ri, i = 1, …, 6, and a quantum switcher S that switches between the entangled connections using its local quantum memory. The switcher also can perform entanglement swapping in the network. The switcher node S has knowledge about the physical attributes (e.g., entanglement fidelity, quantum memory status, etc) of quantum repeater nodes to make a decision on a path. The knowledge about the repeater nodes can be transmitted over a classical link to the quantum switcher (classical links are not depicted). As it is depicted in Fig. 1(b), the switcher node has a permission to set new entangled connections via its local quantum state with a selected quantum node. The S switcher node decided to set a new entangled connection between its local quantum system and repeater node R2. A standard quantum repeater is not allowed to perform these operations (except with the direct neighbors in the entanglement distribution phase) without a dedicated command from the switcher.

Figure 1
figure 1

The network model with a quantum switcher S and quantum repeater nodes Ri, i = 1, …, 6. The L1, L2- and L3-level entangled links of N are depicted by gray, blue and orange, respectively (additional nodes are not shown). (a) For a current shortest path \({{\mathscr{P}}}_{1}=\{{R}_{1},{R}_{3},S,{R}_{4},{R}_{6}\}\), the active repeater nodes selected by S are R3 and R4. (b) Node S switches the entangled connections in the local quantum memory from R3 to R2 and from R4 to R6. The switching operation defines a new shortest path \({{\mathscr{P}}}_{2}=\{{R}_{1},{R}_{2},S,{R}_{6}\}\).

Note, path \({{\mathscr{P}}}_{1}\) in Fig. 1(a) provides a shortest path at a particular network situation at an initial network time T1. Since the quantum network N evolves in time (quality of the entangled links, the status of the nodes, internal quantum memories, etc), at a given time T2, by utilizing the functions of the switcher node, the switcher node determined a new shortest path, \({{\mathscr{P}}}_{2}\), as depicted in Fig. 1(b).

Quantum Memory Scheduling

In this section, we define a structure for scheduling quantum memory usage of the quantum nodes called the quantum memory utilization graph \({{\mathscr{G}}}_{m}\). This is a directed graph mapped from the network model, with several abstracted nodes and links.

Proposition 1

The \({{\mathscr{G}}}_{m}\) quantum memory utilization graph is a directed graph with abstract nodes and links to schedule the quantum memory usage mapped from the quantum repeater network.

The \({{\mathscr{G}}}_{m}\) graph of quantum memory utilization is constructed as follows. Assuming n quantum nodes (excluding the S switcher node) in the network, the graph contains n abstracted transmitter nodes and n abstracted receiver nodes with directed connections. Note if the quantum switcher S is modeled as the n-th node and nS = 1, where nS is the number of quantum switchers in N, the \({{\mathscr{G}}}_{m}\) graph also can be constructed via (n − 1) transmitter and (n − 1) receiver quantum nodes. A given N with an arbitrary number of quantum switcher nodes defines a particular \({{\mathscr{G}}}_{m}\), therefore the \({{\mathscr{G}}}_{m}\) graph is a combination of all other possible switcher modes. The \({{\mathscr{G}}}_{m}\) graph contains directed edges between quantum node pairs (Vx|Vy) and (Vy|Vz) of a particular switcher mode Si, i = 1, …, NS, where NS is the total number of switcher modes, depicted via nodes labeled as (x|y) and (y|z).

Let us assume that N has a single switcher node S, and it has two states, S1 and S2. In both modes S1 and S2, repeater node R1 serves as a transmitter node for node R2.

In switcher mode S1, the shared entangled connection defines the following relations. Repeater node R1 serves as a transmitter node for nodes R3 and R4. Node R3 serves as a transmitter node for nodes R4 and R5. Node R4 serves as a transmitter node for nodes R5 and R6. Node R5 serves as a transmitter node for node R6.

In switcher mode S2, the shared entangled connection defines the following relation. Repeater node R2 serves as a transmitter node for node R6.

The \({{\mathscr{G}}}_{m}\) graph of quantum memory utilization derived from the quantum network setting of Fig. 1. is illustrated in Fig. 2.

Figure 2
figure 2

The \({{\mathscr{G}}}_{m}\) graph of quantum memory utilization derived from the network setting in Fig. 1. The graph contains n abstracted transmitter nodes and n abstracted receiver nodes with directed entangled connections. An i-level entangled connect is depicted by Li. The S quantum switcher has two states, S1 and S2. The green circles represent quantum nodes operating on switcher mode S1. The yellow circles represent nodes operating on switcher mode S2.

Entanglement Throughput Tree

In this section, we define the structure of the entanglement throughput tree, which aims to extract information from the quantum memory utilization graph. The entanglement throughput tree is also the output format of the multi-objective optimization procedure of the quantum layer.

Lemma 1

The \({{\mathscr{G}}}_{et}\) entanglement throughput tree is a structure modeling the multi-objective optimization problem of quantum repeater networks. The tree structure is derived from the \({{\mathscr{G}}}_{m}\) quantum memory utilization graph.

Proof.

As a given source node is selected, the next nodes are added to the path with a given probability.

Let us assume that \({{\mathscr{G}}}_{m}\) is determined. Let us index the nodes by an ID identifier tag, ID = {A, B, …}, and let SI be the set of unvisited neighbor nodes of a node I.

Let \({B}_{F}({E}_{{{\rm{L}}}_{l}}(I,J))\) refer to the entanglement throughput of a given Ll-level entangled link \({E}_{{{\rm{L}}}_{l}}(I,J)\) between nodes (I, J) measured in the number of d-dimensional entangled states per second at a particular fidelity F1,3,4.

Further, let J be a neighbor node of I with entangled connection \({E}_{{{\rm{L}}}_{l}}(I,J)\) and with entanglement throughput \({B}_{F}({E}_{{{\rm{L}}}_{l}}(I,J))\).

A cost function, Ω(I, J), between nodes (I, J) is defined as

$${\rm{\Omega }}(I,J)=\frac{1}{C({E}_{{{\rm{L}}}_{l}}(I,J))+{\zeta }_{J}},$$
(2)

where \(C({E}_{{{\rm{L}}}_{l}}(I,J))\) is the cost of entangled link \({E}_{{{\rm{L}}}_{l}}(I,J)\) defined as

$$C({E}_{{{\rm{L}}}_{l}}(I,J))=\frac{1}{{B}_{F}({E}_{{{\rm{L}}}_{l}}(I,J))},$$
(3)

while ζJ is the cost of quantum storage in node J.

Let \({\lambda }_{{E}_{{{\rm{L}}}_{l}}(I,J)}\) be the entanglement utility coefficient of entangled link \({E}_{{{\rm{L}}}_{l}}(I,J)\) between nodes I and J, initialized as \({\lambda }_{{E}_{{{\rm{L}}}_{l}}(I,J)}\ge 0\). This amount is equivalent to the utility of the entangled link \({E}_{{{\rm{L}}}_{l}}(I,J)\) that it has taken to arrive at the current node J from I.

At a given \({B}_{F}({E}_{{{\rm{L}}}_{l}}(I,J))\), the initial \({\lambda }_{{E}_{{{\rm{L}}}_{l}}(I,J)}\) entanglement utility4 of link \({E}_{{{\rm{L}}}_{l}}(I,J)\) is updated to \({\lambda }_{{E}_{{{\rm{L}}}_{l}}(I,J)}^{^{\prime} }\) as

$$\begin{array}{rcl}{\lambda }_{{E}_{{{\rm{L}}}_{l}}(I,J)}^{^{\prime} } & = & {(\frac{1}{{\lambda }_{{E}_{{{\rm{L}}}_{l}}(I,J)}}+{B}_{F}({E}_{{{\rm{L}}}_{l}}(I,J)))}^{-1}\\ & = & \frac{{\lambda }_{{E}_{{{\rm{L}}}_{l}}(I,J)}}{1+{B}_{F}({E}_{{{\rm{L}}}_{l}}(I,J)){\lambda }_{{E}_{{{\rm{L}}}_{l}}(I,J)}}.\end{array}$$
(4)

Using these cost functions, the Pr(I, J) probability that from node I a node J is selected is as follows:

$${\rm{\Pr }}(I,J)=\{\begin{array}{ll}\frac{{({\lambda }_{{E}_{{{\rm{L}}}_{l}}(I,J)}^{^{\prime} })}^{{\omega }^{\ast }}{({\rm{\Omega }}(I,J))}^{\delta }}{{\sum }_{\forall X\in {S}_{I}}\,{({\lambda }_{{E}_{{{\rm{L}}}_{l}}(I,X)}^{^{\prime} })}^{{\omega }^{\ast }}{({\rm{\Omega }}(I,X))}^{\delta }}, & {\rm{if}}\,J\in {S}_{I}\\ 0,{\rm{otherwise}} & \end{array},$$
(5)

where ω* and δ are weighting coefficients4.

Using (2), (3) and (5) for all node pairs, the \( {\mathcal M} \) method to build a random \({{\mathscr{G}}}_{et}\) entanglement throughput tree using a \({{\mathscr{G}}}_{m}\) graph is as follows36,37.

Let S′ refer to the set of already reached destination nodes, and let \( {\mathcal I} \) be the set of initial nodes, \({ {\mathcal F} }_{I}\) the set of feasible neighboring nodes to node I. Let \({\mathscr{D}}\) be the set of destination nodes.

The method is given in Procedure 1.

Procedure 1
figure a

Random Entanglement Throughput Tree Construction.

The proof is concluded here.■

The structure of a \({{\mathscr{G}}}_{et}\) entanglement throughput tree is illustrated in Fig. 3.

Figure 3
figure 3

The structure of a \({{\mathscr{G}}}_{et}\) entanglement throughput tree. A quantum node has an ID identifier tag, ID = {A, B, …, L}, and all incoming and outcoming entangled links are identified by the sID source neighbor node and the tID target neighbor node. Node A represents a source, while the destination nodes are J, K, L. The \({B}_{F}({E}_{{{\rm{L}}}_{l}}(x,y))\) entanglement throughput of all (x, y) node pairs are depicted above the directed lines; the link identifier \({E}_{{{\rm{L}}}_{l}}(x,y)\) is depicted under the links.

Entanglement Assignment Cycle

In this section, we propose a solution for an optimal assignment (scheduling) of stored entanglement called the entanglement assignment cycle, \({\alpha }_{{{\mathscr{G}}}_{et}}\). The goal of \({\alpha }_{{{\mathscr{G}}}_{et}}\) is to achieve a minimal overall storage time \({t}_{s}^{\ast }({{\mathscr{G}}}_{et})\) at a given \({{\mathscr{G}}}_{et}\) entanglement throughput tree.

Lemma 2

An entanglement assignment cycle can be determined by a weighted graph coloring method.

Proof.

To determine the minimal overall storage time for a \({{\mathscr{G}}}_{et}\) entanglement throughput tree, the \({{\mathscr{C}}}_{{{\mathscr{G}}}_{et}}\) conflict graph of that \({{\mathscr{G}}}_{et}\) is constructed first. In the \({{\mathscr{C}}}_{{{\mathscr{G}}}_{et}}\) graph, each vertex corresponds to a directed link of \({{\mathscr{G}}}_{et}\) (an entangled connection). An edge exists between two vertices of \({{\mathscr{C}}}_{{{\mathscr{G}}}_{et}}\), if only the vertices (entangled connections) have a conflict. A conflict occurs if two (stored) entangled connects are associated to the same physical link. The problem is therefore to associate each link of \({{\mathscr{G}}}_{et}\) a \({\alpha }_{{{\mathscr{G}}}_{et}}\) storage schedule (optimal assignment of stored entanglement), which includes the list of time slots when a given link can transmit the stored entangled states such that \({\alpha }_{{{\mathscr{G}}}_{et}}\) total number of time units is minimized. Therefore, our goal is to determine what entangled connects of the given \({{\mathscr{G}}}_{et}\) should be scheduled in which time unit, such that the total storage time is minimal in \({\alpha }_{{{\mathscr{G}}}_{et}}\).

Let τn,t {0, 1} be an indicator variable, defined as

$${\tau }_{n,t}=\{\begin{array}{ll}1, & {\rm{if}}\,n\,{\rm{is}}\,{\rm{associated}}\,{\rm{at}}\,t\\ {\rm{0}}, & {\rm{otherwise}}\end{array}$$
(6)

For a periodic scheduling,

$${\tau }_{n,t}={\tau }_{n,t+iT},$$
(7)

where T is a period and i is a constant.

For an entangled connection n, let us define \(\wedge (n)\) as the set of entangled connects n′ that are scheduled in the same time unit t, but the physical link can transmit only n or n′. As follows, for any \(n^{\prime} \in \wedge (n)\),

$${\tau }_{n,t}+{\tau }_{n^{\prime} ,t}\le 1.$$
(8)

As it can be concluded, this problem is analogous to the coloring of the conflict graph \({{\mathscr{C}}}_{{{\mathscr{G}}}_{et}}\).

Then, let us assume that each entangled connection (entangled link) has a weight w(n), which is defined as

$$w(n)=\{\begin{array}{ll}1, & {\rm{if}}\,{F}_{i}={F}_{{\rm{\max }}}\\ \lceil \frac{{F}_{{\rm{\max }}}}{{F}_{i}}\rceil , & {\rm{if}}\,{F}_{i} < {F}_{{\rm{\max }}}\end{array},$$
(9)

where Fi is the fidelity of entangled connection i and Fmax is the largest fidelity available. As follows, for a lower-fidelity entangled connection, the transmission of a given amount of information requires more time units.

As the weights are determined, the problem is analogous to a weighted graph coloring of the conflict graph \({{\mathscr{C}}}_{{{\mathscr{G}}}_{et}}\), which is the assignment of at least w(n) distinct colors to each entangled connection n such that no two entangled connections sharing the same color interfere with each other on the physical link. For this purpose, a simple distributed weighted coloring algorithm46 can be straightforwardly applied.

The method for the \({\mathscr{W}}({{\mathscr{C}}}_{{{\mathscr{G}}}_{et}})\) weighted coloring of conflict graph \({{\mathscr{C}}}_{{{\mathscr{G}}}_{et}}\) is summarized in Procedure 2.

Procedure 2
figure b

Weighted Coloring of a Conflict Graph.

After an entanglement assignment cycle has been determined by the weighted graph coloring method, in the next step the \({\rm{\Delta }}({\mathscr{W}}({{\mathscr{C}}}_{{{\mathscr{G}}}_{et}}))\) time intervals between each time unit of a given cycle is computed. For this purpose, a linear programming method44 can be applied.■

Quantum-Layer Optimization

In this section, we define the multi-objective optimization for the quantum layer. The multi-objective function covers a parallel optimization of quantum memory usage and entanglement throughput for a shortest path.

Theorem 1

At a given \({{\mathscr{G}}}_{m}\), the quantum layer of a quantum repeater network N can be optimized by a procedure \({{\mathscr{P}}}_{Q}\) that achieves a parallel minimization of the quantum memory usage time ts, the maximization of entanglement throughput BF, and the minimization of the number \(|{\mathscr{P}}|\) of entangled links between two arbitrary nodes.

Proof.

The aim of the procedure is to find an optimal entanglement throughput tree \({{\mathscr{G}}}_{et}^{\ast }\) for which the \({t}_{s}({{\mathscr{G}}}_{et}^{\ast })\) overall storage time is minimal, that is,

$${t}_{s}({{\mathscr{G}}}_{et}^{\ast })={t}_{s}^{\ast }({{\mathscr{G}}}_{et}^{\ast });$$
(10)

the \({B}_{F}({{\mathscr{G}}}_{et}^{\ast })\) entanglement throughput for all links is maximal,

$${B}_{F}({{\mathscr{G}}}_{et}^{\ast })={B}_{F}^{\ast }({{\mathscr{G}}}_{et}^{\ast });$$
(11)

and the \(|{\mathscr{P}}({{\mathscr{G}}}_{et}^{\ast })|\) number of entangled links of a path is minimal, thus

$$|{\mathscr{P}}({{\mathscr{G}}}_{et}^{\ast })|=|{{\mathscr{P}}}^{\ast }({{\mathscr{G}}}_{et}^{\ast })|,$$
(12)

where \({{\mathscr{P}}}^{\ast }\) is the shortest path.

The procedure is based on the fact that for a given \({{\mathscr{G}}}_{et}^{\ast }\) with conflict graph \({{\mathscr{C}}}_{{{\mathscr{G}}}_{et}^{\ast }}\), from the knowledge of \({\mathscr{W}}({{\mathscr{C}}}_{{{\mathscr{G}}}_{et}^{\ast }})\) weighted coloring of conflict graph \({{\mathscr{C}}}_{{{\mathscr{G}}}_{et}^{\ast }}\), \({\rm{\Delta }}({\mathscr{W}}({{\mathscr{C}}}_{{{\mathscr{G}}}_{et}^{\ast }}))\) time intervals between each time unit of a given cycle, the required objective values of (10), (11), and (12) can be determined.

Let \({{\mathscr{S}}}_{{{\mathscr{G}}}_{et}}^{\ast }\) be the set of optimal \({{\mathscr{G}}}_{et}^{\ast }\) entanglement throughput trees. Then the aim of the procedure is to determine \({{\mathscr{S}}}_{{{\mathscr{G}}}_{et}}^{\ast }\).

The problem can be rewritten via a solution set X with decision variables44

$${\rm{X}}=\{{x}_{1},\ldots ,{x}_{n}\},$$
(13)

where n is the number of all links in a given quantum memory utilization graph \({{\mathscr{G}}}_{m}\), and xi {0, 1} is defined as

$${x}_{i}=\{\begin{array}{ll}1, & {\rm{if}}\,{\rm{link}}\,i\,{\rm{of}}\,{{\mathscr{G}}}_{m}\,{\rm{is}}\,{\rm{selected}}\,{\rm{by}}\, {\mathcal M} \\ 0, & {\rm{otherwise}}.\end{array}$$
(14)

Let \({{\rm{X}}}_{{{\mathscr{G}}}_{et},i}\) be a solution i from solution set \({{\rm{X}}}_{{{\mathscr{G}}}_{et}}\). Let \({{\rm{X}}}_{{{\mathscr{G}}}_{et},j}\angle {{\rm{X}}}_{{{\mathscr{G}}}_{et},i}\) refer to solution \({{\rm{X}}}_{{{\mathscr{G}}}_{et},i}\) dominating solution \({{\rm{X}}}_{{{\mathscr{G}}}_{et},j}\), which is

$$\begin{array}{lll}{X}_{{{\mathscr{G}}}_{et},j}\angle {X}_{{{\mathscr{G}}}_{et},i} & : & \{{t}_{s}^{\ast }({{\mathscr{G}}}_{et},i)\le {t}_{s}^{\ast }({{\mathscr{G}}}_{et},j),\\ & & {B}_{F}^{\ast }({{\mathscr{G}}}_{et},i)\ge {B}_{F}^{\ast }({{\mathscr{G}}}_{et},j),\\ & & |{{\mathscr{P}}}^{\ast }({{\mathscr{G}}}_{et},i)|\le |{{\mathscr{P}}}^{\ast }({{\mathscr{G}}}_{et},j)|\},\end{array}$$
(15)

and for these relations, there is at least strict inequality44. If \({{\rm{X}}}_{{{\mathscr{G}}}_{et},i}\) dominates all other possible solutions, then \({{\rm{X}}}_{{{\mathscr{G}}}_{et},i}\) is a set of nondominated solutions.

For each set X, let κ refer to the set that contains the best nondominated solutions that have been found at a particular iteration. If κ changes, then the entanglement utilities of the links are updated. If κ is stationary, then the elements of set κ are used to update the entanglement utilities. The former serves as an improvement in the exploration process whereas the latter aims to yield more information via the best solutions44.

At a given graph \({{\mathscr{G}}}_{m}\) of quantum memory utilization, the procedure \({{\mathscr{P}}}_{Q}\) for the optimization of the quantum layer is defined as follows. The output of \({{\mathscr{P}}}_{Q}\) is an optimal set \({{\mathscr{S}}}_{{{\mathscr{G}}}_{et}}^{\ast }\) of entanglement throughput trees that realize the conditions of the multi-objective optimization function. The steps of the quantum layer optimization are summarized in Algorithm 1.

Note if there are no nondominated solutions, then the values of the weighting coefficients ρ, ς, and \(\Upsilon \) in Step 5 of Algorithm 1 can be selected according to the actual trade-off requirements.

The proof is therefore concluded here.■

Algorithm 1
figure c

Quantum Layer Optimization.

Classical-Layer Optimization

In this section we characterize the classical layer optimization procedure.

Theorem 2

The cost function of classical communications can be minimized by a procedure \({{\mathscr{P}}}_{C}\).

Proof.

Let

$$\begin{array}{l}{f}_{{t}_{s}^{\ast },{B}_{F}^{\ast },|{{\mathscr{P}}}^{\ast }|}({{\rm{\Theta }}}_{i})\\ \begin{array}{rcl} & = & (\sum _{g=1}^{{N}_{{t}_{s}^{\ast }}}\,{S}_{{t}_{s}^{\ast },g}(i)+{c}_{{N}_{{t}_{s}^{\ast },g}}^{L}(i))+(\sum _{g=1}^{{N}_{{B}_{F}^{\ast }}}\,{S}_{{B}_{F}^{\ast },g}(i)+{c}_{{N}_{{B}_{F}^{\ast },g}}^{L}(i))\\ & & +\,(\sum _{g=1}^{{N}_{|{{\mathscr{P}}}^{\ast }|}}\,{S}_{|{{\mathscr{P}}}^{\ast }|,g}(i)+{c}_{{N}_{|{{\mathscr{P}}}^{\ast }|},g}^{L}(i))\end{array}\end{array}$$
(16)

be the cost function of classical communication of the multilayer optimization procedure, where Θi Rp is a p-dimensional real vector of an i-th system state of the quantum network, \({N}_{{t}_{s}^{\ast }}\), \({N}_{{B}_{F}^{\ast }}\), \({N}_{|{{\mathscr{P}}}^{\ast }|}\) are the number of nodes that require the determination of optimal \({t}_{s}^{\ast }\), \({B}_{F}^{\ast }\) and \(|{{\mathscr{P}}}^{\ast }|\), \({S}_{{t}_{s}^{\ast },g}(i)\), \({S}_{{B}_{F}^{\ast },g}(i)\), \({S}_{|{{\mathscr{P}}}^{\ast }|,g}(i)\) refer to the number of classical steps required to find \({t}_{s}^{\ast }\), \({B}_{F}^{\ast }\) and \(|{{\mathscr{P}}}^{\ast }|\) for a particular node g of network N, while \({c}_{{N}_{{t}_{s}^{\ast },g}}^{L}(i)\), \({c}_{{N}_{{B}_{F}^{\ast },g}}^{L}(i)\), and \({c}_{{N}_{|{{\mathscr{P}}}^{\ast }|,g}}^{L}(i)\) are the costs of classical link L used for the determination of \({t}_{s}^{\ast }\), \({B}_{F}^{\ast }\) and \(|{{\mathscr{P}}}^{\ast }|\) for a given g.

Then, let introduce indices for Θi as

$${{\rm{\Theta }}}_{i}(j,k,l),$$
(17)

where j is the index of a desired optimal system state, k is the index of an optimal system state reproduction step, l is the index of a non-optimal system state event39.

Let assume that there is a set of S sub-states {Θ1, …,ΘS} in the network, then the T total network state is evaluated as

$$T(j,k,l)=\{{{\rm{\Theta }}}_{i}(j,k,l)|i=1,\ldots ,S\}.$$
(18)

Let \({f}_{{t}_{s}^{\ast },{B}_{F}^{\ast },|{{\mathscr{P}}}^{\ast }|}({{\rm{\Theta }}}_{i}(j,k,l))\) be the cost function of classical communication at a given \({{\rm{\Theta }}}_{i}(j,k,l)|\). Then, if

$${f}_{{t}_{s}^{\ast },{B}_{F}^{\ast },|{{\mathscr{P}}}^{\ast }|}({{\rm{\Theta }}}_{i}(j+1,k,l)) < {f}_{{t}_{s}^{\ast },{B}_{F}^{\ast },|{{\mathscr{P}}}^{\ast }|}({{\rm{\Theta }}}_{i}(j,k,l)),$$
(19)

then the system state evolves from j to j + 1 as

$${{\rm{\Theta }}}_{i}(j+1,k,l)={{\rm{\Theta }}}_{i}(j,k,l)+c(i)u(j),$$
(20)

where c(i) is the number of random system states, while u(j) quantifies a unit cost of system change39,40.

Therefore, for a set of S sub-states {Θ1, …,ΘS} and current vector \({\rm{\Theta }}\in {{\mathbb{R}}}^{p}\), the CN total cost of classical communication is yielded as

$${C}_{N}({\rm{\Theta }},{{\rm{\Theta }}}_{i}(j,k,l))=\sum _{i=1}^{S}\,{f}_{{t}_{s}^{\ast },{B}_{F}^{\ast },|{{\mathscr{P}}}^{\ast }|}({{\rm{\Theta }}}_{i}(j,k,l)),\,i=1,\ldots ,S,$$
(21)

which can be rewritten39 as

$$\begin{array}{l}{C}_{N}({\rm{\Theta }},T(j,k,l))\\ \begin{array}{rcl} & = & \sum _{i=1}^{S}\,(-{\rm{A}}{e}^{-{R}_{{\rm{A}}}}\,\sum _{m=1}^{p}\,{({{\rm{\Theta }}}^{(m)}-{{\rm{\Theta }}}_{i}^{(m)})}^{2})+\sum _{i=1}^{S}\,(\nu {e}^{-{R}_{\nu }}\,\sum _{m=1}^{p}\,{({{\rm{\Theta }}}^{(m)}-{{\rm{\Theta }}}_{i}^{(m)})}^{2})\end{array}\end{array}$$
(22)

where A is the distribution-entity of a current system state, RA is the information transmission rate of A, ν is the distribution-entity of a system state, Rν is the information transmission rate of ν, while Θ(m) is the m-th element of a current network state vector Θ, and \({{\rm{\Theta }}}_{i}^{(m)}\) is the m-th element of Θi.

Using CN(Θ, T(j, k, l)) (see (22)), an environment-dependent cost function Ce is defined as

$${C}_{e}={C}_{N}({\rm{\Theta }},T(j,k,l)){e}^{(M-{f}_{{t}_{s}^{\ast },{B}_{F}^{\ast },|{{\mathscr{P}}}^{\ast }|}({{\rm{\Theta }}}_{i}(j,k,l)))},$$
(23)

where M is a tuning parameter39.

From (23), the \({F}_{cost}^{i}\) cost function at a given Θi(j, k, l) is defined as

$${F}_{\cos \,t}^{i}={f}_{{t}_{s}^{\ast },{B}_{F}^{\ast },|{{\mathscr{P}}}^{\ast }|}({{\rm{\Theta }}}_{i}(j,k,l))+{C}_{e}({{\rm{\Theta }}}_{i}(j,k,l)).$$
(24)

Using the proposed basic model, we define an optimization procedure \({{\mathscr{P}}}_{C}\) of the classical-layer to achieve a CN(Θ, T(j, k, l)) minimized cost function. The method is summarized in Algorithm 2.

These results conclude the proof.■

Algorithm 2
figure d

Classical Layer Optimization.

Large-Constrained Optimization

The optimization efficiency can be further improved for a large constrained network scenario. This step can be replayed by a different approach, which allows an optimization of the classical layer for an arbitrary constrained setting. The solution is based on the idea of system state merging.

Lemma 3

The optimization of the classical-layer can be extended to a large-constrained optimization by state merging.

Proof.

An extended model of the classical-layer optimization is as follows.

Let \({N}_{{t}_{s}^{\ast }}\), \({N}_{{B}_{F}^{\ast }}\), \({N}_{|{{\mathscr{P}}}^{\ast }|}\) be the number of nodes that require the determination of optimal \({t}_{s}^{\ast }\), \({B}_{F}^{\ast }\) and \(|{{\mathscr{P}}}^{\ast }|\). Then, let J be the objective function34, as

$$J=\,{\rm{\min }}\,\sum _{t=1}^{T}\,({\alpha }_{{t}_{s}^{\ast }}(t)+{\alpha }_{{B}_{F}^{\ast }}(t)+{\alpha }_{|{{\mathscr{P}}}^{\ast }|}(t)),$$
(25)

where

$${\alpha }_{{t}_{s}^{\ast }}(t)=\sum _{g=1}^{{N}_{{t}_{s}^{\ast }}}\,{S}_{{t}_{s}^{\ast },g}(t)+{c}_{{N}_{{t}_{s}^{\ast },g}}^{L}(t),$$
(26)
$${\alpha }_{{B}_{F}^{\ast }}(t)=\sum _{g=1}^{{N}_{{B}_{F}^{\ast }}}\,{S}_{{B}_{F}^{\ast },g}(t)+{c}_{{N}_{{B}_{F}^{\ast },g}}^{L}(t),$$
(27)

and

$${\alpha }_{|{{\mathscr{P}}}^{\ast }|}(t)=\sum _{g=1}^{{N}_{|{{\mathscr{P}}}^{\ast }|}}\,{S}_{|{{\mathscr{P}}}^{\ast }|,g}(t)+{c}_{{N}_{|{{\mathscr{P}}}^{\ast }|,g}}^{L}(t),$$
(28)

where the coefficients \({S}_{{t}_{s}^{\ast },g}(t)\), \({S}_{{B}_{F}^{\ast },g}(t)\), \({S}_{|{{\mathscr{P}}}^{\ast }|,g}(t)\) refer to the number of classical steps required to find \({t}_{s}^{\ast }\), \({B}_{F}^{\ast }\) and \(|{{\mathscr{P}}}^{\ast }|\) at a particular network node g and time t, t = 1, …, T, while \({c}_{{N}_{{t}_{s}^{\ast },g}}^{L}(t)\), \({c}_{{N}_{{B}_{F}^{\ast },g}}^{L}(t)\), \({c}_{{N}_{|{{\mathscr{P}}}^{\ast }|,g}}^{L}(t)\) are the costs of classical link L at a particular g and t. Let assume that Θa(j, k, l), Θb(j, k, l) and Θc(j, k, l) are some system states of the classical layer subject of state merging. From these sub-states, the ΘM(j, k, l) merged system is defined as

$${{\rm{\Theta }}}_{M}(j,k,l)={{\rm{\Theta }}}_{a}(j,k,l)+{\rm{\Phi }}({{\rm{\Theta }}}_{b}(j,k,l)-{{\rm{\Theta }}}_{c}(j,k,l)),$$
(29)

where Φ is a merging factor, Φ  [0, 1].

Using (29), an i-th system state Θi(j, k, l) is updated as

$${{\rm{\Theta }}}_{i}(j,k,l)={({\tilde{{\rm{\Theta }}}}_{i}^{a}(j,k,l),{\tilde{{\rm{\Theta }}}}_{i}^{b}(j,k,l),{\tilde{{\rm{\Theta }}}}_{i}^{c}(j,k,l))}^{T},$$
(30)

where

$${\tilde{{\rm{\Theta }}}}_{i}^{a}(j,k,l)=\{\begin{array}{ll}{{\rm{\Theta }}}_{i}(j,k,l), & {\rm{if}}\,{x}_{a} > u\\ {{\rm{\Theta }}}_{M}(j,k,l), & {\rm{otherwise}}\end{array},$$
(31)
$${\tilde{{\rm{\Theta }}}}_{i}^{b}(j,k,l)=\{\begin{array}{ll}{{\rm{\Theta }}}_{i}(j,k,l), & {\rm{if}}\,{x}_{b} > u\\ {{\rm{\Theta }}}^{\ast }(j,k,l), & {\rm{otherwise}}\end{array},$$
(32)

and

$${\tilde{{\rm{\Theta }}}}_{i}^{c}(j,k,l)=\{\begin{array}{ll}{{\rm{\Theta }}}^{\ast }(j,k,l), & {\rm{if}}\,{x}_{c} > u\\ {{\rm{\Theta }}}_{M}(j,k,l), & {\rm{otherwise}}\end{array},$$
(33)

where u is a uniform random number, xa, xb and xc are random numbers, xa, xb, xc [0, 1], Θ*(j, k, l) is a system state that minimizes the objective function J. Then for Q (a, b, c), the objective function value \(J({\tilde{{\rm{\Theta }}}}_{i}^{Q}(j,k,l))\) is compared with the objective function Ji(j, k, l)) of Θi(j, k, l), and \({\tilde{{\rm{\Theta }}}}_{i}^{Q}(j,k,l)\) is updated by the following rule:

$${\tilde{{\rm{\Theta }}}}_{i}^{Q}(j,k,l)=\{\begin{array}{ll}{{\rm{\Theta }}}_{i}(j,k,l), & {\rm{if}}\,J({\tilde{{\rm{\Theta }}}}_{i}^{Q}(j,k,l)) < J({{\rm{\Theta }}}_{i}(j,k,l))\\ {\tilde{{\rm{\Theta }}}}_{i}^{Q}(j,k,l), & {\rm{otherwise}}.\end{array},$$
(34)

The proof is concluded here.■

Cost Uncertainty of Large-Scaled Optimization

The determination of the minimal cost function (25) is can be approached by an output variable OJ = f(ψin), where ψin is the set of input variables, and f(·) is a function that transfers the uncertainty from the independent input random variables ψin to the output variable OJ34. The output set OJ can be rewritten as

$${O}_{J}=f(q,{w}_{1},\ldots ,{w}_{z}),$$
(35)

where q is the set of certain variables, while wi is an input variable under certainty with probability function \({\delta }_{{f}_{{w}_{i}}}\).

Then, for a given variable wi, two pc(wi) probability concentrations34, pc(1)(wi) and pc(2)(wi) are defined as

$$p{c}^{(1)}({w}_{i})=({w}_{i,1},{\zeta }_{i,1})$$
(36)

and

$$p{c}^{(2)}({w}_{i})=({w}_{i,2},{\zeta }_{i,2}),$$
(37)

where wi,g, is the poth location of wi34, g = 1, 2, while ζi,g is a weighting factor.

Therefore, at a given (i, g) parameterization, where i = 1, …, z and g = 1, 2, OJ is expressed by variable \({O}_{J}^{(i,g)}\) as

$${O}_{J}^{(i,g)}=f(q,{w}_{i,g},{\mu }_{{w}_{1}},{\mu }_{{w}_{2}},\ldots ,{\mu }_{{w}_{z}}),$$
(38)

where \({\mu }_{{w}_{i}}\) is the mean of wi, while wi,1 and wi,2 are the poth locations of wi. Therefore, the problem is reduced to 2z deterministic equations34, from which the mean and standard deviation of the output random variable \({O}_{J}^{(i,g)}\) can be computed.

Performance Evaluation

In this section we study the performance of the proposed quantum layer and classical layer optimization methods.

Quantum Layer Optimization

To study the convergence of the quantum layer optimization, we characterize the D(·) closeness function of the elements of the solution set from an optimal Pareto front, and the ζ(·) ratio of optimal solutions in the solution set.

Let \({{\rm{X}}}_{{{\mathscr{G}}}_{et},i}=\{{t}_{s}^{\ast }({{\mathscr{G}}}_{et},i),{B}_{F}^{\ast }({{\mathscr{G}}}_{et},i),|{{\mathscr{P}}}^{\ast }({{\mathscr{G}}}_{et},i)|\}\) be an i-th solution from the solution set \({{\rm{X}}}_{{{\mathscr{G}}}_{et}}({N}_{it})\) found at a Nit finite number of iterations, and let \({{\rm{X}}}_{{{\mathscr{G}}}_{et}}^{\infty }=\{{t}_{s}^{\ast }({{\mathscr{G}}}_{et},z),{B}_{F}^{\ast }({{\mathscr{G}}}_{et},z),|{{\mathscr{P}}}^{\ast }({{\mathscr{G}}}_{et},z)|\}\) refer to a solution z from \({{\rm{X}}}_{{{\mathscr{G}}}_{et}}({N}_{it}\to \infty )\) at Nit → ∞. Then, let \(D({{\rm{X}}}_{{{\mathscr{G}}}_{et},i},{{\rm{X}}}_{{{\mathscr{G}}}_{et}}^{\infty })\) be the distance on the Pareto front44,45,46 between \({{\rm{X}}}_{{{\mathscr{G}}}_{et},i}\) and \({{\rm{X}}}_{{{\mathscr{G}}}_{et}}^{\infty }\), defined as

$$D({{\rm{X}}}_{{{\mathscr{G}}}_{et},i},{{\rm{X}}}_{{{\mathscr{G}}}_{et}}^{\infty })=\tfrac{1}{\chi }(A+B+C),$$
(39)

with \(D({{\rm{X}}}_{{{\mathscr{G}}}_{et},i},{{\rm{X}}}_{{{\mathscr{G}}}_{et}}^{\infty })\in [0,1]\), and where

$$A=\frac{|{t}_{s}^{\ast }({{\mathscr{G}}}_{et},z)-{t}_{s}^{\ast }({{\mathscr{G}}}_{et},i)|}{{t}_{s}^{\ast }({{\mathscr{G}}}_{et},z)},$$
(40)
$$B=\frac{|{B}_{F}^{\ast }({{\mathscr{G}}}_{et},z)-{B}_{F}^{\ast }({{\mathscr{G}}}_{et},i)|}{{B}_{F}^{\ast }({{\mathscr{G}}}_{et},z)},$$
(41)

and

$$C=\frac{||{{\mathscr{P}}}^{\ast }({{\mathscr{G}}}_{et},z)|-|{{\mathscr{P}}}^{\ast }({{\mathscr{G}}}_{et},i)||}{|{{\mathscr{P}}}^{\ast }({{\mathscr{G}}}_{et},z)|},$$
(42)

while χ is a control parameter, χ > 0.

Then, let \(\zeta ({{\rm{X}}}_{{{\mathscr{G}}}_{et}}({N}_{it}),{{\rm{X}}}_{{{\mathscr{G}}}_{et}}({N}_{it}\to \infty ))\) be a ratio of the solution sets \({{\rm{X}}}_{{{\mathscr{G}}}_{et}}({N}_{it})\) and \({{\rm{X}}}_{{{\mathscr{G}}}_{et}}({N}_{it}\to \infty )\) found at finite Nit and Nit → ∞ as

$$\zeta ({{\rm{X}}}_{{{\mathscr{G}}}_{et}}({N}_{it}),{{\rm{X}}}_{{{\mathscr{G}}}_{et}}({N}_{it}\to \infty ))=\tfrac{card({X}_{{{\mathscr{G}}}_{et}}({N}_{it})\wedge {X}_{{{\mathscr{G}}}_{et}}({N}_{it}\to \infty ))}{card({X}_{{{\mathscr{G}}}_{et}}({N}_{it}))},$$
(43)

where \(card\,({{\rm{X}}}_{{{\mathscr{G}}}_{et}}({N}_{it}))\) is the cardinality of set \({{\rm{X}}}_{{{\mathscr{G}}}_{et}}({N}_{it})\), while \({{\rm{X}}}_{{{\mathscr{G}}}_{et}}({N}_{it})\wedge {{\rm{X}}}_{{{\mathscr{G}}}_{et}}({N}_{it}\to \infty )\) is the intersection of solution sets \({{\rm{X}}}_{{{\mathscr{G}}}_{et}}({N}_{it})\) and \({{\rm{X}}}_{{{\mathscr{G}}}_{et}}({N}_{it}\to \infty )\) (solutions included by both solution sets). Note, since the χ control parameter in (39) determines the D distance from the optimal set, χ also affects the ratio of the solution sets (43). If the value of χ is high, the D distance in (39) is low, that results in a high cardinality of the intersection set in (43). If χ is low, the D distance in (39) is high, therefore the cardinality of the intersection set is small.

The quantity of (39) for various Nit and χ are depicted in Fig. 4. From the results it can be concluded that as Nit increases, the \(D({{\rm{X}}}_{{{\mathscr{G}}}_{et},i},{{\rm{X}}}_{{{\mathscr{G}}}_{et}}^{\infty })\) distance significantly decreases, while the speed of convergence is controllable by χ. It also can be verified that the χ control parameter has a significant impact on \(D({{\rm{X}}}_{{{\mathscr{G}}}_{et},i},{{\rm{X}}}_{{{\mathscr{G}}}_{et}}^{\infty })\), and the \(D({{\rm{X}}}_{{{\mathscr{G}}}_{et},i},{{\rm{X}}}_{{{\mathscr{G}}}_{et}}^{\infty })\) distance can be made arbitrarily small via a moderate value for Nit.

Figure 4
figure 4

The \(D({{\rm{X}}}_{{{\mathscr{G}}}_{et},i},{{\rm{X}}}_{{{\mathscr{G}}}_{et}}^{\infty })\) distance in function of Nit and χ. (a) The values of \(D({{\rm{X}}}_{{{\mathscr{G}}}_{et},i},{{\rm{X}}}_{{{\mathscr{G}}}_{et}}^{\infty })\) for 0 < Nit ≤ 2 · 103 and 1 ≤ χ ≤ 10. (b) The values of \(D({{\rm{X}}}_{{{\mathscr{G}}}_{et},i},{{\rm{X}}}_{{{\mathscr{G}}}_{et}}^{\infty })\) for 0 < Nit ≤ 2 · 103 and 11 ≤ χ ≤ 20.

In Fig. 5, the quantity of (43) is illustrated for different values of Nit. As Nit increases the \(\zeta ({{\rm{X}}}_{{{\mathscr{G}}}_{et}}({N}_{it}),{{\rm{X}}}_{{{\mathscr{G}}}_{et}}({N}_{it}\to \infty ))\) ratio increases significantly, while the χ control parameter has a moderate effect on the ratio of (43). The ratio exceeds 0.5 at Nit ≈ 0.5 · 103, and can be increased to arbitrarily high via a moderate increment in Nit.

Figure 5
figure 5

The ratio \(\zeta ({{\rm{X}}}_{{{\mathscr{G}}}_{et}}({N}_{it}),{{\rm{X}}}_{{{\mathscr{G}}}_{et}}({N}_{it}\to \infty ))\) in function of Nit and χ. (a) The values of \(\zeta ({{\rm{X}}}_{{{\mathscr{G}}}_{et}}({N}_{it}),{{\rm{X}}}_{{{\mathscr{G}}}_{et}}({N}_{it}\to \infty ))\) for 0 < Nit ≤ 2 · 103 and 1 ≤ χ ≤ 10. (b) The values of \(\zeta ({{\rm{X}}}_{{{\mathscr{G}}}_{et}}({N}_{it}),{{\rm{X}}}_{{{\mathscr{G}}}_{et}}({N}_{it}\to \infty ))\) for 0 < Nit ≤ 2 · 103 and 11 ≤ χ ≤ 20.

The performance of the quantum layer optimization method is therefore approachable via the distance function (39) and ratio (43). The analysis revealed that at moderate Nit values, the precision of the optimization method can be arbitrary high via the selection of the χ control parameter. It also has been concluded, that a high ratio of the solutions at a finite and moderate Nit, are identical to the solutions at the limit case of Nit → ∞.

Classical Layer Optimization

Let ϕs(i) be the step-size function defined for an i-th state as

$${\varphi }_{s}(i)={\varphi }_{{\rm{\max }}}-{\rm{\Delta }}(\varphi )\cdot \exp (\tfrac{-f(i,j,k,l)}{j}),$$
(44)

where

$${\rm{\Delta }}(\varphi )=({\varphi }_{{\rm{\max }}}-{\varphi }_{{\rm{\min }}}),$$
(45)

where \({\varphi }_{{\rm{\min }}}\) and \({\varphi }_{{\rm{\max }}}\) are some lower and upper bounds, \({\varphi }_{s}(i)\in [{\varphi }_{{\rm{\min }}},{\varphi }_{{\rm{\max }}}]\), while f(i, j, k, l) is defined as

$$f(i,j,k,l)=\omega \tfrac{{J}^{\ast }}{{J}_{P}},$$
(46)

where ω is a constant (restriction factor41,42,43), J* is the minimal cost function (for J, see (25)) at a particular parameter setting (i, j, k, l), JP is the minimal cost function associated to the population.

Then, at an iteration number j let κ(j) be a ratio as

$$\kappa (j)=\tfrac{f(i,j,k,l)}{j}.$$
(47)

The step-size (44) in function of \({\varphi }_{{\rm{\min }}}\) and κ(j) is depicted in Fig. 6.

Figure 6
figure 6

(a) The ϕs(i) step-size function for \({\varphi }_{{\rm{\min }}}\in [0,2\cdot {10}^{-3}]\), \({\varphi }_{{\rm{\max }}}=500{\varphi }_{{\rm{\min }}}\) and κ(j)  [0, 10]. (b) The ϕs(i) step-size function for \({\varphi }_{{\rm{\min }}}\in [0,5\cdot {10}^{-1}]\), \({\varphi }_{{\rm{\max }}}=2{\varphi }_{{\rm{\min }}}\) and κ(j)  [0, 40].

From (44) and (46), the optimal cost function J* at a particular setting of (i, j, k, l) is yielded as

$${J}^{\ast }=-\tfrac{\mathrm{ln}(\tfrac{{\varphi }_{{\rm{\max }}}-{\varphi }_{s}(i)}{{\rm{\Delta }}(\varphi )})j}{\omega }{J}_{P}.$$
(48)

Then, let

$$\kappa (j)=x,$$
(49)

from which the iteration number can be rewritten as

$$j=\tfrac{f(i,j,k,l)}{x},$$
(50)

and also fix f(i, j, k, l) as

$$f(i,j,k,l)={x}^{2}.$$
(51)

Then, from (49) and (51), J* can be evaluated at a given x in function JP and ω, as

$${J}^{\ast }=\tfrac{{x}^{2}{J}_{P}}{\omega }.$$
(52)

From (50) and (51), the function in (48) can be rewritten as

$$\begin{array}{rcl}{J}^{\ast } & = & -\tfrac{\mathrm{ln}(\tfrac{{\varphi }_{{\rm{\max }}}-{\varphi }_{s}(i)}{{\rm{\Delta }}(\varphi )})f(i,j,k,l)}{\omega x}{J}_{P}\\ & = & -\tfrac{\mathrm{ln}(\tfrac{{\varphi }_{{\rm{\max }}}-{\varphi }_{s}(i)}{{\rm{\Delta }}(\varphi )})x}{\omega }{J}_{P},\end{array}$$
(53)

therefore the cost function J* at a particular (49) and (51) can also be evaluated in function of the step size (44).

The J* cost function values associated to the ϕs(i) step size function values of Fig. 6 at JP = 1 and JP = 1 and ω = 100 are depicted in Fig. 7.

Figure 7
figure 7

The J* cost function values associated to the ϕs(i) step size function value, (a) JP = 1 and ω = 1, (b) JP = 1 and ω = 100.

The analysis is therefore revealed that for any ϕs(i), the cost function J* increases with the κ(j) ratio, and the values of J* are tunable via the control parameter ω for arbitrary JP and κ(j) values. The proposed classical layer optimization model is therefore flexible, and allows a dynamics adaption for diverse environmental settings. As future work, our aim is to provide a transmission analysis and comparisons with other schemes.

Conclusions

Here we conceived a multilayer optimization method for the quantum Internet. Multilayer optimization defines separate procedures for the optimization of the quantum layer and the classical layer. Quantum-layer optimization defines a multi-objective function by minimizing the total usage of quantum memories in the quantum nodes, maximizing the entanglement throughput over all entangled links, and reducing the number of entangled links between the arbitrary source and target quantum nodes. We defined the structure of the quantum memory utilization graph and entanglement throughput tree. The classical-layer optimization utilizes the fundaments of swarm intelligence for the minimization of the cost function. Since the proposed multilayer optimization method has no physical layer requirements, it can serve as a useful tool for quantum network communications and future quantum Internet.