Asymmetric quantum decision-making

Shiratori, Honoka; Shinkawa, Hiroaki; Röhm, André; Chauvet, Nicolas; Segawa, Etsuo; Laurent, Jonathan; Bachelier, Guillaume; Yamagami, Tomoki; Horisaki, Ryoichi; Naruse, Makoto

doi:10.1038/s41598-023-41715-z

Download PDF

Article
Open access
Published: 05 September 2023

Asymmetric quantum decision-making

Honoka Shiratori¹,
Hiroaki Shinkawa¹,
André Röhm¹,
Nicolas Chauvet¹,
Etsuo Segawa²,
Jonathan Laurent³,
Guillaume Bachelier³,
Tomoki Yamagami¹,
Ryoichi Horisaki¹ &
…
Makoto Naruse¹

Scientific Reports volume 13, Article number: 14636 (2023) Cite this article

752 Accesses
1 Citations
Metrics details

Subjects

Abstract

Collective decision-making plays a crucial role in information and communication systems. However, decision conflicts among agents often impede the maximization of potential utilities within the system. Quantum processes have shown promise in achieving conflict-free joint decisions between two agents through the entanglement of photons or the quantum interference of orbital angular momentum (OAM). Nonetheless, previous studies have shown symmetric resultant joint decisions, which, while preserving equality, fail to address disparities. In light of global challenges such as ethics and equity, it is imperative for decision-making systems to not only maintain existing equality but also address and resolve disparities. In this study, we investigate asymmetric collective decision-making theoretically and numerically using quantum interference of photons carrying OAM or entangled photons. We successfully demonstrate the realization of asymmetry; however, it should be noted that a certain degree of photon loss is inevitable in the proposed models. We also provide an analytical formulation for determining the available range of asymmetry and describe a method for obtaining the desired degree of asymmetry.

Entangled N-photon states for fair and optimal social decision making

Article Open access 24 November 2020

Entangled-photon decision maker

Article Open access 22 August 2019

Conflict-free collective stochastic decision making by orbital angular momentum of photons through quantum interference

Article Open access 26 October 2021

Introduction

Even in situations with limited knowledge, people are required to make decisions by estimating and believing which choice is profitable¹. The multi-armed bandit problem model depicts the decision-making process in uncertain environments, wherein each player is assumed to intend to maximize reward by predicting the best one among several slot machines, referred to as arms, whose reward probabilities are unknown². In a multi-armed bandit problem, exploration is necessary to predict reward probabilities precisely; however, excessive exploration can diminish the sum of obtained rewards^{3, 4}, whereas minimal explorations can result in the best arm being missed. Furthermore, when numerous players are engaged in the game, the problem is referred to as a competitive multi-armed bandit problem⁵. In this case, decision conflicts are another problem because multiple players choosing the same arm can result in a bottleneck and consequently impede the profits of the entire group^{6, 7}.

Quantum approaches have been extensively studied to solve uncertain problems^{8,9,10,11,12,13}. The quantum properties of photons can aid in solving the problem of decision conflicts in collective decision-making^{5, 14, 15}. Two previous studies developed quantum systems enabling conflict-free decision-making between two players. The first study utilized the Hong–Ou–Mandel effect of orbital angular momentum (OAM)^{14, 15}, whereas the second one utilized entangled photons⁵.

However, these systems prohibit conducting affirmative actions¹⁶ to reduce disparities between players, primarily because the decisions made are always symmetric. Namely, the probability of player X selecting arm l and player Y choosing arm m is inevitably the same as that of player X selecting arm m and player Y choosing arm l. This property is referred to as symmetry, owing to which both players are always treated evenly; essentially, equality is ensured¹⁷. We refer to the previous study utilizing the Hong–Ou–Mandel effect of OAM as the symmetric OAM system, and the other one utilizing entangled photons as the symmetric entangled photon decision maker. The symmetric property is suitable when players are equal since the beginning of the game because, on average, equality is ensured at all times by symmetry. However, consider if one player is in a much more advantageous position compared with the other prior to the game; this inequality cannot be resolved by the aforementioned systems owing to symmetry (Fig. 1a). Thus, these previous systems are superior in maintaining equality; however, they cannot reduce disparities.

To facilitate affirmative actions in resolving inequalities, decision-making must be asymmetric such that a disadvantaged person or entity is more likely to choose the better arm than an advantaged person or entity. Asymmetry is the property that allows the probability of player X selecting arm l and player Y choosing arm m to differ from that of the opposite case. Previously established systems enabled only symmetric treatments, whereas the decision-making systems proposed in this study can control asymmetry by enabling asymmetric treatments, thus being able to facilitate advantageous outcomes for underprivileged agents (Fig. 1b). Note that the initial aforementioned disparities are recognized in various serious social issues ranging from earning differentials, gender gaps, and educational inequalities^18,19,20,21. In addition, the importance of focusing on the wider context of global challenges, such as ethics and fairness, is recognized in the field of responsible artificial intelligence (AI) as responsible research and innovation paradigm (RRI)^{22, 23}. Thus, considering the social context and RRI, ensuring existing equality may be insufficient, and affirmative actions must be enabled to diminish disparities. Another context is setting priority in information and communication services. Prioritized agents or entities should receive higher rewards than others while avoiding decision conflicts.

This study proposed improvements in quantum models and incorporated the potential to address disparities by realizing asymmetric decision-making and enabling control of asymmetry in the competitive multi-armed bandit problem. First, a quantum model was proposed by applying the Hong–Ou–Mandel effect with polarization dependencies, which is referred to as the asymmetric OAM system. This corresponded to an enhanced version of the symmetric OAM system proposed by Amakasu et al.¹⁴ by further incorporating the polarization-dependent effects. Next, the achievable asymmetric decision-making range was clarified analytically. Furthermore, two models to be compared with the asymmetric OAM system were investigated. One was the extension of the symmetric entangled photon decision-maker⁵, whereas the other was an extension of the symmetric OAM system¹⁴. The proposed asymmetric OAM system can provide asymmetric decision-making with negligible photon loss, provided the intended asymmetry is significant, whereas the entangled-photon approach suffers from significant photon loss. Conversely, the proposed asymmetric OAM system must accompany photon loss or decision conflicts when the decision is required to be symmetric, whereas the entangled photon approach accomplishes negligible photon loss in the corresponding situation. Thus, a trade-off exists between the proposed asymmetric OAM system and the entangled photon system.

Asymmetric decision-making by OAM

This section proposes the manner in which asymmetric decision-making can be realized by the decision-making system utilizing OAM. Figure 2a shows the construction of the system for the two-player-K-armed bandit problem. The OAM detected at X corresponds to the arm selected by player X, whereas that detected at Y corresponds to the arm selected by player Y. Two inputs, $\Phi$ and $\Psi$, are represented by two bases: OAM and polarization. This system differs from that in the previous study¹⁴ in that one polarization beam splitter (PBS) is added to it, and photons have polarizations. The polarization of photons is represented by $\alpha$ and $\beta$, i.e., the probability amplitudes of photons having horizontal and vertical polarizations are $\alpha$ and $\beta$, respectively, and the relation $|\alpha |^2 + |\beta |^2 = 1$ holds. Note that $\alpha ,~\beta \in {\mathbb {R}}$ are represented by $\cos \theta$ and $\sin \theta$ respectively, later in this study.

Formulation

This section provides a mathematical derivation of the probabilities corresponding to pairs of decisions based on the system presented in Fig. 2a for the case involving K choices, that is, K OAMs.

First, we give the space to treat polarization and OAMs. Let us denote the horizontal and vertical polarization states by

$$\begin{aligned} \vert {H}\rangle =\left[ \begin{array}{c} 1 \\ 0 \end{array}\right] , ~~ \vert {V}\rangle =\left[ \begin{array}{c} 0 \\ 1 \end{array}\right] , \end{aligned}$$

(1)

respectively. Then the Hilbert space corresponding to polarization states is described as

$$\begin{aligned} {\mathscr {H}}_p:= {{\,\mathrm{\textrm{span}}\,}}\{\vert {H}\rangle ,\ \vert {V}\rangle \} \simeq {\mathbb {C}}^2. \end{aligned}$$

(2)

OAM states are represented by integers; their signs and absolute values denote directions (right $(+)$- or left $(-)$-handed) and numbers of intertwined helices, respectively²⁴. Especially, the numbers of intertwined helices are utilized to identify the selected arms; for example, OAM $\pm k$ corresponds to arm k. Thus, we limit the possible values of OAMs to $\pm 1,\ \pm 2,\ \cdots \pm K$. Here, for $k\in [K] := \{1,\,2,\,\ldots ,\,K\}$, we define vector $\vert {\pm k}\rangle \in {\mathbb {C}}^{2K}$ corresponding to OAM $\pm k$ as follows:

$$\begin{aligned} \vert {+k}\rangle = [~0,\dots ,{\mathop {\breve{1}}\limits ^{\hbox{} k}},\dots ,0~]^\top ,~~ \vert {-k}\rangle = [~0,\dots ,{\mathop {\breve{1}}\limits ^{\hbox { } k+K}},\dots ,0~]^\top , \end{aligned}$$

(3)

where superscript $\top$ on a matrix represents the transpose of the matrix. Then the Hilbert space corresponding to OAM states is described as

$$\begin{aligned} {\mathscr {H}}_o:= {{\,\mathrm{\textrm{span}}\,}}\{\vert {\ell }\rangle \,|\,\ell \in \pm [K]\} \simeq {\mathbb {C}}^{2K}, \end{aligned}$$

(4)

where $\pm [K] = \{\pm 1,\,\pm 2,\,\cdots ,\,\pm K\}$. As polarization states and OAM states are independent, the hybrid states are in the composite Hilbert space defined as

$$\begin{aligned} {\mathscr {H}}_s = {\mathscr {H}}_p \otimes {\mathscr {H}}_o = {{\,\mathrm{\textrm{span}}\,}}\{\vert {P}\rangle \otimes \vert {\ell }\rangle \,|\,P\in \{H,\ V\},\ \ell \in \pm [K]\} \simeq {\mathbb {C}}^{4K}, \end{aligned}$$

(5)

as in²⁵.

The hybrid states of OAM and polarization can be generated using spatial light modulators (SLMs). The first input is represented as:

$$\begin{aligned} \Phi = \left[ \begin{array}{c} \alpha \\ \beta \end{array} \right] \otimes ~~ \tilde{\Phi },~~ \tilde{\Phi }:= \sum _{k=1}^{K} a_k e^{i\phi _k}\vert {+k}\rangle = [ ~a_1 e^{i\phi _1}, ~\ldots , ~a_k e^{i\phi _k}, ~\ldots , ~a_K e^{i\phi _K},\overbrace{~0, ~\ldots , ~0}^{k\, \text {elements}} ~]^\top \in {\mathscr {H}}_o, \end{aligned}$$

(6)

where $a_k \in {\mathbb {R}}$ and $\phi _k\in [0,\ 2\pi )$ for all $k\in [K]$. The coefficient $a_k$ represents the probability amplitude of the superposed OAM with index $+k$, and due to probability conservation the equation $\sum _{k=1}^K a_k^2 =1$ holds. The elements $\alpha$ and $\beta$, which represent a vector in ${\mathscr {H}}_p$, are the probability amplitudes of photon $\Phi$ with horizontal and vertical polarizations, respectively. The elements in the latter half of $\tilde{\Phi }$ were all zero because $\Phi$ was designed to have only positive OAM. In the previous research, one player manipulated $\tilde{\Phi }$ according to his or her preference. $\Phi$ is a 4K dimensional vector because of the tensor product. The first 2K elements of $\Phi$ correspond to the probability amplitudes of horizontal polarization, consisting of the OAMs. Essentially, the squared sum of the first 2K elements of $\Phi$, $\left[ ~ \alpha , ~~0 ~\right] ^\top \otimes \tilde{\Phi }$, is $|\alpha |^2$, which is the probability that a photon $\Phi$ exhibits horizontal polarization. In addition, the latter 2K elements, $\left[ ~ 0, ~~\beta ~\right] ^\top \otimes \tilde{\Phi }$, are the probability amplitudes of vertical polarization.

After passing the first beam splitter, $\Phi$ is transformed to:

$$\begin{aligned} \Phi '= \left( I_2 \otimes A\right) \Phi ,~~ A:= \sum _{\ell \in \pm [K]}\frac{1}{\sqrt{2}}\Bigl ({\vert {\ell }\rangle }{\langle {\ell }\vert } + i{\vert {-\ell }\rangle }{\langle {\ell }\vert }\Bigr ) = \frac{1}{\sqrt{2}} \left[ \begin{array}{cc} I_K &{} iI_K \\ iI_K &{} I_K \end{array} \right] , \end{aligned}$$

(7)

where A corresponds to the effect of a beamsplitter in the 2K-dimensional Hilbert space of OAM, ${\mathscr {H}}_o$. Here, for $N \in {\mathbb {N}}$, $I_N$ indicates a N by N identity matrix. Essentially, OAM did not change if a photon transmits through a beam splitter, whereas the sign of OAM reversed, and the probability amplitude was multiplied by i if it was reflected. However, because both the OAM and polarization were considered herein, the effect of a beam splitter on photon states was $I_2 \otimes A$, which performs a unitary transformation on ${\mathscr {H}}_s$. BS plays a crucial role in inducing quantum interference. The probabilistic nature of transmission and reflection of light at the BS makes it impossible to distinguish whether the photon after passing through the BS came from $\Phi$ or $\Psi$, which is the essence of quantum interference. Note that we consider the two inputs, $\Phi$ and $\Psi$, separately first at this stage of the calculation. The interference effect does not appear in Eq. (7). The effect appears later in Eq. (14). Subsequently, based on the reflection at mirrors after the beam splitter, $\Phi '$ is transformed to:

$$\begin{aligned} \Phi '' = \left( I_2 \otimes R \right) \Phi ' = \left( I_2 \otimes RA \right) \Phi ,~~ R:= \sum _{\ell \in \pm [K]} i{\vert {-\ell }\rangle }{\langle {\ell }\vert } = \left[ \begin{array}{cc} O_K &{} iI_K \\ iI_K &{} O_K \end{array} \right] , \end{aligned}$$

(8)

wherein R corresponds to the effect of the reflection by a mirror in the 2K-dimensional Hilbert space of OAM, and $O_N$ implies a N by N zero matrix. R implies that the reflection reverses the signs of OAMs, and probability amplitudes are multiplied by i. However, because the polarization must be considered in addition to OAM, the effect of reflections on photon states should be $I_2 \otimes R$, which performs a unitary transformation on ${\mathscr {H}}_s$.

At a polarization beam splitter, OAM with horizontal polarization is transmitted, whereas OAM with vertical polarization is reflected and multiplied by i. Therefore, the effect of a polarization beam splitter, which acts on ${\mathscr {H}}_s={\mathscr {H}}_p \otimes {\mathscr {H}}_o$, can be represented by the following 4K by 4K matrix C:

$$\begin{aligned} C:= {\vert {H}\rangle }{\langle {H}\vert } \otimes I_{2K} + {\vert {V}\rangle }{\langle {V}\vert }\otimes \sum _{\ell \in \pm [K]} i{\vert {-\ell }\rangle }{\langle {\ell }\vert } = \left[ \begin{array}{cc} I_{2K} &{} O_{2K} \\ O_{2K} &{} i\sigma \otimes I_K \end{array} \right] , ~~ \sigma := \left[ \begin{array}{cc} 0 &{} 1\\ 1 &{} 0 \end{array} \right] . \end{aligned}$$

(9)

The operator C performs a unitary transformation on ${\mathscr {H}}_s$. Because detectors are sensitive only to OAMs, herein, we should consider the corresponding map ${\mathscr {H}}_s \rightarrow {\mathscr {H}}_o$ which is represented by a 2K by 4K matrix $\left[ \begin{array}{ccc} I_{2K}&\Big |&I_{2K} \end{array} \right]$. The probability amplitude of OAM k and horizontal polarization and that of OAM k and vertical polarization were added. Therefore, the effect of the asymmetric OAM system on one input is expressed as:

$$\begin{aligned} V:= \left[ \begin{array}{ccc} I_{2K}&\Big |&I_{2K} \end{array} \right] C \left( I_2 \otimes RA \right) = \frac{1}{\sqrt{2}} \left[ \begin{array}{cccc} -I_K &{} iI_K &{} -I_K &{} -iI_K \\ iI_K &{} -I_K &{} -iI_K &{} -I_K \end{array} \right] . \end{aligned}$$

(10)

Hence, the observed output of input $\Phi$ is:

$$\begin{aligned} \Phi _{\textrm{out}} = V \Phi = V \left[ \begin{array}{c} \alpha a_1 e^{i\phi _1} \\ \vdots \\ \alpha a_K e^{i\phi _K} \\ 0_K \\ \beta a_1 e^{i\phi _1} \\ \vdots \\ \beta a_K e^{i\phi _K} \\ 0_K \end{array} \right] = \frac{1}{\sqrt{2}} \left[ \begin{array}{c} -(\alpha + \beta ) a_1 e^{i\phi _1} \\ \vdots \\ -(\alpha + \beta ) a_K e^{i\phi _K} \\ i(\alpha - \beta ) a_1 e^{i\phi _1} \\ \vdots \\ i(\alpha - \beta ) a_K e^{i\phi _K} \end{array} \right] . \end{aligned}$$

(11)

Note that $0_N$ indicates a zero vector with N elements.

Similarly, by considering the second input having the same polarization and OAMs with only negative signs:

$$\begin{aligned} \Psi = \left[ \begin{array}{c} \alpha \\ \beta \end{array} \right] \otimes ~~ \tilde{\Psi },~~ \tilde{\Psi }:= \sum _{k=1}^{K} b_k e^{i\psi _k}\vert {-k}\rangle = [~\overbrace{0,~\ldots , ~0}^{k\, \text {elements}},~ b_1 e^{i\psi _1}, ~\ldots , ~ b_k e^{i\psi _k}, ~ \ldots , ~ b_K e^{i\psi _K} ~]^\top , \end{aligned}$$

(12)

$$\begin{aligned} \Psi _{\textrm{out}} = V \Psi = V \left[ \begin{array}{c} 0_K \\ \alpha b_1 e^{i\psi _1} \\ \vdots \\ \alpha b_K e^{i\psi _K} \\ 0_K \\ \beta b_1 e^{i\psi _1} \\ \vdots \\ \beta b_K e^{i\psi _K} \end{array} \right] = \frac{1}{\sqrt{2}} \left[ \begin{array}{c} i(\alpha - \beta ) b_1 e^{i\psi _1} \\ \vdots \\ i(\alpha - \beta ) b_K e^{i\psi _K} \\ -(\alpha + \beta ) b_1 e^{i\psi _1} \\ \vdots \\ -(\alpha + \beta ) b_K e^{i\psi _K} \end{array} \right] . \end{aligned}$$

(13)

Herein, the coefficient $b_k \in {\mathbb {R}}$ represents the probability amplitude of the superposed OAM $-k$, and the equation $\sum _{k=1}^K b_k^2 =1$ holds. The sign of OAM of the two inputs $\Phi$ and $\Psi$ was fixed such that quantum interference could occur at the first beam splitter (see Fig. 2b). While the sign of $\Phi$ only contains positive OAM, those of $\Psi$ were all negative. Thus, $\tilde{\Psi }$ is a 2K dimensional vector, whose ith element is the probability amplitude OAM $+i$ when $i \le K$ and that of OAM $-i$ otherwise. The elements in the first half of $\tilde{\Psi }$ were all zero because $\Psi$ contained only minus OAM.

The output of the total system is $\Phi _{\textrm{out}} \otimes \Psi _{\textrm{out}}$, as shown in Fig. 2b. The j-th element of $\Phi _{\textrm{out}}$ or $\Psi _{\textrm{out}}$ is the probability amplitude of a photon $\Phi$ or $\Psi$ having OAM j being detected at detector X when $j \le K$. Further, it is the probability amplitude of a photon $\Phi$ or $\Psi$ having OAM j being detected at detector Y when $j \ge K+1$. Herein, the focus was placed on the cases where the two photons were detected by two different detectors. Such probability amplitudes can be obtained by the tensor product of the latter half of $\Phi _{\textrm{out}}$ and the first part of $\Psi _{\textrm{out}}$ by Eqs. (11) and (13):

$$\begin{aligned} -\frac{(\alpha -\beta )^2}{2} \left[ \begin{array}{cc} a_1 e^{i\phi _1} \\ \vdots \\ a_K e^{i\phi _K} \end{array} \right] \otimes \left[ \begin{array}{cc} b_1 e^{i\psi _1} \\ \vdots \\ b_K e^{i\psi _K} \end{array} \right] , \end{aligned}$$

(14)

and by the tensor product of the latter half of $\Psi _{\textrm{out}}$ and the first half of $\Phi _{\textrm{out}}$:

$$\begin{aligned} \frac{(\alpha + \beta )^2}{2} \left[ \begin{array}{cc} b_1 e^{i\psi _1} \\ \vdots \\ b_K e^{i\psi _K} \end{array} \right] \otimes \left[ \begin{array}{cc} a_1 e^{i\phi _1} \\ \vdots \\ a_K e^{i\phi _K} \end{array} \right] . \end{aligned}$$

(15)

Table 1 Probabilities of pairs of decisions made by the asymmetric OAM system.

Full size table

Therefore, by Eqs. (14) and (15), the probability amplitude of OAM $k_1$ is detected at X, and OAM $k_2$ is detected at Y, that is, the probability amplitude of player X choosing option $k_1$ and player Y selecting option $k_2$, is:

$$\begin{aligned} \frac{1}{2} \left( (\alpha +\beta )^2 a_{k_2} b_{k_1} e^{i(\phi _{k_2}+\psi _{k_1})} - (\alpha -\beta )^2 a_{k_1} b_{k_2} e^{i(\phi _{k_1} +\psi _{k_2} )} \right) . \end{aligned}$$

(16)

Hence, by considering the squared absolute values, the probability of player X choosing option $k_1$ and player Y selecting option $k_2$ is:

$$\begin{aligned} P(X:k_1, Y:k_2) = \frac{1}{4} a_{k_1}^2 b_{k_2}^2 (\alpha -\beta )^4 + \frac{1}{4} a_{k_2}^2 b_{k_1}^2 (\alpha + \beta )^4 - \frac{1}{2}a_{k_1}a_{k_2}b_{k_1}b_{k_2} (\alpha -\beta )^2 (\alpha + \beta )^2 \cos (\theta _{k_1}-\theta _{k_2}) \end{aligned}$$

(17)

with $\theta _{k}:= (\phi _k -\psi _k)/2$ for $k\in [K]$. Therefore, the difference between the probability of player X choosing arm $k_1$ and player Y selecting arm $k_2$ and that of player X choosing arm $k_2$ and player Y selecting arm $k_1$ is expressed as

$$\begin{aligned} P~(X:k_1,Y:k_2) - P(X:k_2,Y:k_1)= 2\alpha \beta ({a}_{k_2}^2{b}_{k_1}^2 - {a}_{k_1}^2 {b}_{k_2}^2). \end{aligned}$$

(18)

Hence, if the following condition holds true,

$$\begin{aligned} \alpha \beta \ne 0,~ {a}_{k_2}{b}_{k_1} \ne \pm {a}_{k_1} {b}_{k_2} \end{aligned}$$

(19)

the difference expressed as Eq. (18) is non-zero. Thus, $P(X:k_1, Y:k_2) \ne P(X:k_2, Y:k_1)$ is achieved; i.e., asymmetry in decision-making is realized, which is the purpose of adding the PBS in Fig. 2.

However, conflicts can arise with a certain probability at the same time. By substituting $k_1$ and $k_2$ of k, the probability of the conflict occurring with arm k is expressed as:

$$\begin{aligned} P(X:k, Y:k) = \alpha ^2\beta ^2a_k^2 b_k^2 \ne 0. \end{aligned}$$

(20)

Results

Next, the two-players (players X and Y), two-arms (arms 1 and 2; i.e., $K=2$) situation was examined in detail. Table 1 summarizes the probabilities of each pair of decisions, where $p_{k_1 k_2}$ with $k_1,\ k_2\in \{1,\ 2\}$ implies that player X chooses arm $k_1$ and player Y chooses arm $k_2$. Figure 3a demonstrates the feasible pairs of $p_{12}$ and $p_{21}$ by blue-colored region on a plane, with the horizontal and vertical axes being $p_{12}$ and $p_{21}$, respectively. The line of $p_{12}=p_{21}$ implies the symmetric decision-making. As evident, the blue-colored region exists outside the $p_{12}=p_{21}$ line, thus validating the feasibility of asymmetric decision-making.

However, the asymmetric OAM system cannot realize all combinations of $(p_{12},p_{21})$. For example, $(p_{12}, p_{21})=(0.5, 0.5)$ is outside the feasible zone. Indeed, the red curve in Fig. 3a shows the boundary between the feasible and infeasible zones of $(p_{12},p_{21})$. The first right side of the boundary belongs to the impossible zone, whereas the lower left side belongs to the possible zone. This boundary also corresponds to the cases without loss. The formula of this boundary is expressed as:

$$\begin{aligned} 2(p_{12} + p_{21}) = 1+ (p_{12} - p_{21})^2. \end{aligned}$$

(21)

See the Supplementary Information for the derivation of Eq. (21).

Thus, the conflict probability, the probability of both players choosing the same arm, is defined as $p_{12} + p_{21}$, the asymmetry ratio of the decision-making as $p_{21}/p_{12}$, and the loss probability of photons as $1-(p_{11}+p_{12}+p_{21}+p_{22})$. Figure 4a shows the relationship between the conflict probability plus loss probability and symmetry ratio. The red-colored boundary in Fig. 4a denotes the minimum-loss-plus-conflict boundary. By defining the conflict probability plus loss probability as x and the asymmetry ratio as y, the formula is expressed as:

$$\begin{aligned} y={\left\{ \begin{array}{ll} \dfrac{(1+\sqrt{1-2x})^2}{(1-\sqrt{1-2x})^2} ~~&{} \text {when } y \ge 1,\\ \\ \dfrac{(1-\sqrt{1-2x})^2}{(1+\sqrt{1-2x})^2}~~&{} \text {when } y \le 1. \end{array}\right. } \end{aligned}$$

(22)

The detailed derivation of Eq. (22) is presented in Supplementary Information.

In the entangled photon decision maker, described later, 50% loss or conflict is inevitable in obtaining any asymmetry ratio. This rate is smaller than the smallest percentage necessary to realize all asymmetry ratios in the OAM attenuation. For situations when a lower rate of loss or conflict is appealing, an extreme asymmetry ratio, such as more than 100 or smaller than 0.01, is obtained by the asymmetry OAM system. Therefore, the decision-making system using OAM is more suitable when inequality between players is serious such that more powerful affirmative actions are necessary.

To obtain Figs. 3 and 4, first the parameters $a_1, a_2, b_1, b_2, \alpha ,$ and $\beta$ are varied with a step size of $\pi /200$ as follows:

$$\begin{aligned}{}&a_1 = \cos x, ~~a_2= \sin x, ~~x=0,~ \frac{\pi }{200},~ \frac{2\pi }{200}, \ldots ,~ \pi , \end{aligned}$$

(23)

$$\begin{aligned}{}&b_1 = \cos y, ~~b_2= \sin y, ~~y=0,~ \frac{\pi }{200},~ \frac{2\pi }{200}, \ldots ,~ \pi , \end{aligned}$$

(24)

$$\begin{aligned}{}&\alpha =\cos \theta , ~~\beta =\sin \theta , ~~~\theta =0,~ \frac{\pi }{200},~ \frac{2\pi }{200}, \ldots ,~ \frac{\pi }{4} . \end{aligned}$$

(25)

Next, $p_{12}$ and $p_{21}$ are calculated, and data points, which are represented by blue dots in the figures, are obtained.

Obtaining a specific asymmetry ratio

In terms of application, the method to obtain the intended asymmetry ratio must be determined. First, any asymmetry ratio is possible while avoiding decision conflicts. Based on the results presented in Table 1, the conflict probability becomes zero when $a_2=b_1=0$ or $a_1=b_2=0$. Note that the loss probability is not zero.

Let r be the desired asymmetry ratio. When $a_2=b_1=0$, the asymmetry ratio is expressed as:

$$\begin{aligned} r=\frac{p_{21}}{p_{12}} = \frac{(\alpha +\beta )^4}{(\alpha -\beta )^4}. \end{aligned}$$

(26)

By introducing $\theta$ such that $\alpha =\cos \theta , \beta = \sin \theta$, Eq. (26) becomes

$$\begin{aligned} r=\frac{(\cos \theta +\sin \theta )^4}{(\cos \theta -\sin \theta )^4}. \end{aligned}$$

(27)

Organizing Eq. (27) about $\theta$, we obtain

$$\begin{aligned} 3-3r-\cos 4\theta + r\cos 4\theta + 4\sin 2\theta + 4r\sin 2\theta = 0. \end{aligned}$$

(28)

By solving Eq. (28), we obtain $\alpha$ and $\beta$ to realize r without conflicts. Figure 5a shows the relationship between $\theta$ and r based on Eq. (27), showing that r can take every value with $\theta$ from $-\pi /4$ to $\pi /4$. The realization of any r is significant because the degree of asymmetry can be balanced depending on the current inequality between players.

Similarly, when $a_1=b_2=0$, the asymmetry ratio is:

$$\begin{aligned} r=\frac{p_{21}}{p_{12}} = \frac{(\alpha -\beta )^4}{(\alpha +\beta )^4} \end{aligned}$$

(29)

which is reformulated using $\theta$ as

$$\begin{aligned} r=\frac{(\cos \theta -\sin \theta )^4}{(\cos \theta +\sin \theta )^4}. \end{aligned}$$

(30)

Hence, by solving the following Eq. (31), we obtain $\alpha$ and $\beta$ to realize r without conflicts.

$$\begin{aligned} 3-3r-\cos 4\theta + r\cos 4\theta - 4\sin 2\theta - 4r\sin 2\theta = 0 \end{aligned}$$

(31)

Figure 5b shows the relationship between $\theta$ and r based on Eq. (31), showing that r can acquire every value with $\theta$ from 0 to $\pi /4$.

Table 2 Probabilities of decisions when BS is added to symmetric OAM system instead of PBS.

Full size table

Origin of the asymmetry

The difference between the asymmetric and symmetric OAM system is the existence of the PBS in the system and the addition of polarization to the photon state. The polarization of the photon state is expressed by two parameters: $\alpha$ and $\beta$. With the probabilities of $|\alpha |^2$ and $|\beta |^2$, photons are detected as horizontal and vertical polarizations, respectively. These parameters satisfy $|\alpha |^2+|\beta |^2=1$. When $\alpha =0$ or $\beta =0$, the photons simply transmit or are reflected at the PBS. Therefore, quantum interference does not occur at the PBS, with PBS playing no role; this situation corresponds to the symmetric OAM system¹⁴. However, when $\alpha \ne 0$ and $\beta \ne 0$, whether the photons are transmitted or reflected at the PBS is decided stochastically. Therefore, quantum interference can occur. Thus, the occurrence of quantum interference at the PBS renders a difference between the asymmetric and symmetric OAM systems.

Indeed, asymmetric decision-making is possible via the addition of both PBS and BS. Table 2 lists the probabilities of pairs of decisions in the case where BS is added instead of PBS. When BS was added instead of PBS, fewer states could be achieved. For example, by adding PBS to the symmetric OAM system, any nonnegative asymmetry ratio can be achieved without conflicts. This is because parameters $\alpha$ and $\beta$ possess the degree of freedom even if $a_1,~~a_2,~~b_1~~b_2$ are set to (0, 1, 1, 0) or (1, 0, 0, 1) to render conflict probability zero. However, when attempting to render conflicts free in the system where BS is added, only two states can be realized: $p_{12}=0, ~~p_{21}=1$ or $p_{12}=1, ~~p_{21}=0$. Therefore, the addition of PBS to the symmetric OAM system yields superior results.

Asymmetric decision-making by entangled photon decision maker

This section presents the entangled photon decision-maker that can fulfill asymmetric decision-making, particularly for the two-players, two-arms bandit problem. Figure 6 shows a schematic of the entangled decision-maker. The input to the system is two entangled photons. One photon entering PBS 1 decides player X’s choice while another entering PBS 2 decides player Y’s. In a previous study⁵, conflict-free, symmetric decision-making among two players was theoretically and experimentally demonstrated. The system shown in Fig. 6 realizes the asymmetry by discarding photons with specific probabilities at the polarizers before APDs or avalanche photodiodes. This system is different from that in the previous study⁵ owing to the presence of polarizers. Note that herein, a specific input is assumed:

$$\begin{aligned} \frac{1}{\sqrt{2}}\Bigl (\vert {\theta _1, \theta _2}\rangle - \vert {\theta _2, \theta _1}\rangle \Bigr ). \end{aligned}$$

(32)

This is a superposition of the following two states. One is the state with photons with polarizations of $\theta _1$ and $\theta _2$ entering the PBS 1 and 2, respectively. The other is the state with photons with polarization $\theta _2$ and $\theta _1$ entering the PBS 1 and 2, respectively. In particular, the latter state employs a $\pi$ phase shift to consider the minus sign of the second term in Eq. (32), i.e. the second term is actually $e^{i\pi } \vert {\theta _2, \theta _1}\rangle$. Herein, the polarizations of $\theta _1$ and $\theta _2$ are orthogonal to each other and satisfy the following condition.

$$\begin{aligned} \theta _2 = \theta _1 + \frac{\pi }{2}. \end{aligned}$$

(33)