Introduction

Even in situations with limited knowledge, people are required to make decisions by estimating and believing which choice is profitable1. The multi-armed bandit problem model depicts the decision-making process in uncertain environments, wherein each player is assumed to intend to maximize reward by predicting the best one among several slot machines, referred to as arms, whose reward probabilities are unknown2. In a multi-armed bandit problem, exploration is necessary to predict reward probabilities precisely; however, excessive exploration can diminish the sum of obtained rewards3, 4, whereas minimal explorations can result in the best arm being missed. Furthermore, when numerous players are engaged in the game, the problem is referred to as a competitive multi-armed bandit problem5. In this case, decision conflicts are another problem because multiple players choosing the same arm can result in a bottleneck and consequently impede the profits of the entire group6, 7.

Quantum approaches have been extensively studied to solve uncertain problems8,9,10,11,12,13. The quantum properties of photons can aid in solving the problem of decision conflicts in collective decision-making5, 14, 15. Two previous studies developed quantum systems enabling conflict-free decision-making between two players. The first study utilized the Hong–Ou–Mandel effect of orbital angular momentum (OAM)14, 15, whereas the second one utilized entangled photons5.

However, these systems prohibit conducting affirmative actions16 to reduce disparities between players, primarily because the decisions made are always symmetric. Namely, the probability of player X selecting arm l and player Y choosing arm m is inevitably the same as that of player X selecting arm m and player Y choosing arm l. This property is referred to as symmetry, owing to which both players are always treated evenly; essentially, equality is ensured17. We refer to the previous study utilizing the Hong–Ou–Mandel effect of OAM as the symmetric OAM system, and the other one utilizing entangled photons as the symmetric entangled photon decision maker. The symmetric property is suitable when players are equal since the beginning of the game because, on average, equality is ensured at all times by symmetry. However, consider if one player is in a much more advantageous position compared with the other prior to the game; this inequality cannot be resolved by the aforementioned systems owing to symmetry (Fig. 1a). Thus, these previous systems are superior in maintaining equality; however, they cannot reduce disparities.

To facilitate affirmative actions in resolving inequalities, decision-making must be asymmetric such that a disadvantaged person or entity is more likely to choose the better arm than an advantaged person or entity. Asymmetry is the property that allows the probability of player X selecting arm l and player Y choosing arm m to differ from that of the opposite case. Previously established systems enabled only symmetric treatments, whereas the decision-making systems proposed in this study can control asymmetry by enabling asymmetric treatments, thus being able to facilitate advantageous outcomes for underprivileged agents (Fig. 1b). Note that the initial aforementioned disparities are recognized in various serious social issues ranging from earning differentials, gender gaps, and educational inequalities18,19,20,21. In addition, the importance of focusing on the wider context of global challenges, such as ethics and fairness, is recognized in the field of responsible artificial intelligence (AI) as responsible research and innovation paradigm (RRI)22, 23. Thus, considering the social context and RRI, ensuring existing equality may be insufficient, and affirmative actions must be enabled to diminish disparities. Another context is setting priority in information and communication services. Prioritized agents or entities should receive higher rewards than others while avoiding decision conflicts.

This study proposed improvements in quantum models and incorporated the potential to address disparities by realizing asymmetric decision-making and enabling control of asymmetry in the competitive multi-armed bandit problem. First, a quantum model was proposed by applying the Hong–Ou–Mandel effect with polarization dependencies, which is referred to as the asymmetric OAM system. This corresponded to an enhanced version of the symmetric OAM system proposed by Amakasu et al.14 by further incorporating the polarization-dependent effects. Next, the achievable asymmetric decision-making range was clarified analytically. Furthermore, two models to be compared with the asymmetric OAM system were investigated. One was the extension of the symmetric entangled photon decision-maker5, whereas the other was an extension of the symmetric OAM system14. The proposed asymmetric OAM system can provide asymmetric decision-making with negligible photon loss, provided the intended asymmetry is significant, whereas the entangled-photon approach suffers from significant photon loss. Conversely, the proposed asymmetric OAM system must accompany photon loss or decision conflicts when the decision is required to be symmetric, whereas the entangled photon approach accomplishes negligible photon loss in the corresponding situation. Thus, a trade-off exists between the proposed asymmetric OAM system and the entangled photon system.

Figure 1
figure 1

(a) The necessity of asymmetric treatments. The symmetric OAM system in the previous study can maintain the existing equality but cannot solve inequality. (b) Stochastic detection of OAM corresponds to a probabilistic selection of a player. Polarization enables us to diminish inequalities between players.

Asymmetric decision-making by OAM

This section proposes the manner in which asymmetric decision-making can be realized by the decision-making system utilizing OAM. Figure 2a shows the construction of the system for the two-player-K-armed bandit problem. The OAM detected at X corresponds to the arm selected by player X, whereas that detected at Y corresponds to the arm selected by player Y. Two inputs, \(\Phi\) and \(\Psi\), are represented by two bases: OAM and polarization. This system differs from that in the previous study14 in that one polarization beam splitter (PBS) is added to it, and photons have polarizations. The polarization of photons is represented by \(\alpha\) and \(\beta\), i.e., the probability amplitudes of photons having horizontal and vertical polarizations are \(\alpha\) and \(\beta\), respectively, and the relation \(|\alpha |^2 + |\beta |^2 = 1\) holds. Note that \(\alpha ,~\beta \in {\mathbb {R}}\) are represented by \(\cos \theta\) and \(\sin \theta\) respectively, later in this study.

Figure 2
figure 2

(a) System architecture for the asymmetric OAM decision making. It differs from the symmetric OAM system in that a PBS is added, and the basis of polarization is added to the photon state. PBS polarization beam splitter, BS beam splitter, SLM spatial light modulator, HW half wave plate. The zeroth-order extraction enables successful modulation with the SLM by extracting only the zeroth-order OAM from the initially generated light in a superposition of various OAMs. (b) Schematic illustration of the paths of photons.

Formulation

This section provides a mathematical derivation of the probabilities corresponding to pairs of decisions based on the system presented in Fig. 2a for the case involving K choices, that is, K OAMs.

First, we give the space to treat polarization and OAMs. Let us denote the horizontal and vertical polarization states by

$$\begin{aligned} \vert {H}\rangle =\left[ \begin{array}{c} 1 \\ 0 \end{array}\right] , ~~ \vert {V}\rangle =\left[ \begin{array}{c} 0 \\ 1 \end{array}\right] , \end{aligned}$$
(1)

respectively. Then the Hilbert space corresponding to polarization states is described as

$$\begin{aligned} {\mathscr {H}}_p:= {{\,\mathrm{\textrm{span}}\,}}\{\vert {H}\rangle ,\ \vert {V}\rangle \} \simeq {\mathbb {C}}^2. \end{aligned}$$
(2)

OAM states are represented by integers; their signs and absolute values denote directions (right \((+)\)- or left \((-)\)-handed) and numbers of intertwined helices, respectively24. Especially, the numbers of intertwined helices are utilized to identify the selected arms; for example, OAM \(\pm k\) corresponds to arm k. Thus, we limit the possible values of OAMs to \(\pm 1,\ \pm 2,\ \cdots \pm K\). Here, for \(k\in [K] := \{1,\,2,\,\ldots ,\,K\}\), we define vector \(\vert {\pm k}\rangle \in {\mathbb {C}}^{2K}\) corresponding to OAM \(\pm k\) as follows:

$$\begin{aligned} \vert {+k}\rangle = [~0,\dots ,{\mathop {\breve{1}}\limits ^{\hbox{} k}},\dots ,0~]^\top ,~~ \vert {-k}\rangle = [~0,\dots ,{\mathop {\breve{1}}\limits ^{\hbox { } k+K}},\dots ,0~]^\top , \end{aligned}$$
(3)

where superscript \(\top\) on a matrix represents the transpose of the matrix. Then the Hilbert space corresponding to OAM states is described as

$$\begin{aligned} {\mathscr {H}}_o:= {{\,\mathrm{\textrm{span}}\,}}\{\vert {\ell }\rangle \,|\,\ell \in \pm [K]\} \simeq {\mathbb {C}}^{2K}, \end{aligned}$$
(4)

where \(\pm [K] = \{\pm 1,\,\pm 2,\,\cdots ,\,\pm K\}\). As polarization states and OAM states are independent, the hybrid states are in the composite Hilbert space defined as

$$\begin{aligned} {\mathscr {H}}_s = {\mathscr {H}}_p \otimes {\mathscr {H}}_o = {{\,\mathrm{\textrm{span}}\,}}\{\vert {P}\rangle \otimes \vert {\ell }\rangle \,|\,P\in \{H,\ V\},\ \ell \in \pm [K]\} \simeq {\mathbb {C}}^{4K}, \end{aligned}$$
(5)

as in25.

The hybrid states of OAM and polarization can be generated using spatial light modulators (SLMs). The first input is represented as:

$$\begin{aligned} \Phi = \left[ \begin{array}{c} \alpha \\ \beta \end{array} \right] \otimes ~~ \tilde{\Phi },~~ \tilde{\Phi }:= \sum _{k=1}^{K} a_k e^{i\phi _k}\vert {+k}\rangle = [ ~a_1 e^{i\phi _1}, ~\ldots , ~a_k e^{i\phi _k}, ~\ldots , ~a_K e^{i\phi _K},\overbrace{~0, ~\ldots , ~0}^{k\, \text {elements}} ~]^\top \in {\mathscr {H}}_o, \end{aligned}$$
(6)

where \(a_k \in {\mathbb {R}}\) and \(\phi _k\in [0,\ 2\pi )\) for all \(k\in [K]\). The coefficient \(a_k\) represents the probability amplitude of the superposed OAM with index \(+k\), and due to probability conservation the equation \(\sum _{k=1}^K a_k^2 =1\) holds. The elements \(\alpha\) and \(\beta\), which represent a vector in \({\mathscr {H}}_p\), are the probability amplitudes of photon \(\Phi\) with horizontal and vertical polarizations, respectively. The elements in the latter half of \(\tilde{\Phi }\) were all zero because \(\Phi\) was designed to have only positive OAM. In the previous research, one player manipulated \(\tilde{\Phi }\) according to his or her preference. \(\Phi\) is a 4K dimensional vector because of the tensor product. The first 2K elements of \(\Phi\) correspond to the probability amplitudes of horizontal polarization, consisting of the OAMs. Essentially, the squared sum of the first 2K elements of \(\Phi\), \(\left[ ~ \alpha , ~~0 ~\right] ^\top \otimes \tilde{\Phi }\), is \(|\alpha |^2\), which is the probability that a photon \(\Phi\) exhibits horizontal polarization. In addition, the latter 2K elements, \(\left[ ~ 0, ~~\beta ~\right] ^\top \otimes \tilde{\Phi }\), are the probability amplitudes of vertical polarization.

After passing the first beam splitter, \(\Phi\) is transformed to:

$$\begin{aligned} \Phi '= \left( I_2 \otimes A\right) \Phi ,~~ A:= \sum _{\ell \in \pm [K]}\frac{1}{\sqrt{2}}\Bigl ({\vert {\ell }\rangle }{\langle {\ell }\vert } + i{\vert {-\ell }\rangle }{\langle {\ell }\vert }\Bigr ) = \frac{1}{\sqrt{2}} \left[ \begin{array}{cc} I_K &{} iI_K \\ iI_K &{} I_K \end{array} \right] , \end{aligned}$$
(7)

where A corresponds to the effect of a beamsplitter in the 2K-dimensional Hilbert space of OAM, \({\mathscr {H}}_o\). Here, for \(N \in {\mathbb {N}}\), \(I_N\) indicates a N by N identity matrix. Essentially, OAM did not change if a photon transmits through a beam splitter, whereas the sign of OAM reversed, and the probability amplitude was multiplied by i if it was reflected. However, because both the OAM and polarization were considered herein, the effect of a beam splitter on photon states was \(I_2 \otimes A\), which performs a unitary transformation on \({\mathscr {H}}_s\). BS plays a crucial role in inducing quantum interference. The probabilistic nature of transmission and reflection of light at the BS makes it impossible to distinguish whether the photon after passing through the BS came from \(\Phi\) or \(\Psi\), which is the essence of quantum interference. Note that we consider the two inputs, \(\Phi\) and \(\Psi\), separately first at this stage of the calculation. The interference effect does not appear in Eq. (7). The effect appears later in Eq. (14). Subsequently, based on the reflection at mirrors after the beam splitter, \(\Phi '\) is transformed to:

$$\begin{aligned} \Phi '' = \left( I_2 \otimes R \right) \Phi ' = \left( I_2 \otimes RA \right) \Phi ,~~ R:= \sum _{\ell \in \pm [K]} i{\vert {-\ell }\rangle }{\langle {\ell }\vert } = \left[ \begin{array}{cc} O_K &{} iI_K \\ iI_K &{} O_K \end{array} \right] , \end{aligned}$$
(8)

wherein R corresponds to the effect of the reflection by a mirror in the 2K-dimensional Hilbert space of OAM, and \(O_N\) implies a N by N zero matrix. R implies that the reflection reverses the signs of OAMs, and probability amplitudes are multiplied by i. However, because the polarization must be considered in addition to OAM, the effect of reflections on photon states should be \(I_2 \otimes R\), which performs a unitary transformation on \({\mathscr {H}}_s\).

At a polarization beam splitter, OAM with horizontal polarization is transmitted, whereas OAM with vertical polarization is reflected and multiplied by i. Therefore, the effect of a polarization beam splitter, which acts on \({\mathscr {H}}_s={\mathscr {H}}_p \otimes {\mathscr {H}}_o\), can be represented by the following 4K by 4K matrix C:

$$\begin{aligned} C:= {\vert {H}\rangle }{\langle {H}\vert } \otimes I_{2K} + {\vert {V}\rangle }{\langle {V}\vert }\otimes \sum _{\ell \in \pm [K]} i{\vert {-\ell }\rangle }{\langle {\ell }\vert } = \left[ \begin{array}{cc} I_{2K} &{} O_{2K} \\ O_{2K} &{} i\sigma \otimes I_K \end{array} \right] , ~~ \sigma := \left[ \begin{array}{cc} 0 &{} 1\\ 1 &{} 0 \end{array} \right] . \end{aligned}$$
(9)

The operator C performs a unitary transformation on \({\mathscr {H}}_s\). Because detectors are sensitive only to OAMs, herein, we should consider the corresponding map \({\mathscr {H}}_s \rightarrow {\mathscr {H}}_o\) which is represented by a 2K by 4K matrix \(\left[ \begin{array}{ccc} I_{2K}&\Big |&I_{2K} \end{array} \right]\). The probability amplitude of OAM k and horizontal polarization and that of OAM k and vertical polarization were added. Therefore, the effect of the asymmetric OAM system on one input is expressed as:

$$\begin{aligned} V:= \left[ \begin{array}{ccc} I_{2K}&\Big |&I_{2K} \end{array} \right] C \left( I_2 \otimes RA \right) = \frac{1}{\sqrt{2}} \left[ \begin{array}{cccc} -I_K &{} iI_K &{} -I_K &{} -iI_K \\ iI_K &{} -I_K &{} -iI_K &{} -I_K \end{array} \right] . \end{aligned}$$
(10)

Hence, the observed output of input \(\Phi\) is:

$$\begin{aligned} \Phi _{\textrm{out}} = V \Phi = V \left[ \begin{array}{c} \alpha a_1 e^{i\phi _1} \\ \vdots \\ \alpha a_K e^{i\phi _K} \\ 0_K \\ \beta a_1 e^{i\phi _1} \\ \vdots \\ \beta a_K e^{i\phi _K} \\ 0_K \end{array} \right] = \frac{1}{\sqrt{2}} \left[ \begin{array}{c} -(\alpha + \beta ) a_1 e^{i\phi _1} \\ \vdots \\ -(\alpha + \beta ) a_K e^{i\phi _K} \\ i(\alpha - \beta ) a_1 e^{i\phi _1} \\ \vdots \\ i(\alpha - \beta ) a_K e^{i\phi _K} \end{array} \right] . \end{aligned}$$
(11)

Note that \(0_N\) indicates a zero vector with N elements.

Similarly, by considering the second input having the same polarization and OAMs with only negative signs:

$$\begin{aligned} \Psi = \left[ \begin{array}{c} \alpha \\ \beta \end{array} \right] \otimes ~~ \tilde{\Psi },~~ \tilde{\Psi }:= \sum _{k=1}^{K} b_k e^{i\psi _k}\vert {-k}\rangle = [~\overbrace{0,~\ldots , ~0}^{k\, \text {elements}},~ b_1 e^{i\psi _1}, ~\ldots , ~ b_k e^{i\psi _k}, ~ \ldots , ~ b_K e^{i\psi _K} ~]^\top , \end{aligned}$$
(12)
$$\begin{aligned} \Psi _{\textrm{out}} = V \Psi = V \left[ \begin{array}{c} 0_K \\ \alpha b_1 e^{i\psi _1} \\ \vdots \\ \alpha b_K e^{i\psi _K} \\ 0_K \\ \beta b_1 e^{i\psi _1} \\ \vdots \\ \beta b_K e^{i\psi _K} \end{array} \right] = \frac{1}{\sqrt{2}} \left[ \begin{array}{c} i(\alpha - \beta ) b_1 e^{i\psi _1} \\ \vdots \\ i(\alpha - \beta ) b_K e^{i\psi _K} \\ -(\alpha + \beta ) b_1 e^{i\psi _1} \\ \vdots \\ -(\alpha + \beta ) b_K e^{i\psi _K} \end{array} \right] . \end{aligned}$$
(13)

Herein, the coefficient \(b_k \in {\mathbb {R}}\) represents the probability amplitude of the superposed OAM \(-k\), and the equation \(\sum _{k=1}^K b_k^2 =1\) holds. The sign of OAM of the two inputs \(\Phi\) and \(\Psi\) was fixed such that quantum interference could occur at the first beam splitter (see Fig. 2b). While the sign of \(\Phi\) only contains positive OAM, those of \(\Psi\) were all negative. Thus, \(\tilde{\Psi }\) is a 2K dimensional vector, whose ith element is the probability amplitude OAM \(+i\) when \(i \le K\) and that of OAM \(-i\) otherwise. The elements in the first half of \(\tilde{\Psi }\) were all zero because \(\Psi\) contained only minus OAM.

The output of the total system is \(\Phi _{\textrm{out}} \otimes \Psi _{\textrm{out}}\), as shown in Fig. 2b. The j-th element of \(\Phi _{\textrm{out}}\) or \(\Psi _{\textrm{out}}\) is the probability amplitude of a photon \(\Phi\) or \(\Psi\) having OAM j being detected at detector X when \(j \le K\). Further, it is the probability amplitude of a photon \(\Phi\) or \(\Psi\) having OAM j being detected at detector Y when \(j \ge K+1\). Herein, the focus was placed on the cases where the two photons were detected by two different detectors. Such probability amplitudes can be obtained by the tensor product of the latter half of \(\Phi _{\textrm{out}}\) and the first part of \(\Psi _{\textrm{out}}\) by Eqs. (11) and (13):

$$\begin{aligned} -\frac{(\alpha -\beta )^2}{2} \left[ \begin{array}{cc} a_1 e^{i\phi _1} \\ \vdots \\ a_K e^{i\phi _K} \end{array} \right] \otimes \left[ \begin{array}{cc} b_1 e^{i\psi _1} \\ \vdots \\ b_K e^{i\psi _K} \end{array} \right] , \end{aligned}$$
(14)

and by the tensor product of the latter half of \(\Psi _{\textrm{out}}\) and the first half of \(\Phi _{\textrm{out}}\):

$$\begin{aligned} \frac{(\alpha + \beta )^2}{2} \left[ \begin{array}{cc} b_1 e^{i\psi _1} \\ \vdots \\ b_K e^{i\psi _K} \end{array} \right] \otimes \left[ \begin{array}{cc} a_1 e^{i\phi _1} \\ \vdots \\ a_K e^{i\phi _K} \end{array} \right] . \end{aligned}$$
(15)
Table 1 Probabilities of pairs of decisions made by the asymmetric OAM system.

Therefore, by Eqs. (14) and (15), the probability amplitude of OAM \(k_1\) is detected at X, and OAM \(k_2\) is detected at Y, that is, the probability amplitude of player X choosing option \(k_1\) and player Y selecting option \(k_2\), is:

$$\begin{aligned} \frac{1}{2} \left( (\alpha +\beta )^2 a_{k_2} b_{k_1} e^{i(\phi _{k_2}+\psi _{k_1})} - (\alpha -\beta )^2 a_{k_1} b_{k_2} e^{i(\phi _{k_1} +\psi _{k_2} )} \right) . \end{aligned}$$
(16)

Hence, by considering the squared absolute values, the probability of player X choosing option \(k_1\) and player Y selecting option \(k_2\) is:

$$\begin{aligned} P(X:k_1, Y:k_2) = \frac{1}{4} a_{k_1}^2 b_{k_2}^2 (\alpha -\beta )^4 + \frac{1}{4} a_{k_2}^2 b_{k_1}^2 (\alpha + \beta )^4 - \frac{1}{2}a_{k_1}a_{k_2}b_{k_1}b_{k_2} (\alpha -\beta )^2 (\alpha + \beta )^2 \cos (\theta _{k_1}-\theta _{k_2}) \end{aligned}$$
(17)

with \(\theta _{k}:= (\phi _k -\psi _k)/2\) for \(k\in [K]\). Therefore, the difference between the probability of player X choosing arm \(k_1\) and player Y selecting arm \(k_2\) and that of player X choosing arm \(k_2\) and player Y selecting arm \(k_1\) is expressed as

$$\begin{aligned} P~(X:k_1,Y:k_2) - P(X:k_2,Y:k_1)= 2\alpha \beta ({a}_{k_2}^2{b}_{k_1}^2 - {a}_{k_1}^2 {b}_{k_2}^2). \end{aligned}$$
(18)

Hence, if the following condition holds true,

$$\begin{aligned} \alpha \beta \ne 0,~ {a}_{k_2}{b}_{k_1} \ne \pm {a}_{k_1} {b}_{k_2} \end{aligned}$$
(19)

the difference expressed as Eq. (18) is non-zero. Thus, \(P(X:k_1, Y:k_2) \ne P(X:k_2, Y:k_1)\) is achieved; i.e., asymmetry in decision-making is realized, which is the purpose of adding the PBS in Fig. 2.

However, conflicts can arise with a certain probability at the same time. By substituting \(k_1\) and \(k_2\) of k, the probability of the conflict occurring with arm k is expressed as:

$$\begin{aligned} P(X:k, Y:k) = \alpha ^2\beta ^2a_k^2 b_k^2 \ne 0. \end{aligned}$$
(20)

Results

Figure 3
figure 3

Pairs of \((p_{12},~p_{21})\) each system is able to realize. Blue dots are the results obtained by numerical experiments. Because blue dots exist outside the \(p_{12}=p_{21}\) line, asymmetric decision-making is possible by all systems. (a) Asymmetric OAM system. (b) Entangled photon decision maker. (c) OAM attenuation.

Figure 4
figure 4

The relationship between the loss probability plus conflict probability and the asymmetry ratio of each system. Blue dots are the results obtained by numerical experiments. The light blue area is the area mathematical consideration can prove that each system can realize. (a) Asymmetric OAM system. (b) Entangled photon decision maker. (c) OAM attenuation.

Next, the two-players (players X and Y), two-arms (arms 1 and 2; i.e., \(K=2\)) situation was examined in detail. Table 1 summarizes the probabilities of each pair of decisions, where \(p_{k_1 k_2}\) with \(k_1,\ k_2\in \{1,\ 2\}\) implies that player X chooses arm \(k_1\) and player Y chooses arm \(k_2\). Figure 3a demonstrates the feasible pairs of \(p_{12}\) and \(p_{21}\) by blue-colored region on a plane, with the horizontal and vertical axes being \(p_{12}\) and \(p_{21}\), respectively. The line of \(p_{12}=p_{21}\) implies the symmetric decision-making. As evident, the blue-colored region exists outside the \(p_{12}=p_{21}\) line, thus validating the feasibility of asymmetric decision-making.

However, the asymmetric OAM system cannot realize all combinations of \((p_{12},p_{21})\). For example, \((p_{12}, p_{21})=(0.5, 0.5)\) is outside the feasible zone. Indeed, the red curve in Fig. 3a shows the boundary between the feasible and infeasible zones of \((p_{12},p_{21})\). The first right side of the boundary belongs to the impossible zone, whereas the lower left side belongs to the possible zone. This boundary also corresponds to the cases without loss. The formula of this boundary is expressed as:

$$\begin{aligned} 2(p_{12} + p_{21}) = 1+ (p_{12} - p_{21})^2. \end{aligned}$$
(21)

See the Supplementary Information for the derivation of Eq. (21).

Thus, the conflict probability, the probability of both players choosing the same arm, is defined as \(p_{12} + p_{21}\), the asymmetry ratio of the decision-making as \(p_{21}/p_{12}\), and the loss probability of photons as \(1-(p_{11}+p_{12}+p_{21}+p_{22})\). Figure 4a shows the relationship between the conflict probability plus loss probability and symmetry ratio. The red-colored boundary in Fig. 4a denotes the minimum-loss-plus-conflict boundary. By defining the conflict probability plus loss probability as x and the asymmetry ratio as y, the formula is expressed as:

$$\begin{aligned} y={\left\{ \begin{array}{ll} \dfrac{(1+\sqrt{1-2x})^2}{(1-\sqrt{1-2x})^2} ~~&{} \text {when } y \ge 1,\\ \\ \dfrac{(1-\sqrt{1-2x})^2}{(1+\sqrt{1-2x})^2}~~&{} \text {when } y \le 1. \end{array}\right. } \end{aligned}$$
(22)

The detailed derivation of Eq. (22) is presented in Supplementary Information.

In the entangled photon decision maker, described later, 50% loss or conflict is inevitable in obtaining any asymmetry ratio. This rate is smaller than the smallest percentage necessary to realize all asymmetry ratios in the OAM attenuation. For situations when a lower rate of loss or conflict is appealing, an extreme asymmetry ratio, such as more than 100 or smaller than 0.01, is obtained by the asymmetry OAM system. Therefore, the decision-making system using OAM is more suitable when inequality between players is serious such that more powerful affirmative actions are necessary.

To obtain Figs. 3 and 4, first the parameters \(a_1, a_2, b_1, b_2, \alpha ,\) and \(\beta\) are varied with a step size of \(\pi /200\) as follows:

$$\begin{aligned}{}&a_1 = \cos x, ~~a_2= \sin x, ~~x=0,~ \frac{\pi }{200},~ \frac{2\pi }{200}, \ldots ,~ \pi , \end{aligned}$$
(23)
$$\begin{aligned}{}&b_1 = \cos y, ~~b_2= \sin y, ~~y=0,~ \frac{\pi }{200},~ \frac{2\pi }{200}, \ldots ,~ \pi , \end{aligned}$$
(24)
$$\begin{aligned}{}&\alpha =\cos \theta , ~~\beta =\sin \theta , ~~~\theta =0,~ \frac{\pi }{200},~ \frac{2\pi }{200}, \ldots ,~ \frac{\pi }{4} . \end{aligned}$$
(25)

Next, \(p_{12}\) and \(p_{21}\) are calculated, and data points, which are represented by blue dots in the figures, are obtained.

Obtaining a specific asymmetry ratio

In terms of application, the method to obtain the intended asymmetry ratio must be determined. First, any asymmetry ratio is possible while avoiding decision conflicts. Based on the results presented in Table 1, the conflict probability becomes zero when \(a_2=b_1=0\) or \(a_1=b_2=0\). Note that the loss probability is not zero.

Let r be the desired asymmetry ratio. When \(a_2=b_1=0\), the asymmetry ratio is expressed as:

$$\begin{aligned} r=\frac{p_{21}}{p_{12}} = \frac{(\alpha +\beta )^4}{(\alpha -\beta )^4}. \end{aligned}$$
(26)

By introducing \(\theta\) such that \(\alpha =\cos \theta , \beta = \sin \theta\), Eq. (26) becomes

$$\begin{aligned} r=\frac{(\cos \theta +\sin \theta )^4}{(\cos \theta -\sin \theta )^4}. \end{aligned}$$
(27)

Organizing Eq. (27) about \(\theta\), we obtain

$$\begin{aligned} 3-3r-\cos 4\theta + r\cos 4\theta + 4\sin 2\theta + 4r\sin 2\theta = 0. \end{aligned}$$
(28)

By solving Eq. (28), we obtain \(\alpha\) and \(\beta\) to realize r without conflicts. Figure 5a shows the relationship between \(\theta\) and r based on Eq. (27), showing that r can take every value with \(\theta\) from \(-\pi /4\) to \(\pi /4\). The realization of any r is significant because the degree of asymmetry can be balanced depending on the current inequality between players.

Similarly, when \(a_1=b_2=0\), the asymmetry ratio is:

$$\begin{aligned} r=\frac{p_{21}}{p_{12}} = \frac{(\alpha -\beta )^4}{(\alpha +\beta )^4} \end{aligned}$$
(29)

which is reformulated using \(\theta\) as

$$\begin{aligned} r=\frac{(\cos \theta -\sin \theta )^4}{(\cos \theta +\sin \theta )^4}. \end{aligned}$$
(30)

Hence, by solving the following Eq. (31), we obtain \(\alpha\) and \(\beta\) to realize r without conflicts.

$$\begin{aligned} 3-3r-\cos 4\theta + r\cos 4\theta - 4\sin 2\theta - 4r\sin 2\theta = 0 \end{aligned}$$
(31)

Figure 5b shows the relationship between \(\theta\) and r based on Eq. (31), showing that r can acquire every value with \(\theta\) from 0 to \(\pi /4\).

Figure 5
figure 5

(a) The relationship between \(\theta\) and r when \(a_1=b_2=0\). (b) The relationship between \(\theta\) and r when \(a_2=b_1=0\). In both cases, any asymmetry ratio can be achieved by \(\theta\), \(-\pi /4 \le \theta \le \pi /4\).

Table 2 Probabilities of decisions when BS is added to symmetric OAM system instead of PBS.

Origin of the asymmetry

The difference between the asymmetric and symmetric OAM system is the existence of the PBS in the system and the addition of polarization to the photon state. The polarization of the photon state is expressed by two parameters: \(\alpha\) and \(\beta\). With the probabilities of \(|\alpha |^2\) and \(|\beta |^2\), photons are detected as horizontal and vertical polarizations, respectively. These parameters satisfy \(|\alpha |^2+|\beta |^2=1\). When \(\alpha =0\) or \(\beta =0\), the photons simply transmit or are reflected at the PBS. Therefore, quantum interference does not occur at the PBS, with PBS playing no role; this situation corresponds to the symmetric OAM system14. However, when \(\alpha \ne 0\) and \(\beta \ne 0\), whether the photons are transmitted or reflected at the PBS is decided stochastically. Therefore, quantum interference can occur. Thus, the occurrence of quantum interference at the PBS renders a difference between the asymmetric and symmetric OAM systems.

Indeed, asymmetric decision-making is possible via the addition of both PBS and BS. Table 2 lists the probabilities of pairs of decisions in the case where BS is added instead of PBS. When BS was added instead of PBS, fewer states could be achieved. For example, by adding PBS to the symmetric OAM system, any nonnegative asymmetry ratio can be achieved without conflicts. This is because parameters \(\alpha\) and \(\beta\) possess the degree of freedom even if \(a_1,~~a_2,~~b_1~~b_2\) are set to (0,   1,   1,   0) or (1,   0,   0,   1) to render conflict probability zero. However, when attempting to render conflicts free in the system where BS is added, only two states can be realized: \(p_{12}=0, ~~p_{21}=1\) or \(p_{12}=1, ~~p_{21}=0\). Therefore, the addition of PBS to the symmetric OAM system yields superior results.

Asymmetric decision-making by entangled photon decision maker

Figure 6
figure 6

Experimental setup of the entangled photon decision maker. PBS polarization beam splitter, HW half-wave plate, APD avalanche photodiode, POLH polarizer (allowing only horizontal polarization to pass), POLV polarizer (allowing only vertical polarization to pass).

This section presents the entangled photon decision-maker that can fulfill asymmetric decision-making, particularly for the two-players, two-arms bandit problem. Figure 6 shows a schematic of the entangled decision-maker. The input to the system is two entangled photons. One photon entering PBS 1 decides player X’s choice while another entering PBS 2 decides player Y’s. In a previous study5, conflict-free, symmetric decision-making among two players was theoretically and experimentally demonstrated. The system shown in Fig. 6 realizes the asymmetry by discarding photons with specific probabilities at the polarizers before APDs or avalanche photodiodes. This system is different from that in the previous study5 owing to the presence of polarizers. Note that herein, a specific input is assumed:

$$\begin{aligned} \frac{1}{\sqrt{2}}\Bigl (\vert {\theta _1, \theta _2}\rangle - \vert {\theta _2, \theta _1}\rangle \Bigr ). \end{aligned}$$
(32)

This is a superposition of the following two states. One is the state with photons with polarizations of \(\theta _1\) and \(\theta _2\) entering the PBS 1 and 2, respectively. The other is the state with photons with polarization \(\theta _2\) and \(\theta _1\) entering the PBS 1 and 2, respectively. In particular, the latter state employs a \(\pi\) phase shift to consider the minus sign of the second term in Eq. (32), i.e. the second term is actually \(e^{i\pi } \vert {\theta _2, \theta _1}\rangle\). Herein, the polarizations of \(\theta _1\) and \(\theta _2\) are orthogonal to each other and satisfy the following condition.

$$\begin{aligned} \theta _2 = \theta _1 + \frac{\pi }{2}. \end{aligned}$$
(33)

Formulation

Next, the probabilities corresponding to pairs of decisions were derived. Here, we give the space to represent the photon states and the corresponding decision-making. The option 1 and 2 are represented by vectors \(\vert {1}\rangle = [~1,~0~]^\top ,~~\vert {2}\rangle = [~0,~1~]^\top\), respectively. By associating options with detectors, photon states can be described in the Hilbert space:

$$\begin{aligned} {\mathscr {H}} = \textrm{span}\{ \vert {1}\rangle ,\vert {2}\rangle \} \simeq {\mathbb {C}}^2. \end{aligned}$$
(34)

The first photon, present in the upper part of the system, corresponds to options 1 and 2 when detected at APD 1 and APD 2, respectively. Similarly, the output of the second photon, present in the lower part, corresponds to options 1 and 2 when detected at APD 3 and APD 4, respectively. Therefore, the output of the total system is in the Hilbert space \({\mathscr {H}} \otimes {\mathscr {H}}\).

First, consider the first term of Eq. (32), \(\vert {\theta _1, ~~\theta _2}\rangle\). The output of the photon injected into PBS 1 is:

$$\begin{aligned} \phi _1'= \left[ \begin{array}{l} \cos \alpha _X \cos (2\theta _{\textrm{HW}_1} - \theta _1) \\ \sin \beta _X \sin (2\theta _{\textrm{HW}_1} - \theta _1) \end{array} \right] . \end{aligned}$$
(35)

The first element is the probability amplitude of the photon detected at APD1 or the horizontal component, whereas the second one is that of the photon detected at APD2 or the vertical component. Similarly, the output of the photon entering PBS 2 is expressed as:

$$\begin{aligned} \phi _1''= \left[ \begin{array}{l} \cos \alpha _Y \cos (2\theta _{\textrm{HW}_2} - \theta _2) \\ \sin \beta _Y \sin (2\theta _{\textrm{HW}_2} - \theta _2) \end{array} \right] . \end{aligned}$$
(36)

Here, \(\alpha _X, \alpha _Y, \beta _X, \beta _Y\) are the orientations of the polarizers. The first element implies the probability amplitude of the photon detected at APD3, whereas the second one is that of the photon detected at APD4. Therefore, by considering the tensor of \(\phi _1'\) and \(\phi _1''\), the output of the first term of Eq. (32), \(\vert {\theta _1,~~\theta _2}\rangle\), is expressed as:

$$\begin{aligned} \phi _1=\phi _1' \otimes \phi _1'' = \left[ \begin{array}{l} \cos \alpha _X \cos (2\theta _{\textrm{HW}_1} - \theta _1) \cos \alpha _Y \cos (2\theta _{\textrm{HW}_2} - \theta _2) \\ \cos \alpha _X \cos (2\theta _{\textrm{HW}_1} - \theta _1) \sin \beta _Y \sin (2\theta _{\textrm{HW}_2} - \theta _2) \\ \sin \beta _X \sin (2\theta _{\textrm{HW}_1} - \theta _1) \cos \alpha _Y \cos (2\theta _{\textrm{HW}_2} - \theta _2) \\ \sin \beta _X \sin (2\theta _{\textrm{HW}_1} - \theta _1) \sin \beta _Y \sin (2\theta _{\textrm{HW}_2} - \theta _2) \end{array} \right] . \end{aligned}$$
(37)

The first element is the probability amplitude of two photons being detected at APD1 and APD3, whereas the second is that of them being detected at APD1 and APD4, the third is that of them being detected at APD2 and APD3, and the fourth is that of them being detected at APD2 and APD4.

Next, consider the second term of Eq. (32), \(\vert {\theta _2, ~~\theta _1}\rangle\). Similar to the first term, the output of the second term is expressed as:

$$\begin{aligned} \phi _2= \left[ \begin{array}{l} \cos \alpha _X \cos (2\theta _{\textrm{HW}_1} - \theta _2) \cos \alpha _Y \cos (2\theta _{\textrm{HW}_2} - \theta _1) \\ \cos \alpha _X \cos (2\theta _{\textrm{HW}_1} - \theta _2) \sin \beta _Y \sin (2\theta _{\textrm{HW}_2} - \theta _1) \\ \sin \beta _X \sin (2\theta _{\textrm{HW}_1} - \theta _2) \cos \alpha _Y \cos (2\theta _{\textrm{HW}_2} - \theta _1) \\ \sin \beta _X \sin (2\theta _{\textrm{HW}_1} - \theta _2) \sin \beta _Y \sin (2\theta _{\textrm{HW}_2} - \theta _1) \end{array} \right] . \end{aligned}$$
(38)

Considering the superposition of \(\phi _1\) and \(\phi _2\) with the factor of \(1/\sqrt{2}\) as expressed in Eq. (32), the output probability amplitudes is expressed as:

$$\begin{aligned}{} & {} \frac{1}{\sqrt{2}}\left( \phi _1 - \phi _2\right) \nonumber \\{} & {} \quad = \frac{1}{\sqrt{2}} \left[ \begin{array}{l} \cos \alpha _X \cos (2\theta _{\textrm{HW}_1} - \theta _1) \cos \alpha _Y \cos (2\theta _{\textrm{HW}_2} - \theta _2) - \cos \alpha _X \cos (2\theta _{\textrm{HW}_1} - \theta _2) \cos \alpha _Y \cos (2\theta _{\textrm{HW}_2} - \theta _1) \\ \cos \alpha _X \cos (2\theta _{\textrm{HW}_1} - \theta _1) \sin \beta _Y \sin (2\theta _{\textrm{HW}_2} - \theta _2) - \cos \alpha _X \cos (2\theta _{\textrm{HW}_1} - \theta _2) \sin \beta _Y \sin (2\theta _{\textrm{HW}_2} - \theta _1) \\ \sin \beta _X \sin (2\theta _{\textrm{HW}_1} - \theta _1) \cos \alpha _Y \cos (2\theta _{\textrm{HW}_2} - \theta _2) - \sin \beta _X \sin (2\theta _{\textrm{HW}_1} - \theta _2) \cos \alpha _Y \cos (2\theta _{\textrm{HW}_2} - \theta _1) \\ \sin \beta _X \sin (2\theta _{\textrm{HW}_1} - \theta _1) \sin \beta _Y \sin (2\theta _{\textrm{HW}_2} - \theta _2) - \sin \beta _X \sin (2\theta _{\textrm{HW}_1} - \theta _2) \sin \beta _Y \sin (2\theta _{\textrm{HW}_2} - \theta _1) \end{array} \right] . \end{aligned}$$
(39)

Hence, by considering the squared absolute value of each element and the condition of Eq. (33) along with assuming that \(\theta _{\textrm{HW}_1}\) is equal to \(\theta _{\textrm{HW}_2}\), the probabilities are expressed as:

$$\begin{aligned} \left| \frac{1}{\sqrt{2}}\left( \phi _1 - \phi _2\right) \right| ^2= \frac{1}{2} \left[ \begin{array}{c} 0 \\ \sin ^2\beta _Y \cos ^2\alpha _X \\ \sin ^2\beta _X \cos ^2\alpha _Y \\ 0 \end{array}\right] . \end{aligned}$$
(40)

Herein, the first element is the probability of photons detected at APD1 and APD3, essentially implying that both players X and Y choose arm 1. Further, the fourth element is the probability of photon detection at APD2 and APD4, corresponding to both players X and Y selecting arm 2. Owing to the entanglements, the corresponding probabilities are zero (\(p_{11}=p_{22}=0\)).

The second term implies that photons are received by APD1 and APD4, corresponding to player X choosing arm 1 and player Y choosing arm 2. Similarly, the third term implies that photons are received by APD2 and APD4, thus indicating that players X and Y select arms 2 and 1, respectively.

The original entangled-photon decision-maker is with \(\alpha _i\) being 0 and \(\beta _i\) being \(\pi /2\), which results in the second and the third term being 1/2. Consequently, perfect equality is ensured. With the inclusion of polarizers, such equality can be broken, as expressed in Eq. (40).

Results

Table 3 Probabilities of decisions in the entangled-photon decision maker.

Table 3 summarizes the probabilities corresponding to pairs of decisions derived using Eq. (40). Note that conflicts never happen in this system; this implies that two players always choose different options, whereas asymmetric decision-making (\(p_{12} \ne p_{21}\)) can be achievable by adequately setting \(\alpha _i\) and \(\beta _i\) (\(i=1,2\)). Nonetheless, \(p_{12}\) and \(p_{21}\) cannot be larger than 0.5; thus, photon losses are inevitable unless \(\alpha _X=\alpha _Y=0\) and \(\beta _X=\beta _Y=\pi /2\).

The blue-colored region in Fig. 3b presents the achievable pairs of \(p_{12}\) and \(p_{21}\) in the diagram of \(p_{12}\) and \(p_{21}\) in the horizontal and vertical axes, respectively. This demonstrates clearly that any \(p_{12}\) and \(p_{21}\) are accepted if they are equal or less than 0.5; thus, the blue-colored feasible zone resulted in a square area in the lower left corner of the diagram.

Figure 4b shows the relationship between the conflict probability plus loss probability and asymmetry ratio. The mathematical formula of the red-lines border is expressed as:

$$\begin{aligned} y={\left\{ \begin{array}{ll} \dfrac{1}{1-2x} ~~&{} \text {when } y \ge 1, \\ \\ 1-2x~~&{} \text {when } y \le 1. \end{array}\right. } \end{aligned}$$
(41)

The detailed derivation is presented in the Supplementary Information. Figure 4b shows that at most, 50% loss of photons must be tolerated to obtain any asymmetry ratios. The loss is minimal for the symmetric case where \(p_{12}=p_{21}=0.5\). This implies that the entangled photon decision maker is appropriate when maintaining an even treatment is ideal because the two players are almost equal.

Obtaining a specific asymmetry ratio

The intended asymmetry ratio r must be obtained by the entanglement system. For this, the focus was placed on two cases: \(p_{12}=0.5\) and \(p_{21}=0.5\), because the loss probability is the smallest on the possible zone in Fig. 3b under the same asymmetry ratio. When \(p_{12}=0.5\),

$$\begin{aligned} r=\frac{p_{21}}{p_{12}}=\sin ^2 \alpha _X \cos ^2 \alpha _Y \le 1. \end{aligned}$$
(42)

When \(p_{21}=0.5\),

$$\begin{aligned} r=\frac{p_{21}}{p_{12}}=\frac{1}{\sin ^2 \alpha _Y \cos ^2 \alpha _X} \ge 1. \end{aligned}$$
(43)

Therefore, for an asymmetry ratio greater than 1, Eq. (43) must be solved. Otherwise, Eq. (42) must be solved. Thus, the parameters required to achieve the intended r can be obtained. However, loss becomes great when the desired ratio is extreme because either \(p_{12}\) or \(p_{21}\) is fixed to 0.5.

Asymmetric decision-making by OAM attenuation

Formulation

This section presents decision-making by the OAM attenuation system. Figure 7 shows the architecture of the OAM attenuation setup. The two upper detectors, APD1 and APD2, are related to the decision-making of player X, whereas the two lower ones, APD3 and APD4, are related to the decision-making of player Y. The probability of each pair of decisions is the probability of the symmetric decision-making system multiplied by attenuation rates. Thus, the probability of player X choosing option i and player Y selecting option j is expressed as:

$$\begin{aligned} p_{ij} = \left( a_i ^2 b_j ^2 + a_j^2 b_i^2 -2 a_i a_j b_i b_j \cos (\theta _i - \theta _j) \right) d_{X i}^2 d_{Yj}^2, ~~i, j \in \{1, 2\}. \end{aligned}$$
(44)

When \(d_{X1} \ne d_{X2}\) and \(d_{Y1} \ne d_{Y2}\), the OAM attenuation system fulfills asymmetric decision-making.

Figure 7
figure 7

Decision-making system by OAM attenuation. HG hologram, ATT attenuator, BS beam splitter, SLM spatial light modulator.

Results

Table 4 Probabilities of decisions in the OAM attenuation.

Table 4 represents the probabilities for pairs of decision-making. Figure 3c shows the available pairs of \(p_{12}\) and \(p_{21}\). As evident, photon loss is inevitable in this system. Figure 4c shows the relationship between the loss probability and asymmetry ratio. If 75% of photon loss is allowed, all asymmetry ratios can be obtained. However, 75% is the highest probability for achieving all asymmetry ratios among the three systems. Therefore, OAM attenuation performs less effectively compared to the other two systems. Despite its inferior performance, the OAM attenuation system excels in the aspect of no conflict arising, like the entangled photon decision maker, because it is based on the symmetric OAM system. The mathematical formula of the red-lined boundary in Fig. 4c is expressed as:

$$\begin{aligned} y = {\left\{ \begin{array}{ll} 3 - 4x ~~ \text {when} ~~ y \le 1, \\ \\ \dfrac{1}{3-4x} ~~ \text {when} ~~ y\ge 1. \end{array}\right. } \end{aligned}$$
(45)

The derivation of Eq. (45) is presented in the Supplementary Information.

Obtaining a specific asymmetry ratio

Next, we explore how an intended asymmetry ratio r can be obtained in the OAM attenuation setup. For this, we focus on the case \(p_{12}=0.25\) or \(p_{21}=0.25\), because the loss probability is the smallest within the possible zone in Fig. 3c for this particular asymmetry ratio. When \(p_{12} = 0.25\),

$$\begin{aligned} r= \frac{p_{21}}{p_{12}}=\left( a_2^2 b_1^2 + a_1^2 b_2^2 - 2 a_1 a_2 b_1 b_2 \cos (\theta _1-\theta _2) \right) d_{X2}^2 d_{Y1}^2 \le 1. \end{aligned}$$
(46)

When \(p_{21} = 0.25\),

$$\begin{aligned} r= \frac{1}{\left( a_2^2 b_1^2 + a_1^2 b_2^2 - 2 a_1 a_2 b_1 b_2 \cos (\theta _1-\theta _2) \right) d_{X1}^2 d_{Y2}^2} \ge 1. \end{aligned}$$
(47)

If we introduce u,  v such that \(a_1 = \cos {u},~~a_2 = \sin {u}\) and \(b_1 = \cos {v},~~b_2=\sin {v}\), then Eq. (46) becomes:

$$\begin{aligned} r = \frac{p_{21}}{p_{12}} = \frac{1}{2} \left( 1- \cos {2u} \cos {2v} - \cos (\theta _1 - \theta _2) \sin {2u} \sin {2v} \right) d_{X2}^2 d_{Y1}^2, \end{aligned}$$
(48)

and Eq. (47) becomes:

$$\begin{aligned} r = \frac{2}{\left( 1- \cos {2u} \cos {2v} - \cos (\theta _1 - \theta _2) \sin {2u} \sin {2v} \right) d_{X1}^2 d_{Y2}^2}. \end{aligned}$$
(49)

When the intended r is greater than 1, the parameters required to achieve r can be obtained by solving Eq. (48) about u and v. Otherwise, Eq. (49) must be solved. However, loss becomes great when the desired ratio is extreme because either \(p_{12}\) or \(p_{21}\) is fixed to 0.25.

Conclusion

This study analysed asymmetric collective decision-making via quantum properties, quantum interference, and entanglement to explore the possibility of how to implement affirmative actions to reduce disparities in the case of two players. Asymmetry in decision-making was successfully realized by three systems: the asymmetric OAM system, asymmetric entangled photon decision-maker, and OAM attenuation. The probability of players X and Y choosing options 1 and 2, respectively, which is denoted by \(p_{12}\), can be different from the probability of players X and Y choosing options 2 and 1, respectively, which is denoted by \(p_{21}\). Thus, \(p_{12} \ne p_{21}\) was demonstrated to be achievable. Previous studies were limited to symmetric collective decision-making (\(p_{12} = p_{21}\)). Through asymmetric joint decisions, the unfairness and inequalities among agents, which might be innate prior to the games, could be modulated.

Further, herein, several limitations in achieving the asymmetry were clarified. In all systems, some impossible pairs of \(p_{12}\) and \(p_{21}\) were shown to exist. This is partly because of photon loss. The photon loss causes a reduction in \(p_{12}\) and \(p_{21}\), which in turn limits the available combinations of \((p_{12}, p_{21})\). This is because there is a relationship: \(Loss ~probability + p_{11} + p_{12} + p_{21} + p_{22} = 1\). Without any loss, it would be possible to obtain a more diverse set of \((p_{12}, p_{21})\) combinations. However, loss of photons or decision conflicts were inevitably presented with a certain probability. For example, only two combinations of \((p_{12}, ~p_{21})\) were achievable without any loss or conflicts in the asymmetric OAM system, whereas the asymmetric entangled photon decision maker could realize zero-loss decisions only in the symmetric case. However, the OAM attenuation setup could not realize any situation without loss or conflict. The minimum probability of loss or conflicts to achieve any ratio of \(p_{12}\) and \(p_{21}\) was the smallest, that is, 50%, in the asymmetric OAM system and asymmetric entangled photon decision maker. In the case of the OAM attenuation, 75% loss was required to be tolerated to obtain any ratio. This study analytically clarified the boundary between the feasible and infeasible zones of the combination of \((p_{ 12}, ~p_{21})\) for each of the three systems.

However, the existence of unavailable pairs of \(p_{12}\) and \(p_{21}\) may not be a serious concern because any ratio of \(p_{12}\) and \(p_{21}\) can be accomplished by accepting the certain probability of loss or conflicts. Further, to enable affirmative actions to tackle inequalities, all pairs were not required if an appropriate degree of asymmetry could be achieved. The formula for configuring the physical parameters in the quantum systems to realize a specified value of the ratio of \(p_{12}\) and \(p_{21}\) was analytically derived. Finally, because all systems were tuned for only two-player situations, extensions to cases with more players are expected.

In situations when powerful affirmative action is needed as the inequality between players is serious, the asymmetric OAM system is best suited because it exhibits superior performance with low conflict rates and photon loss. Whereas the use of the asymmetric entangled photon decision maker is advisable owing to a minor loss of photons and lower conflict rate when the inequality is rather small and moderate affirmative action is sufficient. This study contributes to extending the photonic and quantum collective decision-making to asymmetric properties, thus paving the way toward covering a broader sense of equality and social welfare based on quantum principles.