Introduction

The field of quantum computing is living an unprecedented expansion due to recent experimental advances. Quantum technologies as a whole1,2,3,4,5,6,7,8 are being benefited from recent results both on quantum computer implementations9,10,11,12,13,14,15 as well as on quantum communications16,17,18,19,20,21,22. Due to this, people have started to think seriously about industrial applications of quantum computers4. Among the different verticals, finance is among the most promising ones, given the ubiquitousness of intractable mathematical problems. For a detailed description of applications of quantum computing in finance, see Ref.23. Among these applications, one of the most prominent is quantum optimization. There are many important optimization problems in finance which can be solved more efficiently using quantum computing. See Refs.24,25,26,27,28 for some examples. In this setting, the most paradigmatic optimization problem in finance is that of portfolio optimization, both in its static and dynamic versions.

Our aim in this work is to solve the dynamic portfolio optimization problem. An issue that investors often face is that short term investments tend to be taxed much higher than long term investments. It is common for investors to impose a minimal holding period, preventing any purchased asset from being sold before a predetermined period of time. We build upon the work from Ref.29, and demonstrate an efficient post-selection protocol to impose the minimal holding constraint. Similarly to Ref.29, we further impose that investors must invest in integer bundles, as is typically the case for exchange-traded funds (ETF) shares.

Financial model

Our aim is to find the best investment trajectory for an investor, given a level of risk that they want to take. According to Modern Portfolio Theory, the optimal investment at a defined level of risk is the one which maximizes profit30. The portfolio’s risk—or volatility—is computed from the assets’ covariance matrix. The ratio of returns to risk is the Sharpe ratio, which is our metric for comparing investments. These are defined mathematically in what follows.

Let us define \({\omega }_{tn}\), the fraction of the total budget invested in asset n at time t. The optimal holding trajectory \({\omega }_{tn}\) minimizes the Modern Portfolio Theory cost function:

$$\begin{aligned} H_0 = \sum _{t} -\mu _t^T {\omega }_t + \frac{\gamma }{2} {\omega }_t^T \Sigma _t {\omega }_t. \end{aligned}$$
(1)

Here, \(\mu _t\) is the vector of the logarithmic returns at time t and \(\Sigma _t\) is the covariance matrix. The risk aversion \(\gamma \) controls the portfolios penalty for risk. This determines the amount of risk an investor is willing to take. The forecasted returns and covariance matrices can be deduced from the stocks’ prices. This is detailed, for instance, in Ref.29. Following this notation, the Sharpe ratio is given by

$$\begin{aligned} \mathrm{Sharpe} \equiv \frac{\sum _{t} \mu _t^T {\omega }_t}{\sqrt{ \sum _t {\omega }_t^T \Sigma _t {\omega }_t}}, \end{aligned}$$
(2)

which, as we said, is nothing but the ration between the total return (numerator) and volatility (denominator).

At any point in time, we would like the entire available budget to be invested. This constrains the holdings \({\omega }_t\) to be normalized at any time t. We enforced this by penalizing portfolios which did not respect this constraint. The cost function we optimized is therefore

$$\begin{aligned} H = H_0 + \sum _t \rho \left( \sum _n {\omega }_{tn} - 1 \right) ^2, \end{aligned}$$
(3)

where the Lagrange multiplier \(\rho \) is an hyperparameter of the model. Note also that Eq. (3) can be written as:

$$\begin{aligned} H = \sum _t h_t, \end{aligned}$$
(4)

for some \(h_t, t \in \{0, N_t\}\). This means that the optimal investment at time t is independent of our investment history, as long as no more constraints are included that correlate trading times among each other. In such a situation it is therefore sufficient to minimize \(h_t\) at every time t to compute the optimal holdings \({\omega }_t\). This is a good approximation, for instance, for the trading investment funds of an everyday investor, for which transaction costs and market impact may be negligible.

We assume shares can only be sold in large bundles. These constraints imply that our objective variables—the instantaneous asset holdings \({\omega }_{tn}\)—are integer variables. At any trading time, we can encode the qubit values to our qubits using a binary encoding:

$$\begin{aligned} {\omega }_{n} = \frac{1}{K} \sum _{q=0}^{N_q-1} 2^q x_{n, q}, \end{aligned}$$
(5)

where \(x_{n, q} \in \{0,1\}\) is the readout value of the \(q\text {th}\) qubit assigned to institution n, and K is the total investment.

This encoding has several consequences. First, the holdings are always bound by: \({\omega }_{tn} \in [0, 1]\). Since \({\omega }_{tn} \ge 0\), investors are not given the option to sell short securities. Allowing short-selling could be an interesting extension for this problem. Second, this encoding allows investors to split their total investment in a maximum of K bundles. This discretization is part of the reason why this problem is so hard to solve on classical computers29. From a practical perspective, using the bit variables the cost function can then be written as

$$\begin{aligned} H = x^T Q x, \end{aligned}$$
(6)

with x a vector of bit variables, and Q a matrix of real numbers. In the language of combinatorial optimization, this is a quadratic unconstrained binary optimization (QUBO) problem, and is the natural input for the D-Wave quantum annealer.

In Eq. (5), we have introduced the bit depth \(N_q\), the number of qubits which encode the \(n\text {th}\) asset holdings. Typically, we would choose \(N_q\) such that \(2^{N_q} -1 \ge K\). This choice allows the investor to invest their entire budget into a single asset. We may, however, want to impose a diversification constraint. Typically, large financial institutions are not allowed to invest more than \(5\%\) of their total budget in any single asset. Choosing \(N_q\) such that \(2^{N_q} -1 = 0.05 K\) naturally implements this diversification constraint.

Minimum holding period

The holding period is the amount of time which elapses between an investment’s purchase and its sale (or sale of a security). Because long-term gains are taxed more favourably than short-term gains, it is common to demand that investments meet a minimum holding period. In this study, we imposed a minimal holding period of 7 days, which is a natural timeframe for some realistic banking products.

Here we imposed this constraint by post-selecting investment trajectories which respected this condition. The number of possible trajectories grows exponentially \(O\left( N_a^{K \times N_t}\right) \). It is therefore computationally prohibitive to successively consider each trajectory and verify if they meet the minimal 7 day holding period.

Instead, we build the investment trajectory iteratively, following the flow chart presented Fig. 1. At every time t, we choose holdings \({\omega }_\text {guess}\) for that instant. We compute them by sampling at each time t the low-energy subspace of \(h_t\) in Eq. (4) using, in our case, the D-Wave machine. the fact that the problem can be decoupled into separate times t allows us to tackle more assets at every given time with the quantum annealer. The sampled instantaneous portfolios are considered in order of decreasing Sharpe ratio. As customary in finance, a large Sharpe ratio is an indicator of a good-quality portfolio, since it implies that the return is large for the risk being assumed. We note, however, that other figures of merit could be used, such as the returns (the larger the better), or the volatility (the lower the better). In our case, though we choose to work with the Sharpe ratio because of the afore-mentioned reasons. We only retain candidates which fulfil the minimum holding constraint.

Figure 1
figure 1

Flow chart detailing the post-selection algorithm used to efficiently eliminate trajectories which do not meet the minimum 7 day holding period.

By following iteratively, the algorithm implements a tree-like selection and efficiently rules out trajectories which do not meet the minimal 7-day holding period, as is illustrated in Fig. 2. Note that at any time t, there always exists at least one solution which meets the minimal 7-day holding period: the solution \({\omega }_t = {\omega }_{t-1}\).

Figure 2
figure 2

Candidate investment trajectories are efficiently ruled out by the post-selection algorithm. Node \({\omega }^{(i)}_t\) represents the \(i\text {th}\) candidate holdings at time t. Green nodes meet the minimum holding period, while grey nodes do not. When the constraint is not met at time t, the node is crossed out and all resulting investment trajectories are eliminated.

Alternatively, we could include this constraint by penalizing in Eq. (3) trajectories which do not meet the minimum holding period. The corresponding penalty term would then correlate different time steps between each other, such that the cost function would not have the separable form of Eq. (4). In such a case, the problem would become even more computationally intractable, and a global optimization strategy over all possible trading times would be necessary from the very beginning. It is therefore remarkable that our simple approach, based on quantum sampling and post-selection, produces such high-quality portfolios in an extremely efficient manner.

Dimensional reduction

The number of objective variables in this problem is proportional to the number of assets, multiplied by the bit depth. For problems of commercial value, this can be very large. Because current quantum resources in Noisy Intermediate Scale Quantum (NISQ) devices are limited, a good option is to apply dimensional reduction techniques. Reducing the space of solutions searched also makes it easier for the optimization routine to converge to the global minimum.

In Ref.29, authors described a method to cluster assets based on their time series’ similarity. In the following, we apply this method, and discard all except for the best asset in each cluster (based on its historical Sharp ratio). The dimensional reduction strategy follows from applying first a Hodrick-Prescott smoothing, to extract data trends, and then computing the euclidean distance between different pairs of trends of assets. Thanks to this, we can perform a clustering of the assets into clusters of assets of similar behavior. This dimensional reduction strategy also allows us to significantly lower the portfolio’s risk, by diversifying our investment among maximally uncorrelated assets. Moreover, our target investors typically select their investment risk category. To meet this risk requirement, we performed a pre-selection among available assets. We computed each assets’ historical volatility, and discarded any asset which significantly exceeded the agreed risk level. This allows us to construct portfolios for investors with different risk profiles: high-risk, medium-risk, low-risk, and so forth.

In practice, we determined the optimal number of clusters \(N_c\) by studying each clusters’ variance. In the studied data, we found that the mean clusters’ variance decreased exponentially with the number of clusters. For \(N_c>7\), the clusters’ variance did not significantly decrease, indicating that 7 clusters capture almost all of the system’s variations. We therefore set \(N_c=7\) in our calculations.

Results

Following the protocol detailed in previous sections, based on looking at optimal Sharpe ratios of portfolios computed with the D-Wave quantum annealer, we found the optimal investment trajectory between May the 31st 2019 and May the 31st 2020 among a total of 50 international assets and indices. Daily transactions were allowed, but investments had to meet a minimum 7 day holding period. We allowed investments to be split into a maximum of \(K=5\) bundles, and set the bit depth to \(N_q=2\).

We found the optimal portfolios by minimizing Eq. (3) on the D-Wave 2000Q processor. Computed investment trajectories for five risk packages (minimum risk, \(5\%\), \(10\%\), \(15\%\), and \(20\%\) annual risk) are plotted in Fig. 3. Also plotted are 1000 random investment trajectories which meet the minimum 7 day holding period. These give us some intuition about the space of allowed portfolios. We see that allowed portfolios tend to exist between \(-20\) and \(20\%\) annualized profit, and between \(10\) and \(17\%\) risk (as measured by the annualized volatility). It is clearly visible that the investment trajectories obtained using our method based on quantum optimization, dimensional reduction and post-selection, tend to gather much closer to the efficient frontier in portfolio space than those obtained by investing at random.

Figure 3
figure 3

Investment trajectories, chosen among 50 assets spanning May the 31st 2019 to May the 31st 2020. The blue dots represent randomly selected trajectories. The coloured dots are investment trajectories with different levels of risk obtained using our quantum optimization toolbox.

In Fig. 4, we show an example investment trajectory. This corresponds to the \(15 \%\) risk package (purple dot in Fig. 3), which yields \(\approx 60\%\) return on investment. We can see from this figure that the \(N_c=7\) most profitable assets which meet the \(15 \%\) risk limit are Bankia Fonduxo Universal FI, Bankia Futuro Sostenible Universal FI, Bankia Indice Eurostoxx Universal FI, Amundi Fds Pioneer US Bd A2 EUR C, Fidelity Global Technology A-Acc-EUR, Nordea 1 − Chinese Equity Y EUR, and Franklin India A(acc)EUR. As can be read off from Fig. 4, the optimal investment on May the 31st 2019 meeting these constraints is: \(1/5\text {th}\) in Fidelity Global Technology A-Acc-EUR, \(1/5\text {th}\) in Amundi Fds Pioneer US Bd A2 EUR C, \(1/5\text {th}\) in Franklin India A(acc)EUR, and \(2/5\text {th}\) in Nordea 1 − Chinese Equity Y EUR. The everyday composition of the trading trajectory can be read form the chart, and it always fulfils by construction the minimum holding period.

Figure 4
figure 4

The optimal investment trajectory between May the 31st 2019 and May the 31st 2020 among considered assets for an investor wanting to take \(\le 15 \%\) risk. As can be seen from Fig. 3, this portfolio provided \(\approx 60\%\) annualized return on investment (purple dot in the Fig. 3). Data for this calculation was obtained from daily prices of international assets and indices as explained in the text, which are public and can be obtained from e.g. Bloomberg, Yahoo Finance, and/or Morningstar databases.

Finally, we would like to discuss about the computational running time and quality of the computed portfolios, in comparison with other classical and quantum optimization strategies. The running time to compute the optimal portfolios in Fig. 3 using our strategy of combining D-Wave and post-selection was of just a few minutes, for daily portfolios of 50 assets. For comparison, other quantum optimization strategies, such as Variational Quantum Eigensolvers (VQE) on universal quantum processors are highly limited at the moment and can only handle very small portfolios with typically 2–3 assets at most29. Compared to classical solvers, our method is way faster than brute-force classical search, which scales exponentially fast in the number of qubits and quickly hits astronomical numbers (the problem is clearly intractable in this sense for 50 assets). Compared to other solvers, D-Wave quantum annealing is known to be faster and allow larger portfolios than solvers such as Gekko, see Table IV in Ref.29, which equally applies in our case.

Conclusion

Here we have proposed a hybrid quantum-classical algorithm for dynamic portfolio optimization with minimal holding period. Our algorithm is based on quantum sampling of near-optimal portfolios at each trading step, combined with dimensional reduction and post-selection. We tested our idea using quantum optimization with the D-Wave 2000Q processor, and real data from 50 assets during 1 year. Our study shows that the method is remarkably efficient and produces in few minutes results close to the optimal efficient frontier in portfolio space, much better than typical random portfolios. We also showed how our approach can be used to produce trajectories corresponding to different risk profiles, in accordance with typical products offered by financial institutions to investors. Our results are a clear example of how the combination of quantum and classical techniques can provide novel and valuable methods to deal with real-life financial data in today’s NISQ quantum processors. Moreover, we believe that the use of distributed quantum computer architectures, such as those discussed in Ref.3, could accelerate even more the implementation of this type of quantum computing algorithms to solve real-life financial mathematical problems.