Doubly elastic net regularized online portfolio optimization with transaction costs

Yao, Xiaoting; Zhang, Na

doi:10.1038/s41598-023-46059-2

Download PDF

Article
Open access
Published: 02 November 2023

Doubly elastic net regularized online portfolio optimization with transaction costs

Xiaoting Yao¹ &
Na Zhang¹

Scientific Reports volume 13, Article number: 18937 (2023) Cite this article

566 Accesses
Metrics details

Subjects

Abstract

Online portfolio optimization with transaction costs is a big challenge in large-scale intelligent computing community, since its undersample from rapidly-changing market and complexity from varying transaction costs. In this paper, we focus on this problem and solve it by machine learning system. Specifically, we reformulate the optimization problem with the minimization over simplex containing three items, which are negative expected return, the elastic net regularization of transaction costs controlled term and portfolio variable, respectively. We propose to apply linearized augmented Lagrangian method (LALM) and the alternating direction method of multipliers (ADMM) to solve the optimization model in a higher efficiency, meanwhile theoretically guarantee their convergence and deduce closed-form solutions of their subproblems in each iteration. Furthermore, we conduct extensive experiments on five benchmark datasets from real market to demonstrate that the proposed algorithms outperform compared state-of-the-art strategies in most cases in six dimensions.

Forecasting stock prices changes using long-short term memory neural network with symbolic genetic programming

Article Open access 03 January 2024

Hybrid quantum investment optimization with minimal holding period

Article Open access 01 October 2021

Multi-level deep Q-networks for Bitcoin trading strategies

Article Open access 08 January 2024

Introduction

Online portfolio optimization has become a hot topic in the intelligent computing and machine learning community^1,2,3,4. It always obtains high return in the short time with extensive calculation. Hence, there are lots of difficulties such as the undersample in the short time for calculation and the transaction cost problem, which we attack in this paper based on machine learning system.

Portfolio optimization originates from the Markowitz mean-variance (MV) theory⁵. However, the online portfolio problem can not be managed by most extended MV theories, since its instantaneous transaction data can not provide a stable and reliable distribution to estimate significant financial parameters. Besides, it is difficult to guarantee the generalization of the optimization model, since online portfolio selection problem is in the rapidly-changing circumstance. In this case, we usually have only a small window to make decision and always undersample for training. Nevertheless, online portfolio optimization is attractive and more general in reality, since its timely and sensitive response to the dynamic market. One common way to solve above problems is decomposing raw data structure to extract the return or risk information. Chen et al.⁶ exploited wavelet feature engineering to transform stock price data, which resulted in higher Sharpe ratio in long-short portfolio trading. Lai et al.⁷ extracted instantaneous risk structure from price covariance matrix, to improve profit by lost control. Another common way to manage online portfolio problem is using machine learning methods. Kelly’s criterion in the information theory^8,9,10 brings out the exponential growth rate (EGR) approach, which focuses on the wealth change integrated with online learning frameworks^1,11,12. The EGR approach is suitable for online portfolio optimization for the reason that it always works via data-driven rather than requiring strict statistical assumptions. Recently, researchers focus more on various machine learning schemes^13,14,15,16 in the basis of EGR approach. Zou et al.¹⁷ proposed that elastic net (the linear combination of L1 norm and the square of L2 norm) leads to sparsity while maintaining a higher accuracy. Ho et al.¹⁸ applied elastic net for portfolio optimization, and Shen et al.¹⁹ constructed sparse portfolio with doubly norm regularized terms, showing an excellent performance. Hence, elastic net interests us and we illustrate it in the following text. Furthermore, researches^20,21 indicate that regularized the portfolio variable contributes to the generalization of the optimization model, which we would also elaborate in the following. Lai et al.²² applied L1 regularized on the portfolio variable and resulted sparse and stable portfolios in the case of short-term investment. Moreover, Luo et al.²³ regularized portfolio variable by L0 norm while showing higher sparsity and better return in online portfolio selection.

However, above strategies even most portfolio strategies do not consider transaction costs, which could not be ignored (especially in online portfolio problem) in reality. It is worth noting that transaction costs can be classified into explicit and implicit. Implicit transaction costs are usually caused by market impact or liquidity risk^24,25, and the explicit is from the transaction. Particularly, we only focus on explicit transaction costs, such as taxes²⁶, buying and selling transaction fees, since they influence more on retailers while we incline to research the online portfolio for retailers.

The first explicit extension was proposed by Bauer et al.²⁷ that Cover’s Universal Portfolio⁹ is still suitable for imposing transaction costs. However, they did not take transaction costs into the decision process. Albeverio et al.²⁸ proposed a new transaction costs optimization model, where reformulated transaction costs as the distance between portfolios. Certainly, it is widely to manage transaction costs via machine learning system. Györfi et al.²⁹ augmented the original Markowitz objective function by adding a penalty term proportional to the sum of the absolute of the portfolio weights, which encourage sparse portfolios and allow transaction costs to be considered. Das et al.³⁰ applied machine learning method to study transaction costs problem, and solve it by GP algorithm. Furtherly, Li et al.³¹ solve portfolio selection problem with transaction costs by proposing a TCO framework, which can get closed-form formulae for portfolio update, and they also found the relationship between transaction costs and portfolios.

Although online portfolio optimization have been studied for decades, it lacks of models considering both transaction costs and sparsity as well as correlations of portfolio variables, simultaneously. Thus, we aim to design in this paper a mathematical model concerning transaction costs as well as regularization of portfolios to enhance the cumulative net wealth and the generalization ability of the model. Noting that elastic net regularization can perform automatic variable selection and maintain related variable groups, we propose a doubly elastic net regularized model for portfolio selection problems. We then apply the linearized augmented Lagrangian method (LALM) and alternating direction method of multipliers (ADMM) to solve the proposed model. It is worth mentioning that guaranteeing the solving algorithm converging to the corresponding model is difficult. A number of researches^18,19,31 did not prove the convergence of the algorithm theoretically, but we do that. Numerical experiments show the efficiency of the proposed algorithms. The innovations and main contributions of this paper are as follows:

We propose a minimization problem over simplex which concerns transaction costs and regularization of portfolios simultaneously. The objective of the problems contains three terms: the negative expected return, the elastic net regularization of the difference between the portfolios of the next and last periods to control the transaction cost, the elastic net regularization (reduces to the square of L2 norm due to the simplex) of the portfolio vector to improve the generalization of the model. Thanks to the properties of the elastic net, the proposed model considers the transaction cost, the sparse property and the correlation between variables simultaneously.
In order to solve the proposed model, we apply the LALM to the model and demonstrate the sequence generated by the algorithm converges to a solution of the proposed model. Further, the closed-form solution of the subproblem in each iteration is established, enabling the computational efficiency of the algorithm. We point out that LALM does not need to project any vector to the simplex set in each iteration, which may save computational time in practice.
We further apply ADMM to solve the proposed model by appropriately splitting the variable into two variables. The convergence of ADMM is established through proving the existence of saddle point of the corresponding augmented Lagrangian function. Different from LALM, each update of ADMM is restricted to the simplex set, which may improve the accuracy of the algorithm.
We compare the proposed algorithms with the state-of-the-art methods for portfolio selection on four benchmark datasets. Numerical experiments illustrate that the proposed algorithms perform better than other compared methods in most cases.

This paper is organized as follows. We present some preliminaries and related works in section “Preliminaries”. The whole portfolio selection system and its solving algorithms are illustrated in section “Portfolio optimization”. Section “Experiment” focuses on the experimental results to evaluate the efficiency of our proposed algorithms and section “Conclusion” summarizes the paper.

Preliminaries

Problem setting

In the real market, transaction costs can not be ignored especially in the short-term investment. Assuming that there are m assets invested for T periods in a financial market. The relative prices can be collected as a vector $\mathbf {x_t}=(x_{t,1},\ldots ,x_{t,i},\ldots ,x_{t,m}),\ t=1,2,\cdots ,T$, where $x_{t,i}=\frac{P_{t,i}}{P_{t-1,i}}$ denotes the relative price and $P_{t,i}$ is the closed price of i th asset in t th period, respectively. A portfolio vector $\mathbf {b_t}=(x_{b,1},\ldots ,x_{b,i},\ldots ,x_{b,m})$ is restricted in the simplex $\bigtriangleup _m=\{{\textbf{b}}: {\textbf{b}}\in {\mathbb {R}}_+^m,\ \sum _{i=1}^{m}b_i=1 \}$ with assumptions of self-financed and non-margin and non-shorting, where $b_{t,i}$ indicates the proportion of total wealth invested in i th asset of t th period.

At the beginning of t th period, the portfolio is set to $\mathbf {b_t}$, thus the wealth allocation at the end is changed to ${\hat{b}}_{t}=\frac{\mathbf {b_t}\centerdot \mathbf {x_t}}{<\mathbf {b_t},\mathbf {x_t}>}$, where $<\cdot \ ,\ \cdot>$ is the inner product and $\cdot$ is the element-wise product, since the fluctuation of market without rebalancing the portfolio during $t^{th}$ period. Supposing transaction costs denoted as $\gamma$, researches^27,28,29,31 proposed the relationship of $\gamma$ and wealth:

$$\begin{aligned} 1=w_{t-1}+\gamma \vert \vert {\hat{\textbf{b}}_{t-1}}-\mathbf {b_t} w_{t-1}\vert \vert _1\, \end{aligned}$$

(1)

where $w_{t-1}$ denotes the net proportion wealth after transaction costs, and $\vert \vert \cdot \vert \vert _p$ denotes the ${\mathcal {l}}_p$ norm. Furthermore, Li et al.³¹ pointed out that final cumulative wealth implicated transaction costs should be updated as:

$$\begin{aligned} S_T=S_0\prod _{t=1}^{T}\left( <\mathbf {b_t}, \mathbf {x_t}>w_{t-1}\right) \, \end{aligned}$$

(2)

where $S_0$ is always normalized to 1. $S_T$ is also called net cumulative wealth, which possesses more computing and comparing significance.

Related works

Brenchmarks systems

Uniformly buy-and-hold (UBAH), Beststock (BEST) and Best Constant Rebalanced Portfolios (BCRP) are three benchmark portfolio strategies. UBAH is a simple but widely-used system, which invests evenly at the beginning and holds until the whole period: ${\textbf{b}}_1=\left( \frac{1}{m},\ldots ,\frac{1}{m}\right)$. Thus, it is obviously that the cumulative wealth of UBAH is $S_T={\textbf{b}}_1 \left( \odot _{i=1}^T{\textbf{x}}_t \right)$, where $\odot$ denotes the element-wise product. Similarly but differently, BEST invests totally in the best outperformance asset in hindsight and remains unchanged: ${\textbf{b}}_1=\mathop {\arg \max }\limits _{\mathbf {b \in \bigtriangleup _m}}\ {\textbf{b}} \left( \odot _{i=1}^T{\textbf{x}}_t \right)$ and $S_T=\mathop {\max }\limits _{\mathbf {b \in \bigtriangleup _m}}\ {\textbf{b}} \left( \odot _{i=1}^T{\textbf{x}}_t \right)$. Besides, BCRP updates portfolio to ${\textbf{b}}^*=\mathop {\arg \max }\limits _{\mathbf {b \in \bigtriangleup _m}}\ \sum _{t=1}^{T}\log ({\textbf{b}}^T{\textbf{x}}_t)$ in each period, where ${\textbf{b}}^*$ denotes the portfolio maximizing return in hindsight. Therefore, the cumulative wealth of BCRP is $S_T=\prod _{t=1}^{T}\left( {\textbf{b}}^{*T}{\textbf{x}}_t \right)$.

Related works on transaction costs

Gy$\mathrm {\ddot{o}}$rfi et al.²⁹ extended the Markowitz portfolio framework by adding penalty term to allow proportional transaction costs considered. The model is widely adopted and attracts an amount of attraction in the area of controlling transaction costs by using penalty term in the portfolio selection problem. They denoted transaction costs related factor as the ratio of net wealth after rebalancing to wealth before rebalancing, that is, $w_{t-1}=\frac{N_{t-1}}{S_{t-1}}$. The rate of proportional transaction costs during sales and buys were denoted by $0<c_s<1$ and $0<c_b<1$, which are controlled by the following formula

$$\begin{aligned} 1=w_{t-1}+c_s\sum _{i=1}^{m}\left( \frac{b_{t-1,i}x_{t-1,i}}{{\textbf{b}}_{t-1}\cdot {\textbf{x}}_{t-1}}-b_{t,i}w_{t-1}\right) ^+ +c_b\sum _{i=1}^{m}\left( b_{t,i}w_{t-1}-\frac{b_{t-1,i}x_{t-1,i}}{{\textbf{b}}_{t-1}\cdot {\textbf{x}}_{t-1}}\right) ^+. \end{aligned}$$

(3)

They proposed the recursive portfolio strategy as follows

$$\begin{aligned} {\textbf{b}}_{t+1}=\mathop {\arg \max }\limits _{\mathbf {b'}}\ v({\textbf{b}}_{t},{\textbf{b}}',{\textbf{X}}_t)+(1-\delta _t)\textrm{E}\{F_{\delta _t}({\textbf{b}}',{\textbf{X}}_{t+1})\vert {\textbf{X}}_t\}, \end{aligned}$$

(4)

where $F_{\delta }({\textbf{b}},{\textbf{X}})$ is the recursive function formulated by the discounted Bellman equation:

$$\begin{aligned} F_{\delta }({\textbf{b}},{\textbf{X}})=\max _{\mathbf {b'}}\ v({\textbf{b}},{\textbf{b}}',{\textbf{x}})+(1-\delta )\textrm{E}\{F_{\delta }({\textbf{b}}',{\textbf{X}}_{2})\vert {\textbf{X}}_1={\textbf{x}}\}, \end{aligned}$$

(5)

where $\delta _t$ is a discount factor such that $\delta _t \rightarrow 0$, ${{\textbf{X}}_t}$ is the homogeneous and first order Markov process, and $v({\textbf{b}},{\textbf{b}}',{\textbf{x}})$ is the inner function defined in²⁹.

Li et al.³¹ considered another proportional transaction cost model named Transaction Cost Optimization (TCO), which is a sparse portfolio selection model by adding norm penalized. They took transaction costs as the regularization term of the portfolio model by L1 norm penalized, and obtained the closed-form solution of portfolio update through proximal gradient descent method. Li et al. theoretically guaranteed that transaction cost is related to $\vert \vert {\textbf{b}}-{\hat{\textbf{b}}}_t\vert \vert _1$ by proving

$$\begin{aligned} \begin{aligned} \frac{1-\gamma }{1-\gamma +\gamma \vert \vert {\hat{\textbf{b}}_{t-1}}-\mathbf {b_t}\vert \vert _1}\le w_{t-1} \le \frac{1+\gamma }{1+\gamma +\gamma \vert \vert {\hat{\textbf{b}}_{t-1}}-\mathbf {b_t}\vert \vert _1}\, \end{aligned} \end{aligned}$$

(6)

for which indicated that the net proportion is inversely related to $\vert \vert {\textbf{b}}-{\hat{\textbf{b}}}_t\vert \vert _1$. Thus they proposed the following model:

$$\begin{aligned}{} & {} {\textbf{b}}_{t+1}=\mathop {\arg \min }\limits _{{\textbf{b}}}\ -{\textrm{E}}\{\log {\textbf{b}}\cdot \tilde{{\textbf{x}}}_{t+1}\}+\lambda \vert \vert {\hat{\textbf{b}}_{t-1}}-\mathbf {b_t}\vert \vert _1 \nonumber \\{} & {} s.t.\ {\textbf{b}}\cdot {\textbf{1}}=1,\ {\textbf{b}}\ge 0\, \end{aligned}$$

(7)

where $\tilde{{\textbf{x}}}_{t+1}$ denotes the predicted price relative vector and $\lambda$ is a regularization parameter. They acquired the closed-form solution in the following:

$$\begin{aligned} \tilde{{\textbf{b}}}_{t+\frac{1}{2}}&=\eta _t\left( {\textrm{E}}\left\{ \frac{\tilde{{\textbf{x}}}_{t+1}}{{\hat{\textbf{b}}}_t\cdot \tilde{{\textbf{x}}}_{t+1}}\right\} -\frac{1}{m}{\textbf{1}}\cdot {\textrm{E}}\left\{ \frac{\tilde{{\textbf{x}}}_{t+1}}{{\hat{\textbf{b}}}_t\cdot \tilde{{\textbf{x}}}_{t+1}}\right\} \right) , \end{aligned}$$

(8)

$$\begin{aligned} {\textbf{b}}_{t+1}&={\hat{\textbf{b}}}_t+sign\left( \tilde{{\textbf{b}}}_{t+\frac{1}{2}}\right) \left[ \vert \tilde{{\textbf{b}}}_{t+\frac{1}{2}}\vert -\lambda \eta _{t+\frac{1}{2}}\right] _+, \end{aligned}$$

(9)

where $\eta _t$ is an inner variable, $[v]_+=\max (0,v)$, and sign(v) denotes the sign of v.

Portfolio optimization

Price information

We should consider price forecasting method to achieve the data-driven ideology, which can lessen the influence by irrational factors^32,33,34 in the market. Specifically, we formulate the expected return by price forecasting method based on historical information.

PAMR³⁵ and CWMR¹² showed that the predicted relative price in next period is inversely to the current period: $\mathbf {x_{t+1}}=\frac{1}{\mathbf {x_t}}$, which utilized the properties of single-period mean reversion to balance risk and return. Besides, OLMAR³ exploited muti-period mean reversion to solve the moving case. It proposed that the relative price in next period will revert to the moving average:

$$\begin{aligned} \mathbf {x_{t+1}}=\frac{1}{w}\left( {\textbf{1}}+\frac{1}{\mathbf {x_t}}+\cdots +\frac{1}{\odot _{i=0}^{w-2}\mathbf {x_{t-i}}}\right) , \end{aligned}$$

(10)

where w is the window size, and it smooths the price volatility in online portfolio problem.

Moreover, above mean reversion strategies may be sub-optimal subject to the noise from real market, due to the real market is not normally distributed³⁶. Meanwhile, the robust median reversion (RMR)³⁷ is robust to real market and can withstand nontrivial transaction costs, which utilizes L1 median estimator^38,39 and online machine learning. RMR can solve the long tail distribution of real market and is shown below:

$$\begin{aligned} \mathbf {x_{t+1}}=\left( \mathop {\arg \min }\limits _{\textbf{p}} \sum _{i=0}^{w-1}\vert \vert {\textbf{P}}_{t-i}-{\textbf{P}}\vert \vert _2\right) /\mathbf {x_{t}}. \end{aligned}$$

(11)

On the other hand, most investors will follow the trend and keep purchasing rising stocks, thus they always consider ${\textbf{P}}_{MAX}$ as a potential level that the future price can probably reach. Consequently, a generalized logarithmic return (GLR)²² was proposed to predict the relative price in next period:

$$\begin{aligned} \mathbf {x_{t+1}}=1.1\log \left( \frac{{\textbf{P}}_{MAX}}{{\textbf{P}}_t}\right) +1. \end{aligned}$$

(12)

In order to illustrate the adaptability to various price processes of the optimization model, and comprehensively predict the relative price, we adopt in the paper above methods to further consider the online portfolio optimization with transaction costs.

The proposed doubly elastic net regularized online portfolio optimization with transaction costs

Considering the goals of maximizing cumulative wealth and minimizing transaction costs, the portfolio model we build in this paper as follows

$$\begin{aligned} \begin{aligned} {\textbf{b}}_{t+1}\in \ \mathop {\arg \min }\limits _{\mathbf {b \in \bigtriangleup _m}} \ -{\textbf{f}}^T{\textbf{b}}+\lambda \vert \vert {\textbf{b}}-{\hat{\textbf{b}}_{t}}\vert \vert _1+\frac{\eta }{2}\vert \vert {\textbf{b}}-{\hat{\textbf{b}}_{t}}\vert \vert _2^2+\frac{\tau }{2}\vert \vert {\textbf{b}}\vert \vert _2^2\, \end{aligned} \end{aligned}$$

(13)

where $\lambda >0$, ${\textbf{f}}$ is the predicted relative price in the next period. Here, we consider the following four cases: ${\textbf{f}}=\frac{1}{\mathbf {x_t}}$, ${\textbf{f}}=\frac{1}{w}\left( 1+\frac{1}{\mathbf {x_t}}+\cdots +\frac{1}{\odot _{i=0}^{w-2} \mathbf {x_{t-i}}}\right)$, ${\textbf{f}}=(\mathop {\arg \min }\limits _{\textbf{p}} \sum _{i=0}^{w-1}\vert \vert {\textbf{P}}_{t-i}-{\textbf{P}}\vert \vert _2)/\mathbf {x_{t}}$ and ${\textbf{f}}=1.1\log \left( \frac{{\textbf{P}}_{MAX}}{{\textbf{P}}_t}\right) +1$. Since model (13) involves in the elastic net regularization terms for ${\textbf{b}}-{\hat{\textbf{b}}_{t}}$ and ${\textbf{b}}$, which will be explained below, we call model (13) the doubly elastic net regularized portfolio optimization (DENRPO) model.

It can be found that ${\textbf{f}}^T{\textbf{b}}$ represents the predicted wealth increasing factor, implying expected return potential of the whole portfolio. Therefore, the goal of maximizing cumulative wealth can be reformulated to this item and negative expected return can change the maximization to a minimization.

Further, we implement the proposed model to manage transaction costs with better generalization. On one hand, transaction costs can be reflected by the wealth growth denoted by net asset proportion. Inspired by the research in that transaction costs influent the net proportion by the distance of $\mathbf {b_t}$ and ${\hat{\textbf{b}}_{t-1}}$, we innovatively apply elastic net for this term, since it makes a difference in the following aspect. Firstly, elastic net tends to preserve the highly correlated variables by L2 norm structure while maintaining sparsity by L1 norm, thus it can avoid extreme positions and improve the diversification and stability. Besides, it is according to regression analysis that elastic net is particularly useful when assets are large-scale, since it overcomes poor sample sentiments. The above analyses lead to minimize

$$\begin{aligned} \lambda \vert \vert {\textbf{b}}-{\hat{\textbf{b}}_{t}}\vert \vert _1+\frac{\eta }{2}\vert \vert {\textbf{b}}-{\hat{\textbf{b}}_{t}}\vert \vert _2^2 \end{aligned}$$

to trade off transaction costs, in which $\lambda$ and $\eta$ are the regularization parameters controlling transaction costs. The smaller the $\lambda$ and $\eta$ are will the smaller the regularization strength is, meanwhile indicating that the model is more inclined to obtain high returns. When the $\lambda$ and $\eta$ are larger, these regularization terms will make $\mathbf {b_{t+1}}$ more inclined to ${\hat{\textbf{b}}_{t}}$, that is the number of rebalanced assets is reduced, thus lessens the transaction cost.

On the other hand, researches show that the portfolio variable regularization $\vert \vert {\textbf{b}}\vert \vert$ makes sense in the portfolio optimization. Fan et al.⁴⁰ showed that L1 norm penalty on portfolio variable is equivalent to constraining the risk or utility cumulative statistical estimation error, since constraints on individual assets limits total exposure, thus controlling for risk approximation errors as well as closer the empirical and actual risk. In addition, Brodie et al.²⁰ proposed that portfolio weights can represent transaction costs. Furthermore, Li⁴¹ proposed that penalizing portfolio weights favors the sparsity and stability of portfolio, since shifting and scaling the portfolio weights derived from the sample estimates towards zero allows small portfolio weights to be set to zero and extremely large positions to be regulated, resulting in sparse and stable portfolios. Besides, DeMiguel et al.⁴² showed that the global minimum variance of a portfolio can be generated through the portfolio regularization term. The above researches provide us with ideas for constructing the model, since this regularization contributes to the generalization of the portfolio model can be theoretically guaranteed. Thus we apply elastic net for ${\textbf{b}}$, which the form is the combination of L1 norm and the square of L2 norm for the portfolio vector with some tuning parameters, as the regularization term of the proposed model. Since ${\textbf{b}}$ is restricted to the simplex, the first term of the elastic net is equivalent to the constant 1. Therefore, we need to minimize $\frac{\tau }{2}\vert \vert {\textbf{b}}\vert \vert _2^2$ to control the sparsity and stability of our model, in which $\tau$ is the regularization parameter controlling the generalization of the model. Based on above discussions, we propose optimization model (13).

The next theorem establishes the existence of solutions to the problem (13).

Theorem 1

For $\lambda ,\ \eta ,\ \tau \ge 0$, problem (13) has optimal solutions. Further, if $\eta >0$ or $\tau >0$, problem (13) has a unique optimal solution.

Proof

It is easy to know that the objective function in model (13) is continuous and the constrain $\bigtriangleup _m$ is closed and bounded. Thus we can obtain that problem (13) has optimal solutions. If $\eta >0$ or $\tau >0$, the objective function is strongly convex. Therefore, problem (13) has a unique optimal solution. $\square$

Solving algorithms

In this section, we develop algorithms to solve the problem (13) in a higher efficiency.

Linearized augmented Lagrangian method

Augmented Lagrangian method (ALM) is an excellent algorithm due to its efficiency for solving the linear equality constrained optimization problem. Hence, we apply the ALM to solve the proposed model (13). Since the nonlinear term in the model will increase the solving difficulty of using general ALM, we develop a linearized augmented Lagrangian method (LALM), which linearizes the quadratic term of ALM, to solve the problem in a higher efficiency.

We first introduce the notion of indicator function on ${\mathbb {R}}_+^m$, denoted by ${\textbf{I}}_{{\mathbb {R}}_+^m}$, which is defined at ${\textbf{b}} \in {\mathbb {R}}^m$ by

$$\begin{aligned} {\textbf{I}}_{{\mathbb {R}}_+^m}({\textbf{b}})= {\left\{ \begin{array}{ll} 0, \hspace{5mm}&{}{\textbf{b}} \in {\mathbb {R}}_+^m,\\ +\infty , &{} \textrm{otherwise}. \end{array}\right. } \end{aligned}$$

(14)

By this way, the augmented Lagrangian function of problem (13) is

$$\begin{aligned} \begin{aligned} L({\textbf{b}},\xi )=&-{\textbf{f}}^T{\textbf{b}}+\lambda \vert \vert {\textbf{b}}-{\hat{\textbf{b}}_{t}}\vert \vert _1+\frac{\eta }{2}\vert \vert {\textbf{b}}-{\hat{\textbf{b}}_{t}}\vert \vert _2^2+\frac{\tau }{2}\vert \vert {\textbf{b}}\vert \vert _2^2+{\textbf{I}}_{{\mathbb {R}}_+^m}({\textbf{b}})+\xi ({\textbf{1}}^T{\textbf{b}}-1) +\frac{\rho }{2} ({\textbf{1}}^T {\textbf{b}}-1)^2\, \end{aligned} \end{aligned}$$

(15)

where $\xi \in {\mathbb {R}}$ is the Lagrange multiplier and $\rho >0$ is a penalty parameter. Then, the LALM updates ${\textbf{b}}^{k+1}$ and $\xi ^{k+1}$ in each iteration by

$$\begin{aligned}{} & {} {\textbf{b}}^{k + 1} \in \mathop {\arg \min }\limits _{{\textbf{b}}}\ g({\textbf{b}})+\xi ^{k}({\textbf{1}}^{T}{\textbf{b}}- 1)+< {\textbf{b}} - {\textbf{b}}^{k}, \rho {\textbf{1}} ({\textbf{1}}^{T}{\textbf{b}}^{k}-1)>+\frac{1}{2\alpha }\vert \vert {\textbf{b}} - {\textbf{b}}^{k}\vert \vert _{2}^{2}\, \end{aligned}$$

(16)

$$\begin{aligned}{} & {} \xi ^{k+1}= \xi ^k+\rho ({\textbf{1}}^T{\textbf{b}}^{k+1}-1)\, \end{aligned}$$

(17)

where $g({\textbf{b}})=-{\textbf{f}}^T{\textbf{b}}+\lambda \vert \vert {\textbf{b}}-{\hat{\textbf{b}}_{t}}\vert \vert _1+\frac{\eta }{2}\vert \vert {\textbf{b}}-{\hat{\textbf{b}}_{t}}\vert \vert _2^2+\frac{\tau }{2}\vert \vert {\textbf{b}}\vert \vert _2^2+{\textbf{I}}_{{\mathbb {R}}_+^m}({\textbf{b}})$, and $\alpha$ is a parameter which will be described in the following text.

The following lemma tells us that the problem (16) can be computed in an easy way. Specifically, the closed-form solution of problem (16) only involves in the soft-thresholding operator and the projection onto ${\mathbb {R}}_+^m$.

Lemma 1

The closed-form solution for the optimization problem (16) is

$$\begin{aligned} {\textbf{b}}^{k+1}={\textrm{P}}_{{\mathbb {R}}_+^m}\left\{ \ {\hat{\textbf{b}}_{t}}+\textrm{sign}({\textbf{q}})\left[ \vert {\textbf{q}}\vert -\frac{\lambda }{\tau +\eta +\frac{1}{\alpha }}\right] _+\right\} \, \end{aligned}$$

(18)

where $([{\textbf{x}}]_+)_i=max\{{\textbf{x}}_i,0\}$ for $i=1,\ldots ,m$ and ${\textbf{x}}\in {\mathbb {R}}_+^m$,

$$\begin{aligned} \begin{aligned} {\textbf{q}}=&\frac{\eta }{\tau +\eta +\frac{1}{\alpha }}{\hat{\textbf{b}}_t}+\frac{\frac{1}{\alpha }}{\tau +\eta +\frac{1}{\alpha }}{\textbf{b}}^k-\frac{\rho }{\tau +\eta +\frac{1}{\alpha }}{\textbf{1}}({\textbf{1}}^T{\textbf{b}}^k-1)+\frac{1}{\tau +\eta +\frac{1}{\alpha }}{\textbf{f}}-\frac{1}{\tau +\eta +\frac{1}{\alpha }}{\textbf{1}}\xi ^k-{\hat{\textbf{b}}_{t}} \, \end{aligned} \end{aligned}$$

and $\textrm{P}_{{\mathbb {R}}_+^m}$ is the projection onto ${\mathbb {R}}_+^m$.

Proof

We derive from (16) that

$$\begin{aligned} \begin{aligned} {\textbf{b}}^{k+1}\in&\mathop {\arg \min }\limits _{{\textbf{b}}\in {\mathbb {R}}_+^m} \lambda \vert \vert \textbf{b}-\hat{\textbf{b}_\textbf{t}}\vert \vert _1+\frac{\eta }{2}\vert \vert \textbf{b}-\hat{\textbf{b}_\textbf{t}}\vert \vert _2^2+\frac{\tau }{2}\vert \vert {\textbf{b}}\vert \vert _2^2+\frac{1}{2\alpha }\vert \vert {\textbf{b}}-{\textbf{b}}^k\vert \vert _2^2+<{\textbf{b}},\rho {\textbf{1}}({\textbf{1}}^{T}{\textbf{b}}^{k}-1)-{\textbf{f}}+{\textbf{1}}\xi ^k> \\ =\,&\mathop {\arg \min }\limits _{{\textbf{b}}\in {\mathbb {R}}_+^m} \lambda \vert \vert \textbf{b}-\hat{\textbf{b}_\textbf{t}}\vert \vert _1+\frac{\eta }{2}\vert \vert \textbf{b}-\hat{\textbf{b}_\textbf{t}}\vert \vert _2^2+\frac{\tau }{2}\vert \vert {\textbf{b}}\vert \vert _2^2+\frac{1}{2\alpha }\vert \vert {\textbf{b}}-{\textbf{b}}^k\vert \vert _2^2+<{\textbf{b}},{\textbf{d}}> \, \end{aligned} \end{aligned}$$

(19)

where ${\textbf{d}}=\rho {\textbf{1}}({\textbf{1}}^{T}{\textbf{b}}^{k}-1)-{\textbf{f}}+{\textbf{1}}\xi ^k$. Then the formula (19) can be further changed into

$$\begin{aligned} \begin{aligned} {\textbf{b}}^{k+1}\in&\mathop {\arg \min }\limits _{{\textbf{b}}\in {\mathbb {R}}_+^m} \lambda \vert \vert \textbf{b}-\hat{\textbf{b}_\textbf{t}}\vert \vert _1+\frac{1}{2}\left( \tau +\eta +\frac{1}{\alpha }\right) \vert \vert {\textbf{b}}-\frac{\eta {\hat{\textbf{b}}_t}}{\tau +\eta +\frac{1}{\alpha }}-\frac{\frac{1}{\alpha }{\textbf{b}}^k}{\tau +\eta +\frac{1}{\alpha }}+\frac{{\textbf{d}}}{\tau +\eta +\frac{1}{\alpha }}\vert \vert _2^2\\ =&\mathop {\arg \min }\limits _{{\textbf{b}}\in {\mathbb {R}}_+^m} \lambda \vert \vert \textbf{b}-\hat{\textbf{b}_\textbf{t}}\vert \vert _1+\frac{1}{2}\left( \tau +\eta +\frac{1}{\alpha }\right) \vert \vert {\textbf{b}}-{\textbf{w}}\vert \vert _2^2\, \end{aligned} \end{aligned}$$

(20)

where ${\textbf{w}}=\frac{\eta }{\tau +\eta +\frac{1}{\alpha }}{\hat{\textbf{b}}_t}+\frac{\frac{1}{\alpha }}{\tau +\eta +\frac{1}{\alpha }}{\textbf{b}}^k-\frac{1}{\tau +\eta +\frac{1}{\alpha }}{\textbf{d}}$. Afterwards, let ${\textbf{u}}=\textbf{b}-\hat{\textbf{b}_\textbf{t}}$, then we can find that

$$\begin{aligned} \begin{aligned} {\textbf{u}}^{k+1}\in&\mathop {\arg \min }\limits _{{\textbf{u}}\in \textrm{T}} \lambda \vert \vert {\textbf{u}}\vert \vert _1+\frac{1}{2}\left( \tau +\eta +\frac{1}{\alpha }\right) \vert \vert {\textbf{u}}-({\textbf{w}}-\hat{\textbf{b}_\textbf{t}})\vert \vert _2^2\\ =\,&\mathop {\arg \min }\limits _{{\textbf{u}}\in \textrm{T}}\frac{\lambda }{\tau +\eta +\frac{1}{\alpha }}\vert \vert {\textbf{u}}\vert \vert _1+\frac{1}{2}\vert \vert {\textbf{u}}-({\textbf{w}}-\hat{\textbf{b}_\textbf{t}})\vert \vert _2^2\\ =\,&\mathop {\arg \min }\limits _{{\textbf{u}}\in \textrm{T}}\frac{\lambda }{\tau +\eta +\frac{1}{\alpha }}\vert \vert {\textbf{u}}\vert \vert _1+\frac{1}{2}\vert \vert {\textbf{u}}-{\textbf{q}}\vert \vert _2^2\\ =\,&\textrm{P}_{\textrm{T}}\left\{ \ \textrm{sign}({\textbf{q}})\left[ \vert {\textbf{q}}\vert -\frac{\lambda }{\tau +\eta +\frac{1}{\alpha }}\right] _+\right\} \, \end{aligned} \end{aligned}$$

(21)

where ${\rm P}_{\rm T}$ is the projection onto the set $\textrm{T}$, ${\textbf{q}}={\textbf{w}}-\hat{\textbf{b}_\textbf{t}}$, that is

$$\begin{aligned} {\textbf{q}}=&\frac{\eta }{\tau +\eta +\frac{1}{\alpha }}{\hat{\textbf{b}}_t}-{\hat{\textbf{b}}_t}+\frac{\frac{1}{\alpha }}{\tau +\eta +\frac{1}{\alpha }}{\textbf{b}}^{k}-\frac{1}{\tau +\eta +\frac{1}{\alpha }}{\textbf{1}}\xi ^{k}-\frac{\rho }{\tau +\eta +\frac{1}{\alpha }}{\textbf{1}}({\textbf{1}}^T{\textbf{b}}^{k}-1)+\frac{1}{\tau +\eta +\frac{1}{\alpha }}{\textbf{f}}\ , \end{aligned}$$

and $\textrm{T}=\{{\textbf{u}}\in {\mathbb {R}}^n:{\textbf{u}}+{\hat{\textbf{b}}}_t \in {\mathbb {R}}^m_+\}$.

Then

$$\begin{aligned} \begin{aligned} {\textbf{b}}^{k+1}=&{\hat{\textbf{b}}}_t+{\textbf{u}}^{k+1}\\ =\,&\textrm{P}_{{\mathbb {R}}^m_+}\ \left\{ {\hat{\textbf{b}}}_t+\textrm{sign}({\textbf{q}})\left[ \vert {\textbf{q}}\vert - \frac{\lambda }{\tau +\eta +\frac{1}{\alpha }} \right] _+\right\} \end{aligned} \end{aligned}$$

$\square$

The above iterative update process is summarized into Algorithm 1. We point out that our proposed algorithm does not need to compute the projection onto the simplex set. This enables the computationally efficiency of our proposed algorithm.

It can be proved that the LALM can be equivalently reformulated as the Chambolle-pock algorithm⁴³. Thus, we can easily get the following theorem regarding the convergence of the proposed algorithm.

Theorem 2

Let $\{{\textbf{b}}^k:k\in {\mathbb {N}}\}$ be generated by Algorithm 1. Then, there exists a ${\textbf{b}}^*\in {\mathbb {R}}^m$ such that

$$\begin{aligned}{\lim _{k\rightarrow \infty }}{\textbf{b}}^k={\textbf{b}}^* \end{aligned}$$

and ${\textbf{b}}^*$ is an optimal solution of problem (13).

In Algorithm 1, Theorem 2 can be established when parameters $\rho$ and $\alpha$ satisfy $\alpha < \frac{1}{\rho m}$ and $\rho >0$, where m is the number of assets. Theorem 2 indicates that our portfolio update algorithm outputs an optimal solution of model (13), which can be supported by theory.

Alternating direction method of multipliers (ADMM)

From the above discussion, the augmented Lagrangian function of problem (13) has a quadratic term $\frac{\rho }{2}\left( {\textbf{1}}^T{\textbf{b}}-1\right) ^2$, which will increase the computational difficulty, that is why we linearize the ALM. To further consider this problem, we apply ADMM to solve this problem, since ADMM introduces an auxiliary variable to guarantee that variables can be updated alternately, while remains applying gradient ascent to update the Lagrangian multiplier, which cleverly avoids the tedious process of solving the quadratic term. Specifically, we introduce an auxiliary variable ${\textbf{d}}\in {\mathbb {R}}^m$ to approach ${\textbf{b}}$, and decompose the iterative update problem of ${\textbf{b}}$ into a complete quadratic minimum solving problem for ${\textbf{b}}$ and a soft-threshold solving problem for ${\textbf{d}}$. We will illustrate the process in the following text.

Besides, since applying ADMM does not require the process of linearization, which leads to a higher accuracy, that it can restrict the solution in the simplex through project ${\textbf{b}}_{t+1}$ onto the simplex to form an eligible portfolio, as instructed by Duchi et al.⁴⁴.

In addition, not all formulations of ADMM have saddle points. Few methods take bother to figure out and prove the existence of saddle point. However, we can prove that the augmented Lagrangian function based on the proposed model (13) has a saddle point, which makes the iterative formulae of ADMM appropriate. Next we reveal the approach of ADMM applied in the model (13).

We first formulate the problem (13) as

$$\begin{aligned} \begin{aligned} {\textbf{b}}^{k+1} \in&\mathop {\arg \min }\limits _{{\textbf{b}}} g_1({\textbf{b}})+g_2({\textbf{d}})\\&s.t.\ \ {\textbf{b}}={\textbf{d}} \end{aligned} \end{aligned}$$

where $g_1({\textbf{b}})=-{\textbf{f}}^T{\textbf{b}}+\frac{\tau }{2}\vert \vert {\textbf{b}}\vert \vert _2^2 +{\textbf{I}}_{\Delta _m}({\textbf{b}})$, $g_2({\textbf{b}})=\lambda \vert \vert {\textbf{b}}-{\hat{\textbf{b}}}_t\vert \vert _1+\frac{\eta }{2}\vert \vert {\textbf{b}}-{\hat{\textbf{b}}}_t\vert \vert _2^2$, and

$$\begin{aligned} {\textbf{I}}_{\Delta _m}({\textbf{b}})= {\left\{ \begin{array}{ll} 0, \hspace{5mm}&{}{\textbf{b}} \in \Delta _m,\\ +\infty , &{} \textrm{otherwise}. \end{array}\right. } \end{aligned}$$

(22)

By this way, the alternating direction method of multipliers (ADMM) can be applied to the problem (13) and its augmented Lagrangian function is

$$\begin{aligned} L({\textbf{b}},{\textbf{d}},{\textbf{y}})=g_1({\textbf{b}})+g_2({\textbf{d}})+{\textbf{y}}^T({\textbf{b}}-{\textbf{d}})+{\frac{\rho }{2}\vert \vert {\textbf{b}}-{\textbf{d}}\vert \vert ^2_2} \end{aligned}$$

(23)

The ADMM generates a new iterate $({\textbf{b}}^{k+1},{\textbf{d}}^{k+1},{\textbf{y}}^{k+1})$ by

$$\begin{aligned} {\left\{ \begin{array}{ll} {\textbf{b}}^{k+1}\in \mathop {\arg \min }\limits _{b}{L({\textbf{b}},{\textbf{d}}^k,{\textbf{y}}^{k})}\\ {\textbf{d}}^{k+1}\in \mathop {\arg \min }\limits _{d}{L({\textbf{b}}^{k+1},{\textbf{d}},{\textbf{y}}^{k})}\\ {\textbf{y}}^{k+1}={\textbf{y}}^{k}+\rho ({{\textbf{b}}}^{k+1}-{\textbf{d}}^{k+1}) \end{array}\right. } \end{aligned}$$

(24)

The following lemma gives the closed-form solution to the above problems.

Lemma 2

The closed-form solutions to ${\textbf{b}}$ and ${\textbf{d}}$ in the optimization problem (24) are

$$\begin{aligned} {\textbf{b}}^{k + 1} =&\textrm{P}_{\Delta _m}\left\{ \frac{1}{\tau + \rho } \left( {\textbf{f}}-{\textbf{y}}^k+\rho {\textbf{d}}^k \right) \right\} \ ,\\ {\textbf{d}}^{k + 1}=&{\hat{\textbf{b}}}_t+\textrm{sign}({\textbf{D}})\left[ \vert {\textbf{D}}\vert - \frac{\lambda }{\eta + \rho }\right] _+\ , \end{aligned}$$

where ${\textbf{D}} = \frac{1}{\eta + \rho }\left( {\textbf{y}}^k+\rho {\textbf{b}}^{k + 1}-\rho {\hat{\textbf{b}}}_t \right)$, and $\textrm{P}_{\Delta _m}$ is the projection onto the simplex $\Delta _m$.

Proof

For the iteration of ${\textbf{b}}$ in the problem (24), we can find that

$$\begin{aligned} \begin{aligned} {\textbf{b}}^{k + 1} \in&\mathop {\arg \min }\limits _{{\textbf{b}}\in \Delta _m} -{\textbf{f}}^T{\textbf{b}}+\frac{\tau }{2}\vert \vert {\textbf{b}}\vert \vert _2^2+{\textbf{y}}{^k}^T({\textbf{b}}-{\textbf{d}}^k)+\frac{\rho }{2}\vert \vert {\textbf{b}}-{\textbf{d}}^k\vert \vert ^2_2\\ =&\mathop {\arg \min }\limits _{{\textbf{b}}\in \Delta _m}<{\textbf{b}},-{\textbf{f}}>+<{\textbf{b}},{\textbf{y}}^k>+\frac{\tau }{2}\vert \vert {\textbf{b}}\vert \vert _2^2+\frac{\rho }{2}\vert \vert {\textbf{b}}-{\textbf{d}}^k\vert \vert ^2_2\\ =&\mathop {\arg \min }\limits _{{\textbf{b}}\in \Delta _m} \frac{1}{2}(\tau +\rho )\vert \vert {\textbf{b}}-\frac{1}{\tau +\rho } \left( {\textbf{f}}-{\textbf{y}}^k+\rho {\textbf{d}}^k \right) \vert \vert ^2_2\\ =&\textrm{P}_{\Delta _m}\left\{ \frac{1}{\tau +\rho } \left( {\textbf{f}}-{\textbf{y}}^k+\rho {\textbf{d}}^k \right) \right\} \, \end{aligned} \end{aligned}$$

(25)

For the iteration of ${\textbf{d}}$ in the problem (24), we can solve it in the following:

$$\begin{aligned} \begin{aligned} {\textbf{d}}^{k + 1} \in&\mathop {\arg \min }\limits _{{\textbf{d}}} \lambda \vert \vert {\textbf{d}}-{\hat{\textbf{b}}}_t\vert \vert _1+\frac{\eta }{2}\vert \vert {\textbf{d}}-{\hat{\textbf{b}}}_t\vert \vert _2^2+{{\textbf{y}}^k}^T({\textbf{b}}^{k+1}-{\textbf{d}})+\frac{\rho }{2}\vert \vert {\textbf{b}}^{k+1}-{\textbf{d}}\vert \vert ^2_2\\ =&\mathop {\arg \min }\limits _{{\textbf{d}}} \lambda \vert \vert {\textbf{d}}-{\hat{\textbf{b}}}_t\vert \vert _1+<{\textbf{d}},-{\textbf{y}}^k>+\frac{\eta }{2}\vert \vert {\textbf{d}}-{\hat{\textbf{b}}}_t\vert \vert _2^2+\frac{\rho }{2}\vert \vert {\textbf{d}}-{\textbf{b}}^{k+1}\vert \vert ^2_2\\ =&\mathop {\arg \min }\limits _{{\textbf{d}}}\lambda \vert \vert {\textbf{d}}-{\hat{\textbf{b}}}_t\vert \vert _1+\frac{1}{2}(\eta +\rho )\vert \vert {\textbf{d}}-\frac{1}{\eta +\rho } \left( {\textbf{y}}^k+\eta {\hat{\textbf{b}}}_t +\rho {\textbf{b}}^{k+1}\right) \vert \vert ^2_2 \end{aligned} \end{aligned}$$

(26)

Let ${\textbf{w}}={\textbf{d}}-{\hat{\textbf{b}}}_t$, then the formula (26) is equal to

$$\begin{aligned} \begin{aligned} {\textbf{w}}^{k+1} \in&\mathop {\arg \min }\limits _{{\textbf{w}}}\lambda \vert \vert {\textbf{w}}\vert \vert _1+\frac{1}{2}(\eta +\rho )\vert \vert {\textbf{w}}-\frac{1}{\eta +\rho }({\textbf{y}}^k +\rho {\textbf{b}}^{k+1}-\rho {\hat{\textbf{b}}}_t)\vert \vert ^2_2\\ =&\mathop {\arg \min }\limits _{{\textbf{w}}}\frac{\lambda }{\eta +\rho }\vert \vert {\textbf{w}}\vert \vert _1+\frac{1}{2}\vert \vert {\textbf{w}}-\frac{1}{\eta +\rho }({\textbf{y}}^k +\rho {\textbf{b}}^{k+1}-\rho {\hat{\textbf{b}}}_t)\vert \vert ^2_2\\ =&\textrm{sign}({\textbf{D}})\left[ \vert {\textbf{D}}\vert - \frac{\lambda }{\eta +\rho }\right] _+\, \end{aligned} \end{aligned}$$

(27)

where ${\textbf{D}} = \frac{1}{\eta +\rho }({\textbf{y}}^k +\rho {\textbf{b}}^{k+1}-\rho {\hat{\textbf{b}}}_t)$, then we can acquire the result:

$$\begin{aligned} \begin{aligned} {\textbf{d}}^{k+1}=&{\hat{\textbf{b}}}_t+{\textbf{w}}^{k+1}\\ =&{\hat{\textbf{b}}}_t+\textrm{sign}({\textbf{D}})\left[ \vert {\textbf{D}}\vert - \frac{\lambda }{\eta +\rho }\right] _+ \end{aligned} \end{aligned}$$

(28)

$\square$

The ADMM solving problem (13) is summarized in Algorithm 2.

We next establish the convergence of ADMM applied to problem (13) for the existence of a saddle point for the Lagrangian function (23). We show this result in the following proposition.

Proposition 3

The Lagrangian function (23) has a saddle point, i.e., there exists $({\textbf{b}}^{*},{\textbf{d}}^{*},{\textbf{y}}^{*})$ such that

$$\begin{aligned} L({\textbf{b}}^{*},{\textbf{d}}^{*},{\textbf{y}})\le L({\textbf{b}}^{*},{\textbf{d}}^{*},{\textbf{y}}^{*})\le L({\textbf{b}},{\textbf{d}},{\textbf{y}}^{*}) \end{aligned}$$

for all ${\textbf{b}},{\textbf{d}},{\textbf{y}} \in {\mathbb {R}}^{m}$.

Proof

By Theorem 1, we suppose ${\textbf{b}}^{*}$ is an optimal solution of problem (13). That is

$$\begin{aligned} {\textbf{b}}^{*}\in \mathop {\arg \min }\limits _{{\textbf{b}}} g_1({\textbf{b}})+g_2({\textbf{b}}). \end{aligned}$$

Then, the Fermat’s rule leads to ${\textbf{0}} \in \partial {(g_1+g_2)({\textbf{b}}^{*})}$. Besides, it is obviously that $g_1$ and $g_2$ can be easily proved their convexity in Theorem 1. Since $g_1$ and $g_2$ are convexity and

$$\begin{aligned} dom(g_2):=\{ {\textbf{x}} \in {\mathbb {R}}^{m}:g_2(x) < +\infty \}={\mathbb {R}}^m, \end{aligned}$$

we have ${\textbf{0}} \in \partial {g_1({\textbf{b}}^{*})}+\partial {g_2({\textbf{b}}^{*})}$. Thus, there exists

$$\begin{aligned} {\textbf{y}}^* \in \partial {g_2({\textbf{b}}^{*})}\ \ \textrm{such}\ \textrm{that}\ -{\textbf{y}}^{*} \in \partial {g_1({\textbf{b}}^{*})}. \end{aligned}$$

Set ${\textbf{d}}^{*}={\textbf{b}}^{*}$. We next show $({\textbf{b}}^{*},{\textbf{d}}^{*},{\textbf{y}}^{*})$ is a saddle point of $L({\textbf{b}},{\textbf{d}},{\textbf{y}}^*)$. First, let $L({\textbf{b}},{\textbf{d}},{\textbf{y}}^*)$ take the partial derivative of ${\textbf{b}}$ at ${\textbf{b}}^*$ and take the partial derivative of ${\textbf{d}}$ at ${\textbf{d}}^*$, then we can get

$$\begin{aligned} {\left\{ \begin{array}{ll} {\textbf{0}}\in \partial {g_1({\textbf{b}}^{*})+{\textbf{y}}^{*}+\rho ({\textbf{b}}^{*}-{\textbf{d}}^{*})}, \\ {\textbf{0}}\in \partial {g_2({\textbf{d}}^{*})-{\textbf{y}}^{*}+\rho ({\textbf{d}}^{*}-{\textbf{b}}^{*})}, \end{array}\right. } \end{aligned}$$

which leads to $({\textbf{b}}^{*},{\textbf{d}}^{*})$ is a minimizer of $L({\textbf{b}},{\textbf{d}},{\textbf{y}}^{*})$ due to the convexity of $L({\textbf{b}},{\textbf{d}},{\textbf{y}}^{*})$. This implies that

$$\begin{aligned} L({\textbf{b}}^{*},{\textbf{d}}^{*},{\textbf{y}}^{*})\le L({\textbf{b}},{\textbf{d}},{\textbf{y}}^{*}) \end{aligned}$$

(29)

for all ${\textbf{b}},{\textbf{d}}\in {\mathbb {R}}^m$. Second, the proposed model (13) is strongly convex if the regularization parameters satisfy. By Slater’s theorem⁴⁵, strongly duality holds and there guarantees that

$$\begin{aligned} L({\textbf{b}}^{*},{\textbf{d}}^{*},{\textbf{y}}^{*})\ge L({\textbf{b}}^{*},{\textbf{d}}^{*},{\textbf{y}}) \end{aligned}$$

(30)

for any ${\textbf{y}} \in {\mathbb {R}}^m$ as ${\textbf{b}}^{*}={\textbf{d}}^{*}$. Combining (29) and (30), we complete the proof. $\square$

To integrately illustrate the ADMM applied in the proposed model (13), we are now ready to establish the convergence result in the following theorem, which is a direct consequence of Proposition 3 and Proposition 5.4.1 in⁴⁶.

Theorem 4

Let $\{\left( {\textbf{b}}^{k},{\textbf{d}}^{k},{\textbf{y}}^{k}\right) :k\in {\mathbb {N}}\}$ be generated by Algorithm 2. Then, $\{{\textbf{b}}^k,{\textbf{d}}^k,{\textbf{y}}^k\}$ is a convergent series, $\{{\textbf{b}}^k-{\textbf{d}}^k\}$ converges to ${\textbf{0}}$, and $\{{\textbf{b}}^k\}$ converges to an optimal solution of problem (13).

Experiment

Data-sets

We compare the performance of DENRPO and other strategies in four datasets, which are NYSE (O)⁹, NYSE (N)^12,21, TSE⁴⁷, MSCI³⁵ and DJIA. These datasets collect the historical relative price information, where the element in i th row and j th column denotes the relative price of j th asset in i th period. NYSE (O) and NYSE (N) are the data collected from the New York Exchange, NYSE(O) contains 36 stocks ranging from 7 March 1962 to 31 December 1984 and NYSE(N) contains 23 survived till 30 June 2010. TSE comes from Toronto Stock Exchange and contains 88 stocks ranging from 4 January 1994 to 31 December 1998. MSCI contains 24 indices that represent the equity markets of 24 countries around the world, ranging from 1 April 2006 to 31 March 2010. The finally dataset DJIA collects the Dow Jones Industrial index of 30 stocks in the whole 2010. The first four datasets mainly test the performance of algorithms in the stock market, and the last dataset is applied for testing algorithms in the long-short transaction. These datasets are publicly available from the real market, so it is effective and comparable to evaluate the proposed optimization model in these datasets.

Experimental setting

Parameter setting

In the proposed model, there are three regularized parameters of model, namely $\lambda$, $\eta$ and $\tau$, and four parameters for algorithm which are $\xi$, ${\textbf{y}}$, $\alpha$ and $\rho$. Thereinto, $\alpha$ is an inner variable, $\xi$ is alternately updated by dual ascent method in LALM and ${\textbf{y}}$ is the Lagrangian multiplier updated by ADMM applied in the proposed model. Therefore, the above variables will not affect the performance of the methods so that we do not discuss them. In order to control the iteration, we set the tolerance $\epsilon =10^{-8}$ and the $max \_ iteration=10^8$. We take $\alpha =\frac{0.999}{\rho m}$, $\rho =0.618$ and discuss $\lambda$, $\eta$ and $\tau$ in the following to control the regularization.

We apply the method that fixes two parameters and then change the other to determine the value of each parameter. Assuming that the transaction cost is denoted to $\gamma$, for the value of $\lambda$, referring to $\lambda =10\gamma$ in the TCO framework³¹, we fixed $\eta =\tau =0$ and search around $\lambda =10\gamma$. Our experiments show that cumulative wealth is relatively high in the same running time when $\lambda =10\gamma$ for which we take $\lambda =10\gamma$. For $\eta$ and $\tau$, we still fix one in 0 and change values of the other. We arrange and combine $\eta$ and $\tau$ performing better in above step and select the combination making the cumulative wealth higher in the same running time. For simplicity, Table 1 are the results of DENRPO1-OLMAR parameters debugging meanwhile Table 2 are the results of DENRPO2-OLMAR parameters debugging in $\gamma =0.5\%$, the value of the parameter result can be understood as the convergence speed of the solving algorithms. It is observing from the experimental result that $\eta =0.00025$ and $\tau =0.00005$ obtains a relatively outstanding comprehensive performance, thus we decided to apply these two values in all experiments.

Comparison approaches

We employ DENRPO method to solve the online portfolio selection problem based on the above four benchmark data sets. As a comparison, 12 other online portfolio selection algorithms also run in our experiment. Specifically, UBAH, BEST and BCRP are three benchmark approaches, where UBAH is able to reflect the stock price trend of real financial market. SSPO, S1, S2 and S3 are sparse strategies based on short-term investment. TCO1 and TCO2 are excellent approaches considering transaction costs, furtherly, TCO-RMR and TCO-GLR utilize RMR and GLR price prediction on the base of transaction cost optimization framework, respectively. WFDA is the portfolio strategy considering the long-short transaction, which is executed by wavelet feature engineering. The details of the algorithms and their parameter value, which are taken from the original paper or derived from numerical experiments based on the original paper, we list below:

(a)
UBAH: Uniformly buy-and-hold strategy;
(b)
BEST: Beststock in hindsight;
(c)
BCRP⁹: Best Constant Rebalanced Portfolios in hindsight;
(d)
TCO1³¹: Transaction Cost Optimization with parameters $\lambda =10\gamma$, $\eta =10$;
(e)
TCO2³¹: Transaction Cost Optimization with parameters $\lambda =10\gamma$, $w=4$, $\eta =10$;
(f)
TCO-RMR³¹: Transaction Cost Optimization utilizing robust L1 median reversion with parameters $\lambda =10\gamma$, $\eta =10$;
(g)
TCO-GLR³¹: Transaction Cost Optimization utilizing GLR price prediction with parameters $\lambda =10\gamma$, $w=4$, $\eta =10$;
(h)
SSPO²²: Short-term Sparse Portfolio Optimization with parameters $\eta =0.005$, $\zeta =500$, $w=5$, $\lambda =0.5$, $\gamma = 0.01$;
(i)
S1²³: $w=5$, $\epsilon =0.001$ (NYSE(O), NYSE(N)), $\epsilon =0.01$ (TSE), $\epsilon =0.003$ (MSCI), where $\epsilon =0.003$ (MSCI) is derived from the data range of the original paper and our experimental effect and S2, S3 take the value in the same way;
(j)
S2²³: $w=5$, $\epsilon =0.001$ (NYSE(O), NYSE(N)), $\epsilon =0.01$ (TSE), $\epsilon =0.003$ (MSCI);
(k)
S3²³: $w=5$, $s_t=3$, $\epsilon =0.001$ (NYSE(O), NYSE(N)), $\epsilon =0.01$ (TSE), $\epsilon =0.003$ (MSCI).
(l)
WFDA⁶: Risk Assessment with Wavelet Feature Engineering for High-Frequency Portfolio Trading.

Cumulative wealth

Cumulative wealth with fixed transaction costs

We fix the transaction cost rate to show the daily cumulative return trend of the proposed algorithms implemented on the NYSE(O) and MSCI for simplicity, by observing the trend of wealth growth under fixed transaction costs facilitates evaluating the performance of the algorithms. Figures 1 and 2 are the daily cumulative return trend of the proposed algorithms compared with TCOs in the transaction cost rate fixed in $0.25\%$. It can be found that the wealth growth structures of DENRPO and TCO strategies are roughly the same, but the daily return of DENRPO can always be a little higher than that achieved by TCO in most cases. Thus after investing for a period, DENRPO can always obtain higher cumulative wealth, which demonstrates the superiority and practicality of the proposed method.

Cumulative wealth with varying transaction costs

To better show the effectiveness of the introduced elastic net term for non-zero transaction costs and portfolio variable, meanwhile analyze the trend of the cumulative wealth in the condition of changing transaction costs, Figs. 3 and 4 as well as Table 3 compare the cumulative wealth achieved by the proposed DENRPO strategies and other methods we list above.

Table 1 Cumulative wealth achieved by DENRPO1-OLMAR about different permutations and combinations of $\eta$ and $\tau$ with transaction costs rates is 0.5%.

Full size table

Table 2 Cumulative wealth achieved by DENRPO2-OLMAR about different permutations and combinations of $\eta$ and $\tau$ with transaction costs rates is 0.5%.

Full size table

We can draw several observations in them. Firstly in Figs. 3 and 4, the cumulative wealth obtained by the three benchmark algorithms almost distributes in a straight line as transaction costs increase, indicating that the cumulative wealth obtained by these benchmark algorithms is less affected by transaction costs. Since UBAH and BEST will not rebalance the portfolio in the transaction period so that they will not produce the rebalancing cost, while BCRP fixes daily rebalancing and is less affected by transaction costs. Certainly for above reasons, they only can obtain less wealth.

Table 3 Cumulative wealth obtained by various algorithms on the four data-sets with transaction costs.

Full size table

Moreover, SSPO, S1, S2 and S3 perform better when transaction cost is 0 in Table 3. However, obviously find that their performance greatly reduces when operating in the non-zero transaction costs. In most data-sets, the cumulative wealth close to 0 when transaction costs only close to $0.5\%$. It is because that the goal of short-term investment is to obtain high return in a short time, thus frequent transactions are required, which will produce a amount of transaction costs. Hence, transaction costs have a greater impact on short-term investment, that leads to cumulative wealth decreases rapidly as transaction costs increase.

In addition, comparing TCO and DENRPO both transaction costs optimization strategies in Figs. 3 and 4, it can be found that the cumulative wealth of DENRPO is significantly higher than that of TCO in most cases. As transaction costs increase, DENRPO strategy can also achieve two small peaks of cumulative wealth in TSE when around $\gamma =0.3\%$ to $\gamma =0.7\%$, indicating that DENRPO strategy has the ability to counteract transaction costs, which ensures the stability and the better out-of-sample performance of the proposed method. Furtherly comparing TCO and DENRPO in Table 3, it is obviously that DENRPO obtains 8.02E+06, 893.22, 7.84, 1.30 in NYSE(O), NYSE(N), TSE and MSCI in $\gamma =0.5\%$, respectively, which indicates that DENRPO survives better than TCO in high transaction costs environment. It explains the stability of DENRPO and the significance of elastic net in the data angle.

Mean excess return

To measure the daily return performance of each algorithm, we first need to know whether proportion of total wealth gained or lost on this day. Due to the existence of transaction costs, we develop to represent this concept as a term related to the net proportion wealth:

$$\begin{aligned} r_s=({\textbf{b}}_t^T{\textbf{x}}_t-1)*w_{t-1}, \end{aligned}$$

(31)

it can be understood as the net proportion gained or lost wealth. Mean excess return (MER)⁴⁸ is defined as the average value of the daily excess returns compared with the UBAH strategy in the paper, which is given as follows:

$$\begin{aligned} MER={\bar{r}}_s-{\bar{r}}_m=\frac{1}{T}\sum _{t=1}^{T}r_{s,t}-r_{m,t}, \end{aligned}$$

(32)

where $r_{s,t}$ and $r_{m,t}$ are the daily excess returns of the compared portfolio strategy and the UBAH on the $t^{th}$ day, respectively.

It is obviously by the definition of MER that a superior portfolio strategy should have a larger MER value, and the larger the MER leads to the better the performance of the strategy. Certainly even a small gap in MER can indicate a larger difference in portfolio strategies, especially for the long-term investments²². We present the MERs for our proposed methods and the TCO strategy, which both consider the transaction cost, in Table 4.

It is obviously that DENRPO performs the best in most case, since it always gains the biggest MER. For example, the MER are 0.0069, 0.0023, 0.0060, 0.0017 in $\gamma =0.25\%$ in NYSE(O), NYSE(N), TSE and MSCI, respectively, which even many strategies that do not take into account transaction costs can not achieve. This is the reason why DENRPO outperforms other systems in cumulative wealth.

Table 4 Mean excess return obtained by several algorithms on the four data-sets with transaction costs.

Full size table

$\alpha$ Factor

We evaluate our proposed method whether outperforms the benchmark and the TCO method considering transaction costs in a statistical significance. The Capital Asset Pricing Model (CAPM)⁴⁹ proposed that intrinsic excess return composes the part of the expected return, which is usually called $\alpha$ Factor in the finance industry⁵⁰. $\alpha$ Factor can be improved by a excellent portfolio strategy and it can be represented in the following way:

$$\begin{aligned} {\hat{\beta }}=\frac{{\hat{c}}(r_s,r_m)}{{\hat{\sigma }}^2(r_s,r_m)}, \end{aligned}$$

(33)

$$\begin{aligned} {\hat{\alpha }}=\bar{r_s}-{\hat{\beta }}\bar{r_m}, \end{aligned}$$

(34)

where ${\hat{c}}(\cdot ,\cdot )$ and ${\hat{\sigma }}(\cdot ,\cdot )$ denote the sample covariance and the sample standard deviation (STD) computed on T trading days, respectively. Table 5 represents the $\alpha$ Factor of our proposed method compared to benchmark and TCO methods. It is obviously that DENRPO achieves 0.0067, 0.0027, 0.0057 in $\gamma =0.25\%$ in NYSE(O), NYSE(N) and TSE, respectively, which are much higher than TCO. It furtherly guarantees the better performance of the proposed methods.

Table 5 $\alpha$ Factor obtained by several algorithms on four data-sets with transaction costs.

Full size table

$\beta$ Factor

Table 6 $\beta$ Factor obtained by several algorithms on four data-sets with transaction costs.

Full size table

In addition to measuring returns, we also need to measure risk indices in order to evaluate a strategy more comprehensively. $\beta$ Factor is a commonly used risk indicator, which measures the volatility of the portfolio strategy return with respect to the market benchmark. In the case of $\beta >0$ and $\beta <1$, shows that the strategy return has a positive correlation with the market return, and if $\beta$ is smaller, then the strategy return fluctuates less greatly than the market return. The calculation method of $\beta$ Factor is given in (33) and Table 6 shows the result of the proposed method compared with benchmark and TCO methods. It is obviously that DENRPO obtains 1.0851, 0.9954 and 1.2146 in $\gamma =0.5\%$ in NYSE(O), NYSE(N) and TSE, respectively, which are much smaller than TCO, indicating that the proposed method can keep the stability as transaction costs increase. This test guarantees the generalization ability of DENRPO.

Sharpe ratio

In order to more synthetically illustrate the empirical superiority of DENRPO, we compare it with WFDA strategy in Sharpe ratio in the long-short baseline. Sharpe ratio measures the excess return in the unit risk, thus the higher ratio leads to better performance of the strategy. It can be computed as:

$$\begin{aligned} Sharpe\ ratio = \frac{\bar{r_s}-\bar{r_m}}{{\hat{\sigma }}^2(r_s)}. \end{aligned}$$

(35)

Besides, we compare the long-short baseline setting that simultaneously buys the top five and shorts the bottom five stocks sorted by VaR and CVaR in an hour, and holds the position for one day in DJIA dataset. Table 7 shows the Sharpe ratio of DENRPO, raw long-short term baseline and WFDA-based long-short transaction, where $\alpha$ denotes the confidence level, raw and WFDA denotes VaR and CVaR computed by raw data and WFDA processed data, respectively. We can find that the Sharpe ratio of DENRPO is the highest in all compared strategies, indicating its effectiveness and superiority in the real market.

Table 7 Sharpe Ratio obtained by DENRPO and WEDA on DJIA dataset in long-short transaction.

Full size table

Conclusion

In this paper, we study the online portfolio selection problem with transaction costs via machine learning. First, we formulate the problem as a minimization problem on the simplex. By minimizing negative expected returns and applying elastic net regularization to transaction cost controlled terms and portfolio variables, a practical and robust model is constructed to achieve the goal of maximizing return while minimizing transaction costs. Since the augmented Lagrangian function based on the proposed model has a quadratic term, we develop to apply LALM and ADMM to solve the model, which subtly reduces the computational difficulty. Further, we theoretically guarantee that the sequences generated by the proposed algorithms converge to the solution of the proposed model, and we also establish the closed-form solutions of the subproblems in each iteration. Moreover, we compare with state-of-the-art portfolio algorithms on five commonly used benchmark datasets. Extensive numerical experiments demonstrate that the cumulative wealth obtained by proposed algorithms outperforms all compared algorithms as transaction costs increasing in most cases and it also outshine in long-short transaction scenario, indicating that our proposed portfolio selection algorithms are highly competitive and have practical significance.

Data availibility

The datasets generated and analysed during the current study are available in the Github repository, https://github.com/Ting221/DENRPO.

Accession codes

Code used for the numerical experiments can be found here: https://github.com/Ting221/DENRPO.

References

Agarwal, A., Hazan, E., Kale, S. & Schapire, R. E. Algorithms for portfolio management based on the newton method. In Proceedings of the 23rd International Conference on Machine Learning 9–16 (2006).
Li, B., Hoi, S. C. & Gopalkrishnan, V. Corn: Correlation-driven nonparametric learning approach for portfolio selection. ACM Trans. Intell. Syst. Technol. 2, 1–29 (2011).
Article Google Scholar
Li, B., Hoi, S. C. H., Sahoo, D. & Liu, Z. Y. Moving average reversion strategy for on-line portfolio selection. Artif. Intell. 222, 104–123 (2015).
Article MathSciNet Google Scholar
Li, B., Sahoo, D. & Hoi, S. C. Olps: A toolbox for on-line portfolio selection. J. Mach. Learn. Res. 17, 1242–1246 (2016).
MathSciNet Google Scholar
Markowitz, H. M. Portfolio selection. J. Financ. 7, 77–91 (1952).
Google Scholar
Chen, Y.-T., Sun, E. W. & Yu, M.-T. Risk assessment with wavelet feature engineering for high-frequency portfolio trading. Comput. Econ. 52, 653–684 (2018).
Article Google Scholar
Lai, Z. R., Tan, L. M., Wu, X. T. & Fang, L. D. Loss control with rank-one covariance estimate for short-term portfolio optimization. J. Mach. Learn. Res. 21, 1–37 (2020).
MathSciNet MATH Google Scholar
Algoet, P. H. & Cover, T. M. Asymptotic optimality and asymptotic equipartition properties of log-optimum investment. Ann. Prob. 16, 876–898 (1988).
Article MathSciNet MATH Google Scholar
Cover, T. Universal portfolios. Math. Financ. 1, 1–29 (1991).
Article MathSciNet MATH Google Scholar
Kelly, J. L. A new interpretation of information rate. Bell Syst. Tech. J. 35, 917–926 (1956).
Article MathSciNet Google Scholar
Li, B. & Hoi, S. C. H. Online portfolio selection: A survey. ACM Comput. Surv. 46, 1–36 (2014).
Article ADS MATH Google Scholar
Li, B., Hoi, S. C. H., Zhao, P. L. & Gopalkrishnan, V. Confidence weighted mean reversion strategy for online portfolio selection. ACM Trans. Knowl. Discov. Data 7, 1–38 (2013).
Article Google Scholar
Lai, Z. R., Dai, D. Q., Ren, C. X. & Huang, K. K. Radial basis functions with adaptive input and composite trend representation for portfolio selection. IEEE Trans. Neural Netw. Learn. Syst. 29, 6214–6226 (2018).
Article PubMed Google Scholar
Lim, Q. Y. E., Cao, Q. & Quek, C. Dynamic portfolio rebalancing through reinforcement learning. Neural Comput. Appl. 34, 7125–7139 (2022).
Article Google Scholar
Zhao, L., Zou, S., Huang, M. & Wang, G. Distributed regularized stochastic configuration networks via the elastic net. Neural Comput. Appl. 33, 3281–3297 (2021).
Article Google Scholar
Corsaro, S., De Simone, V., Marino, Z. & Scognamiglio, S. L1-regularization in portfolio selection with machine learning. Mathematics 10, 1–15 (2022).
Article Google Scholar
Zou, H. & Hastie, T. Regularization and variable selection via the elastic net. J. R. Stat. Soc. 67, 768 (2005).
Article MathSciNet MATH Google Scholar
Ho, M., Sun, Z. & Xin, J. Weighted elastic net penalized mean-variance portfolio design and computation. SIAM J. Financ. Math. 6, 1220–1244 (2015).
Article MathSciNet MATH Google Scholar
Shen, W. W., Wang, J. & Ma, S. Q. Doubly regularized portfolio with risk minimization. In Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, AAAI’14 1286–1292 (2014).
Brodie, J., Daubechies, I., Mol, C. D., Giannone, D. & Loris, I. Sparse and stable markowitz portfolios. Proc. Natl. Acad. Sci. 106, 12267–12272 (2009).
Article ADS CAS PubMed PubMed Central MATH Google Scholar
Györfi, L., Ottucsak, G. & Walk, H. Machine learning for financial engineering. (World Scientific, 2012).
Lai, Z. R., Yang, P. Y., Fang, L. D. & Wu, X. T. Short-term sparse portfolio optimization based on alternating direction method of multipliers. J. Mach. Learn. Res. 19, 1–28 (2018).
MathSciNet MATH Google Scholar
Luo, Z., Yu, X., Xiu, N. & Wang, X. Closed-form solutions for short-term sparse portfolio optimization. Optimization 71, 1937–1953 (2020).
Article MathSciNet MATH Google Scholar
Sun, E. W., Kruse, T. & Yu, M.-T. High frequency trading, liquidity, and execution cost. Ann. Oper. Res. 223, 403–432 (2014).
Article MathSciNet MATH Google Scholar
Sun, E. W., Kruse, T. & Chen, Y.-T. Stylized algorithmic trading: Satisfying the predicted near-term demand of liquidity. Ann. Oper. Res. 218, 315–347 (2019).
Article MATH Google Scholar
Sun, E. W., Kruse, T. & Yu, M.-T. Financial transaction tax: Policy analytics based on optimal trading. Comput. Econ. 46, 103–141 (2015).
Article Google Scholar
Bauer, E. & Kohavi, R. An empirical comparison of voting classification algorithms: Bagging, boosting, and variants. Mach. Learn. 36, 105–139 (1999).
Article Google Scholar
Albeverio, S., Lao, L. J. & Zhao, X. L. On-line portfolio selection strategy with prediction in the presence of transaction costs. Math. Methods Oper. Res. 54, 133–161 (2001).
Article MathSciNet MATH Google Scholar
Györfi, L. & Vajda, I. Growth optimal investment with transaction costs. In International Conference on Algorithmic Learning Theory 108–122 (2008).
Das, P., Johnson, N. & Banerjee, A. Online lazy updates for portfolio selection with transaction costs. In Twenty-Seventh AAAI Conference on Artificial Intelligence 202–208 (2013).
Li, B., Wang, J. L., Huang, D. J. & Hoi, S. C. H. Transaction cost optimization for online portfolio selection. Quant. Financ. 18, 1411–1424 (2017).
Article MathSciNet MATH Google Scholar
Bondt, W. & Thaler, R. Does the stock market overreact?. J. Financ. 40, 793–805 (1985).
Article Google Scholar
Kahneman, D. & Tversky, A. Prospect theory: An analysis of decision under risk. Econometrica 47, 263–292 (1979).
Article MathSciNet MATH Google Scholar
Shiller, J. From efficient markets theory to behavioral finance. J. Econ. Perspect. 17, 83–104 (2003).
Article Google Scholar
Li, B., Zhao, P. L., Hoi, S. C. H. & Gopalkrishnan, V. Pamr: Passive aggressive mean reversion strategy for portfolio selection. Mach. Learn. 87, 221–258 (2012).
Article MathSciNet MATH Google Scholar
Merton, R. C. On estimating the expected return on the market: An exploratory investigation. J. Financ. Econ. 8, 323–361 (1980).
Article Google Scholar
Huang, D., Zhou, J., Li, B., Hoi, S. C. H. & Zhou, S. Robust median reversion strategy for online portfolio selection. IEEE Trans. Knowl. Data Eng. 28, 2480–2493 (2016).
Article Google Scholar
Weiszfeld, E. Sur le point pour lequel la somme des distances de n points donnes est minimum. Tohoku Math. J. 43, 559–61 (1936).
MATH Google Scholar
Vardit, Y. & Zhang, C. H. The multivariate l1-median and associated data depth. Proc. Natl. Acad. Sci. U. S. A. 97, 1423–1426 (2000).
Article ADS Google Scholar
Fan, J. Q., Zhang, J. J. & Yu, K. Vast portfolio selection with gross-exposure constraints. J. Am. Stat. Assoc. 107, 592–606 (2012).
Article MathSciNet PubMed PubMed Central MATH Google Scholar
Li, J. H. Sparse and stable portfolio selection with parameter uncertainty. J. Business Econ. Stat. 33, 381–392 (2015).
Article MathSciNet Google Scholar
DeMiguel, V., Garlappi, L. & Uppal, R. Optimal versus naive diversification: How inefficient is the 1/n portfolio strategy?. Rev. Financ. Stud. 22, 1915–1953 (2009).
Article Google Scholar
Chambolle, A. & Pock, T. A first-order primal-dual algorithm for convex problems with applications to imaging. J. Math. Imaging Vis. 40, 120–145 (2011).
Article MathSciNet MATH Google Scholar
Duchi, J., Shalev-Shwartz, S., Singer, Y. & Chandra, T. Efficient projections onto the l1-ball for learning in high dimensions. In Proceedings of the 25th International Conference on Machine Learning 272–279 (2008).
Boyd, S. & Vandenberghe, L. Convex optimization. (Cambridge university press, 2004).
Bertsekas, D. P. Convex optimization algorithms. (Athena Scientific, 2015).
Borodin, A., El-Yaniv, R. & Gogan, V. Can we learn to beat the best stock. J. Artif. Intell. Res. 21, 579–594 (2004).
Article MathSciNet MATH Google Scholar
Jegadeesh, N. Evidence of predictable behavior of security returns. J. Financ. 45, 881–898 (1990).
Article Google Scholar
Sharpe, W. F. Capital asset prices: A theory of market equilibrium under conditions of risk. J. Financ. 19, 442–452 (1964).
Google Scholar
Lintner, J. The valuation of risk assets and the selection of risky investments in stock portfolios and capital budgets. Rev. Econ. Stat. 47, 13–37 (1965).
Article Google Scholar

Download references

Acknowledgements

This research was funded by the National Science Foundation of China grant number 12271181, by the Opening Project of Guangdong Province Key Laboratory of Computational Science at the Sun Yat-sen University grant number 2021001, by the Guangzhou Basic Research Program grant number 202201010426 and by the Guangdong Basic and Applied Basic Research Foundation grant number 2023A1515030046.

Author information

Authors and Affiliations

Department of Applied Mathematics College Mathematics and Informatics, South China Agricultural University, Guangzhou, 510642, People’s Republic of China
Xiaoting Yao & Na Zhang

Authors

Xiaoting Yao
View author publications
You can also search for this author in PubMed Google Scholar
Na Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

X.T.Y. and N.Z. conceived the experiments, X.T.Y. and N.Z. conducted the experiments, X.T.Y. and N.Z. analysed the results. All authors reviewed the manuscript.

Corresponding author

Correspondence to Na Zhang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Yao, X., Zhang, N. Doubly elastic net regularized online portfolio optimization with transaction costs. Sci Rep 13, 18937 (2023). https://doi.org/10.1038/s41598-023-46059-2

Download citation

Received: 09 April 2023
Accepted: 27 October 2023
Published: 02 November 2023
DOI: https://doi.org/10.1038/s41598-023-46059-2

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Forecasting stock prices changes using long-short term memory neural network with symbolic genetic programming

Hybrid quantum investment optimization with minimal holding period

Multi-level deep Q-networks for Bitcoin trading strategies

Introduction

Preliminaries

Problem setting

Related works

Brenchmarks systems

Related works on transaction costs

Portfolio optimization

Price information

The proposed doubly elastic net regularized online portfolio optimization with transaction costs

Theorem 1

Proof

Solving algorithms

Linearized augmented Lagrangian method

Lemma 1

Proof

Theorem 2

Alternating direction method of multipliers (ADMM)

Lemma 2

Proof

Proposition 3

Proof

Theorem 4

Experiment

Data-sets

Experimental setting

Parameter setting

Comparison approaches

Cumulative wealth

Cumulative wealth with fixed transaction costs

Cumulative wealth with varying transaction costs

Mean excess return

\(\alpha\) Factor

\(\beta\) Factor

Sharpe ratio

Conclusion

Data availibility

Accession codes

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links