Introduction

For countries and regions where the birth rate is declining rapidly and population ageing is accelerating, it is necessary to continuously verify the financial situation of the public pension system to maintain long-term sustainability. Since the public pension system plays an essential role in supporting a large part of household expenditures, the adequacy of its benefits must also be examined.Footnote 1 The Organization for Economic Co-operation and Development (OECD, 2021a) has discussed the challenges common to pension systems in developed countries: The sustainability of these systems must be maintained while ensuring sufficient benefits. However, it is challenging to implement policies that fulfil both simultaneously because providing adequate benefits is a factor that worsens the health of the system. Therefore, we conduct a novel stochastic simulation study using the Japanese government’s official pension verification programmes. We contribute to the policy debate on the risk of benefit levels and the long-term financial sustainability of the Japanese public pension system by focusing on the investment risk-taking of the reserve fund, which affects the benefit level and sustainability.

The problems of the public pension system are generally caused by the trends of the birth rate, the ageing of the population, increasing longevity, declining productivity growth, and these fluctuations. However, social security systems, including public pensions, have been established and reformed through political processes. Cremer and Pestieau (2000) argued that economic and demographic factors play a relatively minor role in the pay-as-you-go pension system’s problems, but political factors are far more critical. For example, politicians may offer higher pension payments as subsidies for winning elections without sufficient contributions to cover them. Another problem for public pension systems is their inability to develop a credible institutional framework for contributors and pensioners in which the promise of payments is reasonably respected (Besley and Prat, 2005).Footnote 2

An automatic balancing mechanism (ABM) for a public pension scheme is a set of legally defined measures that are automatically applied depending on the solvency or sustainability indicator. It aims to restore the financial equilibrium of pay-as-you-go pension schemes through successive applications to make them viable without repeated intervention by the legislator (Vidal-Meliá et al., 2009). Recently, many countries have adopted an ABM, including Sweden, Canada, Germany, Finland, and Japan.Footnote 3 There are several ways to view this mechanism. ABM features range from simple solvency tests to complex multifactorial approaches that increase the burden on contributors and beneficiaries to make the system solvent. An example of the former is the Canadian system, where the contribution rate (tax rate) automatically increases under predetermined conditions. The second example is the Swedish ABM, which provides for the automatic adjustment of benefits. The United States (US) social security system includes specific ABM features, such as a cost-of-living adjustment (American Academy of Actuaries, 2011).

Regarding the public pension system in Japan, an ABM was introduced in 2004, accompanied by a fixed (capped) pension premium.Footnote 4 The Japanese ABM is classified into a scheme of benefits linked to life expectancy, demographic ratios, and wage growth by the OECD (2021a). Real pension benefits are reduced based on the negative growth rate of insured persons through modified indexation (macroeconomic slide in Japanese terms) to re-establish the financial equilibrium. The reserved fund is used to make up for the shortfall in immediate contributions so that the projected reserve-to-expenditure ratio at the end of the verification period (95 years later) is one, meaning that the pension reserve fund equips 1 year of total expenditure. The reserve-to-expenditure ratio (RER) is defined as the reserve fund amount of the previous year over the total pension expenditure, including benefit payments.

Meanwhile, the government has a target replacement rate to measure the adequacy of pension benefits. The replacement rate (RR) is the ratio of pension benefits to the income of insured persons. A representative household is assumed for both benefits and income. The current RR is approximately 60%, and the government’s target is at least 50% (Ministry of Health, Labour and Welfare [MHLW], 2020a). Under Japan’s ABM, the government and the Government Pension Investment Fund (GPIF) can only control the investment risk-taking of the reserve funds, which could cause significant fluctuations in the RR and RER. In this context, a stochastic simulation study is required to assess the adequacy of the benefits and sustainability of the pension system.

Regarding the financial verification of the public pension system in Japan, the MHLW (2019a) has set up several scenarios for future population and economic conditions and predicted the changes in the value of future pension reserve funds and RR without using a stochastic simulation. For future population, the MHLW (2019a) uses the National Institute of Population and Social Security Research (2017) and assumes three scenarios (medium, high, and low) for total fertility and mortality rates. For economic assumptions, MHLW (2019a) uses Cabinet Office (2019) for two short-term scenarios to 2028 and then considers several long-term economic scenarios, including economic growth and labour force participation assumptions under the government’s standard assumptions (Cases 1 to 3), lower assumptions (Cases 4 and 5), and risky situations (Case 6). The values of future pension reserve funds and RR were predicted separately for each scenario. However, even if one of these scenarios occurs as a long-term average, economic conditions and financial markets will fluctuate in the short-term, and the RR and RER will fluctuate accordingly. Furthermore, the risk of the pension system is not straightforward because of the ABM; thus, it cannot be analysed by a simple scenario-based prediction. The characteristics of the pension system are complex and similar to those of derivative products in financial markets. Even if a simple scenario-based prediction can analyse the future on average, the risks of RR and RER can hardly be captured without a stochastic simulation (Kitamura et al., 2006).

Previous studies use stochastic simulation models to estimate probability distributions and confidence intervals as indicators of the future financial status of the pension system.Footnote 5 The analysis provides insights that supplement future forecasts in addition to the scenario analysis for the RR and RER (Board of Trustees, Federal Old-Age and Survivors Insurance, and Federal Disability Insurance Trust Funds, 2020; Giang and Pfau, 2008). Moreover, as public pensions are complex systems and there are interactions among variables, it is difficult to analyse their risk using a simple model; their risk should be explicitly analysed with stochastic simulations (Castañeda et al., 2021; Chen and Matkin, 2017; Mielczarek, 2013). It is challenging to interpret the RR and RER in the context of the Japanese ABM with a scenario-based simple analysis; therefore, a probabilistic simulation model is required (Kitamura et al., 2006; Nakashima and Kitamura, 2021; Usuki et al., 2003). In particular, the risk-taking of reserve funds is a crucial factor for the future financial condition of public pensions in Japan (Kitamura et al., 2006; MHLW, 2020b) and the only decision parameter that the government can control.

Moreover, previous studies examine the effectiveness of ABMs for benefit adequacy and financial sustainability of the public pension system (Boado-Penas et al., 2020; Devolder et al., 2021; Fujisawa and Li, 2012; Vidal-Meliá et al., 2009). For example, Vidal-Meliá et al. (2009) study the introduction of ABM in the Spanish public pension system, where the level of insolvency is high, and there is a clear problem of structural actuarial imbalances. They find that ABM and official solvency indicators provide a credible institutional framework that can be used to increase the probability of pension payment promises and minimise the use of the pension system as an electoral tool. Furthermore, Boado-Penas et al. (2020) assess the adequacy and sustainability of a pay-as-you-go pension system using an ABM. Countries such as Australia, Canada, Norway, Sweden, Latvia, Poland, and Japan have public pension systems with pay-as-you-go elements and investment funds. They consider different risk-taking strategies for the investment fund of the pension system. Using the non-linear optimisation proposed by Godínez-Olivares et al. (2016), they find trade-offs between financial sustainability and adequacy of the system. For example, a higher allocation to equity increases the probability of a higher RR. Fujisawa and Li (2012) examine the effect of an ABM on pension benefits in Japan. They use a stochastic simulation model in which mortality and fertility rates change stochastically, calculate the present value of the difference between the real benefits with and without an ABM, and estimate the average deficit and its value at risk (VaR).

Using a stochastic simulation model that applies the government’s public pension verification programmes used in MHLW (2019a), we examine the adequacy and sustainability of the Japanese public pension system as a function of the risk-taking of pension reserve funds. Adequacy and sustainability are measured by the RR and RER, and risk-taking is measured by the stock weight (SW), the percentage of investments in stocks (including domestic and foreign stocks) in total reserve funds. We consider investments in traditional asset classes: domestic stocks, domestic bonds, foreign stocks, and foreign bonds. We use only the medium variant of population projections to focus on economic fluctuations. Our assumptions for the stochastic simulation are based on two government economic cases: Case 3 (the third-highest growth scenario: standard scenario) and Case 5 (the second-lowest growth scenario: pessimistic scenario) in the MHLW (2019b), hereafter referred to as economic assumptions 3 and 5, respectively.Footnote 6 Primarily, we focus on economic assumption 5 because we analyse the downside risk of RR and RER.

We contribute to the literature by examining the future joint distributions of RR and RER under ABM in different stock risk-taking settings. The RR and RER are essential parameters for measuring the adequacy and sustainability of the public pension system. In particular, we simultaneously assess the downside risk of these measures, although the literature examines them separately. We demonstrate that both the RR and the RER have significant downside risks despite the existence of the ABM; the RR can be less than half, and the RER can be exhausted (RER < 0) with a certain probability. In addition, we apply dynamic investment strategies under the ABM, where the SW is changed according to the RER level. The dynamic investment strategies we consider are portfolio insurance and contrarian strategies. We then study the impact of these strategies on the downside risks of RR and RER. These studies have not yet been addressed in the literature.

Boado-Penas et al. (2020) consider different equity risk-taking patterns in an ABM framework. Although their outcomes are similar to our study, including a version of RER and RR, their decision variables are the contribution rate, pension age, and pension indexation rate. Unlike them, our decision variable is equity risk-taking, the only decision variable the government has under ABM; the contribution rate, pension age, and pension indexation rate are predetermined. As for studies in Japan, Fujisawa and Li (2012) investigate the financial sustainability of the Japanese public pension system under ABM using a stochastic simulation model. However, their model treats mortality and fertility rates as stochastic: they do not consider the stochastic change in economic variables such as inflation and wage growth rates. Using a stochastic simulation method applied to the government verification programme in 2014, Hizu (2020) examines the RER in Japanese public pensions. However, the author does not consider the risk-taking of the reserve fund and RR, which is the main theme of this study. Nakashima and Kitamura (2021) study the RR of basic pension and employees’ pension insurance using a simple original stochastic simulation model. Their main objective is to examine the effectiveness of pension revision plans, and they do not consider the RER and risk-taking of pension reserve funds.

We find that the pension reserve fund should take an adequate risk when investing in stocks. For high-risk investments such as SW = 100%, there are many cases where the ABM does not stop during our simulation periods. The RR and RER continue to decline in these cases. For low-risk investments, such as SW = 0% (domestic bonds only), the investment returns are insufficient, and the RR declines almost certainly to two-thirds or less of the current RR. By contrast, middle-range investment risk-taking (between 25% and 75%) is better. Second, our results show that the pension reserve fund could be exhausted for a certain probability. We estimate the government’s obligation to compensate for the shortage of reserve funds and find that the government has a risk of a considerable burden in some cases. Third, we consider dynamic investment strategies and find that portfolio insurance can limit the downside risks of RR and RER, while a contrarian strategy increases the median value of RR and RER. Finally, the government may temporarily increase its liabilities, the negative RER, to finance the payment of benefits as the ABM further reduces its benefits and eventually restores solvency.

An international reader may benefit from this study as follows: Japan has the oldest population and a large public pension system, and this public pension system has a large reserve fund and invests in stocks. Therefore, it is valuable for researchers and policymakers to study how the ABM works in such an established pension system depending on the different risk-taking in stocks. The discussion on the adequacy and sustainability of the public pension system with reserve funds is helpful for countries with an ABM and those considering its introduction in the future.

The remainder of this paper is organised as follows. Section “Simulation method and assumption” explains the simulation method, and section “Results” presents the results. Finally, section “Conclusions” concludes the paper.

Simulation method and assumption

Simulation procedure

The MHLW (Ministry of Health, Labour and Welfare) discloses the calculation programme used to verify the actuarial valuations every 5 years (MHLW, 2019b). Actuarial valuations include predictions of the long-term status of financial conditions, such as the projection of future premium revenue and benefit expenditure over an approximately 100-year period. These require assumptions about future demographic, social, and economic conditions. Multiple sets of assumptions covering a range of possible scenarios were used. Figure 1 illustrates the actuarial verification process implemented using MHLW’s computer programme written in FORTRAN and C++.Footnote 7 The initial assumptions are mainly the current number of insured persons, the population projection, and the projected labour force participation rate. Then, the number of insured persons in the future is projected. Second, the number of future pensioners is projected based on the current number of pensioners, pension plan participation rates, and pension rules, such as pension age and benefit payments. Third, as explained below, pension benefits and premium revenue are projected using the wage growth rate, inflation rate, and modified indexation rate. Fourth, total expenditure including pension benefits and total revenue are projected considering the national subsidy. Fifth, the value of the pension reserve fund is projected using investment returns. The investment returns differ according to the SWs in this study. This study considers five SWs: SW = 0%, 25%, 50%, 75%, and 100%. Finally, the RR and RER, the main measures in this study defined below, are projected. We repeatedly run this set of programmes by stochastically changing the inflation rate, wage growth rate, and investment returns. The start year of the simulation is 2019, and the end year is 2115. We conducted a Monte Carlo simulation with 9000 replications for each SW and economic assumption (explained below) using a large-scale computer at one of the authors’ institutions.

Fig. 1: Actuarial verification model.
figure 1

The actuarial verification process implemented using MHLW’s computer programme. Our stochastic variables are the inflation rate, wage growth rate, and investment return.

Modified indexation rate, replacement rate (RR), and reserve-to-expenditure ratio (RER)

According to the ABM, the pension benefits for pensioners in the basic pension (BP: the first tier) and employees’ pension insurance (EPI: the second tier) and the notional accrual wage amounts for employees’ pension insurance are adjusted annually by the modified indexation rate (m):

$$\begin{array}{*{20}{c}} {m_{h,t} \equiv Min\left[ {f\left( {\theta _t,\varphi _t} \right),Max\left\{ {f\left( {\theta _t,\varphi _t} \right) + Min\left( {\delta _t - \gamma ,0} \right),0} \right\}} \right] \cdot I_{\left\{ {t \le t_h^ \ast } \right\}}} \\ { + \,f\left( {\theta _t,\varphi _t} \right) \cdot I_{\left\{ {t > t_h^ \ast } \right\}},} \\ {t_h^ \ast = inf_{2010 \le t \le T}\{ t|RER_{h,T} = 1\} } \end{array}$$
(1)

where h represents the public pension system, \(h \in \left\{ {BP,EPI} \right\}\); θ and φ are the inflation and wage growth rates, respectively; the function f differs depending on the insured persons and pensioners: \(f\left( {\theta ,\varphi } \right) \equiv \varphi\) for insured persons and \(f\left( {\theta ,\varphi } \right) \equiv \min \left( {\theta ,\varphi } \right)\) for pensioners; δ represents the annual growth rate of the total number of insured persons, which is usually negative; and γ is constant and set to 0.003. The notional accrual wage amounts are used to calculate the pension benefits at the pension-claiming age. The modified indexation rate is applied to the basic pension and employees’ pension insurance separately until the year (\(t \le t_h^ \ast\)) when the predicted RER for the corresponding pension system becomes one at the end of the verification period (\(RER_{h,T} = 1\)), or equivalently, the pension system re-establishes financial equilibrium. In our simulation, θ and φ are the stochastic variables. The pension benefits fluctuate based on these variables. Furthermore, the investment returns for reserve funds are stochastic. As the RER depends on θ, φ, and investment returns, the year the ABM ends (\(t_h^ \ast\)) fluctuates according to these variables. Therefore, the RR and RER fluctuate through the change in \(t_h^ \ast\) in addition to changes in θ, φ, and investment returns. After the application of modified indexation ends (\(t > t_h^ \ast\)), the indexation rate is \(f\left( {\theta ,\varphi } \right)\).

The notional accrual wage amounts (N) and full basic pension benefits (FB) for each individual (i) are adjusted by the modified indexation rate (m):

$$\begin{array}{l}N_{t,i} = N_{t - 1,i}\left( {1 + m_{EPI,t}} \right) + W_{t,i},\\ FB_{t,i} = FB_{t - 1,i}\left( {1 + m_{BP,t}} \right),\end{array}$$

where W is the standardised wage income for each individual. The pension benefits (B) at the year pension claimed (pc) are:

$$\begin{array}{l}B_{EPI,pc,i} = \pi _{pc,i} \cdot N_{pc - 1,i},\\ B_{BP,pc,i} = \gamma _{pc,i} \cdot FB_{pc - 1,i},\end{array}$$

where π is the multiplier for determining the employees’ pension benefit, and γ is the proportion of insured months to 480 months (for full pension) in the basic pension. The pension benefits (B) for each pension system (h) after the claiming age are:

$$B_{h,t,i} = B_{h,t - 1,i}\left( {1 + m_{h,t}} \right).$$

Our outcomes (RR and RER) are:

$$RR_t \equiv \frac{{\mathop {\sum}\nolimits_{h \in \left\{ {BP,EPI} \right\}} {Representative\_standard\_benefits_{h,t - 1}\left( {1 + m_{h,t}} \right)} }}{{Representative\_standard\_wage_{t - 1}\left( {1 + \varphi _t} \right)}},$$
(2)
$$\begin{array}{l}RER_{t}\equiv \frac{\sum_{h\in\{BP,EPI\}}Reserve\_fund_{h,t-1}}{\sum_{h\in\{BP,EPI\}}Total\_expenditure_{h,t}}.\end{array}$$
(3)

where Representative_standard_benefitsh,0 is a function of pension benefits of each pension system at the start of the simulation, and Representative_standard_wage0 is a function of notional accrual wage amounts at the start of the simulation; those are pension benefits and wage income of a representative household defined in MHLW (2019b, p. 14). Total_expenditureh,t is the expenditure of each public pension system that depends on total pension benefits (\(B_{h,t} \equiv \mathop {\sum}\nolimits_i {B_{h,t,i}}\) and other expenditures. These expenditures depend on the modified indexation rate (m). The reserve fund of each pension system is

$$\begin{array}{*{20}{c}} {Reserve\_fund_{h,t}} \\ { = Total\_revenue_{h,t} - Total\_expenditure_{h,t} + Reserve\_fund_{h,t - 1}\left( {1 + r_t} \right),} \end{array}$$
(4)

where Total_revenue is the income of each pension system including premiums and rt is the investment returns.

Japanese public pension system and investments of the reserve fund

The starting year of our simulation is 2019. For actual public pensions in Japan, the total revenue in 2019 was JPY 52.9 trillion, the total expenditure was JPY 53.3 trillion, and the size of the reserve fund was JPY 190.5 trillion (MHLW, 2019c). The RR was 61.7% (MHLW, 2019b), and the RER was 3.76 in 2019 (MHLW, 2019c). The ABM started in 2015 and, therefore, applied in 2019. In our simulation, the initial value of RR is 61.7%, and the RER is 4.0 (MHLW, 2019b). The RER differs slightly from the above value as the size of a reserve fund sometimes varies depending on coverage, measure, and purpose, such as the difference between market value and book value.

According to the GPIF (2020a, 2020b), it reconstructs its strategic asset allocation (or policy asset mix) of asset classes with allowances from the strategic allocation almost every 5 years, together with the timing of the actuarial valuation of public pensions. A strategic allocation is a long-term target asset allocation for each asset class. The current strategic allocation was determined in 2020. The GPIF uses four traditional asset classes: domestic stocks, domestic bonds, foreign stocks, and foreign bonds. The GPIF sets benchmark indices for each asset class to manage risk or measure investment performance. The benchmarks for domestic stocks, domestic bonds, foreign stocks, and foreign bonds are Tokyo Stock Exchange Price Index (TOPIX) including dividends, Nomura Bond Price Index (Nomura–BPI) excluding asset-backed securities, MSCI All Country World Index (ACWI) excluding Japan including dividends (JPY base), and FTSE World Government Bond Index (WGBI) excluding Japan and China without currency hedge (JPY base), respectively. Approximately 86–94% (62–79%) of stock (bond) investments have been passively managed with their respective benchmarks over the past 5 years, while the remainder has been actively managed (GPIF, 2022). After constructing the strategic allocation, the GPIF monitors and rebalances its portfolio periodically: If the portfolio deviates from the strategic allocation beyond the allowance, the portfolio is rebalanced to the strategic allocation. The strategic allocation is determined considering the actuarial valuation and the economic and capital market environment. The GPIF may change its strategic asset allocation significantly for some periods (e.g., change in 2014: increase in stock allocation from 24% to 50%), while it may remain at small changes for other periods (e.g., change in 2020: stock allocation remains at 50%, domestic bond allocation decreases by 10%, while foreign bond allocation increases by 10%). Regarding alternative assets (e.g., infrastructure, private equity, and real estate), the GPIF has invested in these assets since 2014, and its current balance is approximately JPY 2.1 trillion (GPIF, 2022). It classifies them into traditional asset classes according to their risk and return characteristics, with the restriction that the maximum allocation to them is 5% of total assets.

In our main simulation, we use a constant mix strategy. Once the SW is determined at the beginning of a simulation, it is maintained throughout the simulation. In other words, the SW is rebalanced each year to the predetermined SW. The GPIF also uses this approach to examine the risk and return characteristics of the reserve fund when the GPIF constructs its strategic asset allocation. The asset allocation for SW = 50% in our simulation corresponds to the GPIF’s current strategic asset allocation. Other asset allocations could be potential candidates for the GPIF’s future strategic asset allocation.

Simulation assumption

Table 1 shows the assumptions used in our simulations. We assume that the inflation rate, wage growth rate, and investment returns follow a normal distribution. As for the parameter settings depending on the economic assumption, the mean values of these variables differ, while we use the same standard deviation and correlations for all economic assumptions. We assume that no significant structural change occurs in the distributions, except for the mean, across the economic assumptions. We also assume that reserve funds invest only in four traditional asset classes: domestic stocks, domestic bonds, foreign stocks, and foreign bonds. Although the OECD (2021b) reports that many private and public pension funds invest in alternative assets, including private equity, venture capital, real estate, and infrastructure, we do not explicitly consider these assets because GPIF’s strategic asset allocation only considers the four traditional assets. The OECD (2021b) shows that public pension funds with large assets under management, such as the Social Security Trust Fund in the US, the GPIF in Japan, and the Government Pension Fund—Global in Norway, invest mainly in traditional assets. Investments in alternative assets are limited, partly due to the liquidity problem. The supply of these alternative assets may not be sufficient for large public pension funds, especially in Japan. The mean and standard deviation of each variable and their correlations were taken from the GPIF (2020a). The variables for which no assumptions are made in the GPIF (2020a), such as the correlation between the inflation rate and asset class returns, are estimated using annual data from the last 25 years (1993 to 2017). Table 1 shows the assumptions for our simulations. The mean and standard deviation of the investment returns differ depending on the stock weighting. The correlation between investment returns and inflation and between investment returns and the wage growth rate also differ depending on the SW.

Table 1 Simulation assumption.

The GPIF’s current strategic asset allocation is domestic stocks (25%), domestic bonds (25%), foreign stocks (25%), and foreign bonds (25%). In our simulation, we assume that the domestic and foreign asset classes are equally weighted, as the current strategic allocation weights domestic and foreign asset classes equally. In the case of SW = 25%, domestic and foreign stocks are 12.5% each. Similarly, domestic and foreign bonds are 12.5% each. We run simulations for different domestic and foreign weights to check the robustness of the results in subsection “Robustness checks”.

Our economic assumptions are based on the settings of MHLW (2019a, 2019b), which uses Cabinet Office (2019) projections until 2028. Thereafter, the MHLW (2019a, 2019b) estimates economic assumptions using a Cobb–Douglas aggregate production function. The MHLW uses the National Institute of Population and Social Security Research (2017) for the population assumption. Cabinet Office (2019) uses its economic and fiscal model, a kind of IS-LM Phillips Curve style model. Gross domestic product (GDP) growth, inflation, wages, and interest rates are thus determined endogenously. The MHLW uses a Cobb–Douglas aggregate production function for long-term economic assumptions: Real wage growth rates and investment returns. The parameters of the Cobb–Douglas aggregate production function are exogenous in terms of total factor productivity (TFP), capital’s share of output, the depreciation rate, and the ratio of gross fixed capital formation to output. The inflation rates are also given exogenously. In the case of the Cobb–Douglas aggregate production function, depopulation negatively affects the real growth rate and asset returns, as discussed in Arnott and Chaves (2012).

We acknowledge that the default risk on the national debt in Japan is not negligible because it is about twice its GDP and larger than in other countries. In addition, reserve funds considerably invest in Japanese government bonds (JGBs), which indicates that reserve funds face considerable default risk. However, our simulation assumes no default risk for the Japanese government; therefore, the public pension system does not face default risk for the JGBs. The default risks of the Japanese government and the public pension system are beyond the scope of this study and will be investigated in our future research.

In Japan, the ABM aims to control the RER with the RR policy target. The RER and RR transitions are built into the ABM and cannot be intentionally changed once the law is implemented. The only variable that the policy can change is the SW. The policy sets the RER to be one at the end of the verification period. Once this target is set, it is controlled by the ABM. However, as economic conditions fluctuate, the RER may not be 1 in the final verification period. This value may also fall below zero. Unlike the RER, the RR is changed by the ABM, but it is not automatically controlled to achieve a specific policy target (e.g., RR ≥ 50%). RER and RR risks need to be managed because economic conditions fluctuate. Since RER and RR can vary, these risks need to be examined jointly. It is also necessary to discuss both the situation in the last year of the simulation and the levels of these measures before the last year. The ABM unilaterally lowers the RR, and the RER becomes one in the last year of the verification period due to the ABM. However, their development is complicated by the timing of the termination of the ABM. The RER may fall very low in some cases but not in others; it falls and rises or vice versa.

Results

Transitions of marginal distribution for the replacement rate (RR) and reserve-to-expenditure ratio (RER)

Figure 2 illustrates the transition of percentile values by year and stock weight (SW) for economic assumption 5 (pessimistic scenario). Panels (A) and (B) illustrate the results for the RR (replacement rate) and the RER (reserve-to-expenditure ratio). The RR represents the public pension benefits for pensioners over the wage income of the insured, simulated by Eq. (2), and the RER represents the reserve fund amount of the previous year over the total pension expenditure, simulated by Eq. (3). Both indicators are measured on a total public pension basis: the sum of the basic pension and employees’ pension insurance, given the government’s standard family structure assumptions. For each panel, the lower, middle, and upper lines represent the 10th (dashed line), 50th (solid line), and 90th (dash-dot line) percentile values, according to the SW (stock weight): SW = 0% (domestic bonds only), 25%, 50% (GPIF’s current strategic asset allocation), 75%, and 100% (stocks only). The RR remains at the same level and graphically horizontal after the ABM stops, indicating that the pension system re-establishes financial equilibrium.

Fig. 2: Transition of percentile values of RR and RER for economic assumption 5.
figure 2

A RR for economic assumption 5. B RER for economic assumption 5. The lines in the figure represent the transition of the 10th (dashed line), 50th (solid line), and 90th (dash-dot line) percentile values, respectively, in order from the bottom.

Panel (A) displays the transition of percentile values of the RR with the passage of the simulation year for economic assumption 5, according to the SW. Generally, increasing SW tends to decrease the 10th percentile value and increase the 90th percentile value of the RR; however, the effect of the ABM on the RR differs according to the SW. The 10th percentile values are below 40% in around 2060 and continue to decrease as the SW rises. This decrease is because there are some sample paths in which the ABM for the basic pension does not stop during the simulation period. Thus, the ABM continues to decrease the benefit level.Footnote 8 These sample paths consist of nearly 7%, 20%, and 26% of the total sample paths in the cases of SW = 0%, 50%, and 100%, respectively. The 90th percentile values increase with increasing SW. However, the lines for SW = 75% and 100% are similar because there is an upper limit to the increase in the RR. As the modified indexation rate (m) of the RR is defined by Eq. (1), future RR does not exceed the current RR. In Eq. (1), the modified indexation rate (m) is at most wage growth rate (φ) by definition. In Eq. (2), the growth rate of the numerator of RR is at most φ, and that of the denominator is always φ. Therefore, RR can only decrease or maintain the current level (approximately 60%).Footnote 9 In addition, there are many sample paths in these SWs where ABM ends earlier and remains at a higher level than government verification. The sample paths where ABM ends before 2030 account for almost 9% and 12% of the total sample paths in the basic pension for SW = 75% and 100%, respectively. They are approximately 27% of the total sample paths in the employees’ pension insurance for SW = 75% and 100%. The 50th percentile values reach about 40% and remain similar for all SW, although the government target is 50%.

Panel (B) illustrates the change in the distribution of the RER for the total public pension under economic assumption 5 according to the SWs.Footnote 10 The RER should be one in the last year of the simulation period, according to the ABM, for both the basic pension and employees’ pension insurance.Footnote 11 However, the RER can be more than one during the simulation periods for the sample paths if investment returns are high and the ABM ends early. By contrast, it can be less than one if the ABM does not stop even in the last year, which means that the financial condition of the public pension is not recovered, and pension benefits continue to be reduced. The current reserve fund balance for the employees’ pension insurance at the end of 2018 is JPY 200.0 trillion, whereas that of the basic pension is JPY 11.5 trillion. Thus, the employees’ pension insurance greatly affects the overall funding level. In addition, even if the RER falls below one during the simulation period, it can recover to one towards the end of the simulation period for some sample paths because the ABM further reduces pension benefits. As predicted, the 50th percentile values for any SWs mostly decline and reach one at the end of the simulation period. The 10th percentile values decline during the early simulation periods mainly because of low (negative) investment returns. They can be below zero for all SWs, meaning the pension reserve fund can be depleted during the simulation period. For example, the 10th percentile value for SW = 50% becomes negative in 2065–2106 and reaches approximately –0.5. It then increases in the later simulation periods because of the reduction in benefits by the ABM. However, the 10th percentile values of the RER do not attain one at the end of the simulation period for all SWs, except SW = 0%. Some sample paths exist where the ABM does not stop, even in the last year of the simulation period, and the pension system does not re-establish financial equilibrium. The 90th percentile values rise first during the early simulation periods because of the higher investment returns for SW = 25 or more and then fall because the high benefits are maintained owing to the early stop of the ABM.

Panel (A) in Fig. 3 illustrates the result of the RR under economic assumption 3 (standard scenario): the general tendency of the effects of the SWs is almost similar to that of economic assumption 5, whereas the overall levels of the RR are generally higher because the investment returns are higher than in economic assumption 5. The ABM thus continues for a shorter period than economic assumption 5. The 10th percentile value decreases as SW increases. It bottoms out and remains at the same level for SW = 0%, 25%, and 50% after the ABM ends, restoring financial equilibrium. In contrast, it decreases when SW = 75% or more because there are sample paths where the ABM for the basic pension does not end during the simulation period, and thus the benefit level continues to decrease. The RR has a lower limit in these cases because some components of the modified indexation rate are predetermined in our model and have a lower bound during our simulation periods.Footnote 12 These sample paths for the basic pension consist of nearly 10% and 14% of the total sample paths for SW = 75% and 100%, respectively.Footnote 13 The 90th percentile values increase with the SW increase, but the lines for SW = 75% and 100% are similar because of the upper limit. There are many sample paths in these SWs where the ABM stops before 2030, approximately 20% and 60% of sample paths for the basic pension and employees’ pension insurance, respectively. The 50th percentile value increases with an increase in SW because the expected return for the reserve fund increases, but they are almost the same for SW = 75% and 100% because of the upper limits. Even with the better economic assumption in economic assumption 3, the 50th percentile value does not attain 50%, which is the government’s target.

Fig. 3: Transition of percentile values of RR and RER for economic assumption 3.
figure 3

A RR for economic assumption 3. RER for economic assumption 3. B The lines in the figure represent the transition of the 10th (dashed line), 50th (solid line), and 90th (dash-dot line) percentile values, respectively, in order from the bottom.

Panel (B) illustrates the RER results for economic assumption 3. The general characteristics are similar to economic assumption 5, although the RER level tends to be higher. It should be noted that the 10th percentile value becomes one in the last year of the simulation periods for all SWs and does not reach zero, except for SW = 100%. The 90th percentile values for SW ≥ 50% increase and do not fall to one.

Figures 2 and 3 show that the RR and RER have considerable downside risks despite the existence of the ABM. We explore the main source of these fluctuations in subsection “Robustness checks” below.

Examination of joint movements of the replacement rate (RR) and reserve-to-expenditure ratio (RER)

Figure 4 displays statistics that capture the joint distribution of the RR (vertical axis) and RER (horizontal axis). Panels (A) and (B) show the results of economic assumptions 5 and 3, respectively. The pink triangle, red square, and light green triangle represent the 90th, 50th, and 10th percentile values, respectively. The 10th percentile value corresponds to this study’s value at risk (VaR). A marginal conditional value at risk (CVaR) is defined as the mean of each variable (RR or RER) below its VaR. The green diamond is located at a cross point of each CVaR. The point indicated by an arrow is the year 2019, the start year of our simulation, and the opposite end is the year 2115, the last year of the simulation. In between indicates intermediate years of simulation. We use the VaR and CVaR to measure the risk of RR and RER (risk indicators). However, we cannot manage risk only by measuring risk; criteria for evaluating these indicators are needed. We use RR = 40% and RER = 0 as our criteria. The government target is RR = 50% and RER = 1. However, our results show that RR = 50% is too high to accomplish on average. Whether or not the pension reserve fund is depleted is an important political issue; we thus use RER = 0.Footnote 14

Fig. 4: Joint transition of percentile value and marginal conditional value at risk by year and stock weight.
figure 4

A Joint transition of RR and RER for economic assumption 5. B Joint transition of RR and RER for economic assumption 3. Pink triangles, red squares, and light green triangles represent the 90th, 50th, and 10th percentile values, respectively, and green diamonds represent the marginal CVaR (mean value of RR and RER under their 10th particle value) according to the stock weight (SW). The arrow indicates the initial value in 2019, and the opposite end indicates 2115.

Panel (A) displays the results for economic assumption 5. For SW (stock weight) = 0% (the upper left plot), all statistics go down and move to the left from the start to the last year of simulation, meaning that the RR and RER decrease as the simulation year increases. The 50th percentile values should be located on the vertical line of one in the last year of simulation by the ABM for all SWs.Footnote 15 For SW = 25% or more, the 10th percentile value moves to the left until the middle year of the simulation, indicating that the RER worsens. Then, towards the last year of the simulation, it moves to the right and almost reaches the vertical line of one for most SWs, which means that the RER improves. These results indicate that temporary government borrowing is possible. The borrowing can be repaid by reducing pension benefits through the ABM. The movements of CVaR are similar to the 10th percentile values, except for SW = 75% and 100%. The CVaRs move to the left towards the last year of the simulation, indicating that the CVaR of the RER worsens. The 90th percentile values for SW = 25% or more move to the right, meaning the RER improves. The 90th percentile values then move to the vertical line of one in a later simulation period because of the ABM, whereas that of SW = 100% continues to move to the right.Footnote 16

Panel (B) displays the results for economic assumption 3. The general tendency regarding SW is similar to that of economic assumption 5, although all statistics shift upward, indicating that RR improves. It should be noted that from SW = 0% and 75%, the VaR and CVaRs are close, whereas the CVaR for SW = 100% moves in the left direction mainly because of the negative investment returns. The 90th percentile values for SW = 50% or more move in the right direction and do not turn towards the vertical line of one.Footnote 17

Examination of the replacement rate (RR) and reserve-to-expenditure ratio (RER) specific for economic assumption 5 and year 2075

In this subsection, we choose a particular year (2075) and analyse the relationship between the RER and RR for economic assumption 5 to deepen our understanding of the effect of the SW (stock weight). Figure 5 displays the RR and RER scatter plots for economic assumption 5 in 2075. The orange dots represent the points where the RR or RER is less than the 10th percentile value (VaR), and the blue dots represent the other points. The pink triangle, red square, and light green triangle represent the 90th, 50th, and 10th percentile values, respectively. The green diamonds represent the CVaR of RR and RER.Footnote 18 Fig. 5 reveals that the scatter plot for SW = 0% has a round shape, whereas other scatter plots have a distinctive shape. The RR and RER fluctuate depending on the SW. The realised distribution is broad and distinct, narrow in the lower left, and wide in the upper right. As SW increases, the scattering becomes more extensive.

Fig. 5: Scatter plot of RR and RER for economic assumption 5 and year = 2075.
figure 5

The pink triangle, red square, and light green triangle represent the 90th percentile, 50th percentile, and 10th percentile values, respectively, and green diamonds represent the marginal CVaR (mean value of RR and RER under their 10th particle value) according to the stock weight (SW). The orange dots represent the RR and RER below the 10th percentile value.

Panel (A) of Fig. 6 displays the 50th percentile value, VaR, and CVaR of the RER for economic assumption 5 in 2075. The 50th percentile value increases approximately from 1.0 to 2.0 as the SW increases, peaks at SW = 50% and 75%, and then decreases (concave shape). By contrast, the VaR is approximately 0.0 in the case of SW = 0% and 25% and then decreases. The CVaR is similar to the VaR but decreases sharply for SW = 50% or more. Panel (B) displays the 50th percentile value, VaR, and CVaR of the RR for economic assumption 5 in 2075. The 50th percentile value increases approximately from 38% to 40% as the SW increases, peaks at SW = 50% and 75%, and then decreases. By contrast, the VaR and CVaR are higher around SW = 0% and 25%, decreasing as the SW increases. Panel (C) illustrates these two probabilities. The upper line illustrates the probability that the RR is less than its VaR or the RER is less than its VaR. The probability decreases from approximately 16% to 14% as the SW increases until SW = 50%, after which it is flat for SW = 50% or more. The lower line shows the probability that the RR is less than its VaR and the RER is less than its VaR, representing a more severe risk situation. The probability increases from approximately 4.0% to 6.0% as the SW increases until SW = 50%, after which it is flat for SW = 50% or more.

Fig. 6: Simulation result for economic assumption 5 and year = 2075.
figure 6

A RER for economic assumption 5 and Year = 2075. B RR for economic assumption 5 and Year = 2075. C Probability of risk situation for economic assumption 5 and Year = 2075. Marginal CVaR represents the mean of a variable below the 10th percentile value. Prob (RR < p10 or PER < p10) represents the probability that the RR is less than the 10th percentile value or the RER is its 10th percentile value. Prob (RR < p10 and PER < p10) represents the probability that the RR is less than the 10th percentile value, and the RER is its 10th percentile value.

In summary, when considering the 50th percentile value, the RR and RER are jointly highest around SW = 50% and 75%. By contrast, the RR and RER are jointly stable around SW = 50% or less when considering the VaR and CVaR. The VaR and CVaR for each RR or RER continue to decrease for SW ≥ 75% or more. The probability that the RR or RER is less than its VaR is relatively flat for SW = 25%, 50%, and 75%. These results indicate that the SW should not be high or low. The following section examines these relationships for other years (2050, 2100, and 2115).

Examination of the effect of the stock weight (SW) on the replacement rate (RR) and reserve-to-expenditure ratio (RER)

Table 2 summarises the results of the analysis. Panels A and B display the simulation results for economic assumptions = 5 and 3, respectively. Panel A shows that the RR (replacement rate) and RER (reserve-to-expenditure ratio) tend to have the highest values for moderate SW (stock weight), such as SW = 50% and 75%, when considering the 50th percentile values. The RR and RER tend to have the highest values for low SW, such as SW = 0% and 25%, when we focus on the risk (VaR and CVaR) in 2050, 2075, 2100, and 2115. The probability that either the RR or RER is less than its VaR is the lowest for SW = 75% in 2050 and 2075, SW = 25% in 2100, and SW = 0% in 2115. Panel B shows the results for economic assumption 3, similar to economic assumption 5; a higher SW can pursue a higher RR and RER because of the better economic conditions for economic assumption 3 than for economic assumption 5. The 50th percentile values tend to be the highest for higher SWs (SW = 75% and 100%), whereas the VaR and CVaRs tend to be the highest for lower SWs (SW = 0% and 25%). The probabilities that the RR or RER is less than its VaR have a U-shape concerning the SW, meaning that the probabilities tend to be lower for moderate SWs except for the year 2115. The SW should be determined considering risk and return. How to determine SW should be a matter of policy discussion. The current government verification uses only the median values for valuation, but discussing whether risk and return should be emphasised and how they should be valued is necessary. Based on our simulation results, moderate risk-taking, such as SW = 50%, is expected to perform better when considering the risk and return of RR and RER. However, further policy discussion is needed to determine the optimal SW, considering the timeline to assume, the economic situation, the expected RR compared to the current target (RR = 50%), and the burden on negative RER, which will be our future study.

Table 2 Simulation results. Panel A: Simulation results for economic assumption 5. Panel B: Simulation results for economic assumption 3.

Government obligation in the case of depleted reserve funds

Figure 2 and Table 2 show that reserve funds can be depleted with certain probabilities. Once depleted, the difference between total revenue and total expenditure is expected to be covered by the government. It would be important to consider the extent to which the government will have to bear the burden of making up for this shortfall. We assess the present value of this government obligation if the RER (reserve-to-expenditure ratio) becomes negative. We estimate two measures under four different discount rates. One is the present value of the net cash flow (the difference between total revenue and total expenditure) of the public pension when RER < 0 (cashflow-based measure), and the other is the present value of the minimum reserve funds when RER < 0 (balance-based measure). The former captures the present value of total government spending to compensate for the shortfall in reserve funds, and the latter captures the present value of the maximum government debt. We use four different discount rates: the average 10-year Japanese government bond (JGB) yield (0.19%), the expected return of domestic bonds (0.49%), the inflation rate (0.80%), and the expected return of investments (2.82%). The average 10-year JGB yield is a daily average over the last 10 years (2012/12–2022/12) for 10-year maturity JGBs. The expected returns on domestic bonds, inflation rate, and the expected returns on investments use our simulation assumptions in economic assumption 5, and SW = 50%. Although the government indicates that the pension system will become a full pay-as-you-go system once the reserve fund is exhausted, we assume that ABM will continue.

Table 3 shows the (conditional) present values of the government’s obligations when the RER becomes negative, which range from JPY 9.5–65.3 trillion depending on the measures and discount rates. The probability of the RER becoming negative for at least 1 year is 20%. The results also show that the standard deviations of the two measures are large enough to indicate that the obligation could become even larger. As for the cashflow-based measure, the relationship between the discount rate and the present value is inconsistent. For example, the average value (–10.8) of the lowest discount rate (0.19%) is higher than that of the higher discount rates (0.49% and 0.80%). These results are due to the timing of the cash flows. The net cash flows tend to be negative when the RER becomes negative, and they can become positive over time as ABM reduces the benefit level. If we compare these obligations with the current outstanding face value of Japanese government bonds (JGBs), which is JPY 1027 trillion (Ministry of Finance, 2023a), the present value of government obligations is 0.9%–6.4% of the outstanding value. These amounts of government obligations are insignificant compared to the JGB, as the JGB has a huge outstanding balance. Now, comparing government obligations to GDP, the cashflow-based measure is not a considerable number. However, when looking at the balance-based measure, there is a considerable risk that the obligation becomes a significant burden (11.4% of GDP) compared to other countries (Ministry of Finance, 2023b). The obligation is lower in the cashflow-based measure than the balance-based measure, which is considered to be the effect of ABM reducing benefits.

Table 3 Present value of government obligation when reserve funds are depleted.

Robustness checks

In the previous subsections, we noted that the RR (replacement rate) and RER (reserve-to-expenditure ratio) show significant downside risks despite the existence of ABM. Instead, ABM seems to amplify the risk of RR and RER. If the year in which ABM ends fluctuates, RR and RER also fluctuate, as explained in section “Simulation method and assumption”. In addition, the fluctuations of RER and RR are also influenced by the stochastic behaviour of the inflation rate, wage growth rate, and investment returns. The question naturally arises as to which factor contributes most to the fluctuations in RER and RR. Therefore, we ran additional simulations to investigate the source of volatility in RER and RR by varying only one variable and fixing the others at their mean.

Columns (2)–(4) in Table 4 show the results when only one variable is stochastic under economic assumption 5 and SW = 50%. Column (1) again shows that the results for all three variables are stochastic, with SW = 50% as the reference (referred to as the base case). The column of the government verification displays that all our stochastic variables are fixed at their means, which mimics the MHLW (2019b) through our simulation. The 50th percentile values of RR and RER in Column (1) are lower than those of the government verification because investment returns are low in some cases, and modified indexation continues (modified indexation does not end during our simulation for some paths, as explained in subsection “Transitions of marginal distribution for the replacement rate (RR) and reserve-to-expenditure ratio (RER)”).

Table 4 Robustness checks.

Column (2), where only investment returns are stochastic, shows the result close to the base case in Column (1). In Columns (3) and (4), the variations of RR and RER for the results in which the stochastic variable is only wage growth rate or inflation rate are small. The results suggest that the volatility of investment returns mainly explains the variations in RR and RER. The results in Columns (3) and (4) are close to those of the government verification (which has no variation): Wage growth rate and inflation rate have less impact on the variations of RR and RER than investment returns. These results are consistent with previous studies (Kitamura et al., 2006; MHLW, 2020b).

Furthermore, the literature shows that the volatilities of economic variables change over time. The volatility of many economic variables increases during recessions, and in particular, stock volatility increases when stock prices fall and debt rises (Schwert, 1989). A US study also shows that the volatility of GDP, macroeconomic, and financial variables decreased significantly after the mid-1980s (Justiniano and Primiceri, 2008), referred to as the great modification. The literature has discussed factors for this phenomenon, including simple luck, monetary policy, and inventory management. Recent market integration causes a phenomenon in one market to spill over to another (Engle and Susmel, 1993): Changes in volatility in the US can lead to changes in volatility in other markets, including Japan (Schwert, 2011). Our simulation uses the volatility assumptions estimated from the last 25 years of data (1993–2017), which is after the great modification, and the volatilities happen to be low: the future volatilities may be higher or lower than in the past. To further check the robustness of our results, we run additional simulations with double and half the volatility of investment returns: all asset class volatilities are doubled (halved), and the volatility is estimated for SW = 50%. We keep the same assumptions for the expected returns and correlations. Columns (5) and (6) in Table 4 show the simulation results for doubling and halving the volatility of the asset classes. We calculate the investment returns based on SW = 50% under economic assumption 5. As expected, fluctuations of RR and RER are larger in Column (5) than in Column (6). However, the results in Columns (5) and (6) are not uniformly worsened or improved compared with the base case. For the doubled volatilities, RR and RER are worsened except for the 90th percentile values of RER. For the halved volatilities, RR and RER are improved except for the 90th percentile values of RR and RER. These results indicate that the effect of volatility is not simple: asset volatility assumption is one of the critical factors regarding fluctuations of RR and RER because of the ABM.

Finally, in our simulation, we assume that domestic and foreign stocks (bonds) are equally weighted for each different SW. However, the variations in RR and RER may be sensitive to the allocation of domestic and foreign assets because the impact of economic assumptions about exchange rates, interest rates, and economic growth differs between foreign and Japanese markets. To examine the effect of allocation to foreign assets on the variation of RR and RER, we run additional simulations for two polar cases: foreign-only or domestic-only allocation. For the foreign-only, stocks are allocated to foreign stocks and bonds to foreign bonds. The domestic-only allocation is similar. We then simulate these polar cases under SW = 50% and economic assumption 5 to compare the results with the base case: The allocation of the foreign-only is that foreign stocks are 50% and foreign bonds are 50%. Because of these changes in allocation, the assumptions on the expected returns, risk, and correlations between investment returns, inflation rate, and wage growth rate differ from the base case. Columns (7) and (8) in Table 4 display the results for foreign-only and domestic-only allocation, respectively. The results show that RR and RER tend to be lower than the base case. The values of RR and RER for the domestic-only are lower than those of the base case, partly because the domestic-only has lower expected returns. There are notable differences for the foreign-only in Column (7). The 10th percentile values of RER are distinctly lower than that of the base case, indicating that the allocation only to foreign assets is risker than the base case. By contrast, the 90th percentile values of RER are higher than those of the base case because of the higher expected returns for foreign allocation only. These results indicate that a certain level of mixing asset classes with domestic and foreign assets is preferable to the polar cases by diversifying risks and maintaining expected returns. However, we only examine the two polar cases and do not address the other allocations. There may be more efficient allocations between domestic and foreign investments, and further research is needed, which will be future work.

Dynamic investment strategies

In the previous sections, the SW (stock weight) was fixed (a constant mix strategy) during the simulation period. However, SW need not be constant and can be changed dynamically according to investment objectives (Perold and Sharpe, 1988; Sharpe, 1987). In this subsection, we consider two dynamic investment strategies to examine their effectiveness in controlling the downside risk of RR (replacement rate) and RER (reserve-to-expenditure ratio): portfolio insurance and contrarian strategy.

Portfolio insurance is a risk management strategy to protect the downside risk of a portfolio while maintaining the return potential. Various methods and strategies to secure risky assets against large losses can be found in the literature, such as the stop-loss strategy (Bird et al., 1988), the synthetic put strategy (Rubinstein and Leland, 1981), the constant proportion portfolio insurance (CPPI) strategy (Black and Jones, 1987; Black and Perold, 1992), the time-invariant portfolio protection methodology (Estep and Kritzman, 1988), and the VaR dynamic investment strategy (Herold et al., 2007). In general, to maintain a predetermined floor, the SW decreases when the stock price falls to avoid further losses. By contrast, it increases when the stock price rises due to the increased risk buffer.

The literature analyses the performance of portfolio insurance. Cesari and Cremonini (2003) show that the performance of portfolio insurance depends on the market environment. For example, the constant proportion portfolio insurance strategy dominates other strategies in bear and no-trend markets, while a constant mix strategy is preferred in bull markets. Annaert et al. (2009) find that portfolio insurance strategies outperform a buy-and-hold strategy regarding downside protection but provide lower excess returns. They argue that portfolio insurance strategies are a valuable alternative to fixed strategies for investors considering downside protection.

Previous studies also analyse the investors’ suitability of portfolio insurance for investors. Dichtl and Drobetz (2011) analyse portfolio insurance in the context of cumulative prospect theory and suggest that loss aversion and probability weighting contribute to portfolio insurance being a preferred investment strategy for a prospect theory investor. Dierkes et al. (2010) also demonstrate that portfolio insurance is attractive to investors with preferences described by cumulative prospect theory. Institutional investors often use portfolio insurance to protect their portfolios from large losses. Shefrin and Statman (1993) and Kahneman and Riepe (1998) argue that investment advisors should consider strategies limiting the downside risk while retaining some possible upside gains. A major drawback of portfolio insurance is the cost of buying derivatives or managing option duplicating portfolios. In addition, protection may not be perfect: changing market conditions and discrete transactions incur additional risk to keep the floor.

A contrarian strategy, in general, is formed by buying past losers and selling past winners (De Bondt and Thaler, 1985, 1987). A contrarian strategy is profitable by exploiting investors’ suboptimal behaviour, such as overreaction. Various variables have been used to construct contrarian portfolios, including past stock returns (De Bondt and Thaler, 1985) and accounting ratios such as book-to-market, cash flow-to-price and earnings-to-price (Lakonishok et al., 1994). Studies have demonstrated the profitability of contrarian strategies in various countries and regions, including European countries (Brouwer et al., 1997; Mun et al., 1999), China (Kang et al., 2002; Chen et al., 2018), New Zealand (Chin et al., 2002), Turkey (Bildik and Gülay, 2007), Hong Kong (Ramiah et al., 2011), and Japan (Chou et al., 2007). The contrarian strategy is utilised by institutional investors such as pension funds, life insurance companies, and mutual funds (De Haan and Kakes, 2011; Wei et al., 2015). Although the contrarian strategy originally aimed to exploit market inefficiencies and generate profits, this study examines the rule-based contrarian strategy to compare it with portfolio insurance. Since this study uses simulations and not market data, we analyse the impact of a rule-based contrarian strategy on the downside risk of RR and RER.

This section presents an application example for portfolio insurance and a contrarian strategy for RER under an ABM. The RER represents the long-term sustainability of the public pension system and the target of the ABM. We then compare the effect of these strategies with the constant mix strategy (SW = 50%) for RER and RR risk management. Table 5 displays the SW change rules based on portfolio insurance and (rule-based) contrarian strategy. We focus on RER instead of the value of reserve funds to decide on a stock allocation: For example, SW = 25% in portfolio insurance if 3.5 > RER ≥ 2.5, whereas SW = 75% in the contrarian strategy for the same RER range. We apply this rule in 2040, 2060, 2080, and 2100 only for paths in which the ABM does not stop (financial equilibrium is not attained). We retain the previous year’s SW for paths in which the ABM stops. In addition, we do not control the RR, as the ABM does not control it.Footnote 19 We assume that the inflation rate, wage growth rate, and investment returns follow a random walk. In this sense, these stochastic variations do not work to the advantage or disadvantage of portfolio insurance and the contrarian strategy, but the characteristics of the strategies (the rules for changing the SW shown in Table 5) determine the impact of each strategy on the RR and RER.Footnote 20 We run a Monte Carlo simulation with 9000 replications, similar to our main analysis.

Table 5 Stock weight for portfolio insurance and contrarian strategies.

Table 6 presents the simulation results for the portfolio insurance and the contrarian strategy. We include the results of SW = 50% (referred to as the base case) for reference and display the same statistics for RR and RER in 2050, 2075, 2100, and 2115 from Panel A of Table 2. The results reveal that portfolio insurance allows us to control RR and RER risks while giving up on attaining the median values of the base case. The VaR and CVaR in the portfolio insurance are higher than in the base case as the year gets later, indicating that the portfolio insurance can successfully manage the downside risks of RER and RR. The 90th percentile values for RR and RER are also higher than in the base case. By contrast, the 50th percentile values are lower than in the base case. The contrarian strategy leads to opposite results. It can achieve higher median values for the RER and RR, whereas the downside risk increases.

Table 6 Simulation results for dynamic investment strategies.

The results indicate that portfolio insurance lowers the downside risk of the RER and RR, and, therefore, it is a candidate strategy for achieving long-term stability in public pensions. However, there are also problems with the introduction of portfolio insurance: The size of the reserve funds of the Japanese public pension system is enormous. To avoid downside risk in adverse market conditions, portfolio insurance for such large public pension funds could accelerate the destabilisation of capital markets. One example of portfolio insurance damaging stock markets is the 1987 stock market crash, Black Monday. Investors with portfolio insurance contributed to large market decline (Leland and Rubinstein, 1988): When stock prices fell sharply, investors with portfolio insurance began to sell their holdings to limit their losses. These sales accelerate the further decline in stock prices. Instead, the public pension reserve fund is expected to be a contrarian in such an unfavourable situation and stabilise the market. This study assumes that the investment behaviour of the public pension fund does not influence stock prices: the effects of the use of dynamic strategies by large public pension funds are not fully considered. Whether portfolio insurance, a contrarian strategy, or a constant mix strategy is appropriate for investing in public pensions in Japan needs further discussion. Moreover, the dynamic strategy considered in this subsection involves a change in stock weighting every 20 years, and we examine only one example of an application for portfolio insurance and the contrarian strategy. In the future, we would like to discuss which rule is more desirable for managing the downside risks of RR and RER through these strategies.

Conclusions

Using the stochastic simulation method and applying the government’s verification programme for public pensions, we examine the effect of stock investments on the RR (replacement rate) and RER (reserve-to-expenditure ratio). We find that, first, the 50th percentile values of the RR for any SW (stock weight) are below 50%, which indicates that the simple scenario-based prediction the government uses overestimates the RR. Second, the risk and return of RR and RER depend on the SW. The analysis considering the SW and ABM (automatic balancing mechanism) also increases complexity. A higher SW can pursue a higher RR and RER, on average. However, as SW increases, sample paths for which the ABM does not stop during the simulation periods increase. The VaR and CVaR for the RER can fall below zero, especially for SW = 75% or more. By contrast, for SW = 0% (domestic bonds only), the 50th percentile of the RR is lower because the investment return is insufficient. As for the desirable SW considering the risk and return of the RR and RER, moderate risk-taking around SW = 50% and 75% provides the highest 50th percentile values for the RR and RER. The VaR and CVaR for RR and RER are the permissible values compared to the other SWs. Therefore, the moderate risk-taking of the reserve funds should be appropriate by accepting certain severe risk situations that may occur: The probability of both RR and RER being below their VaR remains at 5%. Third, our results show that there is a certain probability that the pension reserve fund may be depleted, and the government’s obligation to compensate for the shortage of reserve funds may be a significant burden in some cases. Fourth, portfolio insurance can limit the downside risks of RR and RER, while a contrarian strategy increases the median value of RR and RER. However, determining the appropriate level of the SW, considering the risk and return of the RR and RER, is not a simple process. This study assumes that the reserve fund takes a short position with a predetermined SW when it is exhausted. According to the MHLW (2020a), “a situation in which the reserve fund is exhausted and hinders the payment of benefits should not exist even temporarily.” However, we assume that the ABM continues borrowing when the reserve fund is exhausted. Our results reveal that the RER recovers to zero in many cases and often reaches one in the final year, even when it temporarily falls below zero. Currently, the government is considering switching the public pension system to a full pay-as-you-go system once the pension reserve funds are depleted. This will lead to an immediate and substantial decline in the RR. Our findings suggest that this situation could become a reality in the near future. Therefore, the government should consider borrowing to delay the system’s transition.Footnote 21