Introduction

Inflation reflects the rise in price level in a society for a certain period. A lower inflation level is detrimental to the growth of social wealth, while a higher inflation level leads to a decrease in social wealth, and a given society needs a moderate level of inflation to promote sustained wealth growth. Thus, inflation stands as one focus in academia because of its diverse economic and financial aspects and has become one of the most important goals of macroeconomic regulation in various countries (Woodford and Walsh, 2005; Yellen, 2017; Effah Nyamekye and Adusei Poku, 2017). Recently, the urgency of research on inflation has further strengthened and has received increasing attention from central banks due to multiple factors, such as COVID-19, the excessive issuance of global currency, carbon neutrality expectations, and variations in international crude oil prices.

The explanation and forecasting of inflation are two attractive aspects of research on inflation and have made numerous achievements. For example, Fama (1975) and Zakaria et al. (2021) highlight the importance of understanding inflation with traditional economic and financial factors, i.e., interest rates and crude oil; McKnight et al. (2020) and Choi (2021) adopt sophisticated models to forecast actual inflation and argue for the suitability of the selected models in inflation forecasting. However, the shortcomings of current research are also evident. For example, current research overly emphasizes the role of traditional economic and financial factors in inflation determination and lacks in-depth investigations on the roles of emerging factors (Aparicio and Bertolotto, 2020). Additionally, the commonly adopted New Keynes Phillips Curve (NKPC)-based models for inflation forecasting have reached a limit, and their forecast accuracy is weak (Mavroeidis et al. 2014, Aparicio and Bertolotto, 2020).

Thus, discovering novel factors to explain and accurately forecast inflation has become an interesting and important issue. However, relevant investigations seem to be limited, which represents a research gap and motivates the authors to identify and explore novel factors and help fill the gap.

Investor attention, generated from behavioural finance, is an emerging issue in current research on finance and economics and is theoretically connected with inflation. First, investor attention affects asset pricing in diverse ways (Chen et al. 2022; Cai et al. 2022; Liu et al. 2022), and inflation is a direct reflection of general price variations in a society. Second, investor attention reflects individuals’ information acquisition (Chen and Lo, 2019), and information is one of the most critical factors in forming inflation expectations (Larsen et al. 2021). According to the theory of NKPC (Friedman, 1968; Chen and Lo, 2019), inflation expectations affect inflation. Therefore, investor attention and inflation are connected.

Despite the theoretical connections, few studies discuss the connections between investor attention and inflation, making this paper of significance to research both inflation and behavioural finance. In this context, the research objective of this paper is to explore whether investor attention can empirically affect inflation. Consequently, this paper aims to combine behavioural finance and macroeconomics to explore what kind of relationship is hidden between the two issues. Based on the current investigations of explaining and forecasting inflation, the theoretical connection between investor attention and inflation, the research question, and the research objective, this paper proposes three hypotheses:

H1: Investor attention can empirically explain inflation.

H2: Investor attention can empirically forecast inflation.

H3: Investor attention affects inflation through its influence on inflation expectations.

The originality of this paper lies in its connection of investor attention and inflation, thus providing new empirical insight into macro inflation with respect to micro-investor attention. To the best of our knowledge, this paper makes the following contributions to the literature regarding investor attention and inflation. First, this paper may represent the first attempt to empirically explain and forecast inflation from the aspect of investor attention, thus extending the application of emerging investor attention to conventional macroeconomics. Second, this paper provides additional empirical evidence that inflation is influenced by not only conventional factors but also emerging factors, for example, investor attention. The empirical process can be summarized as follows. First, this paper constructs a VAR model and implements the corresponding Granger causality test to explore the explanatory power of investor attention on inflation. Second, based on the models for explaining inflation with investor attention, we construct predictive models to further compare and evaluate the forecast accuracy with commonly used techniques in inflation forecasting during short and long horizons. Third, this paper updates the data and model specifications to implement robustness checks to ensure the rigor required of an academic research paper. Finally, given the connections between investor attention and inflation expectations, this study empirically tests the correlation between inflation expectations and investor attention.

This study is structured as follows. Section 2 provides a brief literature review on inflation and investor attention. Sections 3 and 4 present the data and methodologies, respectively. The empirical results are shown in Section 5. Section 6 describes the robustness checks. Section 7 further discusses investor attention and inflation expectations. Section 8 concludes the paper.

Literature review

This paper focuses on inflation from two perspectives, i.e., inflation explanation and forecasting; in fact, numerous relevant investigations have been performed. For example, Canova and Ferroni (2012) attribute variations in inflation to monetary authorities. Murphy (2014) and Friedrich (2016) explain inflation by the Phillips curve and its derivative models. Hakkio (2009), Monacelli and Sala (2009), Ciccarelli and Mojon (2010), and Mumtaz and Surico (2012) argue that several factors can influence inflation, such as industrial production, unemployment rates, and nominal wages. Altissimo et al. (2009) investigate inflation from the perspective of the aggregation mechanism, and Forbes et al. (2021) use a ‘trendy’ approach to understand inflation dynamics. More recently, Bouras et al. (2023) and Liu and Ma (2023) attribute inflation to the cost of firms and the exchange rate, respectively. Tillmann (2024) argues that supply chain pressure contributes to inflation. Inflation forecasting has also been a focus of research and is a popular topic in academia. Specifically, econometric and statistical forecasting models are preferred by researchers. For example, Marcellino et al. (2003), Ang et al. (2007), Clements and Galvão (2013) select the VAR model, and Forni et al. (2003), Eickmeier and Ziegler (2008), and Hall et al. (2023) adopt the dynamic factor analysis to predict inflation issues. Nakamura (2005), Szafranek (2019), and Almosova and Andresen (2023) use a neural network, and Wright (2009) and Hauzenberger et al. (2023) employ the Bayesian method and nonlinear dimension reduction techniques, respectively. Survey-based inflation forecasts have also been studied (Croushore, 2010; Faust and Wright, 2013; Huber et al. 2023). Moreover, due to the lead-lag relationship in the Phillips curve model, Chletsos et al. (2016), McKnight et al. (2020), and Bańbura and Bobeica (2023) forecast inflation based on the Phillips curve. Recently, advanced machine learning (ML) technology has been introduced to inflation forecasting. For example, Medeiros et al. (2021), Aras and Lisboa (2022), and Araujo and Gaglianone (2023) demonstrate the advantages of the ML model for predicting inflation in the United States, Turkey and Brazil, respectively; Rodríguez-Vargas (2020) uses two variants of k nearest neighbour, random forest, extreme gradient enhancement, and long short-term memory (LSTM) networks to evaluate inflation forecasts in Costa Rica.

Some novel factors have also been introduced in research on inflation. For example, Chen et al. (2014) introduce commodity price aggregates to inflation forecasts. Bec and De Gaye (2016) and Balcilar et al. (2017) incorporate metal price series and oil price forecast errors in predictive models for inflation and argue that such additions benefit inflation forecasting. Aparicio and Bertolotto (2020) combine online price indices and inflation and show that online price indices can forecast inflation well. Clements and Reade (2020), Rambaccussing and Kwiatkowski (2020), Mazumder (2021), and Simionescu (2022) investigate inflation from the perspective of behavioural finance, i.e., investor sentiment; however, the results seem contradictory. As the prices of agricultural products are important components of the whole price level, researchers have introduced agricultural products to inflation forecasts (Tule et al. 2019; Fasanya and Awodimila, 2020). Some factors that seem to be almost impossible to relate to inflation, such as climate variables and carbon market returns, have also been shown to significantly improve the accuracy of inflation forecasts (Boneva and Ferrucci, 2022; Xu et al. 2023). The list is far from exhaustive and exemplifies how active the field of connecting novel factors and inflation has been in recent years.

Behavioural finance has been shown to be an important factor in diverse financial and economic aspects and has become a research hotspot exhibiting numerous academic achievements in recent years (Adra and Barbopoulos, 2018; Audrino et al. 2020). Current investigations are based on two main aspects. The first focus relies on investor sentiments. For example, Fu et al. (2015) derive a sentiment-adjusted Markowitz efficient frontier; Kim and Ryu (2021) argue that investor sentiment is a determinant of investors’ trading decisions and behaviours; and Shen et al. (2023) investigate the effect of investor sentiment on new energy stock returns as well as value at risks (VaR) before and during COVID-19. Ryu et al. (2023) investigate the effects of sentiment on mispricing. Bashir et al. (2024) reveal a positive significant relation between stock price crash risk and investor sentiment. Another focus is investor attention. For example, Vozlyublennaia (2014) and Zhang et al. (2021) argue that investor attention can not only explain but also accurately forecast the stock market. Taking investor attention as a common financial variable, Han et al. (2018) and Wu et al. (2019) prove the importance of investor attention in the foreign exchange market, and Li et al. (2019), Chen et al. (2020), Zhang et al. (2022), Zhou et al. (2022), and Zhou et al. (2023) argue that investor attention matters in the crude oil market, internet financial market, commodity futures market and carbon market, respectively. Investor attention has also been proven to be an important pricing factor in the cryptocurrency market (Ibikunle et al. 2020; Zhu et al. 2021; Smales, 2022; Wan et al. 2023). Recently, investor attention and corporate ESG performance have been combined, and the results indicate that investor attention can significantly improve the ESG standards of listed companies (Zhang and Zhang, 2024). Contagion spillover based on investor attention has also been studied by scholars. For example, Li (2024) investigates the role of investor attention on the price of petroleum products (APPP) in forecasting Chinese stock market volatility.

As seen above, several novel factors are extending to research on inflation. However, relevant investigations are quite limited. Behavioural finance has developed rapidly and has shown to be important in finance and economics. On the one hand, investor sentiment, generated from behavioural finance, shows different empirical results regarding inflation. On the other hand, no investigation has attempted to link investor attention and inflation despite the fact that the two issues are naturally connected. This research gap sparks our interest in exploring the role of investor attention in inflation determination. Thus, in this paper, these connections are comprehensively researched. Specifically, this paper investigates inflation explaining and forecasting from the perspective of investor attention to enrich understanding in both behavioural finance and macroeconomics.

Data

This paper selects the Google Search Volume Index (GSVI) from Google Trends to represent novel investor attention rather than other indicators, i.e., extreme return, abnormal trading volume, advertising expenditure, and media coverage, as GSVI shows advantages in terms of timeliness and information comprehensiveness (Zhang et al. 2022). As we aim to analyze the connection between investor attention and inflation in the United States, we set the search area to ‘America’ and directly searched for the keyword ‘inflation’ from the beginning of the GSVI in January 2004 to July 2020 to obtain data on monthly investor attention. For inflation, this paper chooses the consumer price index (CPI) inflation, as this indicator has a relatively high frequency and sufficient data for empirical investigation at a monthly frequency (Liu and Smith, 2014). Specifically, this study chooses the seasonally adjusted monthly CPI inflation for all urban consumers; as this indicator is officially more useful, we downloaded the related data freely from the Federal Reserve Economic Data. All the two series are transferred to logarithmic differences and are named with \({{Att}}_{t}\) and \({{Inf}}_{t}\) for further empirical research.

Some basic information on investor attention (\({{Att}}_{t}\)) and the CPI inflation rate (\({{Inf}}_{t}\)) is shown in Fig. 1 and Table 1, respectively. As shown in Table 1, the mean CPI inflation rate is positive, while the mean investor attention is negative, suggesting that investor attention and the inflation rate may be negatively connected. Furthermore, as we aim to analyse the selected data under the framework of the VAR model, an ADF-KPSS-PP joint stationary test is implemented to avoid the pseudo-regression phenomenon. The results show that the time series for the inflation rate or investor attention is stationary and can be directly used for VAR modelling. In addition, as shown in Fig. 1, the extreme value of the CPI inflation rate always appears after the peak value in investor attention, which may further imply that investor attention affects the inflation rate. From both Fig. 1 and Table 1, it is obvious that the time series for investor attention fluctuates more than that for the inflation rate.

Fig. 1
figure 1

Variation of investor attention and actual inflation from January 2004 to July 2020.

Table 1 Basic information on CPI inflation rate and investor attention.

In this paper, as we analyze inflation explaining and forecasting, the full sample is divided into two parts. The first part, from January 2004 to August 2016, is used as an in-sample period for inflation explained through investor attention, and the remaining period, from September 2016 to July 2020, is used for out-of-sample forecasting.

Methodologies

VAR and Granger causality test

The VAR model is a linear model that allows the variables to be depicted by several lagged indicators and is widely used in exploring the dynamic connections between financial variables (Guidolin and Hyde, 2012; Zhang and Lin, 2019). Thus, this paper adopts the widely used VAR model to explore the relationships between investor attention and the CPI inflation rate. Specifically, the VAR model used in this paper is specified by Eqs. (1) and (2) as follows:

$${{Inf}}_{t}={\alpha }_{01}+{\alpha }_{11}{{Inf}}_{t-1}+\cdots +{\alpha }_{n1}{{Inf}}_{t-n}+{\beta }_{11}{At}{t}_{t-1}+\cdots +{\beta }_{n1}{At}{t}_{t-n}+{\varepsilon }_{t}$$
(1)
$${At}{t}_{t}={\alpha }_{02}+{\alpha }_{12}{{Inf}}_{t-1}+\cdots +{\alpha }_{n2}{{Inf}}_{t-n}+{\beta }_{12}{At}{t}_{t-1}+\cdots +{\beta }_{n2}{At}{t}_{t-n}+{e}_{t}$$
(2)

where \({{Inf}}_{t}\) represents the inflation rate at time \(t\), \({{Att}}_{t}\) refers to the current investor attention, n is the lag length for the VAR model, and \((t-i)\) represents the operator for the time lag. The linear Granger causality test examines the joint significance of all the lagged terms of one variable in a certain equation in the framework of the VAR model. Specific to the above VAR model represented by Eqs. (1) and (2), the interested Granger causality test is to determine whether the coefficients of (\({\beta }_{11},\ldots ,{\beta }_{n1}\)) are jointly equal to 0. Specifically, the significance of the Chi-square (\({\chi }^{2}\)) statistic is used to evaluate the results of the Granger causality test.

Linear regression models for inflation explanation and forecasting

As documented by previous studies, the oil market is a crucial external factor in inflation determination (Bec and De Gaye, 2016). Thus, in this paper, we control for non-negligible factors in the regression model to fully understand the role of investor attention in inflation. Specifically, according to Aparicio and Bertolotto (2020) and Zhou et al. (2023), the factor is introduced to the regression model based on the VAR model. The detailed regression model is shown in Eq. (3). Notably, the regression model incorporates the interaction terms between investor attention and the oil market as well:

$$\begin{array}{l}{{Inf}}_{t}={\alpha }_{01}+{\alpha }_{11}{{Inf}}_{t-1}+\ldots +{{\rm{\alpha }}}_{{\rm{n}}1}{{Inf}}_{{\rm{t}}-{\rm{n}}}\\\qquad\quad+\,{{{\rm{\beta }}}_{11}{Att}}_{{\rm{t}}-1}+\ldots +{{\rm{\beta }}}_{{\rm{n}}1}{{Att}}_{{\rm{t}}-{\rm{n}}}+{\lambda }_{11}{{oil}}_{t-1}+\ldots \\\qquad\quad+\,{\lambda }_{n1}{{oil}}_{t-n}+{\rho }_{11}{{Att}}_{t-1}{\times {oil}}_{t-1}+\ldots \\\qquad\quad+\,{\rho }_{n1}{{Att}}_{t-n}\times {{oil}}_{t-n}+{e}_{t}\end{array}$$
(3)

where \({{oil}}_{t-i}\) represents the lagged oil market return and \({{Att}}_{t-i}\times {{oil}}_{t-i}\) represents the lagged interaction term between investor attention and the oil market return. As Eqs. (1) and (3) contain the ‘lead-lag’ relationship between the inflation rate and other variables, this paper extends these two equations and introduces the following out-of-sample forecasting models in Eqs. (4) and (5):

$$\widehat{{{Inf}}_{t}}=\widehat{{\alpha }_{01}}+\widehat{{\alpha }_{11}}{{Inf}}_{{\rm{t}}-1}+\ldots +\widehat{{{\rm{\alpha }}}_{{\rm{n}}1}}{{Inf}}_{{\rm{t}}-{\rm{n}}}+{\widehat{{{\rm{\beta }}}_{11}}{Att}}_{{\rm{t}}-1}+\ldots +\widehat{{{\rm{\beta }}}_{{\rm{n}}1}}{{Att}}_{t-n}$$
(4)
$$\begin{array}{l}\widehat{{{Inf}}_{t}}=\widehat{{\alpha }_{01}}+\widehat{{\alpha }_{11}}{{Inf}}_{t-1}+\ldots +\widehat{{\alpha }_{n1}}{{Inf}}_{t-n}\\\qquad\quad+\,{\widehat{{\beta }_{11}}{Att}}_{t-1}+\ldots +\widehat{{\beta }_{n1}}{A}_{t-n}+\widehat{{\lambda }_{11}}{{oil}}_{t-1}+\ldots \\\qquad\quad+\,\widehat{{\lambda }_{n1}}{{oil}}_{t-n}+\widehat{{\rho }_{11}}{{Att}}_{t-1}\times {{oil}}_{t-1}+\ldots \\\qquad\quad+\,\widehat{{\rho }_{n1}}{{Att}}_{t-n}{\times {oil}}_{t-n}\end{array}$$
(5)

where \(\widehat{{{Inf}}_{t}}\) is the predicted inflation rate. To accurately forecast the inflation rate, in this study, the rolling window forecasting method is selected. In other words, in the case of out-of-sample forecasting, the window size remains fixed when the estimation window rolls forwards (Zhou et al. 2022, Zhou et al. 2023).

Forecasting evaluation for out-of-sample data

In this paper, as we explore the role of investor attention in inflation forecasting, several evaluation indicators must be introduced. Following previous studies (Zhou et al. 2022; Zhou et al. 2023), this paper compares and assesses the accuracy of different predictive models by calculating the out-of-sample R squared (\({R}_{{oos}}^{2}\)), mean squared forecast error (MSFE), and MSFE-adjusted statistics. \({R}_{{oos}}^{2}\) is obtained by Eq. (6):

$${R}_{{oos}}^{2}=1-\frac{{\sum }_{k=t+1}^{T}{({{Inf}}_{k}-\hat{{{Inf}}_{k}})}^{2}}{{\sum }_{k=t+1}^{T}{({{Inf}}_{k}-\bar{{{Inf}}_{k}})}^{2}}$$
(6)

where T is the length of the full sample and \(t\) is the size of the rolling window. \({{Inf}}_{k}\) is the actual inflation rate. \(\hat{{{Inf}}_{k}}\) represents the forecasted value of the inflation rate from the predictive model. \(\widehat{{{Inf}}_{k}}\) denotes the predicted inflation rate of the benchmark forecasting model. A positive \({R}_{{oos}}^{2}\) indicates that the predictive model outperforms the benchmark model. Commonly, benchmark forecasts of inflation are obtained by three methods, i.e., forecasts from the autoregression model (AR), random walk (RW) model and Federal Reserve Greenbook (Adebiyi, 2007; Álvarez-Díaz and Gupta, 2016; Rossi and Sekhposyan, 2016). In this paper, we do not select the Federal Reserve’s Greenbook despite the fact that the forecast is made by the experts of monetary policy authority. The reason is as follows. The estimations of the Federal Reserve’s Greenbook are released to the public with a 5-year delay; if we choose the indicator as the benchmark, given the appearance of the GSVI in 2004, the selection may cause the data to be outdated and the sample size to be small. Thus, the remaining two models, i.e., AR and RW, are selected as the benchmarks. The MSFE can be obtained from Eq. (7):

$${MSFE}=\frac{{\sum }_{k=t+1}^{T}{({{Inf}}_{k}-\hat{{{Inf}}_{k}})}^{2}}{T-t}$$
(7)

Based on MSFE, Clark and McCracken (2001) developed the MSFE-adjusted statistic. The MSFE-adjusted statistic can further identify whether the out-of-sample forecasts are significant. Specifically, the MSFE-adjusted statistic is measured by Eq. (8):

$${MSF}{E}_{{adj}}={{MSFE}}_{a}-{{MSFE}}_{b}+\frac{{\sum }_{k=t+1}^{T}{(\widehat{{{Inf}}_{k,a}}-\widehat{{{Inf}}_{k,b}})}^{2}}{T}$$
(8)

where \({{MSFE}}_{a}\) and \({{MSFE}}_{b}\) denote the MSFE statistics of the predictive model and the benchmark model, respectively. \(\hat{{{Inf}}_{k,a}}\) and \(\hat{{{Inf}}_{k,b}}\) represent the forecast values of the inflation rate from the predictive model and the benchmark model, respectively.

Empirical results

Explanation of inflation

The basic empirical process shows that the optimal lag length in the VAR model is 1. In other words, setting the lag length to 1 is sufficient to ensure the stability of the VAR model. Based on this lag length, this paper estimates the VAR model and implements the corresponding Granger causality test for the in-sample period. The related estimation results are shown in Table 2. To further illustrate the stability of the VAR model, the AR root test is implemented, and the results are shown in Fig. 2. The AR root test shows that all the characteristic roots are in the unit circle, indicating that the established VAR model has good stability.

Table 2 VAR estimation results and Granger causality test.
Fig. 2
figure 2

A graphic description of the AR root test under the framework of VAR(1) model.

Two interesting discoveries are shown in Table 2. First, current investor attention on inflation indeed has a negative impact on the inflation rate in the next observation period, as the value of \({{Att}}_{t-1}\) in the equation for the inflation rate in the VAR model is significantly negative. Second, the results from the Granger causality test show that investor attention to inflation does indeed Granger cause changes in the inflation rate. The VAR model allows researchers to understand the inflation rate’s reaction to the shock from investor attention under the framework of the impulse response function (IRF). Thus, this paper implements the related impulse response analysis and shows the results in Figs. 3, 4. As shown in Fig. 3, once the inflation receives one unit shock from investor attention, the impact may last approximately 7 months.

Fig. 3
figure 3

Response of inflation rate to investor attention from January 2004 to August 2016 under VAR model.

Fig. 4
figure 4

Response of investor attention to the inflation rate from January 2004 to August 2016 under VAR model.

Equation (3) controls for the factor of the oil market and reinvestigates the relationship between investor attention and the inflation rate. According to Wang et al. (2023), the Brent oil futures market is an important market in the global oil market. Thus, this paper collects the returns of Brent oil futures to represent the oil market. The estimation results of Eq. (3) are shown in Table 3. As shown, after controlling for the oil market factor in the last observation period, investor attention still has a significant negative impact on the current inflation rate.

Table 3 Estimation results of Eq. (3).

In summary, investor attention has a negative impact on the inflation rate. This phenomenon may be attributed to the following reasons. First, when investor attention on inflation increases, a society may suffer from inflation in the current period, which may indicate that the public is concerned with currency devaluation in the future (Borensztein and De Gregorio, 1999). Therefore, investors may increase expenditures in the current period by converting paper money to commodities to purchase more products with the same amount of money. Thus, in the next period, total consumption demand in a society may decrease. According to the basic theory of demand and supply in macroeconomics, the price level will decrease in the next observation period. Thus, the inflation rate decreases. Second, according to the investor recognition hypothesis and limited attention theory (Merton, 1987), investors are aware of only a subset of the available assets in informationally incomplete markets; for neglected assets, a return premium is needed. Thus, investor attention should be negatively connected with asset returns. In addition, numerous investigations have empirically documented that an increase in investor attention is followed by a decrease in returns (Smales, 2021; Piñeiro-Chousa et al. 2020). This result implies that prices are decreasing; in other words, the price decreases, resulting in a negative correlation between investor attention and the inflation rate. The results on the significance of investor attention further demonstrate the importance in macroeconomics of considering not only traditional factors but also investor psychology and behaviour when investigating variations in the inflation rate.

All the above analyses show that investor attention has excellent explanatory power regarding the macro inflation rate. Furthermore, due to their significant explanatory power, the results can help authorities guide public attention to stabilize the inflation rate. However, this remarkable explanatory power is not enough to illustrate the crucial role of investor attention in inflation determination, as Welch and Goyal (2008) argue that out-of-sample tests seem to be more precise. Thus, the role of investor attention in inflation determination deserves further investigation. It is also worthwhile to explore whether investor attention can be used to forecast the inflation rate in the out-of-sample period. We show these results in the subsequent subsection.

Inflation forecasting

This paper first implements short-horizon forecasting to explore the predictive power of investor attention for the inflation rate, similar to the findings of numerous studies (Zhu et al. 2021; Zhang et al. 2022; Zhou et al. 2022; Zhou et al. 2023). Specifically, this paper forecasts the inflation rate for the next month based on current information about the inflation rate and investor attention. We set the lag length of the AR benchmark model to 1, as other lag lengths do not show an advantage in forecasting (Aparicio and Bertolotto, 2020). The out-of-sample forecasts are implemented based on Eqs. (4) and (5), and the accuracy evaluation is presented in Table 4. As shown in Table 4, the predictive models with investor attention significantly outperform the two benchmark models in all cases, as all the \({R}_{{oos}}^{2}\) values are greater than 0.05 and the MSFE-adjusted statistics are significant.

Table 4 Forecast evaluation for the basic out-of-sample forecast.

The above results indicate that investor attention is a nonnegligible factor in inflation determination, as investor attention can be used to explain and forecast inflation. However, as Zhou et al. (2023) note, models that work well for short-horizon forecasts may not perform well for long-horizon forecasts. Thus, it is also useful to explore whether investor attention can be used to forecast inflation over long horizons. Inspired by Zhu et al. (2021), Zhou et al. (2022) and Zhou et al. (2023), we implement relevant investigations. Specifically, we forecast the inflation rate two or three months ahead and compare the forecast performance with that of the AR (1) and RW benchmark models. The results are presented in Table 5. As shown, in the long-horizon forecasts, the predictive models incorporating investor attention significantly outperform the AR (1) and RW benchmark models, as all the \({R}_{{oos}}^{2}\) values are larger than 0 and the MSFE-adjusted statistics are significant. The empirical results for long-horizon forecasts further confirm the importance of investor attention to inflation.

Table 5 Forecast evaluation for the long horizons.

Robustness checks

The above results demonstrate the important role of investor attention in inflation. However, the results are obtained through investor attention on certain keywords of ‘inflation’ and variable or certain model specifications, which may lack the necessary rigor for an academic research paper. Thus, this paper implements two robustness checks. The first check is to update investor attention with other keywords searched by Google Trends. The second check is to alter the variable or model specifications. The related robustness checks are shown in the following subsections.

Update keywords

We search for another keyword, ‘monetary policy’, which is implemented by the central bank and is closely related to inflation. Based on the two keywords ‘inflation’ and ‘monetary policy’, we reperformed all the above empirical investigations to test our empirical results that investor attention can explain and forecast inflation. The estimation results are shown in Table 6 and Table 7.

Table 6 Robustness checks for inflation explaining after altering the investor attention.
Table 7 Robustness checks for inflation forecasting.

As shown in Table 6, investor attention still has a negative impact on inflation and is still the Granger cause of inflation. In other words, the conclusion that investor attention can explain the inflation rate does not change.

Table 7 shows several interesting findings. First, if the RW model is selected as the benchmark model, both predictive models can outperform the benchmarks. Second, if the AR (1) model were selected as the benchmark, the forecasting results would vary. If the GSVI is represented by the sum of inflation and monetary policy, the predictive model can outperform the benchmark model; furthermore, the forecast accuracy is even greater than the results shown in Table 4 and Table 5. However, if the GSVI is represented by monetary policy, the predictive models perform worse than the benchmark. In summary, Table 7 shows that updating the search term does not change the conclusion that the GSVI is a crucial factor in inflation determination.

Update model specifications

In this section, the variables and model specifications used to illustrate the impacts of investor attention on the inflation rate are adjusted to ensure adequate rigor for an academic research paper. Specifically, we make three changes. The first modification involves adding another control variable, i.e., the real interest rate, as the real interest rate is also an important factor in explaining inflation (Lanne, 2006). We collect real interest rate data from Federal Reserve Economic Data and update the model specification as follows in Eqs. (9) and (10). The second modification is to consider the higher moment of investor attention, as high-order moments are proven to be crucial factors for financial variables (Zhou et al. 2022). The model specification is shown in Eq. (11). The third modification involves changing the regression model to an interactive model according to Vozlyublennaia (2014); the model specification is shown in Eqs. (12)-(13).

$$\begin{array}{l}{{Inf}}_{t}={\alpha }_{01}+{\alpha }_{11}{{Inf}}_{t-1}+{{\beta }_{11}{Att}}_{t-1}\\\qquad+\,{\gamma }_{n1}{{oil}}_{t-1}+{\delta }_{n1}{{Rate}}_{t-1}+{e}_{t}\end{array}$$
(9)
$${{Inf}}_{t}={\alpha }_{01}+{\alpha }_{11}{{Inf}}_{t-1}+{{\beta }_{11}{Att}}_{t-1}+{\gamma }_{n1}{{oil}}_{t-1}+{\delta }_{n1}{{Rate}}_{t-1}+{\eta }_{n1}{{Att}}_{t-1}\times {{oil}}_{t-1}+{\lambda }_{n1}{{Att}}_{t-1}{\times {Rate}}_{t-1}+{e}_{t}$$
(10)
$${{Inf}}_{t}={\alpha }_{01}+{\alpha }_{11}{{Inf}}_{t-1}+{{\beta }_{11}{Att}}_{t-1}+{\gamma }_{n1}{{Att}}_{t-1}^{2}+{e}_{t}$$
(11)
$${{Inf}}_{t}={\alpha }_{01}+{\alpha }_{11}{{Inf}}_{t-1}+{{\beta }_{11}{Att}}_{t-1}+{\gamma }_{n1}{{Inf}}_{t-1}\times {{Att}}_{t-1}+{e}_{t}$$
(12)
$${{Inf}}_{t}={\alpha }_{01}+{\alpha }_{11}{{Inf}}_{t-1}+{{\beta }_{11}{Att}}_{t-1}+{\gamma }_{n1}{{Att}}_{t-1}\times D({{Inf}}_{t-1} < 0)+{e}_{t}$$
(13)

The detailed regression results of the above equations are provided in Table 8. As shown in Table 8, the coefficient for investor attention is still significantly negative. Thus, we may conclude that changes in the model specification do not change our conclusion that investor attention has excellent explanatory power for inflation. In summary, the robustness checks in subsections 6.1 and 6.2 do not change our conclusion that investor attention has a nonnegligible influence on the inflation rate. Thus, H1 and H2 hold.

Table 8 Robustness checks for model specifications.

Further discussion: investor attention and inflation expectations

Investor attention reflected by searching is a key step in obtaining information to form inflation expectations in the current society, and inflation expectations are a crucial factor for determining actual inflation under the NKPC (Friedman, 1968; Chen and Lo, 2019; Larsen et al. 2021). Thus, it is natural to link investor attention and inflation expectations to better understand the impact of investor attention on inflation. However, relevant studies are lacking. This paper collects data on inflation expectations made by the University of Michigan, which is a survey-based inflation expectation and has been widely used in previous studies (Luoma and Luoto, 2009; Malmendier and Nagel, 2016). The measurement of inflation expectations has several advantages. For example, the survey covers a wide range of daily life for American households and can be used to evaluate the dynamics of inflation expectations and potential consumption in the future. In addition, the measurement is considered the most representative indicator of inflation and consumer behaviour. This paper visualizes the connection between investor attention and inflation expectations in Eq. (14). The model specification is based on the following considerations. According to Friedman (1968), during the formation of inflation expectations, information processing takes a certain amount of time; thus, time lags are introduced. Information processing capabilities vary for individuals; considering the sample of our dataset, twelve lags are introduced.

$${{Exp}}_{t}=c+\mathop{\sum}\limits_{i=1}^{12}{{\beta}_{i}{Att}}_{t-i}+{\varepsilon }_{t}$$
(14)

In Eq. (14), \({{Exp}}_{t}\) refers to the inflation expectation, \({Att}\) is the operator for investor attention, and i is the time lag. We estimate Eq. (14) and show the results in Table 9. As shown below, the coefficients for investor attention are significant. However, these results are not sufficient to conclude that investor attention affects inflation expectations, as investor attention is regarded as a financial time series and potential multicollinearity exists. Thus, this paper further implements a variance inflation factor (VIF) test, and the results are shown in Table 10. According to Table 10, the regression model does not have serious multicollinearity problems because the values for the VIF tests are smaller than the commonly used criteria, i.e., 5. Thus, the results in Table 9 are significant, investor attention is a crucial external factor for inflation expectations, and H3 is supported.

Table 9 Estimation results of Eq. (14).
Table 10 VIF test.

Conclusions

The novelty of this paper is its confirmation of the important role of investor attention in inflation, which broadens the research fields of both behavioural finance and macroeconomics. The empirical process and results can be summarized as follows. First, we select several linear model specifications to conclude that investor attention and inflation are negatively connected. Second, we extend models of inflation explaining out-of-sample forecasts. The empirical results show that predictive models that incorporate investor attention can outperform the AR (1) and RW benchmark models in both short and long horizons. Third, this paper implements robustness checks, and investor attention can still explain and forecast inflation. Fourth, this paper proves that investor attention affects inflation through inflation expectations. In summary, investor attention matters in inflation determination.

The results shed light on several perspectives. For example, on the one hand, central banks may guide public opinions to control inflation to some extent; on the other hand, economic participants may adopt simple models to forecast inflation to avoid potential losses. However, deficiencies exist. First, adopting sophisticated models to further test the linear or nonlinear roles of investor attention is of interest. Second, this paper constructs a linear regression model for inflation expectation and investor attention. The information process is complex, and a linear model may not be applicable, considering alternative models and cumulative values of investor attention are also interesting issues. At a minimum, the two abovementioned deficiencies deserve further investigation.