Introduction

The empirical literature has firmly established the negative effect of poor exchange rate policy on economic growth and development. However, little or no attention has been paid to the relative importance of the exchange rate as a determinant of economic growth. Meanwhile, factor priority is vital for the examination of the determinants of economic growth.

Meanwhile, the need for desirable exchange rate policies become one of the major macroeconomic objectives of all countries. Hence, empirical studies have focused on the effect of undervaluation and/or overvaluation of currencies on sustainable economic growth. For instance, Rodrik (2008) found that exchange rate undervaluation stimulates economic growth in developing countries. He also claimed that his findings are robust for different measures of the exchange rate and different estimation techniques. The critical questions needed to be answered about the findings of Rodrik’s work are; how relevant is undervaluation (or other measures of the exchange rate) as a stimulant of economic growth? Would the findings be valid for all forms of the functional relationship between the exchange rate and economic growth? Does the priority of the exchange rate remain the same for economic growth in developed, emerging (such as BRICS) and underdeveloped countries? To the best of the knowledge of the authors, all these questions are still begging for answers despite the plethora of studies regarding the effect of the exchange rate on economic growth.

Earlier, Easterly (2003) in his cynical studies of the cross-country growth literature, argued that high overvaluation would have a negative impact on economic growth which has been consistently intensified by economic and political instability. He also observed that the problem of overvaluation of currencies has been related to foreign currency insufficiency, unsustainable current account, fluctuation in macroeconomics dynamics imbalance in payments and deficit. These problems in one way or the other have a damaging impact on economic growth.

Since then, there have been numerous studies on the relationship between the real exchange rate and economic growth. Some of the studies focus on the impact of exchange rate regime on economic growth (see Levy-Yeyati and Sturzenegger, 2003; Harms and Kretschmann, 2009; Sokolov et al., 2011; Ma and McCauley, 2011; De Vita and Kyaw, 2011; Benhima, 2012); while others focus on the linkages between real exchange rate and economic growth (see Tharakan, 1999; Rodrik, 2008; Tarawalie, 2010; Abida, 2011; Elbadawi et al., 2012; Glüzmann et al., 2012; and Vieira et al., 2013). Most of these studies focus on finding the relationship between the variables of interest for various economies using different econometric techniques. The estimation techniques employed in most of the studies are the parametric approaches including both dynamic and system generalized method of moments (GMM) and other parametric estimation techniques.

However, no paper exists on the relationship between real exchange rate and economic growth that focuses on the prioritization of the impact of exchange rate measures on economic growth using the nonparametric technique. In other words, to the best of the authors’ knowledge, no researcher in the field of economics has ever used these machining learning techniques to investigate the relationship between real exchange rate and economic growth. It is on this premise that we employ the machining learning approach of particle swarm optimization (PSO), genetic algorithm (GA), and feature selection to evaluate the validity of Rodrik’s findings for Germany, Slovakia, and South Africa, as developed, developing and emerging economies, respectively.

Therefore, the contribution of this study is twofold: First, evaluate the validity of the findings of Rodrik’s work in developed, developing, and emerging economies using the nonparametric estimation techniques of machining learning approach. Secondly, to determine the optimal model between two major variants of machining learning approach (PSO and GA techniques) for countries under observation. These are a great addition to the literature of exchange rate–growth relationships.

The layout of the paper is scheduled as follows: in section “Data and variables” we discuss the data and variables of interest. In section “Methodology” we discuss in detail the methodology employed (machining learning approach of PSO, GA, and feature selection). Section “Results and discussions” contains the results and empirical findings. Finally, we conclude in section “Conclusion”.

Data and variables

Our empirical investigation uses annual data for three selected countries, namely Germany, Slovakia, and South Africa. Thus, this study introduced machining learning approach (PSO, GA, and feature selection techniques) to evaluate the relative primacy of the exchange rate among other factors for sustainable economic growth in Germany, South Africa, and Slovakia. The selection of these countries has been done so to represent countries with different economic structures. Germany is a major developed country with a flexible exchange rate regime and with huge trade surpluses. More specifically, it is the strongest economy in Europe and the fourth strongest economy in the world in nominal terms. On the other hand, both South Africa is a middle-income, developing country with also floating exchange rate regimes. However, South Africa is the only BRICS member and second-largest economy and by far the largest GDP per capita in sub-Saharan Africa. On the other hand, Slovakia is a high-income country, which became a European Union member in 2004 and adopted the Euro in 2009. It recorded a 10.4% GDP growth between 2000 and 2008, the highest economic growth in Europe during the period. It is grown at a higher rate and more opened than the German economy. As a result, it is called the Talta Tiger. These significant dynamics of the Slovakian economy informed its choice in this study.

The variables under observation are GDP growth proxy for economic growth, undervaluation as a proxy for real exchange rate and GDP per capita, as specified and use in the Rodrik model. We obtained all data from the World Bank Database 2016 (online) over the period of 1990–2016. The variables are transformed into their natural logarithm to harmonize the units of measurement and avoid potential heteroscedasticity.

Methodology

In this section, we discuss the techniques of the machining learning approach. The discussion is important because of the relative novelty of the techniques in the field of Economics.

Rodrik model

Rodrik (2008) considers the maximum of 188 countries and 11, five years periods from 1950 to 1954 till 2000–2004. The estimation of the relationship between growth and RER undervaluation is following from:

$${\mathrm{Growth}}_{it} = {\alpha} + {\beta}\,{\mathrm{lnRGDP}}_{it - 1} + {\updelta}\,{\mathrm{lnUN}}_{it} + f_i + f_t + \mu _{it}$$
(1)

where annual GDP growth is the dependent variable, RGDP is the initial income per capita, fi and ft are the full set of country dummies, and the UN is the index of undervaluation. He used the Balassa–Samuelson effect to calculate the undervaluation. General agreement has been made regarding the testing of external and internal Balassa–Samuelson effects with an intense reservation on the use of purchasing power parity (PPP) in the tradeable sector. The debate is ongoing regarding the alternative model among the international economists and possible refinement of the econometric methodology has been advocated.

$${\mathrm{lnUN}}_{it} = {\mathrm{lnRER}}_{it} - \widehat {{\mathrm{lnRER}}}_{it}$$
(2)

where \(\widehat {{\mathrm{RER}}_{{it}}}\), is the predicted value-form of the following equation:

$${\mathrm{lnRER}}_{it} = {\alpha} + {\beta}\,{\mathrm{lnRGDP}}_{it} + f_t + \mu _{it}$$
(3)

This estimated equation is the Balassa–Samuelson effect.

Machining learning approach

Recently, researchers, though mostly in engineering (see Liu et al., 2002; Ceylam and Ozturk, 2004; Yusup et al., 2012) and few in the field of applied economics (see Canyurt et al., 2005, 2006, 2008; Assareh et al., 2010; Amjadi et al., 2010; Yu et al., 2012; Saliminezhad and Lisaniler, 2018) have been carrying out research using machining learning approach. They found that the multilayer perceptron (MLPI outperforms linear regression. Several evolutionary approaches such as ACO, ABC, GA, PSO, and SA for optimization process parameters have been employed in machining learning due to the nonlinear nature of variables. The estimation of the exchange rate–growth relationship has been modeled using a different form of econometric equations. But none of these studies, including Rodrik (2008), considered the functional forms of their models. Often, relationships between variables are linear or nonlinear. Thus, it becomes paramount to first examine the functional forms of these models for better analysis, whether it is in a linear or non-linear form (see Bishop, 1986). Meanwhile, the nonlinear form of equations can be better handled by nonparametric techniques because the assumptions about the distribution of the variables are not needed. The result of the LM test for nonlinearity (popularly called Ramsey Reset Test) proposed by Ramsey (1969, 1974) conducted in this study shows that the variables employed are in nonlinear form. Thus, the PSO, GA, and feature selection techniques are used for the study. These nonparametric techniques do not need assumptions on the distribution of the variables. Also, the techniques check the priority of each variable and reduce the cost of estimation. The techniques are discussed in detail as follows.

Feature selection (multilayer artificial neural network—ANN)

The purpose of feature selection is to process multifaceted data by discovering the suitable characteristics and get rid of the immaterial ones. Proper feature selection can cause a decrease in measurement costs and also provide a sound understanding of the genuine domain. Feature selection algorithms can be divided into four groups based on different selection strategies, which are embedded, hybrid, filter, and wrapper methods.

The embed techniques carry out feature selection through the process of training and obtaining model fitting of a particular mechanism concurrently. Rakotomamonjy (2003) in his analysis using SVM-based criteria to choose between relevant variables while discarding the irrelevant ones. This choice was based on SVM iterative to execute feature selection. The wrapper techniques harmonize a classifier such as SVM proposed by Bradley and Mangasarian (1998) LDA by Tang et al. (2014), KNN by Prasartvit et al. (2013) to choose between a set of features that possess the most discerning power. Other techniques of wrapper could be any combination of a particular given classifier and search strategy.

The filter techniques independently evaluate the important feature by studying the features of training data through specific statistical method and present the feature selection process of any classifier (Hybrid techniques, on the other hand, was proposed by seeing Uğuz, 2011; Ghamisi et al., 2014; and Xue et al., 2012). All the aforementioned techniques follow the pattern of the single-neuron model which is powerful in learning patterns but cannot learn all types of the pattern (Banerjee et al., 2003). Therefore, the multilayer ANN models are employed in this study. It has an intermediate hidden layer which enables it to learn all kinds of patterns and produce a good prediction. In the multilayer model, the inputs are first transformed into the hidden units whose outputs become the inputs of the output units (Banerjee et al., 2003). The output units finally produce outputs or predictions. The process is mathematically expressed as follows:

$${\mathrm{net}}_{j} = {b}_{j} + {\sum } {\gamma}_{{ji}}{x}_{i}$$
(4)

where netj is the net value for the hidden unit (j), (xi) is the input of the unit (i) and (γji) is a weight linking the input unit (i) to the hidden unit (j). The output of the hidden layer unit (j), becomes

$${G}_{j} = {G}\left( {{\mathrm{net}}_{j}} \right)$$
(5)

Following the same process, the net value for the output unit (h) is

$${\mathrm{net}}_{h} = {b}_{h} + {\sum } {\beta}_{{hj}}{G}_{j}$$
(6)

(βhj) is the weight linking the hidden unit (j) to the output unit (h) while (Gj) is the output of the hidden unit (j), which serves as input for the output unit (h). in that process, the final output of the output layer unit is

$${F}_{h} = {f}\left( {{b}_{h} + {\sum }{\beta}_{{hj}}{G}\left( {{b}_{j} + {\sum } {\gamma}_{{ji}}{x}_{i}} \right)} \right)$$
(7)

Then, using the generalized delta rule, the weight adjustment process (learning) for each output layer takes the form

$${\beta}_{{hj}}\left( {{t} + 1} \right) = {\beta}_{{hj}}\left( {t} \right) + {\eta}\overline {V}$$
(8)

(η) is the learning rate (conventionally <1), and (hj) is the gradient vector associated with the weight (βhj).

PSO techniques

Eberhart and Kennedy (1995) introduced PSO to explain the problem related to continuous optimization. According to Li et al. (2008), the swarm is made up of volume-less pieces with stochastic rates, each of which depicts a workable or realizable solution. This makes it possible for the algorithm to discover the optimal solution by moving the particles in the solution space.

$$\begin{array}{l}v_{{id}}\left( {t\,+\,1} \right)=\omega v_{{id}}\left( {t} \right)+{c}_{i}{r}_{{id}}\left( {t} \right)\left[ {{y}_{{id}}\left( {t} \right)-{x}_{{id}}\left( {t} \right)} \right] \\ +\,{c}_2{r}_{2{d}}\left({t} \right)\left[ {\widehat {y}_{d}\left( {t} \right)-{x}_{{id}}\left( {t} \right)} \right]\end{array}$$
(9)

where ω is the inertia weight factor, c1c2 is acceleration constant, in general r1d, r2d are random numbers in the range [0,1], y is position based on Pbest; \(\widehat y\) is position based on Gbest, vid is velocity of particle i in dimension d, xid is position of particle i in dimension d.

Generally, ω is a function specified as

$$\omega =\omega _{{\mathrm{max}}} - \frac{{\omega _{{\mathrm{max}}}-\omega _{{\mathrm{min}}}}}{{{{\mathrm{Iter}}}_{{\mathrm{max}}}}} \times {{\mathrm{Iter}}}$$
(10)

where ωmax, ωmin: Initial and final interties parameter weights, Itermax: maximum iteration number, Iter: Current iteration number. As the velocity of a particle is determined, its position becomes

$${x}_{{id}}\left( {{t} + 1} \right) = {x}_{{id}}\left( {t} \right) + {v}_{{id}}\left( {{t} + 1} \right)$$
(11)

In this study, empirically, Eq. (8) will be

$${\mathrm{EXR}}_{{id}}\left( {{t} + 1} \right) = {\mathrm{EXR}}\left( {t} \right) + {v}_{{id}}\left( {{t} + 1} \right)$$
(12)

where EXR is the exchange rate. Eq. (9) determines the position of the exchange rate as a determinant of economic growth.

The PSO is applied in this study because of its numerous advantages. The application of PSO is quite easy and requires just a few lines of programming code. To implement these programming codes, complicated mathematical operators are not required. Thus, it is computationally reasonable with regard to both speed and memory requirements. Also, it produces robust and high-quality solutions for continuous nonlinear optimization problems within shorter estimation time. PSO combines the characteristics of both the evolution strategies (ES) and GA and it has more stable convergence features than other stochastic techniques. (see Zuperl et al., 2007; Phanthuna et al., 2013; Anand and Suganthi, 2020). Succinctly, we used this technique due to its advantage in multi-objective optimization, the ability to reduce the cost of estimation and also checks the priority of each variable via permutation and combination.

Genetic algorithm

The GA is an optimization forecasting model. GA are mathematical tools that use broad sorts of applications. GAs are optimization methods and random search control by natural genetics and evolution principles. According to Goldberg (1989), GA is adaptive in nature, involves robust search processes, efficient, uses a huge amount of implied parallelism and generates nearly related optimal solutions. It is mostly used in the field of engineering, where it is becoming more famous. However, GA is also used in energy demand estimation. It is basically used to optimize the weight of various demand equations by making use of available data as a proxy for economic indicators. GA is also used in optimizing weights of equations to enhance the estimation of energy projections for the future. Canyurt et al. (2005) use various economic indicators for Turkey to study natural gas, oil, energy demand for coal and future household’s energy demand. Amjadi et al. (2010) used GA techniques based on certain economic indicators to estimate electricity needs for Iran. Assareh et al. (2010) also employed GA methods to evaluate for Iran, oil demand based on specific socio-economic indicators, using linear and exponential form of GA, while Ceylan and Ozturk (2004) and Haldenbilen and Ceylan (2005) in their respective study on transportation energy demand and total energy demand for Turkey using GA technique estimations.

GA encompasses selection, crossover, and mutation process for the selection of a member. A member is selected on the basis of its fitness which is measured by the probability as follows:

$${p}_{i} = {f}_i/{\sum \limits_{k = 1}^n} {{f}_k}$$
(13)

where Pi is the probability of the ith member in the population. The members with the best fitness value can participate several times while those with worse fitness value are deleted.

Results and discussion

In this section, we report the findings from the machining iteration carried out on the dataset. To determine the most optimized model that could improve the Rodrik model, between PSO and GA machining estimation, we present the estimation results for PSO and GA in Table 1 from 1000 iteration. In Table 1, the PSO model for Germany shows that the PSO is appropriate for a developed economy like Germany because the standard error generated through the machining estimation of PSO is minimal 0.084536 while the standard error generated though the GA is 0.15645. By comparing these results, we conclude that the PSO with optimization produced a better and reliable model for a developed economy like Germany.

Table 1 Result of the PSO and GA.

The implication of this is that, for Germany as a developed country, improving the Rodrik model through the PSO is a better and preferred model when compared to the GA model.

Based on the result obtained, in Table 2, the most relevant independent variable according to FS that influences economic growth in Germany is GDP per capita. The GDP per capita estimate obtained through the FS estimate is 0.34 is positive and statistically significant with the standard error of 0.009. That is, GDP per capita has a direct impact on the economic growth of Germany. An increase in GDP per capita would increase economic growth by 0.34. This implies that using Rodrik model for Germany, where he specified GDP per capita and undervaluation as a function of economic growth, GDP per capita play most influential impact on the economic growth of Germany, while undervaluation has no significant impact on economic growth, this is in support of RodrikFootnote 1 conclusion for developed countries. According to Rodrik, he argued that undervaluation of the real exchange rate is not a good policy measure for the government or policymakers in developed countries to embark upon in the quest for enhanced sustainable economic growth.

Table 2 Result of feature selection.

In the case of Slovakia, which is also a developed economy. Based on our result, we found that PSO is a preferred model when compared with the GA model for Slovakia. That is, the PSO with the optimization model produced a minimal standard error estimate of 0.81139 compared to the GA optimization standard error estimate of 1.4759. That is, for Slovakia, the PSO model is better and reliable for improving the Rodrik model than the GA model. We could not fit our model for a developing country of Slovakia based on a data problem. However, the selected feature (FS) for Slovakia is the real exchange rate. This implies that real exchange is directly related to economic growth. An increase in the real exchange rate would increase economic growth by 0.47. By implication, GDP per capita does not have a huge influencing impact on the economic growth of Slovakia. The findings do not conform to Rodrik’s (2008) findings that undervaluation does not stimulate economic growth in developed countries. The probable reason is the fact that Slovakia surpassed Germany in terms of trade openness, merchandise trade growth, and GDP growth over the years. For instance, the Slovakian economy grew at 4.1% with 11.6% growth in merchandised trade and trade openness of about 190% of the GDP in 2018 (UNCTAD, 2019). On the other hand, the trade openness of Germany was 87% of GDP with 7.8% growth in merchandised trade and 1.4% economic growth in 2018.

For South Africa, both the PSO and GA models can improve the Rodrik model. The PSO estimate is 0.9148, while that of GA is 0.081297. This by implication means that the GA model performs better for an emerging economy of South Africa than the PSO model with optimization. Thus, we conclude that, though both models are appropriate in improving the Rodrik hypothesis for South Africa, the GA model would provide a more reliable and efficient result, when compared with the PSO coefficient estimate. The GA model provides the least standard error estimate than the PSO model. Based on this fact, the GA coefficient estimate is more efficient and reliable than that of the PSO model for South Africa.

Similar to Germany, the selected feature (FS) for South Africa is also GDP per capita with a positive statistically significant coefficient estimate of 0.44 and a standard error estimate of 0.007. This also implies that GDP per capita has a direct impact on the economic growth of South Africa. An increase in GDP per capita would increase the economic growth of South Africa by 0.44. Thus, using the Rodrik model for South Africa, GDP per capita is the most crucial variable and it has a significant impact on growth than undervaluation, which has no significant impact on economic growth. This does not support Rodrik’s conclusion for developing countries (see the “Conclusion” section for explanation)

Conclusion

In his 2008 study, Rodrik stressed the importance of the relationship between the real exchange rate and economic growth to policymakers. He argued that the undervaluation of currency influences and stimulate economic growth, particularly for developing countries. Since then several studies have been carried out to analyze the relationship. However, all of these studies have used one parametric estimation of econometric techniques or another, but none have been able to investigate the relationship using the machining learning approach (nonparametric approach). To the author’s best knowledge, the current study is the first to apply the machining learning techniques on this topic using time series analysis of developed and developing economies over the period of 1990–2016. We investigated the validity of undervaluation (that is, the higher rate of the real exchange rate) for developed and developing economies (Germany, South Africa, and Slovakia). Our findings confirm that the undervaluation of currency stimulates growth in Slovakia but does not significantly affect economic growth in Germany and South Africa. In the case of South Africa and Slovakia, the findings do not conform to Rodrik’s (2008) findings. Slovakia is a developed country. Yet, undervaluation is the most important policy that stimulates economic growth in the country. On the other hand, our findings reveal that undervaluation does not stimulate economic growth in South Africa, a developing (emerging) economy. The plausible explanation for these opposite findings is the differences in the size and dynamics of tradable sectors of the countries. Although both are developed countries, Slovakia surpassed Germany in terms of trade openness, merchandise trade growth, and GDP growth over the years. For instance, the Slovakian economy grew at 4.1% with 11.6% growth in merchandised trade and trade openness of about 190% of the GDP in 2018 (UNCTAD, 2019). On the other hand, the trade openness of Germany was 87% of GDP with 7.8% growth in merchandised trade and 1.4% economic growth in 2018 (UNCTAD, 2019). Since, Rodrik explained that the channel of transmission of the effect of the undervaluation is the tradable sector, the differences in the findings of Germany and Slovakia might be because of the wide gap in their growth and trade openness. Similarly, the tradable sector in South Africa is relatively underdeveloped with 0.8% GDP growth, 5.6% growth in merchandised trade and trade openness of about 60% of GDP. The study also gives useful insight that will enable policymakers to achieve real exchange rate and economic growth targets for developed and developing economies. In short, this study concludes that the exchange rate takes precedence over other factors as a determinant of economic growth in developing and developed economies with the high rate of trade openness while income per capita is the most important determinant of economic growth at the lower rate of trade openness. We, therefore, recommend that highly opened economies should focus on viable exchange rate policies, such as undervaluation of currency to enhance sustained economic growth. On the other hand, relatively less open economies should focus on policies that improve income per capita rather than exchange rate policies.