On the mixed Kibria–Lukman estimator for the linear regression model

Chen, Hongmei; Wu, Jibo

doi:10.1038/s41598-022-16689-z

Download PDF

Article
Open access
Published: 20 July 2022

On the mixed Kibria–Lukman estimator for the linear regression model

Hongmei Chen¹ &
Jibo Wu²

Scientific Reports volume 12, Article number: 12430 (2022) Cite this article

913 Accesses
1 Citations
Metrics details

Subjects

Abstract

This paper considers a linear regression model with stochastic restrictions,we propose a new mixed Kibria–Lukman estimator by combining the mixed estimator and the Kibria–Lukman estimator.This new estimator is a general estimation, including OLS estimator, mixed estimator and Kibria–Lukman estimator as special cases. In addition, we discuss the advantages of the new estimator based on MSEM criterion, and illustrate the theoretical results through examples and simulation analysis.

Robust-stein estimator for overcoming outliers and multicollinearity

Article Open access 05 June 2023

Improvement in variance estimation using transformed auxiliary variable under simple random sampling

Article Open access 06 April 2024

A new class of Poisson Ridge-type estimator

Article Open access 27 March 2023

Introduction

Consider the following linear regression model:

$$\begin{aligned} y=X\beta +\varepsilon , \end{aligned}$$

(1)

where y is the response variable vector of $n \times 1, X$ is the column full rank independent variables matrix of $n \times (p+1), \beta$ is the unknown coefficient vector of $p \times 1, \varepsilon$ is the random error vector of n dimension such that $E(\varepsilon )=0$ and ${\text {Cov}}(\varepsilon )=\sigma ^{2} I$, where $\sigma ^{2}>0$ is mean squared error.

In the estimation of unknown coefficient vector $\beta$, the OLS estimator is the most commonly used:

$$\begin{aligned} \hat{\beta }_{OLS}=\left( X^{\prime } X\right) ^{-1} X^{'}y \end{aligned}$$

(2)

It is easy to know from formula (2), $E \hat{\beta }=\beta$, and the OLS estimator has been widely used because of its unbiased nature and concise form. However, the ill condition of the design matrix X caused by the increasing number of dependent predictors often makes the OLS estimates unstable.

Massy¹ proposed principal component estimator. Hoerl and Kennard² obtained the ridge estimation by introducing a ridge parameter k into the design $X'X$ matrix calculation. Swindel³ proposed a modified ridge estimator with prior information while Lukman et al.⁴ proposed the two-parameter form of the ridge estimator called the modified ridge estimator (MRT). Liu⁵ obtained a linearized form of the ridge estimator called the Liu estimator. Akdeniz and Kaciranlara⁶ proposed the generalized Liu estimator. Liu⁷ obtained a two-parameter form of the Liu estimator.

Many scholars have found that a new estimator can be obtained by combining the two estimators, which generally have good statistical properties. Baye and Parker⁸ proposed r–k estimator by combining ridge estimator and principal component estimator. Kaciranlar and Sakallioglu⁹ proposed r–d estimator by combining Liu estimator and principal component estimator. Ozkale and Kaciranlar¹⁰ proposed two parameter estimator by combining the James–Stein Shrinkage estimator and the modified ridge estimator proposed by Swindel. Batah et al.¹¹ proposed a modified r-k estimator combining unbiased ridge estimator and principal component estimator. Yang and Chang¹² proposed another two parameter estimator based on ridge estimator and Liu estimator. Lukman et al.¹³ proposed a new estimator by combining modified ridge estimator (MRT) and principal component estimator. Kibria and Lukman¹⁴ proposed Kibria–Lukman estimator by combining ridge estimator and Liu estimator.

In practice, in addition to the sample information given by model (1), additional information about parameters in the sample information, such as certain deterministic or stochastic restrictions on unknown parameters, can also be considered. This method can also overcome the complex collinearity problem. Theil and Goldberger¹⁵ and Theil¹⁶ proposed mixed estimator by comprehensively considering sample information and constraints. Schiffrin and Toutenburg¹⁷ proposed weighted mixed estimator for the different importance of sample information and prior information.

In recent years, biased estimation and estimation methods with prior information are often combined to form a broader biased estimation. Hubert and Wijekoon¹⁸ proposed a stochastic restricted Liu estimator by combining Liu estimator and mixed estimator. Yang and Xu¹⁹ obtained another stochastic mixed Liu estimator. In the same year, Yang and Chang further studied the stochastic mixed Liu estimator and obtained the weighted mixed Liu estimator. Yang and Li¹² proposed another stochastic mixed ridge estimator. Ozbay and Kaciranlar²⁰ integrated two parameter estimator and mixed estimator and proposed a two parameter mixed estimator.

In this paper, a new mixed KL estimator under stochastic restrictions is proposed, and its excellent properties under certain conditions are proved theoretically. The above theoretical results are verified and analyzed by examples and data simulation.

The proposed estimator

Hoerl and Kennard² proposed the ridge estimator (RE):

$$\begin{aligned} \hat{\beta }_{R E}=\left( X^{\prime } X+k I\right) ^{-1} X^{\prime } y \end{aligned}$$

(3)

where $k>0$ is the parameter. In fact, ridge estimator is obtained by solving the following extreme value problem:

$$\begin{aligned} (y-X \beta )^{\prime }(y-X \beta )+k\left( \beta ^{\prime } \beta -c\right) \end{aligned}$$

where c is constant, k is the Lagrange constant.

Kibria and Lukman¹⁴ proposed the Kibria Lukman (KL) estimator:

$$\begin{aligned} \hat{\beta }_{K L}=\left( X^{\prime } X+k I\right) ^{-1}\left( X^{\prime } y-k \hat{\beta }\right) \end{aligned}$$

(4)

where $k>0$ is the parameter.KL estimator is obtained by solving the following extreme value problem:

$$\begin{aligned} (y-X \beta )^{\prime }(y-X \beta )+k\left[ (\beta +\hat{\beta })^{\prime }(\beta +\hat{\beta })-c\right] \end{aligned}$$

(5)

where c is constant, k is the Lagrange constant.

Consider the following stochastic restrictions:

$$\begin{aligned} r=R \beta +e, e \sim \left( 0, \sigma ^{2} \psi \right) , \end{aligned}$$

(6)

where r is the known random vector of $j \times 1$, R is the row full rank sample data matrix of $j \times p$, let e be the $j \times 1$ random error vector and independent of each other, and $\psi$ be the known positive definite matrix.

Theil and Goldberger¹⁵ and Theil¹⁶ proposed the mixed estimator by integrating sample information and constraints. The derivation idea is to rewrite models (1) and (6) into a new linear model:

$$\begin{aligned} \left( \begin{array}{l} y \\ r \end{array}\right) =\left( \begin{array}{l} X \\ R \end{array}\right) \beta +\left( \begin{array}{l} \varepsilon \\ e \end{array}\right) \end{aligned}$$

If $\tilde{y}=\left( \begin{array}{l}y \\ r\end{array}\right) , \tilde{X}=\left( \begin{array}{l}X \\ R\end{array}\right) , \tilde{\varepsilon }=\left( \begin{array}{l}\varepsilon \\ e\end{array}\right)$, above model is transformed into

$$\begin{aligned} \tilde{y}=\tilde{X} \beta +\tilde{\varepsilon } \end{aligned}$$

(7)

By applying the least square estimator to the new linear model (7), the mixed estimator (ME)of parameter $\beta$ is obtained:

$$\begin{aligned} \hat{\beta }_{M E}=\left( X^{\prime } X+R^{\prime } \psi ^{-1} R\right) ^{-1}\left( X^{\prime } y+R^{\prime } \psi ^{-1} r\right) \end{aligned}$$

(8)

Combined mixed estimator and ridge estimator and proposed stochastic mixed ridge estimation (RME):

$$\begin{aligned} \hat{\beta }_{MRE}=\left( X^{\prime } X+k I+R^{\prime } \psi ^{-1} R\right) ^{-1}\left( X^{\prime } y+R^{\prime } \psi ^{-1} r\right) \end{aligned}$$

(9)

The estimator proposed in this paper is obtained by solving the following extreme value problem:

$$\begin{aligned} \Phi ^{*}=(y-X \beta )^{\prime }(y-X \beta )+k\left[ (\beta -d \hat{\beta })^{\prime }(\beta -d \hat{\beta })-c\right] +(r-R \beta )^{\prime } \psi ^{-1}(r-R \beta ) \end{aligned}$$

(10)

where c is constant, k is Lagrange constant.

Regular equations can be obtained:

$$\begin{aligned}&X^{'}X \beta -X^{'}y+k(\beta -d \hat{\beta })+R^{\prime } \psi ^{-1} R-R^{\prime } \psi ^{-1} r=0 \end{aligned}$$

(11)

$$\begin{aligned} (\beta -d \hat{\beta })^{\prime }(\beta -d \hat{\beta })=c \end{aligned}$$

(12)

from Eqs. (11) and (12), we can get the mixed KL estimator:

$$\begin{aligned} \hat{\beta }_{MKL}=\left( X^{'}X+k I+R^{\prime } \psi ^{-1} R\right) ^{-1}\left( X^{'}y-k \hat{\beta }+R^{\prime } \psi ^{-1} r\right) , k>0 \end{aligned}$$

(13)

It can be seen from Eq. (13) that mixed estimator, KL estimator and OLS estimator can be regarded as special cases of mixed KL estimator.Namely

When $k=0, \hat{\beta }_{ME}=\hat{\beta }_{MKL}=\left( X^{\prime } X+R^{\prime } \psi ^{-1} R\right) ^{-1}\left( X^{\prime } y+R^{\prime } \psi ^{-1} r\right)$ is mixed estimator;

When $R=0, \hat{\beta }_{K L}=\hat{\beta }_{MKL}=(X^{'}X+k I)^{-1}(X'y-k \hat{\beta })$ is $\mathrm {KL}$ estimator;

When $k=0, R=0, \hat{\beta }_{OLS}=\hat{\beta }_{MKL}=(X^{'}X)^{-1}X^{'}y$ is OLS estimator.

The performance of the new estimator

If $\hat{\beta }$ is the estimation of $\beta$, then the mean square error matrix $(\mathrm {MSEM})$ of $\hat{\beta }$ is given as:

$$\begin{aligned} {\text {MSEM}}(\hat{\beta })=E(\hat{\beta }-\beta )(\hat{\beta }-\beta )^{\prime }={\text {Cov}}(\hat{\beta })+{\text {Bias}}(\hat{\beta }) {\text {Bias}}(\hat{\beta })^{'} \end{aligned}$$

where ${\text {Cov}}(\hat{\beta })$ is the covariance matrix of $\hat{\beta }$, and ${\text {Bias}}(\hat{\beta })=E(\hat{\beta })-\beta$ is the deviation vector. Two estimates $\hat{\beta }_{1}$ and $\hat{\beta }_{2}$, $\hat{\beta }_{2}$ are better than $\hat{\beta }_{1}$ under MSEM criterion if and only if:

$$\begin{aligned} \Delta \left( \hat{\beta }_{1}, \hat{\beta }_{2}\right) ={\text {MSEM}}\left( \hat{\beta }_{1}\right) -{\text {MSEM}}\left( \hat{\beta }_{2}\right) \ge 0 \end{aligned}$$

Lemma 3.1

Suppose two $n \times n$ matrix $M>0, N \ge 0$, then $M>N \Leftrightarrow \lambda _{1}\left( N M^{-1}\right) <1$, where $\lambda _{1}\left( N M^{-1}\right)$ is the maximum eigenvalue of matrix $N M^{-1}$.

The mean square error matrix of mixed KL estimator $\hat{\beta }_{MKL}$ is calculated as follows:

$$\begin{aligned} E\left( \hat{\beta }_{MKL}\right)= \, & {} E\left[ \left( X^{\prime } X+k I+R^{\prime } \psi ^{-1} R\right) ^{-1}\left( X^{'}y-k \hat{\beta }+R^{\prime } \psi ^{-1} r\right) \right] \nonumber \\= \, & {} A_{k} E\left( X^{'}y-k \hat{\beta }+R^{\prime } \psi ^{-1} r\right) \nonumber \\= \, & {} A_{k} E\left( X^{'}y+k \hat{\beta }-2 k \hat{\beta }+R^{\prime } \psi ^{-1} r\right) \nonumber \\= \, & {} A_{k}\left( A_{k}^{-1}-2 k\right) \beta \nonumber \\= \, & {} \beta -2 k A_{k} \beta \end{aligned}$$

(14)

where $A_{k}=\left( X^{'}X+k I+R^{\prime } \psi ^{-1} R\right) ^{-1}$.

Deviation vector: ${\text {Bias}}\left( \hat{\beta }_{MKL}\right) =E\left( \hat{\beta }_{MKL}\right) =-2 k A_{k} \beta$.

$$\begin{aligned} {\text {Cov}}\left( \hat{\beta }_{MKL}\right)= \, & {} {\text {Cov}}\left[ \left( X^{\prime } X+k I+R^{\prime } \psi ^{-1} R\right) ^{-1}\left( X^{'}y-k \hat{\beta }+R^{\prime } \psi ^{-1} r\right) \right] \nonumber \\= \, & {} {\text {Cov}}\left[ A_{k}\left( X^{'}y-k \hat{\beta }+R^{\prime } \psi ^{-1} r\right) \right] \nonumber \\= \, & {} A_{k} {\text {Cov}}\left( X^{'}y-k \hat{\beta }+R^{\prime } \psi ^{-1} r\right) A_{k}\nonumber \\= \, & {} A_{k}\left( \sigma ^{2} X^{\prime } X-k \sigma ^{2} S^{-1}+\sigma ^{2} R^{\prime } \psi ^{-1} R\right) A_{k}\nonumber \\= \, & {} \sigma ^{2} A_{k}\left( X^{\prime } X-k S^{-1}+R^{\prime } \psi ^{-1} R\right) A_{k} \end{aligned}$$

(15)

Therefore,

$$\begin{aligned} {\text {MSEM}}(\hat{\beta }_{MKL})= \, & {} {\text {Cov}}(\hat{\beta }_{MKL})+{\text {Bias}}(\hat{\beta }_{MKL}) {\text {Bias}}(\hat{\beta }_{MKL})^{'}\nonumber \\= \, & {} \sigma ^{2} A_{k}\left( X^{\prime } X-k S^{-1}+R^{\prime } \psi ^{-1} R\right) A_{k}+4 k^{2} A_{k} \beta \beta ^{\prime } A_{k}\nonumber \\= \, & {} \sigma ^{2} A_{k}\left( X^{\prime } X-k S^{-1}+R^{\prime } \psi ^{-1} R\right) A_{k}+b_{1} b_{1}^{\prime } \end{aligned}$$

(16)

where $b_{1}=-2 k A_{k} \beta .$

By substituting $k=0$ into Eq. (16), the mean square error matrix of the mixed estimator can be obtained:

$$\begin{aligned} {\text {MSEM}}(\hat{\beta }_{ME})= \, & {} \sigma ^{2}\left( X^{\prime } X+R^{\prime } \psi ^{-1} R\right) ^{-1}\left( X^{\prime } X+R^{\prime } \psi ^{-1} R\right) \left( X^{\prime } X+R^{\prime } \psi ^{-1} R\right) ^{-1}\nonumber \\= \, & {} \sigma ^{2}\left( X^{\prime } X+R^{\prime } \psi ^{-1} R\right) ^{-1}\nonumber \\= \, & {} \sigma ^{2} M^{-1} \end{aligned}$$

(17)

where $M=X^{\prime } X+R^{\prime } \psi ^{-1} R$.

By substituting $R=0$ into Eq. (16), the mean square error matrix of the KL estimator can be obtained:

$$\begin{aligned} {\text {MSEM}}\left( \hat{\beta }_{KL}\right)= \, & {} \sigma ^{2}\left( X^{\prime } X+k I\right) ^{-1}\left( X^{\prime } X-k S^{-1}\right) \left( X^{\prime } X+k I\right) ^{-1}\nonumber \\\, &\,+4 k^{2}\left( X^{\prime } X+k I\right) ^{-1} \beta \beta ^{\prime }\left( X^{\prime } X+k I\right) ^{-1}\nonumber \\= \, & {} \sigma ^{2} S_{k}^{-1}\left( X^{\prime } X-k S^{-1}\right) S_{k}^{-1}+4 k^{2} S_{k}^{-1} \beta \beta ^{\prime } S_{k}^{-1}\nonumber \\= \, & {} \sigma ^{2} S_{k}^{-1}\left( X^{\prime } X-k S^{-1}\right) S_{k}^{-1}+b_{2} b_{2}^{\prime } \end{aligned}$$

(18)

where $S_{k}=X^{\prime } X+k I, b_{2}=-2 k S_{k}^{-1} \beta$.

By substituting $k=0, R=0$ into Eq. (16), the mean square error matrix of the OLS estimator can be obtained:

$$\begin{aligned} {\text {MSEM}}\left( \hat{\beta }_{\text{ OLS } }\right) =\sigma ^{2} S^{-1} \end{aligned}$$

(19)

Mean square error matrix of mixed ridge estimator:

$$\begin{aligned} E\left( \hat{\beta }_{MRE}\right)= \, & {} E\left[ \left( X^{\prime } X+k I+R^{\prime } \psi ^{-1} R\right) ^{-1}\left( X^{'}y+R^{\prime } \psi ^{-1} r\right) \right] \nonumber \\= \, & {} A_{k} E\left( X^{\prime } y+R^{\prime } \psi ^{-1} r\right) \nonumber \\= \, & {} A_{k} E\left( X^{'}y+k \hat{\beta }-k \hat{\beta }+R^{\prime } \psi ^{-1} r\right) \nonumber \\= \, & {} A_{k}\left( A_{k}^{-1}-kI\right) \beta \nonumber \\= \, & {} \beta -k A_{k} \beta \end{aligned}$$

(20)

Deviation vector: ${\text {Bias}}\left( \hat{\beta }_{MRE}\right) =E\left( \hat{\beta }_{MRE}\right) -\beta =-k A_{k} \beta .$

$$\begin{aligned} \begin{aligned} Cov\left( \hat{\beta }_{MRE}\right)&=\, {\text {Cov}}\left[ \left( X^{\prime } X+k I+R^{\prime } \psi ^{-1} R\right) ^{-1}\left( X^{\prime } y+R^{\prime } \psi ^{-1} r\right) \right] \\&=\,{\text {Cov}}\left[ A_{k}\left( X^{'}y+R^{\prime } \psi ^{-1} r\right) \right] \\&=A_{k} {\text {Cov}}\left( X^{'}y+R^{\prime } \psi ^{-1} r\right) A_{k}\\&=A_{k}\left( \sigma ^{2} X^{\prime } X+\sigma ^{2} R^{\prime } \psi ^{-1} R\right) A_{k} \\&=\,\sigma ^{2} A_{k}\left( X^{\prime } X+R^{\prime } \psi ^{-1} R\right) A_{k} \end{aligned} \end{aligned}$$

Therefore,

$$\begin{aligned} {\text {MSEM}}\left( \hat{\beta }_{MRE}\right) =\sigma ^{2} A_{k}\left( X^{\prime } X+R^{\prime } \psi ^{-1} R\right) A_{k}+k^{2} A_{k} \beta \beta ^{\prime } A_{k}. \end{aligned}$$

(21)

Comparison between mixed KL estimator and mixed estimator

From Eqs. (16) and (17), we make

$$\begin{aligned} \Delta _{1}= \, & {} {\text {MSEM}}\left( \hat{\beta }_{M E}\right) -{\text {MSEM}}\left( \hat{\beta }_{M K L}\right) \nonumber \\= \, & {} \sigma ^{2} M^{-1}-\sigma ^{2} A_{k}\left( X^{\prime } X-k S^{-1}+R^{\prime } \psi ^{-1} R\right) A_{k}-b_{1} b_{1}^{\prime }\nonumber \\= \, & {} \sigma ^{2} M^{-1}-\sigma ^{2} A_{k}\left( M-k S^{-1}\right) A_{k}-b_{1} b_{1}^{\prime }\nonumber \\= \, & {} \sigma ^{2}\left[ M^{-1}-A_{k}\left( M-k S^{-1}\right) A_{k}\right] -b_{1} b_{1}^{\prime } \end{aligned}$$

(22)

Because

$$\begin{aligned}&M^{-1}-A_{k}\left( M-k S^{-1}\right) A_{k}\\&\quad =A_{k} A_{k}^{-1} M^{-1} A_{k}^{-1} A_{k}-A_{k}\left( M-kS^{-1}\right) A_{k}\\&\quad =A_{k}\left[ A_{k}^{-1} M^{-1} A_{k}^{-1}-\left( M-k S^{-1}\right) \right] A_{k} \\&\quad =A_{k}\left[ (M+k I) M^{-1}(M+k I)-\left( M-k S^{-1}\right) \right] A_{k} \\&\quad =A_{k}\left( M+2 k I+k^{2} M^{-1}-M+k S^{-1}\right) A_{k} \\&\quad =A_{k}\left( 2 k I+k^{2} M^{-1}+k S^{-1}\right) A_{k}, \end{aligned}$$

from $k>0$,so $M^{-1}-A_{k}\left( M-k S^{-1}\right) A_{k}>0$, Theorem 3.2 is obtained.

Theorem 3.2

The necessary and sufficient conditions for mixed KL estimator $\hat{\beta }_{MKL}$ to be superior to mixed estimator $\hat{\beta }_{M E}$ under MSEM criterion are as follows:

$$\begin{aligned} \sigma ^{-2} b_{1}^{\prime }\left[ M^{-1}-A_{k}\left( M-k S^{-1}\right) A_{k}\right] ^{-1} b_{1} \le 1 \end{aligned}$$

(23)

Comparison between mixed KL estimator and KL estimator

From Eqs. (16) and (18), we make

$$\begin{aligned} \begin{aligned} \Delta _{2}&={\text {MSEM}}\left( \hat{\beta }_{KL}\right) -{\text {MSEM}}\left( \hat{\beta }_{MKL}\right) \\&=\sigma ^{2} S_{k}^{-1}\left( S-k S^{-1}\right) S_{k}^{-1}+b_{2} b_{2}^{\prime }-\sigma ^{2} A_{k}\left( S-k S^{-1}+R^{\prime } \psi ^{-1} R\right) A_{k}-b_{1} b_{1}^{\prime } \\&=\sigma ^{2}\left[ S_{k}^{-1}\left( S-k S^{-1}\right) S_{k}^{-1}-A_{k}\left( S-k S^{-1}+R^{\prime } \psi ^{-1} R\right) A_{k}\right] +b_{2} b_{2}^{\prime }-b_{1} b_{1}^{\prime } \end{aligned} \end{aligned}$$

(24)

Because

$$\begin{aligned}&S_{k}^{-1}\left( S-k S^{-1}\right) S_{k}^{-1}-A_{k}\left( S-k S^{-1}+R^{\prime } \psi ^{-1} R\right) A_{k} \\&\quad =A_{k}\left[ A_{k}^{-1} S_{k}^{-1}\left( S-k S^{-1}\right) S_{k}^{-1} A_{k}^{-1}-\left( S-k S^{-1}+R^{\prime } \psi ^{-1} R\right) \right] A_{k} \\&\quad =A_{k}\left[ \left( S_{k}+R^{\prime } \psi ^{-1} R\right) S_{k}^{-1}NS_{k}^{-1}\left( S_{k}+R^{\prime } \psi ^{-1} R\right) -\left( N+R^{\prime } \psi ^{-1} R\right) \right] A_{k} \\&\quad =A_{k}\left[ \left( S_{k}+Q\right) S_{k}^{-1} N S_{k}^{-1}\left( S_{k}+Q\right) -(N+Q)\right] A_{k} \\&\quad =A_{k}\left[ \left( I+Q S_{k}^{-1}\right) N\left( I+S_{k}^{-1} Q\right) -(N+Q)\right] A_{k} \\&\quad =A_{k}\left[ N+N S_{k}^{-1} Q+Q S_{k}^{-1} N+Q S_{k}^{-1} N S_{k}^{-1} Q-(N+Q)\right] A_{k} \\&\quad =A_{k}\left( N S_{k}^{-1} Q+Q S_{k}^{-1} N+Q S_{k}^{-1} N S_{k}^{-1} Q-Q\right) A_{k} \\&\quad =A_{k} B A_{k}, \end{aligned}$$

where$N=S-kS^{-1}, Q=R^{\prime } \psi ^{-1} R, B=N S_{k}^{-1} Q+Q S_{k}^{-1} N+Q S_{k}^{-1} N S_{k}^{-1} Q-Q$

According to the Lemma 3.1, it can be obtained that if $k<\min \nolimits _{i=1}^{p}\lambda _{i}^{2}$, then $N>0$. So $B>0$ if and only if $k<\min \nolimits _{i=1}^{p}\lambda _{i}^{2},\lambda _{1} Q\left( N S_{k}^{-1} Q+Q S_{k}^{-1} N+Q S_{k}^{-1} N S_{k}^{-1} Q\right) ^{-1}<1$.

As long as $k<\min \nolimits _{i=1}^{p}\lambda _{i}^{2},\lambda _{1} Q\left( N S_{k}^{-1} Q+Q S_{k}^{-1} N+Q S_{k}^{-1} N S_{k}^{-1} Q\right) ^{-1}<1$, following conclusions can be obtained:

$$\begin{aligned} \Delta _{2} \ge 0 \text{ if } \text{ and } \text{ only } \text{ if } b_{1}^{\prime }\left( \sigma ^{2} A_{k} B A_{k}+b_{2} b_{2}^{\prime }\right) ^{-1} b_{1} \le 1 . \text{ Therefore, } \text{ there } \text{ is } \text{ Theorem } 3.2. \end{aligned}$$

Theorem 3.3

When $k<\min \limits _{i=1}^{p}\lambda _{i}^{2},\lambda _{1} Q\left( N S_{k}^{-1} Q+Q S_{k}^{-1} N+Q S_{k}^{-1} N S_{k}^{-1} Q\right) ^{-1}<1$, the necessary and sufficient conditions for mixed KL estimator $\hat{\beta }_{MKL}$ to be superior to KL estimator $\hat{\beta }_{KL}$ under MSEM criterion are as follows:

$$\begin{aligned} b_{1}^{\prime }\left( \sigma ^{2} A_{k} B A_{k}+b_{2} b_{2}^{\prime }\right) ^{-1} b_{1} \le 1 \end{aligned}$$

(25)

Comparison between mixed KL estimator and OLS estimator

From Eqs. (16) and (19), we make

$$\begin{aligned} \Delta _{3}= & {} {\text {MSEM}}\left( \hat{\beta }_{OLS}\right) -{\text {MSEM}}\left( \hat{\beta }_{MKL}\right) \nonumber \\= & {} \sigma ^{2} S^{-1}-\sigma ^{2} A_{k}\left( X^{\prime } X-k S^{-1}+R^{\prime } \psi ^{-1} R\right) A_{k}-b_{1} b_{1}^{\prime }\nonumber \\= & {} \sigma ^{2}\left[ S^{-1}-A_{k}\left( X^{\prime } X-k S^{-1}+R^{\prime } \psi ^{-1} R\right) A_{k}\right] -b_{1} b_{1}^{\prime } \end{aligned}$$

(26)

Because

$$\begin{aligned}&S^{-1}-A_{k}\left( X^{\prime } X-k S^{-1}+R^{\prime } \psi ^{-1} R\right) A_{k}\\&\quad =A_{k} A_{k}^{-1} S^{-1} A_{k}^{-1} A_{k}-A_{k}\left( X^{\prime } X-k S^{-1}+R^{\prime } \psi ^{-1} R\right) A_{k} \\&\quad =A_{k}\left[ A_{k}^{-1} S^{-1} A_{k}^{-1}-\left( X^{\prime } X-k S^{-1}+R^{\prime } \psi ^{-1} R\right) \right] A_{k} \\&\quad =A_{k}\left[ \left( S+k I+Q\right) S^{-1}\left( S+k I+Q\right) -\left( S-k S^{-1}+Q\right) \right] A_{k} \\&\quad =A_{k}\left[ \left( I+k S^{-1}+QS^{-1}\right) \left( S+k I+Q\right) -\left( S-k S^{-1}+Q\right) \right] A_{k} \\&\quad =A_{k}\left[ S+k I+Q+\left( I+k S^{-1}+QS^{-1}\right) \left( k I+Q\right) -\left( S-k S^{-1}+Q\right) \right] A_{k} \\&\quad =A_{k}\left[ k I+k S^{-1}+\left( I+k S^{-1}+QS^{-1}\right) \left( k I+Q\right) \right] A_{k} \\&\quad =A_{k}\left( 2 k I+k S^{-1}+k^{2} S^{-1}+Q+k S^{-1} Q+k Q S^{-1}+Q S^{-1} Q\right) A_{k} \\&\quad =A_{k}\left[ 2 k I+k S^{-1}+k^{2} S^{-1}+Q+k\left( S^{-1} Q+Q S^{-1}\right) +Q S^{-1} Q\right] A_{k} \\&\quad =A_{k}\left[ 2 k I+k S^{-1}+k^{2} S^{-1}+Q+k C+Q S^{-1} Q\right] A_{k} \end{aligned}$$

where $C=S^{-1} Q+Q S^{-1}$.

Because $C=C^{\prime }$, and $\lambda _{i}\left( S^{-1} Q\right) =\lambda _{i}\left( S^{-\frac{1}{2}} Q S^{-\frac{1}{2}}\right) >0$, we can get $C>0$, so $2 k I+k S^{-1}+k^{2} S^{-1}+Q+k C+Q S^{-1} Q>0$, that is $S^{-1}-A_{k}\left( X^{\prime } X-k S^{-1}+R^{\prime } \psi ^{-1} R\right) A_{k}>0$, Theorem 3.4 is obtained.

Theorem 3.4

The necessary and sufficient conditions for mixed KL estimator $\hat{\beta }_{MKL}$ to be superior to $\hat{\beta }_{OLS}$ under MSEM criterion are as follows:

$$\begin{aligned} \sigma ^{-2} b_{1}^{\prime }\left[ S^{-1}-A_{k}\left( X^{\prime } X-k S^{-1}+R^{\prime } \psi ^{-1} R\right) A_{k}\right] b_{1} \le 1 \end{aligned}$$

(27)

Comparison between mixed KL estimator and mixed ridge estimator

From Eqs. (16) and (22), we make

$$\begin{aligned} \Delta _{4}= & {} {\text {MSEM}}\left( \beta _{M R E}\right) -{\text {MSEM}}\left( \beta _{M K L}\right) \nonumber \\= & {} \sigma ^{2} A_{k}\left( S+Q\right) A_{k}+k^{2} A_{k} \beta \beta ^{\prime } A_{k}-\sigma ^{2} A_{k}\left( S-k S^{-1}+Q\right) A_{k}-4 k^{2} A_{k} \beta \beta ^{\prime } A_{k}\nonumber \\= & {} \sigma ^{2} A_{k} M A_{k}-\sigma ^{2} A_{k}\left( M-k S^{-1}\right) A_{k}-3 k^{2} A_{k} \beta \beta ^{\prime } A_{k}\nonumber \\= & {} \sigma ^{2} A_{k}\left[ M-\left( M-k S^{-1}\right) \right] A_{k}-3 k^{2} A_{k} \beta \beta ^{\prime } A_{k}\nonumber \\= & {} k \sigma ^{2} A_{k} S^{-1} A_{k}-3 k^{2} A_{k} \beta \beta ^{\prime } A_{k}\nonumber \\= & {} k A_{k}\left( \sigma ^{2} S^{-1}-3 k \beta \beta ^{\prime }\right) A_{k} \end{aligned}$$

(28)

Theorem 3.5

The necessary and sufficient conditions for mixed KL estimator $\hat{\beta }_{MKL}$ to be superior to the mixed ridge estimator $\hat{\beta }_{MRE}$ under MSEM criterion are as follows:

$$\begin{aligned} 3 k \sigma ^{-2} \beta ^{\prime } S \beta \le 1 \end{aligned}$$

(29)

.

Numerical example and simulation study

In order to further explain the theoretical results, this section will verify and analyze the above theoretical results through examples.

The example analysis data adopts the percentage data of research and development expenses in GNP of several countries from 1972 to 1986 used by Gruber²¹, Akdeniz and Erol²², in which $x_{1}$ represents France, $x_{2}$ represents West Germany, $x_{3}$ represents Japan, $x_{4}$ represents the former Soviet Union and y represents the United States. See Table 1 for specific data.

Table 1 1972–1986 research and development expenditure as a percentage of GNP.

Subjects

Abstract

Similar content being viewed by others

Robust-stein estimator for overcoming outliers and multicollinearity

Improvement in variance estimation using transformed auxiliary variable under simple random sampling

A new class of Poisson Ridge-type estimator

Introduction

The proposed estimator

The performance of the new estimator

Lemma 3.1

Comparison between mixed KL estimator and mixed estimator

Theorem 3.2

Comparison between mixed KL estimator and KL estimator

Theorem 3.3

Comparison between mixed KL estimator and OLS estimator

Theorem 3.4

Comparison between mixed KL estimator and mixed ridge estimator

Theorem 3.5

Numerical example and simulation study

Conclusions

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links