An enhanced estimator of finite population variance using two auxiliary variables under simple random sampling

In this article, we have suggested a new improved estimator for estimation of finite population variance under simple random sampling. We use two auxiliary variables to improve the efficiency of estimator. The numerical expressions for the bias and mean square error are derived up to the first order approximation. To evaluate the efficiency of the new estimator, we conduct a numerical study using four real data sets and a simulation study. The result shows that the suggested estimator has a minimum mean square error and higher percentage relative efficiency as compared to all the existing estimators. These findings demonstrate the significance of our suggested estimator and highlight its potential applications in various fields. Theoretical and numerical analyses show that our suggested estimator outperforms all existing estimators in terms of efficiency. This demonstrates the practical value of incorporating auxiliary variables into the estimation process and the potential for future research in this area.

The use of auxiliary information can increase the precision of population parameter estimates in survey sampling, assuming that it is strongly associated with the study variable.To get precise estimates of the unknown population characteristics, auxiliary information is essential in parameter selection and at estimation stage.Estimating population variance is specifically important in populations that are likely to be skewed, and the accuracy of the estimation has been the focus of many research studies.By reducing the dispersion of the estimate obtained from various samples, we can obtain a more accurate representation of the population.Variance is a feature that can be used to describe a statistical population, and brilliant efforts have been made to estimate this characteristic as accurately as possible.The goal is to reduce the scattering of the calculated values, and appropriate use of auxiliary variables in survey sampling results in significant decrease in the variance of the estimator of unknown population parameter(s).In many situations, it is necessary to estimate the population variance of the study variable.The variance estimate for the population is crucial in a number of disciplines, such as agriculture, health, biology, and business, where we confront populations that are likely to be skewed.Everywhere in the world, there are differences in the things we do every day.The idea that no two things or persons are exactly alike is one that is widely held.For instance, a farmer must be well aware of how the geographical climate varies over time in order to decide how to sow his land.A clinician must have a thorough understanding of the variations in human hypertension and fever in order to administer the appropriate care.
The estimation of population variance has been a matter of interest for several researchers over the years.Garcia and Cebrian 1 suggested the ratio estimator for the population variance.Upadhyaya et al. 2 recommended a broad class of estimators for variance of the ratio estimator.Chandra and Singh 3 discussed a family of estimators for population variance.Arcos et al. 4 suggested incorporating the auxiliary information available in variance estimation.Kadilar and Cingi 5 discussed improvement in variance estimation in simple random sampling.Grover 6 discussed a correction note on improvement in variance using auxiliary information.Sharma and Singh 7 discussed a generalized class of estimators for finite population variance.Singh and Solanki 8 suggested improved estimation of finite population variance using auxiliary information.Yadav and Kadilar 9 recommended a two parameter variance estimator using auxiliary information.Adichwal et al. 10 proposed generalized class of estimators for population variance using auxiliary attribute.Adichwal et al. 11 recommended generalized class of estimators for population variance using information on two auxiliary variables.Lone and Tailor 12 suggested estimation of population variance in simple random sampling.Singh and Khalid 13 discussed effective estimation strategy of population variance in two-phase successive sampling under random non-response.Audu et al. 14 proposed difference-cum-ratio estimators for estimating finite population coefficient of variation in simple random sampling.Shahzad et al. 15 suggested a new variance estimators for calibration approach under stratified random sampling.Singh and Khalid 16 suggested a composite class of estimators to deal with the issue of variance estimation under the situations of random non-response in two-occasion successive sampling.Zaman and Bulut 17 suggested a new class of robust ratio estimators for finite population variance.Shahzad et al. 18 discussed use of calibration constraints and linear moments for variance estimation under stratified adaptive cluster sampling.Ashutosh et al. 19 suggested calibration Approach for Variance Estimation of Small Domain.Ahmad 20 discussed improved estimation of population variance under stratified random sampling.Ahmad 21 discussed improved variance estimator using dual auxiliary variable under simple random sampling.Ahmad et al. 22 suggested enhanced generalized class of estimators under simple random sampling.
Numerous authors have contributed to the estimation of population variance as mentioned above.In this article, we have suggested an enhanced estimator, which has valuable contribution and highlighted the importance of auxiliary variables in estimating of population variance.By using actual data and a simulation study, the suggested estimator has minimum mean square error and higher percentage relative efficiency as compared to recent well known existing estimators.
The primary goal of the current work is given by: 1.To propose a new estimator for finite population variance using two auxiliary variables under simple random sampling.2. The properties i.e. bias and mean squared error of the propose estimator is derived up to the first order of approximation.3. Through the use of the actual data and a simulation study, the application of the propose estimator is highlighted.4. A comparison of estimators is done with existing estimators in terms of minimum mean square error.5. Our proposed estimator provides a novel and valuable contribution to the field of population variance estimation, and we believe it has the potential to be useful in a wide range of applications.

Notations and symbols
Consider ω ) is a population comprised of size N, consider a sample of size n is chosen from ω ⌣ by using simple random sampling without replacement (SRSWOR).Let Y be the population study variable, and X 1 and X 2 denote the two auxiliary variables respectively.Consider population variance of Y, X 1 , and X 2 are denoted by S 2 y , S 2 x1 , and , be the sample vari- , be the population variances.Let y , x 1 , and x 2 be the sample mean.
To derive the bias and mean square error, we consider the following error terms: where ¯ * lmn = (¯ lmn − 1) , and = 1 n − 1 N .wherel, m, and n be the positive numerals and υ 200 , υ 020 , andυ 002 are the second-order moments about means of Y, X 1 , and X 2 , and ¯ lmn is the moment ratio.
The article has been structured as follows: In Section "Introduction", we have provided a brief overview of the topic and highlighted the importance of estimating population variance.In Section "Notations and symbols", we review some of the existing estimators for estimation of population variance.In Section "Existing estimators", we proposed a new estimator for population variance under simple random sampling.In Section "Proposed estimator", we will compare the proposed estimator with the existing estimators in terms of MSE.In Section "Efficiency assessment", we will present a numerical study to illustrate the performance of the proposed estimator and compare it with the existing estimators.In Section "Numerical study", we will perform simulations to validate the results of the numerical study and provide additional insights into the performance of the estimators.In Section "Simulation study", we will discuss the results of our study, the limitations of the proposed estimator, and the scope for future research in this field.In Section "Discussion", we will summarize the main findings of our study and provide concluding remarks on the estimation of population variance under simple random sampling.

Existing estimators
In this section, we study different variance estimators based on simple random sampling that are available in the literature.
(i) The usual estimator T 0 , is given by: The variance of T 0 , is given by: (ii) Isaki 23 suggested a ratio estimator, which is given by: The bias and mean square error of T 1 are given as: and (iii) The difference type estimator is given by: where k is the constant, i.e. k = The minimum variance at the optimal value of k is given by: where ρ = .
(iv) The difference type estimator given by: where ω (1) (5) www.nature.com/scientificreports/ The bias and MSE of T 4 and T 5 are given by: and (vi) Grover and Kaur 25 , suggested the following estimator, which is given by: The bias of T 6 , is given by: The minimum values of ω

Proposed estimator
The use of auxiliary variables can improve the precision and efficiency of an estimator during both the design and estimation procedures.Taking motivation from the work of Ahmad et al. 26 , we propose an enhanced estimator for estimation of finite population variance using two auxiliary variables under simple random sampling.Our suggested variance estimator based on simple random sampling is more efficient and flexible as compared to all the existing estimators considered in this study, which is given by: where and ω ⌣ 7 are unknown constants.

After expanding the above equation, we have
The bias and mean square error of T prop , is given by: (10) (13) (15)  where

Efficiency assessment
In this section, we compare the proposed estimator with existing estimators in terms of mean square error.

Numerical study
In this section, we conduct a mathematical exploration to compare the performance of our suggested estimator with existing estimators.We consider four actual datasets that are sampled using simple random sampling, and their descriptions are provided in Table 1.To compare the performance of the estimators, we use the concept of percentage relative efficiency (PRE), which is a widely used measure in statistics.
To calculate the PRE, we use the following expression: where (u) = (0, 1, The expression of mean square error (MSE) using four actual data sets are given in Table 2.The conditional values of estimators are given in Table 3.The expression of percentage relative efficiency (PRE) using four actual data sets are given in Table 4.
Population 1: [Source: Murthy 27 ] Y: Output of the workshop, X 1 =Fixed capital, X 2 = Number of worker.Population 2: [Source: Cochran 28 ] Y: Food cost, X 1 : size of the family, and X 2 = Income.Population 3: [Source: Singh 29 ] Y: Estimated fish caught in the year 1995, X 1 : Estimated fish caught in the year 1994, X 2 = Estimated fish caught in the year 1993.

Simulation study
This portion shows the algebraic method for the evaluation of T prop with the existing counterparts T 0 , T 1 , T 2 , T 3 , T 4 , T 5, and T 6 .The algorithm for simulation study to demonstrate the performance of numerous estimators of T 0 is as follows: 1. Produce two independent random variables X from N µ, σ 2 and Z from N µ 1 , σ 2 1 using the box-Muller method.2. Set Y = ρX + 1 − ρ 2 , where 0 < ρ = 0.5, 0.6, 0.7, 0.8 > 1.
3. Return the pair (Y, X, Z). 4. Let the population-I with the parameters Î¼ = 3, σ = 2, µ 1 = 5 and σ 1 = 3 and in step-1 and repeat steps 1-3 for 500 times.This population will have different variances for X and Z. 5. Likewise, produce the population-II with the parameters Î¼ = 2, σ = 3, µ 1 = 4 and σ 1 = 3 in step-1 and repeat steps 1-3 for 500 times.6.From the population of size N = 50, draw 500 SRSWOR ( y i ,x i ) (i = 1, 2,3,…,n) of size n = 10, 15 and 20. 7. The Average MSE of the estimators is defined by: 8.The percentage relative efficiency of estimators as compared to usual estimator T 0 is defined by: 9.The Average PRE of the estimators is defined by The results of the simulation study are presented in Tables 5 and 6.

Discussion
As stated before we take four data sets and simulation, to notice the performance of the proposed enhanced variance estimator based on simple random sampling using two auxiliary variables.The proposed estimator is compared with existing estimators concerning MSE and PRE.The summary statistic of actual data sets is presented in Table 1.The MSE of estimators are given in Table 2 and the conditional values of estimators are given Table 2. Mean square error using population's I-IV. in Table 3. From the mathematical results using actual data, and PRE are given in Table 4, it is observed that the proposed estimator is efficient in terms of efficiency.We evaluated the efficiency of our proposed variance estimator in simple random sampling using a simulation study.From Tables 5 and 6, it can be concluded that for different values of the correlation coefficient ρ = 0.5, 0.6, 0.7, 0.8 and the sample size n = 10, 15, 20 the perfor- mance of proposed estimator T prop is more efficient in comparison with the usual estimator and other existing estimators.The result of the simulation study clearly shows that the average PRE of the proposed estimator T prop is higher as compared to the average PRE of the existing estimator for both the simulated data sets I and dataset II.Hence it can be recommended to use it further.

Conclusion
In this article, we have suggested a novel variance estimator using two auxiliary variables under simple random sampling.We conducted a comprehensive comparison of our suggested estimator with several existing counterparts using four real data sets.We also obtained the properties of the proposed estimator under the first order of approximation and carried out a simulation study to test its robustness and generalizability.The results showed that the suggested estimator outperformed to the existing estimators in terms of efficiency.Future research could focus on extending our proposed estimator to two-phase sampling designs, and to situations with non-response and measurement error, where information from auxiliary variables can be utilized to estimate population variance more accurately.Additionally, it would be interesting to investigate the performance of our proposed estimator in more complex survey settings, such as clustered and stratified sampling.

⌣ 1 and ω ⌣ 2
is the unidentified constants: and The bias and minimum MSE at the optimal value of ω ⌣ 1 and ω ⌣ 2 , are given by: and (v) The exponential ratio and product type estimators by Singh et al. 24 :

⌣ 3 and ω ⌣ 4 ⌣ 3 and ω ⌣ 4 ,
are given by: andThe minimum MSE at the optimum values of ω are given by:

Table 1 .
Summary statistics of population's I-IV.

Table 4 .
Percentage relative efficiency w.r.t T 0 of population's I-IV.

Table 5 .
Evaluation of T prop with other existing counterparts for Population-I.

Table 6 .
Evaluation of T prop with other existing counterparts for Population-II.