A New Class of L-Moments Based Calibration Variance Estimators

2021-12-16 06:40UsmanShahzadIshfaqAhmadIbrahimMufrahAlmanjahieNadiaAlNoorandMuhammadHanif
Computers Materials&Continua 2021年3期

Usman Shahzad,Ishfaq Ahmad,Ibrahim Mufrah Almanjahie,Nadia H.Al Noor and Muhammad Hanif

1Department of Mathematics and Statistics,International Islamic University,Islamabad,46000,Pakistan

2Department of Mathematics and Statistics,PMAS-Arid Agriculture University,Rawalpindi,46300,Pakistan

3Department of Mathematics,King Khalid University,Abha,62529,Saudi Arabia

4Statistical Research and Studies Support Unit,King Khalid University,Abha,62529,Saudi Arabia

5Department of Mathematics,College of Science,Mustansiriyah University,Baghdad,10011,Iraq

Abstract:Variance is one of the most important measures of descriptive statistics and commonly used for statistical analysis.The traditional second-order central moment based variance estimation is a widely utilized methodology.However, traditional variance estimator is highly affected in the presence of extreme values.So this paper initially, proposes two classes of calibration estimators based on an adaptation of the estimators recently proposed by Koyuncu and then presents a new class of L-Moments based calibration variance estimators utilizing L-Moments characteristics (L-location, Lscale,L-CV)and auxiliary information.It is demonstrated that the proposed L-Moments based calibration variance estimators are more efficient than adapted ones.Artificial data is considered for assessing the performance of the proposed estimators.We also demonstrated an application related to apple fruit for purposes of the article.Using artificial and real data sets,percentage relative efficiency (PRE) of the proposed class of estimators with respect to adapted ones are calculated.The PRE results indicate to the superiority of the proposed class over adapted ones in the presence of extreme values.In this manner,the proposed class of estimators could be applied over an expansive range of survey sampling whenever auxiliary information is available in the presence of extreme values.

Keywords: L-moments; variance estimation; calibration approach; stratified random sampling

1 Introduction

All around, it was considered that the utilization of auxiliary knowledge (information) in the test review (sampling survey) configuration brings about effective estimators of population parameters.The literature on test review portrays an incredible assortment of strategies for utilizing auxiliary information.Ratio technique is the acceptable delineation in this specific situation,see [1-5].In some cases when some population parameters, as the mean, coefficient of variation,the standard deviation of the auxiliary variable have been known many authors including [6,7]imagined an enormous number of improved estimators for the population mean of the variate of interest.This urge researchers to utilize the characteristics of the auxiliary variable (mean, coefficient of variation, standard deviation, median and quartiles) in order to improve the population variance estimate of the study variable.

Variations are available in our life.For example, a doctor needs a full comprehension of varieties in the level of human circulatory strain, internal heat level, and heartbeat rate for satisfactory medicine.A maker needs steady information on the degree of varieties in individuals’response to his item to have the option to realize whether to decrease or increment his cost or improve the nature of his item.An agriculturist requests a sufficient comprehension of the varieties in climatic factors particularly all around (or from time to time) to have the option to anticipate when, where, and how to plant his harvest.A lot of circumstances can be experienced where the population variance estimate of the study variable assumes a huge significance.Consequently,different authors, for example, [8,9] have given their consideration to the improved estimation of population variance of the study variable with the utilization of auxiliary information.However,all these quoted estimators based on traditional second-order central moment based variance estimation which is highly affected in presence of extreme values.So, in the current article, we have characterized a class of estimators for the population variance of the study variable based on L-Moments and calibration approach that is less effective to extreme values.

An experimental investigation utilizing apple production data sets was led, and we got good outcomes, numerically.The analysis of apple fruit production has great importance in food sciences.In view of the applications here, this study well provide a significant premise for future food sciences, medical researches, and many engineering applications.

The rest of the article is arranged in the following major sections.In Section 2, we present some classes of calibration based variance estimators according to the traditional second-order central moment.In Section 3, we provide L-Moments characteristics (L-location, L-scale, LCV) and, propose a new class of calibration based variance estimators under stratified random sampling scheme.Results discussion is documented in Section 4.Finally, some conclusions are given in Section 5.

2 Adapted Classes of Estimators

Consider a finite population Ωa={va1,va2,...,van} of sizeN.Suppose(Y,X)be the study and auxiliary variables associated with Ωa.Let Ωais stratified intoLstrata with thehth stratum containingNhunits, whereh= 1,2,...,L, such thatNh=N.A simple random sample(srs) of sizenhis drawn without replacement from thehth stratum such that=n.Further,(xhi,yhi)are the observed values ofYandXinhth stratum, wherei=1,2,...,Nhandh=1,2,...,L.In light of this stratified sampling design, taking motivation from [1], we present the first calibrated class of variance estimators as given below

whereis the sample variance of the study variable inhth stratum.Further,are calibrated weights that are selected in such a way that the sum of the chi-square type distance measure

is minimum, subject to the calibration constraints

Note thatWh=denoting the traditional stratum weight,are the sample and population characteristics of the auxiliary variable inhth stratum and, Ψhare appropriately selected weights that express the form of the estimator.Some suitable choices for Ψhare provided in Tab.1.

Minimization of Eq.(2), subject to the calibration constraints set out in Eqs.(3) and (4), the optimum weights shall be calculated by

and thus

where

The whole classT(SHa)is provided in Tab.1.

Taking motivation from [10], we adapt the following calibration estimator of the population variance given by

Table 1: First adapted family of estimators

subject to the following constraints

Minimization of Eq.(8), subject to the calibration constraints set out in Eqs.(9) and (10),the optimum weights shall be calculated by

where

and thus

where

The whole classT(SHb)is provided in Tab.2.

Table 2: Second adapted family of estimators

3 L-Moments and Propose Class of Estimators

The occurrence of extreme values, for example, in agriculture engineering, hydrology, food sciences.Meteorology and climatology, among others, observed nowadays in various parts of the world may influence adversely on human culture.Therefore, it is essential to appraise the accurate measurements of the numerical data in the presence of extreme values.As we have depicted before that variance is one of the significant measures of data description.Variance estimation dependent on traditional moments which is profoundly influenced by extreme values.Hence, a quantifiable procedure is necessitated that will catch the extreme values in the assessment, however less influenced by their quality.An elective strategy that has the vital ability to settle this issue is the L-Moments.Because L-Moments are more robust than traditional moments in the presence of extreme values [11].

The general form of the population L-Moments for the auxiliary variable with respect tohth stratum are as given below

λ1xh=E(X1:1)

λ2xh=(2)-1E(X2:2-X1:2)

λ3xh=(3)-1E(X3:3-2X2:3+X1:3)

λ4xh=(4)-1E(X4:4-3X3:4+3X2:4+X1:4)

Corresponding to these, we can write the sample based L-Moments such asbased on sample observations, respectively.Similarly, we can write the general form of the L-Moments for study variable by replacing x with y.For more details regarding L-Moments,interested readers may refer to [12].Some notations for upcoming proposed work in light of L-Moments with respect to hthstratum are

Sxmh=λ2xh,sxmh=are the population and sample standard deviation (L-scale) of auxiliary variable based on L-Moments.

Symh=λ2yh,symh=are the population and sample standard deviation (L-scale) of study variable based on L-Moments.

Cxmh=are the population and sample coefficient of variation (L-CV) of the auxiliary variable based on L-Moments.

Cymh=are the population and sample coefficient of variation (L-CV) of the study variable based on L-Moments.

The calibration estimation is a method of adjusting the original design weights to expand the accuracy of estimates by utilizing auxiliary information.Calibration based estimators utilize calibrated/adjusted weights that are planned to minimize a given measure of distance to the original weights while fulfilling a group of constraints with the auxiliary information.The authors in [13] are pioneer of calibration estimation.After that much of the work has been done in this filed such as [1,10,14].These results motivated us to define the following calibrated estimator of variance as

whereis the sample variance of the study variable inhth stratum.Further,are calibrated weights that are selected in such a way that the sum of the chi-square type distance measure

is minimum, subject to the calibration constraints

Note thatdenotes the traditional stratum weight,are the sample and population characteristics of the auxiliary variable in hthstratum and, Ψhare appropriately selected weights that express the form of the estimator.In most situations, Ψh=1.Minimization of Eq.(13), subject to the calibration constraints set out in Eqs.(14) and (15), the optimum weights shall be calculated by

and thus

where

The whole classT(Pai)is provided in Tab.3.

Table 3: Propose class of estimators

4 Results and Discussion

In this section, the performance of the suggested estimators is investigated through a simulation study.

4.1 Simulation Design

The simulation design is organized as follows: A random variableXhand random variableYhare defined as follows:

Figure 1: Pop-1, h=1

where

X1~Gamma(2.6,3.8), for h=1

X2~Gamma(2.0,3.1), for h=2

X3~Gamma(1.5,2.7), for h=3

X4~Gamma(2.9,3.1), for h=4

Figure 2: Pop-1, h=2

Figure 3: Pop-1, h=3

Further, we assume thatp=1.6,v=5,R=2 and ε has a standard normal distribution in Eq.(19).We consider the population of size N=1000 forhth stratum.The overall sample of size nh=400 is selected for purposes of the article.Using equal allocation of a sample of size 100 is selected from hthstratum.A scatter plot for each stratum is provided (see Figs.1-4).These figures clearly show the presence of extreme values and hence suitable for our proposed estimators.

Figure 4: Pop-1, h=4

Table 4: PRE of proposed family with respect to T(SHa) for artificial data

The steps of simulation study can be carried out as:

Step-1: A sample of sizenhis selected through SRSWOR from thehth stratum.

Step-2: Using Step-1, we find the value of variance estimate (say)where

Step-3: The Step-1 and Step-2 are replicated withK=5000 times.Thus,are obtained.

Step-4: The mean square error (MSE) of the estimators is computed as

Step-5: The PRE is computed asPRE×100 and the results are provided in Tabs.4 and 5.

Table 5: PRE of proposed family with respect to T(SHb) for artificial data

4.2 Real Life Data

To illustrate the behavior of the proposed estimators in this article, we consider a data set of apple fruit, used by [15], where

X=number of apple trees(1 unit=100 trees),

Y=level of apple production(1 unit=100 tonnes).

It is worth mentioning that we consider 477 villages in 4 strata, respectively (like 1: Marmarian, 2: Agean, 3: Mediterranean, and 4: Central Anatolia) in 1999.We draw a scatter plot for each stratum (see Figs.5-8).The scatter plot of each stratum representing extreme values.Some important characteristics of the data are provided in Tab.6.The PRE of the estimators is calculated using defined five steps and available in Tabs.7 and 8.

Figure 5: Pop-2, h=1

Figure 6: Pop-2, h=2

Figure 7: Pop-2, h=3

Figure 8: Pop-2, h=4

However, this investigation can assist with revealing insight into the value of the proposed class.Actually, it gives a reasonable sign that more mind-boggling options than the traditional moments based variance estimators can be helpful when the data is contaminated with extreme values.Because each PRE result of the proposed estimator is greater than 100.Which means that the proposed class beats the adapted ones thoroughly.Based on real and artificial numerical illustration, we feel confident that the same could hold in other contexts of food sciences and agricultural engineering.

Table 6: Characteristics of apple data

Table 7: PRE of proposed family with respect to T(SHa) for real life data

Table 8: PRE of proposed family with respect to T(SHb) for real life data

5 Conclusions

Usually, in test reviews, it uses traditional moments-based techniques to obtain improved designs and more accurate estimators.However, in the presence of extreme values, traditional moments-based techniques may provide misleading results.In this paper, a calibration approach is utilized and developed two adapted families of estimators.Then a new class of L-Moments based variance estimators has been proposed which provides a better estimate of variance in the presence of extreme values.The performance of adapted and proposed estimators has been assessed through artificial and apple fruit data set.Based on satisfactory results, we recommend survey practitioners utilize the proposed class of variance estimators in variou environmental researches.

Acknowledgement:The authors are thankful to the reviewers for their valuable suggestions.

Funding Statement:The authors are grateful to the Deanship of Scientific Research at King Khalid University, Kingdom of Saudi Arabia for funding this study through the research groups program under project number R.G.P.2/67/41.Ibrahim Mufrah Almanjahie received the grant.

Conflicts of Interest:The authors declare that they have no conflicts of interest to report regarding the present study.