Estimation of Population Variance Using the Coefficient of Kurtosis and Median of an Auxiliary Variable under Simple Random Sampling

In this study we have proposed a modified ratio type estimator for population variance of the study variable y under simple random sampling without replacement making use of coefficient of kurtosis and median of an auxiliary variable x. The estimator’s properties have been derived up to first order of Taylor’s series expansion. The efficiency conditions derived theoretically under which the proposed estimator performs better than existing estimators. Empirical studies have been done using real populations to demonstrate the performance of the developed estimator in comparison with the existing estimators. The proposed estimator as illustrated by the empirical studies performs better than the existing estimators under some specified conditions i.e. it has the smallest Mean Squared Error and the highest Percentage Relative Efficiency. The developed estimator therefore is suitable to be applied to situations in which the variable of interest has a positive correlation with the auxiliary variable.


Introduction
It is notable that the appropriate use of auxiliary information in probability sampling designs yields considerable reduction in the variance of the estimators of population parameters namely, population mean, median,variance,regression coefficient and population correlation coefficient.[1] was the first to show the contribution of known auxiliary information in improving the efficiency of the estimator of the population mean Y in survey sampling.
Survey samplings now touch almost every field of scientific study, including demography, education, energy, transportation, health care, economics, forestry, sociology, politics and so on.In fact it is not an exaggeration to say that much of the data that are statistically analyzed are collected in surveys.It is imperative to note that as the demand in use of surveys increase, the need for more effective methods of analyzing and interpreting the resulting data is inevitable.Measure of precision being a prime requirement of good surveys and appear now in most analysis hence the need to be obtained for almost each estimate derived from the survey data.
On regular instances we encounter surveys in which an auxiliary variable x is relatively cheap (with regard to time and money) to monitor than the study variable y.Use of auxiliary information can increase the precision of an estimator when the study variable y is highly correlated with auxiliary variable x.
In reality such situations do occur when information is available in the form of auxiliary variable, which is highly correlated with study variable, for example, number of trees in an orchard and the yield of fruits.
The most common and widely used measure of precision is the variance of the survey estimator.In reality population variances are always not known but must be estimated from the survey data themselves.In this study we are interested in the estimation of population variance using known auxiliary information under simple random sampling without replacement (SRSWOR) sampling scheme.
The precision of estimators under this situation is always increased, the ratio, product and regression estimators gives better outcome than those of simple random samplings.

Consider a finite population
of N distinct identifiable units.Let Y be our study variable and X be its corresponding auxiliary variable.
Suppose we take a random sample of size n from this bivariate population

( )
, Y X that is ( ) Without Replacement (SRSWOR) method.Let Y and X be the population means of the study and auxiliary variable respectively and their corresponding sample means be y and x .
This study focuses on improving the efficiency in the estimation of ( ) using the coefficient of kurtosis and median.
We define the following notations that we will use throughout the article.For the population observations we have; Also we define the following from the sample observations: In general, we define the following parameters: Thus we note the following; is the coefficient of variation for the study variable y, is the coefficient of variation for the auxiliary variable x and Many authors have come up with more precise estimators by employing prior knowledge of certain population parameter(s).[2] for example attempted use of the coefficient of variation of study variable but prove inadequate for in practice, this parameter is unknown.Motivated by [2] work, [3] [4] and [5] used the known coefficient of variation but now that of the auxiliary variable for estimating population mean of study variable.Reasoning along the same path [6] used the prior value of coefficient of kurtosis of an auxiliary variable in estimating the population variance of the study variable y.
Kurtosis in most cases is not reported or used in many research articles, in spite of the fact that fundamentally speaking every statistical package provides a measure of kurtosis.This maybe attributed to the likelihood that kurtosis is not well understood or its importance in various aspects of statistical analysis has not been explored fully.Kurtosis can simply be expressed as T. K. where E -the expectation operator, µ -the mean, 4 µ -the fourth moment about the mean and σ -the standard deviation.
Median being the middlemost value in a distribution (when the values are arranged in ascending or descending order) has the advantage of being less affected by the outliers and skewed data, thus is preferred to the mean especially when the distribution is not symmetrical.We can therefore utilize the median and the coefficient of kurtosis of the auxiliary variable to derive a more precise ratio type population variance.

Existing Population Variance Estimators
In this section we have reviewed some finite population variance estimators existing in literature which will help in the construction and development of the proposed estimator.Notably, when auxiliary information is not available the usual unbiased estimator to the population variance is The bias and MSE of 1 t ( ) ( ) ( ) where 1 0 Ψ = Population variance, 2 y S estimation using auxiliary information was considered by [7], and proposed ratio type population variance estimator, given by The bias and Mean Squared Error of Isaki's estimator, where The estimator, 3 t bias and MSE obtained as ( ) { } ( ) where The biases and MSE of their estimators, ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) T. K.
The bias and MSE of their estimator 8 t , ( ) ( ) where, [12] using the known quartiles (upper and lower quartile 3 Q and 1 Q respectively) of the auxiliary variable x suggested The biases and MSE of their estimators 9 t and 10 t as follows ( ) ( ) where .Motivated by [10] and [11] [13] considered the estimation of finite population variance using known coefficient of variation and median of an auxiliary variable, proposed an estimator.
The bias and MSE obtained to be, ( ) ( ) where

Proposed Estimator
Motivated by the works of [14] [9] [15] [13] [10] and [16] in the improvement of the performance of the population variance estimator of the study variable using known population parameters of an auxiliary variable.We propose the following modified ratio type population variance estimator using a known value of population coefficient of kurtosis x κ and median x M of an auxiliary variable.
where ( ) Expanding the right hand side of (42) and multiplying out we have Neglecting terms of s ξ ′ having power greater than two we have ( ) ( ) Taking the expectation on both sides of (44) we have our bias Squaring both sides of (44) and neglecting terms of s ξ ′ having power greater than two we have Taking the expectation on both sides of (47) We get the ( )

Theoretical Comparison
The theoretical conditions under which the proposed modified ratio type estimators Using Equations ((49) and ( 50)) we have that

Empirical Studies
Using the data from Population I (Source:[ [21], 228]) and Population II (source: [22]).We assess the performance of the proposed estimator when simple random sampling without replacement (SRSWOR) scheme is used with that of sample variance and existing estimators.We apply the proposed and existing estimators to this data set and the data statistics are given below: Using the above summary values we have the results in Using formula (54) and (57) we compute the Percent Relative Efficiencies and tabulate the results in Table 2.
Percentage Relative efficiency being a robust statistical tool that is used to How to cite this paper: Milton, T.K., Odhiambo, R.O. and Orwa, G.O. (2017) Estimation of Population Variance Using the Coefficient of Kurtosis and Median of an Auxiliary Variable under Simple Ran- initiated the use of coefficient of kurtosis in estimating population variance of a study variable y.Later, the coefficient of kurtosis was used by[3] [5][8] in the estimating the population mean.[9] using the known information on both 2 x S and x κ suggested modified ratio type population variance estimator for 2 y S as T. K. Milton et al.DOI: 10.4236/ojs.2017.76066948 Open Journal of Statistics

[ 10 ]
suggested four modified ratio type variance estimators using known values of x C and x κ ,