Mixture Regression-Cum-Ratio Estimator Using Multi-Auxiliary Variables and Attributes in Single-Phase Sampling

In this paper, we have proposed a class of mixture regression-cum-ratio estimator for estimating population mean by using information on multiple auxiliary variables and attributes simultaneously in single-phase sampling and analyzed the properties of the estimator. An empirical was carried out to compare the performance of the proposed estimator with the existing estimators of finite population mean using simulated population. It was found that the mixture regression-cumratio estimator was more efficient than ratio and regression estimators using one auxiliary variable and attribute, ratio and regression estimators using multiple auxiliary variables and attributes and regression-cum-ratio estimators using multiple auxiliary variables and attributes in singlephase sampling for finite population.


Introduction
The work of Neyman [1] may be referred to as the initial works where auxiliary information has been used.Watson [2] used the regression estimator of leaf area on leaf weight to estimate the average area of the leaves on a plant.Cochran [3] used auxiliary information in single-phase sampling to develop the ratio estimator for estimation of population mean.In the ratio estimator, the study variable and the auxiliary variable had a high positive correlation and the regression line was passing through the origin.Hansen and Hurwitz [4] also suggested the use of auxiliary information in selecting the sample with varying probabilities.
Olkin [5] was the first person to use information on more than one supplementary character, which is positively correlated with the variable under study, using a linear combination of ratio estimator based on each auxiliary variable.Shukla [6] proposed that regression estimator using multiple auxiliary was more efficient than regression estimator using single auxiliary variable.Raj [7] suggested a method of using multi-auxiliary information in sample survey.Singh [8] proposed a ratio-cum-product estimator and its multi-variable expression which were more efficient than ratio, product and mean per unit estimators.
Jhajj, Sharma and Grover [9] proposed a family of estimators using information on auxiliary attribute.They used known information of population proportion possessing an attribute that is highly correlated with study variable Y.The attribute is normally used when the auxiliary variable is not available e.g. an amount of milk produced and a particular breed of cow or an amount of yield of wheat and a particular variety of wheat.Rajesh, Pankaj, Nirmala and Florentins [10] used the information on auxiliary attribute in ratio estimator in estimating population mean of the variable of interest using known attributes such as coefficient of variation, coefficient kurtosis and point bi-serial correlation coefficient.The estimator performed better than the usual sample mean and Naik and Gupta [11] estimator.Rajesh, Pankaj, Nirmala and Florentins [10] also used the auxiliary attribute in regression, product and ratio type exponential estimator following the work of Bahl and Tuteja [12].
Hanif, Haq and Shahbaz [13] [14] proposed a general family of estimators using multiple auxiliary attribute in single and double phase sampling.The estimator had a smaller MSE compared to that of Jhajj, Sharma and Grover [9].They also extended their work to ratio estimator which was generalization of Naik and Gupta [11] estimator in single and double phase sampling with full information, partial information and no information.
The concept of double sampling was first proposed by Neyman [1] in sampling human populations when the mean of auxiliary variable was unknown.It was later extended to multiphase by Robson [15].In most surveys the auxiliary information is always available and every form of auxiliary information should be used in developing sampling strategies.Samiuddin and Hanif [16] introduced the following approach using auxiliary variable.
1) Full information case: information for all auxiliary variables is available.
2) No information case: information for all auxiliary variables is not available.
3) Partial information case: information for some auxiliary variable is available for all population units.Ahmad [17] generalized multivariate ratio and regression estimators for multi-phase sampling.Zahoor, Muhhamad and Munir [18] suggested a generalized regression-cum-ratio estimator for two-phase sampling using multiple auxiliary variables in full, partial and no information case.Kung'u and Odongo [19] and [20] proposed ratio-cum-product estimators using multiple auxiliary attributes in single phase sampling and two-phase sampling using multiple auxiliary attributes in full, partial and no information case.Moeen, Shahbaz and HanIf [21] proposed a class of mixture ratio and regression estimators for single-phase sampling for estimating population mean by using information on auxiliary variables and attributes simultaneously.
In this paper, we will incorporate both multiple auxiliary variables and attributes in regression-cum-ratio estimator to form mixture regression-cum-ratio estimator in single-phase sampling and also incorporate Arora and Bansi [22] approach in writing down the mean squared error.

Notation and Assumption
The following notation will be used in this project.Consider a population of N units.Let Y be the study va- riable for which we want to estimate the population mean and 1 2 , , , k X X X  are k auxiliary variables and , , , t τ τ τ  are t auxiliary attributes.For single-phase sampling design let n be sample sizes for first phase while j x and j r denote the th j auxiliary variables and auxiliary attribute, and y denote the variable of interest from first phase.Let In defining the attributes we assume complete dichotomy so that; 1, if unit of population possess auxiliary attribute 0, otherwise = ∑ be the total number of units in the population and sample respectively pos- sessing attribute j τ .Let is the bi-serial correlation coefficient between study variable and auxiliary variables.Then for simple random sampling without replacement for both first and second phases we write by using phase wise operation of expectations as: Arora and Lai The following notations will be used in deriving the mean square errors of proposed estimators

Mean per Unit in Single-Phase Sampling
The sample mean y using simple random sampling without replacement is given by, While its variance is given, ( ) x n = = ∑ be the unbiased estimator of population means Y and X respectively.

Ratio and Regression Estimator Using Auxiliary Variable
Then the classical ratio estimator by Cochran [3] and regression estimator by Watson [2] are defined respectively by, ( ) where X , the population mean of the auxiliary variable X is known where

Ratio and Regression Estimator Using Multiple Auxiliary Variables
In case of multiple auxiliary variables, the ratio and regression estimators Ahmad [17] are given by,

Ratio and Regression Estimator Using Auxiliary Attribute
In order to have an estimate of the population mean Y the study variable y, assuming the knowledge of the population proportion P, Naik and Gupta [11] defined ratio and estimators of population when the prior information of population proportion of units, possessing the same attribute is variable.Using (1.8) and (1.9) Naik and Gupta [11] proposed following estimators: The minimum MSE of R t and Re t up to the first order of approximation are ( ) ( )

Ratio and Regression Estimator Using Multiple Auxiliary Attributes.
The ratio and regression estimators by Hanif, Haq and Shahbaz [14] for single-phase sampling using information on multiple auxiliary attributes are given by, The MSE of the ( ) t up to the first order of approximation are, ( )

Regression-Cum-Ratio Estimator Using Multiple Auxiliary Attributes
The regression-cum-ratio estimator using multiple auxiliary attributes is given by, ( ) (1.26)

Mixture Ratio and Regression Using Multiple Auxiliary Variables and Attributes
The mixture ratio estimator based on multiple auxiliary variables and attributes by Moeen, Shahbaz and HanIf [21] is given by: The minimum MSE of RM t and Re M t up to the first order of approximation are In general these estimators have a bias of order 1 n − .Since the standard error of the estimates is of order 1 n , the quantity bias/s.e is of order 1 n and becomes negligible as n becomes large.In practice, this quantity is usually unimportant in samples of moderate and large sizes.
In this paper, we have combined mixture ratio and mixture regression estimator to form mixture regressioncum-ratio estimator under single-phase sampling and studied the properties of the proposed estimator.

Mixture Regression-Cum-Ratio Estimator Using Multi-Auxiliary Variables and Attributes in Single-Phase Sampling
If we estimate a study variable when information on all auxiliary variables is available from population, it is utilized in the form of their means.By taking the advantage of mixture regression-cum-ratio estimator technique for single-phase sampling, a generalized estimator for estimating population mean of study variable Y with the use of multi auxiliary variables and attributes is suggested as: ( ) Ignoring the second and higher terms for each expansion of product and after simplification, we write, The mean squared error of MRR t is given by, ( ) ( ) We differentiate the Equation (2.3) partially with respect to ( ) , , j j q q r λ = + + +  and ( ) , , 2, , Taking expectation of (3.49), we get, ( ) (2.12) All the results were obtained after carrying out several random sample and taking the average.In order to evaluate the efficiency gain we could achieve by using the proposed estimators, we have calculated the variance of mean per unit and the mean squared error of all estimators we have considered.We have then calculated percent relative efficiency of each estimator in relation to variance of mean per unit.We have then compared the percent relative efficiency of each estimator, the estimator with the highest percent relative efficiency is considered to be the most efficient than the other estimator.The percent relative efficiency is calculated using the following formulae.The Table 1 shows percent relative efficiency of mean per unit, ratio and regression estimators using one auxiliary variable and attribute, ratio and regression estimators using two auxiliary variables and attributes and regression-cum-ratio estimators using four auxiliary variables and attributes and mixture regression-cum-ratio estimator using multiple auxiliary variables and attributes with respect to mean per unit estimator for singlephase sampling.It is observed that our proposed mixture regression-cum-ratio estimator using multiple auxiliary variables and attributes using multiple auxiliary variables and attributes is the most efficient of the twelve estimators since it has the highest percent relative efficiency.

Conclusion
According to Table 1, the proposed mixture regression-cum-ratio estimator using multiple auxiliary variables and attributes using multiple auxiliary variables and attributes has the highest percent relative efficiency compared to mean per unit, ratio and regression estimators using one auxiliary variable and attribute, ratio and regression estimators using two auxiliary variables and attributes and regression-cum-ratio estimators using four auxiliary variables and attributes in single-phase sampling for finite population.This means that the mixture regression-cum-ratio estimator using multiple auxiliary variables and attributes using multiple auxiliary variables and attributes is the most efficient estimator compared to the estimators that utilize auxiliary variables and attributes.The proposed mixture regression-cum-ratio estimator using multiple auxiliary variables and attributes using multiple auxiliary variables and attributes in single-phase sampling is recommended to estimate the finite population mean as it outperforms all the other namely mean per unit, ratio and regression estimators using one auxiliary variable and attribute, ratio and regression estimators using two auxiliary variables and attributes and regression-cum-ratio estimators using four auxiliary variables and attributes in single-phase sampling.
sampling error and are very small.We assume that ( ) ( ) ( ) 0 proportion of units possessing a specific attributes j τ and y is the mean of the main variable at second phase.The coefficient of variations are given by

ρ
Denotes the multiple coefficient of determination of y on 1 optimum values of ratio and regression estimator respectively.The minimum MSE of 1R t and Re1t up to the first order of approximation are, of ratio and regression estimator respectively.


are the optimum values to the first order of approximation.The minimum MSE of Re 2 R t up to the first order of approximation are,

Table 1 .
Relative efficiency of existing and proposed estimators with respect to mean per unit estimator for single-phase sampling.