Effect of Sample Size on the Control Limits of Exponentially Weighted Moving Average Distance Square Scheme ()

Subject Areas: Mathematical Statistics

1. Introduction
The exponentially weighted moving average (EWMA) chart was introduced for monitoring the sample mean of a quality parameter by Robrts in 1959 [1] . This chart is more sensitive in detecting any small shift in mean compared to the conventional Shewhart control chart which was introduced by Shewhart in 1939 [2] . In EWMA chart, EWMA Variable At is plotted against the sample number t(
), where
(1)
Normally A0, is considered as the target mean µ0, lm is a constant such that 0 < lm < 1 and it is selected based on the shift in the mean to be detected quickly for any process for a given in-control average run length (ARL).
is the tth sample mean of the quality parameter, to be monitored. This chart issues an out-of-control signal if At is greater than the upper control limit (UCL) or lower than the lower control limit (LCL). For designing this chart, lm values and control limits for detecting different shifts in mean under different sample sizes and in- control ARLs, can be found in Crowder (1989) [3] .
Chang and Gan introduced an EWMA chart for monitoring sample variance of a quality parameter in 1993 [4] . In their chart, the EWMA variable Bt is plotted against the sample number t (
), where
(2)
Normally, the value of Bt is taken as
and
is the sample variance of the quality parameter
interested. lv is a positive constant which has the possible values of 0 < lv < 1 and it is based on the shift in the variance to be detected quickly for a given in-control ARL. Like in the EWMA chart for monitoring sample mean, this chart also issues an out-of-control signal if Bt is greater than UCL or the lower than the LCL.
The above discussed two EWMA charts are used for monitoring the process mean and variance independently. In 1997, it was understood that monitoring the sample mean and variance was a bivariate problem and these two had to be monitored jointly [5] . A joint monitoring scheme for monitoring mean and variance simultaneously using EWMA technique was introduced by Gan [6] . Another two joint monitoring schemes, called max EWMA scheme and EWMA semicircle scheme for joint monitoring of process mean and variance were proposed by Chen et al. [7] [8] . The scheme parameters to design these joint monitoring schemes differ based on the sample sizes. This issue restricts the users in selecting a convenient sample size.
Subsequently, a new joint monitoring scheme named EWMA distance square scheme (EWMAD2) was introduced with a claim that its control limits were independent of sample size [9] . This scheme uses the standardized sample mean Ut and variance Vt such that
(3)
and
(4)
(5)
the chi-square distribution with v degrees of freedom and ɸ(.) is the cumulative distribution function of a standard normal random variable [10] . St is the sample standard deviation, σ0 is the population standard deviation and n is the sample size.
A statistics
is defined by
(6)
The EWMAD2 scheme is obtained by plotting the EWMAD2 statistic Ct against the sample number t where
(7)
and
is a constant selected based on the shift in
, to be detected quickly. The opti-
mum
values for detecting various shifts in
, for the selected in-control ARLs can be read in Razmy (2005) [9] . This research paper examines the effect of sample size on the control limits of the EWMAD2 scheme for different ARLs.
2. Methodology
For this study, commonly used in-control ARLS100, 250, 300, 370, 500 and 1000 were selected. The sample sizes (n) studied were 5, 10, 50, 100 and 150. Samples were simulated in SAS using proc RANNO with sample sizes n. For each sample, the statistics
was calculated from the variables ut and vt. The statics Ct was obtained for different values of
(
).
Initially arbitrary control limits (CL) were assumed for each combination of
and in-control ARLs. Then, if C1 < CL, then the second sample was simulated. This procedure was continued till Ct > CL where Ct is an out-of-control point. Each time, the number of samples generated till to find an out-of-control point was recorded and it is the run length. In the same way 100,000 runs were performed and the ARLs were found. Subsequently, the assumed CL values were adjusted till to obtain the required in-control ARLS of 100, 250, 300, 370, 500 and 1000 for different
values. By this procedure, the CLs for different combinations of in-control ARLs, sample sizes and
values were found. In all cases, simulations were run until the standard error of the ARL was less than 1%of the pre-specified ARL.A sample program with sample size 5, is given in the Appendix.
3. Results and Discussion
The obtained CLs were plotted against the lambda value for the selected in-control ARLS and sample sizes. From the Figures 1-5, it could be observed that the control limits was increased with
. It is obvious that the CL increases with in-control ARLs for a given
value. The main observation is, for given
and in-con- trol ARL, the CL limit is constant with any sample size. This proves the claim that in the EWMAD2 scheme, CLs are independent of sample size and this property eases the procedure of designing this scheme with unrestricted sample sizes.
![]()
Figure 1. Control limits for selected in-control ARLS, n = 5.
![]()
Figure 2. Control limits for selected in-control ARLS, n = 10.
![]()
Figure 3. Control limits for selected in-control ARLS, n = 50.
![]()
Figure 4. Control limits for selected in-control ARLS, n = 100.
![]()
Figure 5. Control limits for selected in-control ARLS, n = 150.
Acknowledgements
The outputs for numerous programs were recorded and arranged by Ms. Iresha Dilhani.
Appendix
data raw;
mu0 = 0;
sigma = 1.0;
sigmasq = sigma*sigma;
nnn = 5;
nnnsqrt = sqrt(nnn);
nnn1 = nnn -1;
totalrun = 100,000;
maxsamp = 1,000,000;
seed = 86,924,569;
lam = 0.7;
onelm= 1.0 − lam;
bigh = 7.0856;
dobigd = 0;
dosmalld = 1;
actuals = smalld * sigma;
actualm = mu0 + bigd * sigma/nnnsqrt;
message = 'OK';
dorunnum = 1tototalrun;
qqq = 2;
runlen = 0;
do sample = 1tomaxsamp;
x1 = actualm + actuals * rannor (seed);
x2 = actualm + actuals * rannor (seed);
x3 = actualm + actuals * rannor (seed);
x4 = actualm + actuals * rannor (seed);
x5 = actualm + actuals * rannor (seed);
xsum = x1 + x2 + x3 + x4 + x5;
x2sum = x1*x1 + x2*x2 + x3*x3 + x4*x4 + x5*x5;
xbar = xsum/nnn;
s = x2sum − xsum*xsum/nnn;
s = s/nnn1;
uu = (xbar − mu0)/sigma*nnnsqrt;
uuu = uu*uu;
h = nnn1*s/sigmasq;
hhh= CDF('CHISQUARE', h, nnn1);
ifhhh > 0.0000000000001 and hhh < 0.999999999999999then
vv = probit(hhh); else
vv = 8.21;
vvv = vv*vv;
rrr = uuu + vvv;
runlen = runlen + 1;
qqq = onelm * qqq + lam * rrr;
ifqqq > bighthendo;
output;
goto DONE1;
end;
keep lam runlenbigdsmalldbighxbarqqqrrrvvvuuu message ;
end;
Message = 'Maximum # of sample Exceeded';
output;
DONE1:
end;
end;
end;
procmeans data = raw;
by bigdsmalld;
run;