Bootstrap Confidence Intervals for Proportions of Unequal Sized Groups Adjusted for Overdispersion

Group testing is a method of pooling a number of units together and performing a single test on the resulting group. It is an appealing option when few individual units are thought to be infected leading to reduced costs of testing as compared to individually testing the units. Group testing aims to identify the positive groups in all the groups tested or to estimate the proportion of positives (p) in a population. Interval estimation methods of the proportions in group testing for unequal group sizes adjusted for overdispersion have been examined. Lately improvement in statistical methods allows the construction of highly accurate confidence intervals (CIs). The aim here is to apply group testing for estimation and generate highly accurate Bootstrap confidence intervals (CIs) for the proportion of defective or positive units in particular. This study provided a comparison of several proven methods of constructing CIs for a binomial proportion after adjusting for overdispersion in group testing with groups of unequal sizes. Bootstrap resampling was applied on data simulated from binomial distribution, and confidence intervals with high coverage probabilities were produced. This data was assumed to be overdispersed and independent between groups but correlated within these groups. Interval estimation methods based on the Wald, the Logit and Complementary log-log (CLL) functions were considered. The criterion used in the comparisons is mainly the coverage probabilities attained by nominal 95% CIs, though interval width is also regarded. Bootstrapping produced CIs with high coverage probabilities for each of the three interval methods.

Cite this paper

Mwangi, O. , Islam, A. and Luke, O. (2015) Bootstrap Confidence Intervals for Proportions of Unequal Sized Groups Adjusted for Overdispersion. Open Journal of Statistics, 5, 502-510. doi: 10.4236/ojs.2015.56052.

 [1] Dorfman, R. (1943) The Detection of Defective Members of Large Populations. The Annals of Mathematical Statistics, 14, 436-440. http://dx.doi.org/10.1214/aoms/1177731363 [2] Fletcher, J., Russell, A. and Butler, R. (1999) Seed-Borne Cucumber Mosaic Virus in New Zealand Lentil Crops: Yield Effects and Disease Incidence. New Zealand Journal of Crop and Horticulture Science, 27, 197-204.http://dx.doi.org/10.1080/01140671.1999.9514097 [3] Worlund, D. and Taylor, G. (1983) Estimation of Disease Incidence in Fish Populations. Canadian Journal of Fisheries and Aquatic Science, 40, 2194-2197. http://dx.doi.org/10.1139/f83-254 [4] Swallow, W.H. (1985) Group Testing for Estimating Infection Rates and Probabilities of Disease Transmission. Phytopathology, 75, 882-889. http://dx.doi.org/10.1094/phyto-75-882 [5] Kline, R.L., Bothus, T., Brookmeyer, R., Zeyer, S. and Quinn, T. (1989) Evaluation of Human Immunodeficiency Virus Seroprevalence in Population Surveys Using Pooled Sera. Journal of Clinical Microbiology, 27, 1449-1452. [6] Liu, S.-C., Chiang, K.-S., Lin, C.-H. and Deng, T.-C. (2010) Confidence Interval Procedures for Proportions Estimated by Group Testing with Groups of Unequal Size Adjusted for Overdispersion. Journal of Applied Statistics, 38, 1467-1482. http://dx.doi.org/10.1080/02664763.2010.505953 [7] Paul, S. and Islam, A.S. (1998) Joint Estimation of the Mean and Dispersion Parameters in the Analysis of Proportions: A Comparison of Efficiency and Bias. Canadian Journal of Statistics, 26, 83-94. http://dx.doi.org/10.2307/3315675 [8] Saha, K.K. and Paul, S.R. (2005) Bias-Corrected Maximum Likelihood Estimator of the Negative Binomial Dispersion Parameter. Biometrics, 61, 179-185. http://dx.doi.org/10.1111/j.0006-341X.2005.030833.x [9] Williams, D.A. (1975) The Analysis of Binary Responses for Toxicological Experiments Involving Reproduction and Teratogenicity. Biometrics, 31, 946-952. http://dx.doi.org/10.2307/2529820 [10] Saha, K.K. (2008) Semi-Parametric Estimation for the Dispersion Parameter in the Analysis of over- or Underdispersed Count Data. Journal of Applied Statistics, 35, 1383-1397. http://dx.doi.org/10.1080/02664760802382459 [11] Hepworth, G. (2005) Confidence Intervals for Proportions Estimated by Group Testing with Groups of Unequal Size. Journal of Agricultural, Biological, and Environmental Statistics, 10, 478-497. http://dx.doi.org/10.1198/108571105X81698 [12] Efron, B. (1979) Bootstrap Methods: Another Look at the Jackknife. The Annals of Statistics, 7, 1-26.http://dx.doi.org/10.1214/aos/1176344552 [13] Siele, D. (2012) Interval Estimation for the Beta Binomial Dispersion Parameter. MSc Thesis, Egerton University, Njoro. [14] Saha, K. and Sen, D. (2009) Improved Confidence Interval for the Dispersion Parameter in Count Data Using Profile Likelihood. Technical Report No. 4/09. [15] Williams, D.A. (1982) Extra-Binomial Variation in Logistic Linear Models. Journal of the Royal Statistical Society. Series C (Applied Statistics), 31, 144-148. http://dx.doi.org/10.2307/2347977 [16] Crowder, M.J. (1978) Beta-Binomial Anova for Proportions. Applied Statistics, 27, 34-37. Mathematical Statistics, 14, 436-440. [17] Hepworth, G. (1996) Exact Confidence Intervals for Proportions Estimated by Group Testing. Biometrics, 52, 1134-1146. http://dx.doi.org/10.2307/2533075 [18] Loyer, M.W. and Hamilton, M.A. (1984) Interval Estimation of the Density of Organisms Using a Serial-Dilution Experiment. Biometrics, 40, 907-916. http://dx.doi.org/10.2307/2531142 [19] Efron, B. and DiCiccio, T. (1996) Bootstrap Confidence Intervals. Statistical Science, 11, 189-228.http://dx.doi.org/10.1214/ss/1032280214