Analysis of Complex Correlated Interval-Censored HIV Data from Population Based Survey

DOI: 10.4236/ojs.2015.52015   PDF   HTML   XML   2,009 Downloads   2,520 Views  


In studies of HIV, interval-censored data occur naturally. HIV infection time is not usually known exactly, only that it occurred before the survey, within some time interval or has not occurred at the time of the survey. Infections are often clustered within geographical areas such as enumerator areas (EAs) and thus inducing unobserved frailty. In this paper we consider an approach for estimating parameters when infection time is unknown and assumed correlated within an EA where dependency is modeled as frailties assuming a normal distribution for frailties and a Weibull distribution for baseline hazards. The data was from a household based population survey that used a multi-stage stratified sample design to randomly select 23,275 interviewed individuals from 10,584 households of whom 15,851 interviewed individuals were further tested for HIV (crude prevalence = 9.1%). A further test conducted among those that tested HIV positive found 181 (12.5%) recently infected. Results show high degree of heterogeneity in HIV distribution between EAs translating to a modest correlation of 0.198. Intervention strategies should target geographical areas that contribute disproportionately to the epidemic of HIV. Further research needs to identify such hot spot areas and understand what factors make these areas prone to HIV.

Share and Cite:

Zuma, K. and Mafoko, G. (2015) Analysis of Complex Correlated Interval-Censored HIV Data from Population Based Survey. Open Journal of Statistics, 5, 120-126. doi: 10.4236/ojs.2015.52015.

Conflicts of Interest

The authors declare no conflicts of interest.


[1] McDougal, S.J., Pilcher, C.D., Parekh, B.S., Gershy-Damet, G., Branson, Bernard. M., Marsh, K., et al. (2005) Surveillance for HIV-1 Incidence Using Tests for Recent Infection in Resource-Constrained Countries. AIDS, 19, S25-S30.
[2] Kennedy, S. Dobbs, T., Parekh, B.S., Pau, C.-P., Byers, R., Green, T., et al. (2002) Quantitative Detection of Increasing HIV Type 1 Antibodies after Seroconversion: A Simple Assay for Detecting Recent HIV Infection and Estimating Incidence. AIDS Research and Human Retroviruses, 18, 295-307.
[3] Cox, D.R. (1972) Regression models and Life Tables (with Discussion). Journal of the Royal Statistical Society. Series B, 34, 187-220.
[4] Finkelstein, D.M. (1986) A Proportional Hazards Model for Interval-Censored Failure Time Data. Biometrics, 42, 845-854.
[5] Shisana, O., Rehle, T., Simbayi, L., Parker, W., Zuma, K., Bhana, A., et al. (2005) South African National HIV Prevalence, HIV Incidence, Behaviour and Communication Survey. Human Sciences Research Council Publishers, Cape Town.
[6] Vaupel, J.W., Manton, K.G. and Stallard, E. (1987) The Impact of Heterogeneity in Individual Frailty on the Dynamics of Mortality. Demography, 16, 439-454.
[7] Clayton, D.G. and Cuzick, J. (1985) Multivariate Generalisations of the Proportional Hazards Model (with Discussion). Journal of the Royal Statistical Society. Series A, 148, 82-117.
[8] Klein, J.P. (1992) Semiparametric Estimation of Random Effects Using the Cox Model Based on the EM Algorithm. Biometrics, 48, 795-806.
[9] Sastry, N. (1997) A Nested Frailty Model for Survival Data, with an Application to the Study of Child Survival in Northeast Brazil. Journal of the American Statistical Association, 92, 426-435.
[10] Clayton, D.G. (1991) A Monte Carlo Method for Bayesian Inference in Frailty Models. Biometrics, 47, 467-485.
[11] Bolstad, W.M. and Manda, S.O. (2001) Investigating Child Mortality in Malawi Using Family and Community Random Effects: A Bayesian Analysis. Journal of the American Statistical Association, 96, 12-19.
[12] Zuma, K. (2007) A Bayesian Analysis of Correlated Interval-Censored Data. Communications in Statistics-Theory and Methods, 36, 725-730.
[13] Huang, J. and Wellner, J.A. (1997) Interval Censored Survival Data: A Review of Recent Progress. In: Lin, D.Y. and Fleming, T.R., Eds., Proceedings of the 1st Seattle Symposium in Biostatistics: Survival Analysis, Springer-Verlag, New York, 123-169.
[14] Rucker, G. and Messerer, D. (1988) Remission Duration: An Example of Interval-Censored Observations. Statistics in Medicine, 7, 1139-1145.
[15] Odell, P.M., Anderson, K.M. and D’Agostino, R.B. (1992) Maximum Likelihood Estimation for Interval-Censored Data Using a Weibull-Based Accelerated Failure Time Model. Biometrics, 48, 951-959.
[16] Guo, S.W. and Lin, D.Y. (1994) Regression Analysis of Multivariate Grouped Survival Data. Biometrics, 50, 632-639.
[17] Wei, L.J., Lin, D.Y. and Weissfeld, L. (1989) Regression Analysis of Multivariate Incomplete Failure Time Data by Modeling Marginal Distributions. Journal of the American Statistical Association, 84, 1065-1073.
[18] Dempster, A.P., Laird, N.M. and Rubin, D.B. (1977) Maximum Likelihood from Incomplete Data via the EM Algorithm (with Discussion). Journal of the Royal Statistical Society. Series B, 39, 1-38.
[19] Rehle, T., Shisana, O., Pillay, V., Zuma, K., Puren, A. and Parker, W. (2007) National HIV Incidence Measures? New Insights into the South African Epidemic. South African Medical Journal, 97, 194-199.
[20] Bellamy, S.L., Li, Y., Ryan, L.M., Lipsitz, S., Canner, M.J. and Wright, R. (2004) Analysis of Clustered and Interval Censored Data from a Community-Based Study in Asthma. Statistics in Medicine, 23, 3607-3621.
[21] Fraser-Hurt, N., Zuma, K., Njuho, P., Hosegood, V., Gorgens, M., Chikwava, F., et al. (2011) The HIV Epidemic in South Africa: What Do We Know and How Has It Changed? SANAC, Pretoria.
[22] Tanser, F., Barnighausen, T., Cooke, G.S. and Newell, M. (2009) Localized Spatial Clustering of HIV Infections in a Widely Disseminated Rural South African Epidemic. International Journal of Epidemiology, 38, 1008-1016.
[23] Barnighausen, T., Tanser, T., Gqwede, Z., Mbizana, C., Herbst, K. and Newell, M. (2008) High HIV Incidence in a Community with High HIV Prevalence in Rural South Africa: Findings from a Prospective Population-Based Study. AIDS, 22, 139-144.
[24] Abramowitz, M. and Stegun, I. (1972) Handbook of Mathematical Functions. Dover Publications, New York.
[25] Brillinger, D.R. and Preisler, M.K. (1983) Maximum Likelihood Estimation in a Latent Variable Problem. In: Karlin, S., Ameya, T. and Goodman, L.A., Eds., Studies in Econometrics, Time Series and Multivariate Statistics, Academic Press, New York, 31-65.
[26] Bock, R.D. and Aitkin, M. (1981) Marginal Maximum Likelihood Estimation of Item Parameters: Application of an EM Algorithm. Psychometrika, 46, 443-459.
[27] Anderson, D.A. and Aitkin, M. (1985) Variance Component Models with Binary Response: Interviewer Variability. Journal of the Royal Statistical Society. Series B, 47, 203-210.

comments powered by Disqus

Copyright © 2020 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.