L0 Regularization for the Estimation of Piecewise Constant Hazard Rates in Survival Analysis

Full-Text HTML XML Download Download as PDF (Size:2764KB) PP. 377-394
DOI: 10.4236/am.2017.83031    545 Downloads   655 Views  

ABSTRACT

In a survival analysis context, we suggest a new method to estimate the piecewise constant hazard rate model. The method provides an automatic procedure to find the number and location of cut points and to estimate the hazard on each cut interval. Estimation is performed through a penalized likelihood using an adaptive ridge procedure. A bootstrap procedure is proposed in order to derive valid statistical inference taking both into account the variability of the estimate and the variability in the choice of the cut points. The new method is applied both to simulated data and to the Mayo Clinic trial on primary biliary cirrhosis. The algorithm implementation is seen to work well and to be of practical relevance.

Cite this paper

Bouaziz, O. and Nuel, G. (2017) L0 Regularization for the Estimation of Piecewise Constant Hazard Rates in Survival Analysis. Applied Mathematics, 8, 377-394. doi: 10.4236/am.2017.83031.

References

[1] Antoniou, A., Pharoa, P., Smith, P. and Easton, D. (2004) The Boadicea Model of Genetic Susceptibility to Breast and Ovarian Cancer. British Journal of Cancer, 91, 1580-1590.
https://doi.org/10.1038/sj.bjc.6602175
[2] Clayton, D. and Hills, M. (1993) Statistical Models in Epidemiology. OUP, Oxford.
[3] Odd, A., Ornulf, B. and Hakon, G. (2008) Survival and Event History Analysis. Springer, New York.
[4] Kessing, L.V., Thomsen, A.F., Mogensen, U.B. and Andersen, P.K. (2010) Treatment with Antipsychotics and the Risk of Diabetes in Clinical Practice. The British Journal of Psychiatry, 197, 266-271.
https://doi.org/10.1192/bjp.bp.109.076935
[5] Jensen, H.M., Gron, R., Lidegaard, O., Pedersen, L.H., Andersen, P.K. and Kessing, L.V. (2013) The Effects of Maternal Depression and Use of Antidepressants during Pregnancy on Risk of a Child Small for Gestational Age. Psychopharmacology, 228, 199-205.
https://doi.org/10.1007/s00213-013-3029-5
[6] Hviid, A. and Svanstrom, H. (2009) Antibiotic Use and Type 1 Diabetes in Childhood. American Journal of Epidemiology, 169, 1079-1084.
https://doi.org/10.1093/aje/kwp038
[7] Gron, R., Gerds, T.A. and Andersen, P.K. (2015) Misspecified Poisson Regression Models for Large-Scale Registry Data: Inference for “Large n and Small p”. Statistics in Medicine, 35, 1117-1129.
[8] Cox, D.R. (1972) Regression Models and Life Tables (with Discussion). Journal of the Royal Statistical Society, 34, 187-220.
[9] Clayton, D.G. (1978) A Model for Association in Bivariate Life Tables and Its Application in Epidemiological Studies of Familial Tendency in Chronic Disease Incidence. Biometrika, 65, 141-151.
https://doi.org/10.1093/biomet/65.1.141
[10] Hougaard, P. (1995) Frailty Models for Survival Data. Lifetime Data Analysis, 1, 255-273.
https://doi.org/10.1007/BF00985760
[11] Therneau, T.M. and Grambsch, P.M. (2000) Modeling Survival Data: Extending the Cox Model. Springer Science and Business Media, Berlin/Heidelberg.
https://doi.org/10.1007/978-1-4757-3294-8
[12] Ripatti, S. and Palmgren, J. (2002) Estimation of Multivariate Frailty Models Using Penalized Partial Likelihood. Biometrics, 56, 1016-1022.
https://doi.org/10.1111/j.0006-341X.2000.01016.x
[13] Klein, J.P., Moeschberger, M., Li, Y., Wang, S. and Flournoy, N. (1992) Estimating Random Effects in the Framingham Heart Study. In: Klein, J.P. and Goel, P.K., Eds., Survival Analysis: State of the Art, Springer, Berlin/Heidelberg, 99-120.
https://doi.org/10.1007/978-94-015-7983-4_7
[14] Andersen, P.K., Klein, J.P., Knudsen, K.M. and Tabanera y Palacios, R. (1997) Estimation of Variance in Cox’s Regression Model with Shared Gamma Frailties. Biometrics, 53, 1475-1484.
https://doi.org/10.2307/2533513
[15] Tsiatis, A.A. and Davidian, M. (2004) Joint Modeling of Longitudinal and Time-to-Event Data: An Overview. Statistica Sinica, 14, 809-834.
[16] Rizopoulos, D. (2012) Joint Models for Longitudinal and Time-to-Event Data: With Applications in R. CRC Press, Boca Raton.
https://doi.org/10.1201/b12208
[17] Rizopoulos, D., et al. (2010) JM: An R Package for the Joint Modelling of Longitudinal and Time-to-Event Data. Journal of Statistical Software, 35, 1-33.
https://doi.org/10.18637/jss.v035.i09
[18] Rondeau, V., Mazroui, Y. and Gonzalez, J.R. (2012) Frailty Pack: An R Package for the Analysis of Correlated Survival Data with Frailty Models Using Penalized Likelihood Estimation or Parametrical Estimation. Journal of Statistical Software, 47, 1-28.
https://doi.org/10.18637/jss.v047.i04
[19] Rondeau, V., Filleul, L. and Joly, P. (2006) Nested Frailty Models Using Maximum Penalized Likelihood Estimation. Statistics in Medicine, 25, 4036-4052.
https://doi.org/10.1002/sim.2510
[20] Rondeau, V., Mathoulin-Pelissier, S., Jacqmin-Gadda, H., Brouste, V. and Soubeyran, P. (2007) Joint Frailty Models for Recurring Events and Death Using Maximum Penalized Likelihood Estimation: Application on Cancer Events. Biostatistics, 8, 708-721.
https://doi.org/10.1093/biostatistics/kxl043
[21] Farewell, V.T. (1982) The Use of Mixture Models for the Analysis of Survival Data with Long-Term Survivors. Biometrics, 38, 1041-1046.
https://doi.org/10.2307/2529885
[22] Sy, J.P. and Taylor, J.M. (2000) Estimation in a Cox Proportional Hazards Cure Model. Biometrics, 56, 227-236.
https://doi.org/10.1111/j.0006-341X.2000.00227.x
[23] Sun, J. (2007) The Statistical Analysis of Interval-Censored Failure Time Data. Springer Science and Business Media, Berlin/Heidelberg.
[24] Groeneboom, P. and Wellner, J.A. (1992) Information Bounds and Nonparametric Maximum Likelihood Estimation. Vol. 19, Springer Science and Business Media, Berlin/Heidelberg.
[25] Groeneboom, P. (1996) Lectures on Inverse Problems. In: Dobrushin, R., Groeneboom, P. and Ledoux, M., Eds., Lectures on Probability Theory and Statistics, Springer, Berlin, 67-164.
https://doi.org/10.1007/BFb0095675
[26] Frommlet, F. and Nuel, G. (2016) An Adaptive Ridge Procedure for 10 Regularization. PLoS ONE, 11, e0148620.
https://doi.org/10.1371/journal.pone.0148620
[27] Andersen, P.K., Borgan, O., Gill, R.D. and Keiding, N. (1993) Statistical Models Based on Counting Processes. Springer-Verlag, New York.
[28] Van Houwelingen, H.C., Bruinsma, T., Hart, A.A.M., Van’t Veer, L.J. and Wessels, L.F.A. (2006) Cross-Validated Cox Regression on Microarray Gene Expression Data. Statistics in Medicine, 25, 3201-3216.
https://doi.org/10.1002/sim.2353
[29] Simon, N., Friedman, J., Hastie, T. and Tibshirani, R. (2011) Regularization Paths for Cox’s Proportional Hazards Model via Coordinate Descent. Journal of Statistical Software, 39, 1-13.
https://doi.org/10.18637/jss.v039.i05
[30] Fleming, T.R. and Harrington, D.P. (1991) Counting Processes and Survival Analysis. Wiley Series in Probability and Mathematical Statistics: Applied Probability and Statistics, John Wiley and Sons Inc., New York.

  
comments powered by Disqus

Copyright © 2017 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.