Open Journal of Statistics

Volume 4, Issue 1 (February 2014)

ISSN Print: 2161-718X   ISSN Online: 2161-7198

Google-based Impact Factor: 0.53  Citations  

Automatic Variable Selection for High-Dimensional Linear Models with Longitudinal Data

HTML  Download Download as PDF (Size: 249KB)  PP. 38-48  
DOI: 10.4236/ojs.2014.41005    2,961 Downloads   5,002 Views  
Author(s)

ABSTRACT

High-dimensional longitudinal data arise frequently in biomedical and genomic research. It is important to select relevant covariates when the dimension of the parameters diverges as the sample size increases. We consider the problem of variable selection in high-dimensional linear models with longitudinal data. A new variable selection procedure is proposed using the smooth-threshold generalized estimating equation and quadratic inference functions (SGEE-QIF) to incorporate correlation information. The proposed procedure automatically eliminates inactive predictors by setting the corresponding parameters to be zero, and simultaneously estimates the nonzero regression coefficients by solving the SGEE-QIF. The proposed procedure avoids the convex optimization problem and is flexible and easy to implement. We establish the asymptotic properties in a high-dimensional framework where the number of covariates increases as the number of cluster increases. Extensive Monte Carlo simulation studies are conducted to examine the finite sample performance of the proposed variable selection procedure.

Share and Cite:

R. Tian and L. Xue, "Automatic Variable Selection for High-Dimensional Linear Models with Longitudinal Data," Open Journal of Statistics, Vol. 4 No. 1, 2014, pp. 38-48. doi: 10.4236/ojs.2014.41005.

Cited by

No relevant information.

Copyright © 2024 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.