Improved Empirical Likelihood Inference for Multiplicative Regression with Independent and Longitudinal Data ()
1. Introduction
In this paper, we consider the multiplicative regression models:
(1)
where Y is the response variable taking positive values,
is a p-dimensional vector of covariates including the intercept as its first component,
is the corresponding p-dimensional vector of unknown regression coefficients, and
is the positive error. When taking logarithm transformation on both sides of (1), the model becomes the known linear regression model relating
to
, and the least square method or the least absolute deviation method may be applied. A common feature of both methods mentioned above is that the absolute error criterion is employed. However, as argued by [1], in many practical applications, the relative error is more of interest, and some earlier discussions and applications based on the relative error criterion can be found in the references of [1]. Based on a random sample
from the model (1), [1] proposed the novel least absolute relative error criterion to estimate the regression parameters in model (1) by minimizing the following objective function
(2)
which is conceptually meaningful and has some appealing merits such as the scale-free and unit-free characters. Under some regularity conditions, they also established the resulting estimators are consistent and asymptotical normality. However, due to the computational difficulty mainly caused by the non-smoothness of the loss function
, [2] further investigated the least product relative error criterion, by which the loss function is defined as
(3)
They also showed the estimators enjoy some asymptotical properties and that the simple plug-in variance estimation is reasonable. Following their works, some authors continued to extend the model (1) to some more flexible semiparametric models, such as the partially linear MM ([3]-[6]), single-index MM ([7]-[9]), varying coefficient MM ([10]) and partially linear varying index MM ([11]).
However, it should be noted that LARE estimators’ asymptotic covariance matrixes involve the unknown density function of the error term, which makes the construction of the confidence region intervals of the regression parameters not straightforward. One way is to directly estimate the covariance via some nonparametric methods such as the kernel smooth method, but the efficiency may be unsatisfactory when the sample size is not large enough. As done in the papers mentioned above, one can resort to a resampling approach, also named the random weighting method, which is computationally intensive and time-consuming, especially when the dimensionality of the covariates is more than one. What’s worse, both alternatives perform poorly when the sample size is small, as presented in the results of [12]-[14].
To overcome those drawbacks mentioned above, in this paper, we follow the line of [15] to extend the AEL approach to the multiplicative model with independent data and longitudinal data, combining with the LARE and LPRE criteria, respectively. It is shown that the adjusted empirical log-likelihood ratio function at the true value of regression parameters converges weakly to the standard Chi-squared distribution. As will be seen in the simulation studies later, the improved coverage probability is achieved when the sample size is small, and becomes comparable with classical EL method when the sample size is moderate or large. This phenomenal turn-up for the analysis of both independent data and longitudinal data.
The paper is organized as follows. In Section 2, we introduce the AEL method for the multiplicative model with independent data based on the LARE criterion, and with longitudinal data based on the LPRE criterion, respectively. In addition, some asymptotic properties are provided, which are used to construct the confidence regions of regression parameters and make hypothesis tests. Extensive simulation studies are conducted to demonstrate the usefulness of the proposal and results are presented in Section 3. Finally, several conclusions and remarks are given in Section 4, and proofs are deferred to the Appendix.
2. Main Results
2.1. AEL for LARE Estimation with Independent Data
Suppose that
is a random sample from the model (1). Then the LARE estimator can be rewritten as solution to the following estimating equation
(4)
where
and
denotes the sign function. Write
,
. Under the later condition (A4), it holds that
. Inspired by the EL approach, for a given
, the EL ratio of
is defined as
According to the findings in [15],
is well defined only if 0 is inside of the convex hull of
. To avoid the computational constraints, Chen (2006) proposed the adjusted EL approach. Explicitly, define
and
, where
. Then the AEL ratio function is
(5)
For each given
, by the Lagrange multiplier method, (5) achieves its maximum at
where
is the Lagrange multipliers and satisfies
Furthermore, the proposed adjusted empirical log-likelihood ratio is given by
To establish the asymptotic distribution of the proposed statistic, some regular conditions are required as follows.
(A1) The random error
has a density function f, which is continuous in a neighborhood of 1.
(A2)
.
(A3) X is bounded and does not concentrate on any hyperplane of
dimension.
(A4)
.
(A5)
.
Conditions (A1)-(A3) are common requirements in the study of multiplicative regression models. (A4) is an identification condition for the LARE estimation, which is similar to the zero mean condition in the classical linear mean regression. (A5) is required for proof, which is also used in [12].
Theorem 1 Suppose conditions (A1)-(A5) hold and
. As
,
converges in distribution to a standard chi-squared random variables with p degrees of freedom.
For fixed
, a natural asymptotic
confidence region for
based on the empirical likelihood ratio is given by
where
is the upper c-quantile of the chi-squared distribution in Theorem 1.
2.2. AEL for LPRE Estimation with Longitudinal Data
Consider a longitudinal study consisting of n subjects with the j-th subject having
observations. For each
and
, as notations presented in model (1),
is a random sample from the model
(6)
where
are known bounded design vectors, errors
across subjects are independent but may be dependent within the same subject. As proposed in [13], the LPRE estimator can be defined as solution to the following estimating equation
(7)
where
. For notation simplicity, we write
,
.
and
, where
. Under the later condition (B4), it holds that
. Following the idea of the block EL approach in [16], for a given
, the AEL ratio function of
is defined as
(8)
For each given
, by the Lagrange multiplier method, (8) achieves its maximum at
where
is the Lagrange multipliers and satisfies
Furthermore, the proposed adjusted empirical log-likelihood ratio is given by
To establish the asymptotic distribution of the proposed statistic, some regular conditions are required as follows.
(B1) The random errors
have a common density function f, and
.
(B2)
.
(B3)There exists a constant
such that
.
(B4)
, and
converges to a positive definite matrix D.
Conditions (B1)-(B4) are also used in [13].
Theorem 2 Suppose conditions (B1)-(B4) hold and
. As
,
converges in distribution to a standard chi-squared random variables with p degrees of freedom.
For fixed
, a natural asymptotic
confidence region for
based on the empirical likelihood ratio is given by
where
is the upper c-quantile of the chi-squared distribution in Theorem 1.
3. Simulation Studies
In this section, we investigate the finite sample performance of the proposed adjusted empirical likelihood method (AEL) with the ordinary empirical likelihood method (EL) for analyzing independent and longitudinal data, respectively. For comparison, we compute the coverage probabilities of the simultaneous confidence regions for all regression coefficients under various settings. Besides, we also consider the power behavior of the methods mentioned above.
3.1. Comparisons for Independent Data
Similar to the model used in Section 3 of [12], the studies in this part are based on the following model,
(9)
where
and
are independent standard normal random variables and
take a value (1, 1, 1). The random error
is assumed to be independent of
, and its logarithm
comes from two different distributions: one is the standard normal distribution
, the other is the uniform distribution
, which are denoted by
and
, respectively. As is proposed in Section 3, the confidence region at nominal level
for AEL and EL are constructed by
and
, which are defined as
respectively. Here and after,
is chosen to 0.05 or 0.1, and
is the
-level quantile of a Chi-squared distribution
.
Furthermore, based on the results in Theorem 1, we evaluate the power of the proposed methods by testing the hypothesis
against
, where we consider four combinations of
, namely, (0.5, 0.5, 0.5), (0.8, 0.8, 0.8), (0.9, 0.9,0.9) and (0.8, 0.5, 0.8). For each specific setting, 1000 independent replications are simulated based on the sample sizes 25, 50, 100, 200, 300, 500, 1000, respectively. In contrast to the simulation design in [12] where only the moderate sample sizes are studied, we also consider the small sample size cases.
Simulation results are summarized in Table 1, Table 2. It can be seen from Table 1 that both AEL and EL methods provide accurate coverage probabilities of confidence regions when the sample size is moderate (≥200). Especially, the coverage probabilities for the two methods are close to the nominal level and usually comparable as the sample size increases. But when the sample size is small (≤100), the coverage probabilities via AEL method are much better than that of the EL method under all settings, although there still exists a certain gap with the nominal levels. As presented in Table 2, when the null hypothesis H0 is far away from the true value of the regression coefficients, such as the cases
or (0.8, 0.8,0.8), both tests are powerful and comparable, even when the sample size is small, or there exists only one component of
is far away from the true value of the regression parameters, such as the case
. Meanwhile, when H0 lies in the nearby of the true value of the regression coefficients, the AEL test is slightly less powerful than the EL test for small sample sizes and the difference between them becomes less and negligible with the increase in the sample size. This result is not strange and similar performance still occurs in Table 3 of [12], which shows that the confidence region based on the AEL method contains the confidence region based on the EL method and the cost is not large and deserved compared to the improvement of the coverage probability for samples with small sample sizes.
Table 1. Coverage probabilities of the confidence region for
with independent data.
|
|
|
0.9 |
|
0.95 |
|
0.9 |
|
0.95 |
|
n |
AEL |
EL |
AEL |
EL |
AEL |
EL |
AEL |
EL |
25 |
0.850 |
0.773 |
0.908 |
0.849 |
0.853 |
0.798 |
0.915 |
0.856 |
50 |
0.860 |
0.826 |
0.904 |
0.881 |
0.895 |
0.864 |
0.924 |
0.894 |
100 |
0.870 |
0.855 |
0.931 |
0.922 |
0.871 |
0.852 |
0.951 |
0.944 |
200 |
0.874 |
0.868 |
0.942 |
0.933 |
0.908 |
0.898 |
0.945 |
0.939 |
300 |
0.883 |
0.879 |
0.940 |
0.933 |
0.897 |
0.893 |
0.961 |
0.959 |
500 |
0.885 |
0.879 |
0.948 |
0.947 |
0.918 |
0.913 |
0.953 |
0.951 |
1000 |
0.901 |
0.899 |
0.944 |
0.943 |
0.912 |
0.911 |
0.941 |
0.939 |
Table 2. Powers of tests at nominal levels 0.1 and 0.05 for
with independent data.
|
|
|
0.1 |
|
0.05 |
|
0.1 |
|
0.05 |
|
|
AEL |
EL |
AEL |
EL |
AEL |
EL |
AEL |
EL |
|
|
|
|
|
|
|
|
|
n = 50 |
0.998 |
0.999 |
0.999 |
0.999 |
1 |
1 |
0.998 |
0.998 |
n = 100 |
1 |
1 |
1 |
1 |
1 |
1 |
1 |
1 |
n = 200 |
1 |
1 |
1 |
1 |
1 |
1 |
1 |
1 |
|
|
|
|
|
|
|
|
|
n = 50 |
0.638 |
0.682 |
0.493 |
0.544 |
0.579 |
0.623 |
0.425 |
0.485 |
n = 100 |
0.885 |
0.895 |
0.823 |
0.841 |
0.849 |
0.865 |
0.773 |
0.799 |
n = 200 |
0.986 |
0.987 |
0.988 |
0.988 |
0.988 |
0.990 |
0.983 |
0.986 |
|
|
|
|
|
|
|
|
|
n = 50 |
0.285 |
0.326 |
0.201 |
0.240 |
0.240 |
0.265 |
0.142 |
0.173 |
n = 100 |
0.396 |
0.426 |
0.302 |
0.322 |
0.349 |
0.374 |
0.250 |
0.275 |
n = 200 |
0.637 |
0.651 |
0.493 |
0.508 |
0.551 |
0.567 |
0.459 |
0.480 |
|
|
|
|
|
|
|
|
|
n = 50 |
0.994 |
0.995 |
0.987 |
0.990 |
0.986 |
0.993 |
0.969 |
0.977 |
n = 100 |
1 |
1 |
1 |
1 |
1 |
1 |
1 |
1 |
n = 200 |
1 |
1 |
1 |
1 |
1 |
1 |
1 |
1 |
Table 3. Coverage probabilities of the confidence region for
with longitudinal data.
|
S1 |
S2 |
0.9 |
|
0.95 |
|
0.9 |
|
0.95 |
|
n |
AEL |
EL |
AEL |
EL |
AEL |
EL |
AEL |
EL |
|
|
|
|
|
|
|
|
|
25 |
0.850 |
0.811 |
0.931 |
0.886 |
0.842 |
0.797 |
0.920 |
0.872 |
50 |
0.882 |
0.850 |
0.920 |
0.898 |
0.856 |
0.823 |
0.916 |
0.898 |
100 |
0.877 |
0.867 |
0.934 |
0.925 |
0.888 |
0.875 |
0.925 |
0.917 |
200 |
0.902 |
0.893 |
0.945 |
0.938 |
0.901 |
0.897 |
0.938 |
0.938 |
300 |
0.889 |
0.884 |
0.946 |
0.944 |
0.896 |
0.890 |
0.948 |
0.944 |
500 |
0.887 |
0.885 |
0.950 |
0.948 |
0.896 |
0.892 |
0.950 |
0.946 |
|
|
|
|
|
|
|
|
|
25 |
0.895 |
0.848 |
0.949 |
0.922 |
0.878 |
0.833 |
0.929 |
0.882 |
50 |
0.898 |
0.879 |
0.953 |
0.943 |
0.872 |
0.854 |
0.939 |
0.920 |
100 |
0.889 |
0.880 |
0.958 |
0.949 |
0.900 |
0.888 |
0.956 |
0.952 |
200 |
0.906 |
0.901 |
0.947 |
0.940 |
0.895 |
0.889 |
0.940 |
0.935 |
300 |
0.908 |
0.905 |
0.943 |
0.943 |
0.888 |
0.883 |
0.963 |
0.960 |
500 |
0.917 |
0.914 |
0.949 |
0.948 |
0.913 |
0.910 |
0.954 |
0.952 |
3.2. Comparisons for Longitudinal Data
Similar to the simulation design in [13], we consider the following model,
(10)
where
and
are independent standard normal random variables and
. For each i, the random error
is assumed to be independent of
, and its logarithm
comes from two different distributions, which are denoted by
and
, respectively. Furthermore, the covariance matrix of the logarithm vector
has the form
, where
or
. Denote these two mattresses by S1 and S2, respectively.
Furthermore, based on the results in Theorem 2, we evaluate the power of the proposed methods by testing the hypothesis
against
. For each specific setting, 1000 independent replications are simulated based on the sample sizes 25, 50, 100, 200, 300, 500, respectively. In contrast to the simulation design in [13], where only the moderate sample sizes are studied, we also consider the small sample size cases.
Table 3 presents the coverage probabilities of the confidence regions at normal levels of 0.9 and 0.95, respectively. Again, we find the fact that both AEL and EL methods provide accurate coverage probabilities of confidence regions when the sample size is moderate (≥200). Especially, the coverage probabilities for the two methods are close to the nominal level and usually comparable as the sample size increases. But when the sample size is small (≤300), the coverage probabilities via AEL method are much better than that of the EL method under all settings, although there still exists a certain gap with the nominal levels.
4. Conclusion
This paper introduces an efficient and easy-to-implemented adjusted empirical likelihood-based method for constructing the confidence region of the regression parameters in the multiplicative regression model with independent and longitudinal data, respectively. The proposed procedures can avoid estimating the nuisance parameters and perform well even if when the size of sample is relatively small. In the future, extensions can be made to handle an increasing number of covariates as done in [14], and the cases where the response variables are missing or censored.
Conflicts of Interest
The authors declare no conflicts of interest.
Appendix A. Proofs of Theorems 1-2
Proof of Theorem 1. Under conditions (A1)-(A5), according to the lemma 1 in [12], we have
,
and
, where
denotes the Euclidean norm.
Let
and
, which implies
. Note that
Multiplying
to both sides of (3), we get
Some algebraic calculations yield that
. From the inequality
, it hold that
Thus
Combining the results above, we have
and
Conditions (A3) and (A5) imply that S is positive definite. Using the fact that
,
where
denotes the smallest eigenvalue of S, it holds that
Together with
and
, it follows that
. Recall that
we have
. Then it follows that
. Furthermore,
As a result, The Theorem 1 is proved.
Proof of Theorem 2. The proof is similar to the line of Theorem 1 except some modifications as done in [13]. Hence it is omitted here.