Minimum Quadratic Distance Methods Using Grouped Data for Parametric Families of Copulas

Minimum quadratic distance (MQD) methods are used to construct chi-square test statistics for simple and composite hypothesis for parametric families of copulas. The methods aim at grouped data which form a contingency table but by defining a rule to group the data using Quasi-Monte Carlo numbers and two marginal empirical quantiles, the methods can be extended to handle complete data. The rule implicitly defines points on the nonnegative quadrant to form quadratic distances and the similarities of the rule with the use of random cells for classical minimum chi-square methods are indicated. The methods are relatively simple to implement and might be useful for applied works in various fields such as actuarial science.


Introduction
In actuarial science or biostatistics we often encounter bivariate data which are already grouped into cells forming a contingency table, see Partrat [1] (p 225), Gibbons and Chakraborti [2] (p 511-512) for examples, and the primary focus is on dependency study and we only want like to make inference on association parameters of the parametric survival copula used to model the dependency of the two components of the bivariate observations.For the complete data, in actuarial science or biostatistics usually we assume to have a sample of nonnegative bivariate observations ( ) which are independent and identically distributed (iid) as ( ) with the bivariate subsequently put into cells of a contingency table.By making use of multinomial distributions which are induced by a contingency table, chi-square tests can be proposed.In practice, if data are gouped into a contingency table without being transformed, then the tests procedures are no longer applicable.They also note that chi-square tests statistics can have good power along some direction of the alternatives yet being simple to apply and might be of interest for practitioners.
We also know that chi-square tests statistics in one dimension might not be consistent in all direction of the alternatives yet due to its simplicity to apply as there is a unique asymptotic chi-square distribution across the composite hypothesis, one can control the size of the test.Depending on the alternatives and by carefully choosing the intervals to partition the real line, chi-square tests can still have good power against some directions of the alternatives and in practice.
Often we are primarily concerned about some type of alternatives instead of all alternatives.For these advantages, chi-square tests are still used despite there are more powerful tests such as the Cramer-Von Mises tests, see Greenwood and Nikulin [8] (p 124-126) for power under contaminated mixture distributions alternatives and Lehmann [9] (p 326-329) for discussions on power of chi-square tests which are related to the way to create intervals to group the data in one dimension.
Therefore, if we can retain the advantages of the chi-square tests in two dimensions of having a unique chi-square distribution across the null composite hypothesis and improve on the issue of arbitrariness of a grouping rule, the inference procedures might still be attractive for practitioners as implementing other tests procedures might need extensive simulations to approximate a null distribution which depends on 0 ∈ Ω θ .
In this paper, we would like to develop minimum quadratic distance (MQD) procedures for grouped data and the procedures can be extended to the situation of having complete data and they must be grouped by specifying a rule which make use of the Halton sequence of Quasi-Monte Carlo (QMC) numbers and two empirical quantiles from the two marginal distributions or marginal survival functions.Tests for copula models can be performed using chi-square tests statistics with data already grouped and if complete data is available they can be grouped according a more clearly defined rule.As mentioned earlier, the rule to select cells to group the data is a rule to select points on the nonnegative quadrant to construct quadratic distances.If complete data is available then it is established using QMC methods and based on the idea of selecting points in the nonnegative quadrant so that Cramer-Von Mises distances can be approximated by quadratic distances.The methods can also be applied to Copula models with a singular component when u v = provided that the Copula function is differ- entiable with respect to the parameters given by θ .An example of such a copula is the one parameter Marshall Olkin(MO) copula, for discussions on MO copulas, see Dobrowolski and Kumar [10] and Marshall and Olkin [11].We briefly list some copula models often encountered in practice.Most of , with the standard normal univariate quantile function denoted by ( ) − and the integrand of the above integral is a bivariate normal density function with standard normal marginals and parameter ρ.
Copulas are often used to create bivariate distributions and for inference procedures for these distributions for actuarial science, see Klugman and Parsa [3], Klugman et al. [14], Frees and Valdez [15] for examples.
Before giving further details and properties of MQD methods, we shall give the logic behind the MQD procedures.
Let the bivariate empirical survival function be defined as For the time being assume that the M points given by ( ) x y l M ′ =  are already chosen, then we can define the vector of empirical components, ( For MQD procedures with univariate observations, see Luong and Thompson [16].
The paper is organized as follows.
In Section 3, MQD methods will be developed using predetermined grouped data such as data presented using a contingency

Contingency Tables
Contingency table data can be viewed as a special form of two-dimensional grouped data.We will give some more details about this form of grouped data.
Assume that we have a sample ( ) , Let the nonnegative axis X be partitioned into disjoints interval = ∞ and similarly, the axis Y be partitioned into disjoints interval The nonnegative quadrant can be partitioned into nonoverlapping cells of the form.

( ) F x and ( )
G y are assumed to be absolutely con- tinuous.
The sample proportion or empirical probability for one observation which falls into cell ij C can be obtained using ( ) It is not difficult to see that there is redundant information displayed by a contingency table, one way to see that there is duplication is to note and similarly,  ( ) ( ) , 0, , 0, 1, , Therefore, the set points given by ( ) ( ) { } , , , , 1, , , 1, ,  can be discarded without affecting the information provided by the contingency table.Consequently, we can view a contingency table implicitly define a grid on the nonnegative quadrant with only ( )( ) It is also clear that if we want a rule to choose cells, the same rule will allow us to choose points on the nonnegative quadrant.
The objective function of the proposed quadratic form will be given below.It is a natural extension of the objective function used in the univariate case.Define a vector with empirical components so that we only need one subscript by collapsing the points of the contingency table given by into a vector by putting the first row of the matrix as the first batch of elements of the vector and the second row being the second batch of elements so forth so on, i.e., let , , , , , 1 1 and its counterpart which makes use of the copula model is The number of components of n z is M with the assumption M m > .
A class of quadratic distances can be defined as with W being a symmetric and positive definite matrix.In this class, we focus on two choices of W . Letting = W I , we obtain the unweighted quadratic distance, this choice is not optimum but it produces consistent estimators and can be used as preliminary estimates for θ to start the numerical procedures for finding more effi- cient estimators.The matrix W is defined up to a positive constant as mini- mizing the objective function multiplied by a positive constant still gives the same estimators and Ŵ a consistent estimate of W can be used to replace W without affecting the asymptotic theory for estimation and asymptotic dis- tribution for test statistics.Using quadratic distance theory or generalized methods of moment (GMM) theory, it is not difficult to see that an optimum choice for W is to let and 0 Ω is an asymptotic covariance matrix which is given by ( ) given by Luong and Thompson [16] (p 245).
Clearly, 0 Ω depends on 0 θ .We shall obtain the expression for 0 Ω and show that 0 Ω can be estimated by 0 Ω in the next section as we can obtain a preliminary consistent estimate for 0 θ by using the unweighted quadratic dis- tance or other quick methods; see the methods of moment using Spearman-rho in Section 5.2 for example.Consequently, by quadratic distance we mean the following efficient version with the objective function defined as The version with = W I will be called unweighted quadratic distance.In the next section we shall use the influence function representation for ( ) Ω and we shall also propose 0 Ω a consistent estimate for 0 Ω .

Optimum Matrix W0
The matrix 0 Ω which is the asymptotic covariance matrix of the vector ( ) plays an important role for MQD methods as we can obtain estimators with good efficiencies for estimators using 0 Ω or a consistent estimate of 0 Ω and we also have chi-square tests statistics.Despite that 0 Ω is un- known, its elements are not complicated and moreover, it can be replaced by a consistent estimate without affecting the asymptotic properties of the procedures.We shall give more details about this matrix and construct  0 Ω , a consis- tent estimate of 0 Ω .
Using influence representation for the vector of functions of ( ) is the covariance matrix of the vector ( ) , , , Cov Y Y is not symmetric, the matrix has 9 elements, see technical Appendix (TA2) in the Appendices for more details.The elements can be expressed as The elements ij c can be estimated empirically by replacing , , S F G in the expressions of ij c by , , Therefore, we can form  ( ) Cov Y Y which estimates ( ) Cov Y Y .Similarly, by replacing 0 θ by a consistent preliminary estimate ( ) 0 0 θ which can be obtained using the unweighted quadratic distance for example and replacing , F G by , Ω an estimate for 0 Ω will have the elements given by and define  1 0 0 ˆ− = W Ω . 0 W will be used as an optimum matrix for constructing quadratic distance as the asymptotic property remain unchanged.We can replace the unknown matrix

Estimation
The MQD estimators can be seen as given by the vector θ which minimizes Consistency for quadratic distance estimators using predetermined grouped data or if complete data is available but must be grouped according a rule can be treated in a unified way using the following Theorem 1 which is essentially Theorem 3.1 of Pakes and Pollard [18] (p 1038) and the proof has been given by the authors.In fact, their Theorems 3.1 and 3.3 are also useful for Section 4 where we have complete data and we have choices to group the data into cells or equivalently forming the artificial sample points on the nonnegative quadrant to form the quadratic distances.

Theorem 1 (Consistency)
Under the following conditions θ converges in probability to 0 bounded in probability and occurs at the values of the vector values of the MQD estimators, so the conditions 1) and 2) are satisfied for both versions.Implicitly, we make the assumption that the parameter space Ω is compact.Also, for both versions ( ) the number of components of ( ) n G θ is greater than the number of parameters of the model, i.e., M m > .
For 0 ≠ θ θ we have ( ) for some 0 B > since survival func- tions evaluated at points are components of ( ) n G θ and these functions are bounded.This implies that there exist real numbers u and v with 0 u v < < < ∞ such that ( ) Therefore, the minimum quadratic distance (MQD) estimators are consistent, i.e., 0 ˆp  → θ θ .The Theorem 3.1 given by Pakes and Pollard [18] (p 1038-1039) is an elegant theorem using the norm concept of functional analysis.Now we turn our attention to the question of asymptotic normality for the quadratic distance estimators and it is possible to have unified approach using their Theorem Note that ( ) ( ) The points ( ) ( ) By using ( ) Note that ( ) Q θ is differentiable and a quadratic function of θ , the vector * θ which minimizes ( ) and since  W is assumed to be a positive define matrix; we have ( ) Clearly set up fits into the scopes of their Theorem 3.3 where we shall rearrange the results to make them more suitable for MQD methods and verify that we can satisfy the regularity conditions of Theorem 3.3.We shall state Theorem 2 and Corollary 1 below which are essentially their Theorem (3.3) and the proofs have been given by Pakes and Pollard [18].Note that the condition 4) is slightly more stringent but simpler to check than the condition 3) in their Theorem.
Theorem 2 Let θ be a vector of consistent estimators for 0 θ , the unique vector which satisfies ( ) Under the following conditions: 1) The parameter space Ω is compact, θ is an interior point of Ω. 2) for every sequence { } n δ of positive numbers which converge to zero.5) Then, we have the following representation which will give the asymptotic distribution of θ in Corollary 1, i.e., ( ) or equivalently, using equality in distribution, ( ) ( ) ( ) or equivalently, ( ) The proofs of these results follow the results used to prove Theorem 3.3 given by Pakes and Pollard [18] (p 1040-1043).For expression (22) or expression (23) to hold, in general only condition 5) of Theorem 2 is needed and there is no need to assume that ( ) 0 n G θ has an asymptotic distribution.From the results of Theorem 2, it is easy to see that we can obtain the main result of the following Corollary 1 which gives the asymptotic covariance matrix for the quadratic distance estimators for both versions.
The matrices T and V depend on 0 θ , we also adopt the notations ( ) ( ) We observe that when applying condition 4) of Theorem 2 to MQD methods in general involves technicalities.Note that to verify the condition 4, it is equivalent to verify a regularity condition for the approximation is of the right order which implies the condition 3 given by their Theorem 3.3, which might be the most difficult to check.The rest of the conditions for Theorem 2 are satisfied in general. Let and define )) n g θ can also be expressed as Since the elements of ( ) is bounded in probability and continuous in probability with ( ) ( ) Therefore, results given in section of Luong et al. [19] (p 218) can be used to justify the sequence of functions.( ) Γ as given by expression ( 19) can be estimated once the parameters are esti- mated.

Simple Hypothesis
In this section, the quadratic distance ( ) n Q θ will be used to construct good- ness of fit test statistics for the simple hypothesis H 0 : data coming from a specified distribution with distribution 0 F θ , 0 θ is specified.The chi-square test statistic with its chi-square asymptotic distribution and its degree of freedom  are given below, i.e., ( ) ( ) It is not difficult to see that indeed we have the above asymptotic chi-square distribution as ( ) ( ) , using standard results for distribution of quadratic forms, see Luong and Thompson [16] (p 247) for example.

Composite Hypothesis
The quadratic distances ( ) n Q θ can also be used for construction of the test satistics for the composite hypothesis H 0 : data comes from a parametric model { } S θ .The chi-square test statistic and its asymptotic distribution are given similarly in this case by ( ) with M m > .To justify the asymptotic chi-square distribution given above, note that we have the equality in probability, ( ) ( ) ; the rank of the matrix B is also equal to its trace using the techniques as given by Luong and Thompson [16] (p 248-249).).We need a few preliminary notions tools and define sample quantiles then statistics can be viewed as functionals of the sample distribution; the notion of influence function is also introduced and this useful tool will be used to find their asymptotic variance of the functional.

Estimation and Model Testing Using Complete Data
We shall define the pth sample quantile of a distribution as we shall need two sample quantiles from the marginal distributions together with QMC numbers to construct an approximation of an integral.Our quadratic distance based on selected points can be viewed as an approximation of a continuous version given by an integral as given by expression (33).
From a bivariate distribution we have two marginal distributions ( ) ( ) The sample survival function is defined as The sample quantile functions T H is a valuable tool to study the asymptotic properties of the statistical functional and will be introduced below.Let H be the true distribution and n H is the usual empirical distribution which estimates H; also let x δ be the degenerate distribution at x, i.e., ( ) ( ) 0 x u δ = , oth- erwise; the influence function of T viewed as a function of x, ( ) , T H

IC
x is de- fined as a functional directional derivative at H in the direction of ( ) Alternatively, it is easy to see that ( ) and this gives a convenient way to compute the influence function.It can be shown that the influence function of quantile ( ) , with h being the density function of the distribution H which is assumed to be absolutely continuous, see Huber [20] (p 56), Hogg et al. [

T H T H T H H o n
, see Hogg et al. [21] (p 593).Consequently, in general we have for bounded influence functional with the use of means of central limit theorems (CLT) the following convergence in distribution The influence functions for ( ) , We also use . The sequence of points belong to the unit square ( ) ( ) 0,1 0,1 × can be obtained as follows.
For 1 2 b = , we divide the interval ( ) have packages to generate the sequences and see Glaserman [22] (p 293-297) for the related pseudo codes; also see the seminal paper by Halton [23]; for the general principles of QMC methods, see Glasserman [22] (p 281-292).The Halton sequence together with two chosen sample quantiles from the two marginal distributions will allow us to choose points to match the bivariate empirical survival function with its model counterpart as we shall have an artificial sample with values on the nonnegative quadrant with the use of two empirical quantiles from the marginal distributions.These points can be viewed as sample points from an artificial sample and since they depend on quantiles which are robust, the artificial sample can be viewed as free of outliers and the methods which make use of them will be robust.
Note that the Halton sequence of numbers are deterministic and useful for approximating an integral, if we would like to compute numerically an integral of the form ( ) being a bivariate function.Using the M terms of the Halton sequence and QMC principles, it can be approximated as but if we are used to integration by simulation we might want to think the M terms represent a quasi random sample of size M from a bivariate uniform distribution which is useful for approximating A.
From observations which are given by ( ) , iid with common bivariate survival distribution ( ) , S x y .Let the two marginal survival functions be denoted by ( ) F x and ( ) G y and they are absolutely continuous by assumption; also define the bivariate empirical distribution function which is similar to the bivariate empirical survival function as The two empirical marginal survival functions are defined respectively by We might want to think that we would like to approximate the following Cramer-Von Mises distance expressed as an integral given by which is similar to univariate Cramér-Von Mises (CVM) distance and minimizing the distance with respect to θ will give the CVM estimator for θ , see Lu- ong and Blier-Wong [24] for CVM estimation for example.
In the next section we shall give details on how to form a type of quasi sample or artificial sample of size M using the  terms of the Halton sequence of M terms and the two sample quantiles of the marginal distributions F and G or equivalently using the corresponding empirical function quantiles as discuss earlier and this will allow us to define the sequence ( ) so that the above integral can be approximated by the following finite sum of the type of an average of M terms We can see the expression ( 34) is an unweighted quadratic distance using the identity matrix I as weight matrix instead of  0 W .The unweighted quadratic distance still produces consistent estimators but possibly less efficient estimators than estimators using the quadratic distance with  0 W for large samples and for finite samples the estimators obtained using I might still have reasonable per- formances and yet being simple to obtained.
The set of points ( ) is a set of points proposed to be used to form optimum quadratic distances in case that complete data is available.We shall see the set of points depend on two quantiles chosen from the two marginal distributions and they are random consequently.We might want to think that we end up working with random cells.
As for the minimum chi-square methods if random cells stabilize into fixed cells minimum chi-square methods in general have the same efficiency as based on stabilized fixed cells, see Pollard [25] (p 324-326) and Moore and Spruill [26] for the notion of random cells; quadratic distance methods will share the same properties.The chosen points are random but it will be shown that they do stabilize and therefore these random points can be viewed as fixed at stabilized points and despite that they are random, it does not affect efficiencies of the estimators or asymptotic distributions of goodness-of-fit test statistics which make use of them.These properties will be further discussed and studied in more details in the next section along with the introduction of an artificial sample of size M given by the points ( ) on the nonegative quadrant which give us a guideline on how to choose points if complete data is available.

Halton Sequences and an Artificial Sample
From the M terms of the Halton sequences, we have ( ) , we can form the artificial sample with elements given by ( ) , . Note that we have the following relationships between empirical quantile based on distributions and survival functions with ( ) ( ) ( ) ( ) We can view ( ) Since ( ) ( )
It turns out that quadratic distances for both versions constructed with the points ( ) s t l M =  are asymptotic equivalent to quadratic distances using the points ( ) s t l M =  so that asymptotic theory developed using the points ( ) considered to be fixed continue to be valid; we shall show indeed this is the case.Similar conclusions have been established for the minimum chi-square methods with the use of random cells provide that these cells stabilize to fixed cells, see Theorem 2 given by Pollard [25] (p 324-326).We shall define a few notations to make the arguments easier to follow.
Define ( ) s t s t l M = =  and similarly let We work with the quadratic distance defined using ( ) { } , s t which leads to consider quadratic of the form ( ) , we also use respectively the notations and define .
It suffices to verify that results of Theorem 1, Theorem 2 and its corollary in Section 3 continue to hold.
Observe that we have which is a contaminated bivariate survival function and , 0 1.
Similarly for the marginals, ( , 0 1. Now, we consider ( ) the jth element of ( ) , but we can use the influence function representation of ( ) , , , a technique proposed by Reid [29] (p 80-81) but in this case it will need three influence functions which are given by ( ) ( ) and the expression is reduced to by noting the first two terms of the the RHS of the above expression cancel each other since we have ( ) If we compare with the corresponding jth term of ( ) has the same influence functions as the functional ( ) . It is not difficult to see that we have the equalities ( ) ( ) .
Therefore, all the asymptotic results of Section 3 remain valid and all these in-fluence functions are bounded so that inference methods making use of these functionals are robust in general.Furthermore, we can consider the inference procedures based on quadratic distances as we have non-random points

Numerical Issues
In this section we shall consider the numerical problem of not being able to obtain the matrix 0 Ŵ as 0 Ω might be nearly singular and we need to replace 0 Ŵ by a near optimum matrix 0 W  obtained from 0 Ω .The techniques of regularizing a matrix have been introduced by Carrasco and Florens [28] (p 809-810) for GMM estimation with continuum moment conditions, MQD methods can be viewed as similar to GMM with a finite number of moment conditions and clearly the techniques can also be applied for MQD methods.We use the spectral decomposition of 0 Ω to obtain its eigenvalues and eigenvec- tors, see Hogg et al. [21] (p 179) for the spectral decomposition of a symmetric positive definite matrix which allows us to express Ω where the i s λ′ are positive eigenvalues with corresponding eigenvectors given by the i s ′ v of the matrix 0 Ω .Now, observe that is not obtainable numerically.It is due to the eigenvalues which are not stable, the regularization of 0 Ω will lead to the following matrix which hopefully is obtainable and approximate 0 Ŵ .It consists of perturbing the i s λ′ by a small positive number a and define the approximate optimum matrix as Carrasco and Florens [28] (p 809-810) for GMM estimation with continuum moment conditions have shown that asymptotic theory remains unchanged if 0 a → at a suitable rate as n → ∞ .This condition is difficult to verify in prac- tice.However, we might want to continue to use the asymptotic theory in an approximate sense, i.e., we can replace 0 Ŵ by 0 W  and view such a replacement does not modify the asymptotic theory in practice.
A more rigorous approach to justify the chi-square distribution for goodness of fit tests is to divide into 2 steps, first using 0 W  to construct the distance for estimation and letting θ be the vector which minimizes ( ) 2) given by Luong and Thompson [16] (p 248).The matrices Σ and Γ are respectively consistent estimates of Σ and Γ .
It suffices to find the Moore-Penrose ˆ− Σ generalized inverse of Σ and con- struct the test statistics as The asymptotic distribution of the test statistics will be again chi-square with M m − degree of freedom using distribution theory for quadratic forms, see Luong and Thompson [16] (p 247) for example and for generalized inverses, see Harville [29] (p 493-514).
Note that if 0 Ŵ can be used for estimation then we can let there is no need to use two quadratic distances separately.

A Limited Simulation Study
For the study, we fix the number of points 25 M = .The two samples quantiles θ .The efficient MQD estimator is denoted by θ .In the simulation study since we have so many marginal survival functions which can be used so we decide to draw observations directly from the Copula Models.
This is not what happens in real life situation but we want to test the procedures.
We do not have the computing resources for a large scale study and try various marginal survival functions.More works need to be done but we want to illustrate the procedures.
We use sample size 2000 n = and the number of samples used is 100 N = .
For comparison of of MQD estimator θ versus Methods of moment (MM) es- timator θ  we use the ratio of relative efficiency ( ) where the mean square error of an estimator π for 0 π is defined as can similarly be used for comparison and it can be estimated using simulated samples.
The range of parameter being considered is 0.1, 0.2, , 0.9 θ =  , the results are summarized using the first table of Table 1 where we find that the MM estimator and the two quadratic distance estimators have practically equal efficiency up to 4 or 5 decimal precisions.
To study the size of the chi-square tests and the power of the tests let H 0 : The MO copula model MO  C with ( ) , C u v θ as given by expression (37) and 1 2 θ = .With Procedures to simulate from Gaussian and MO copulas are given in chapter 6 by Ross [13]  Power study using M = 25 points, n = 3000 and the alternative hypothesis specified as the contaminated model ( ) each run takes around three minutes to complete.As most of the time we are drawing observations using an alternative model but for testing we must estimate the parameter θ of the MO model, the algorithm tends to take time to converge.The study is very limited as the number of simulated samples is small with 30 N = and only a few copula models are considered but it seems to point to the potential uses of MQD chi-square tests.The tests especially with 35 M = seem to have power especially along some directions which can be represented as a mixture type of models as shown by the means and standard deviations of the chi-square statistics as displayed in the second and third table of Table 1.
More simulation works are needed to assess the power of the MQD tests using various copula models.There are not many statistical procedures for copula models using data that have been already grouped.MQD methods might be useful for this type of situation.

Conclusions
Minimum Quadratic Distance Methods (MQD) offer a unified for estimation and model testing using grouped data under the form of a contingency table for parametric copula models without having to assume parametric models for the marginal distributions.The methods share with minimum chi-square methods by having a unique asymptotic distribution across the composite hypothesis for testing which make the implementations relatively simple without requiring extensive simulations for approximating the null asymptotic distribution.It is shown in this paper that if complete data are available, a rule to define points based on QMC numbers can be proposed to alleviate the arbitrariness on the choice of points to construct quadratic distances.The rule will also make quadratic distances close to Cramer-Von Mises distances.It is well known that in one dimension, chi-square tests cannot be consistent against all alternatives but if the intervals are chosen properly the tests still can have good power against some form of alternatives considered to be useful for applications.
MQD tests statistics with the rule of choosing points might preserve the same properties and by being relative simple to implement, they can be useful for applied works.More numerical and simulation works are needed for further study the power of the MQD tests.
The three influence functions are given respectively by The covariance matrix ( ) Cov Y Y is defined as ( ) Therefore the elements of the matrix ( ) Cov Y Y are given by [ ] Cov Y Y can be reexpressed as the equalities as given by expression (9) in Section 3.2.
vector which makes use of the copula model, the classical Euclidean norm.QD inferences procedures developed subsequently are based on ( ) n Q θ which are similar to the univariate case.
the contingency table are recorded or equivalently the sample proportions which fall into these cells are recorded.Contingency tables are often encountered in actuarial science and biostatistics, see Partrat[1] (p 225), Gibbons and Chakraborti[2] (p 511-512) and we shall give a brief description below.

W
its consitent estimate which is  0 without affecting asymptotic theory for estimation and tests.

3. 3 ,
see Pakes and Pollard[18] (p 1040-1043) where we shall restate their Theorem as Theorem 2 and Corollary 1 given subsequently after the following discussions on the ideas behind their theorem, allowing us to get asymptotic normality results for estimators obtained from extremum of a smooth or nonsmooth objective function.

.
Using results of Corollary 1, we have asymptotic normality for the MQD estimators which is given by

L
θ as given by expression.Therefore we also have the following equalities in distribution, 31) Open Journal of Statistics and note that 0 Σ = W B and the trace of the matrix ( )

4. 1 .
PreliminariesIn Section 4.1 and Section 4.2, we shall define a rule of selecting the points complete data are available.Selecting points is equivalent to define the cells used to group the data and we shall see that random cells be used as the points ( ) numbers on the unit square multiplied by two chosen sample quantiles from the two marginal distributions will be used.They are random and can be viewed as sample points on the nonnegative quadrant forming an artificial sample.For minimum chi-square methods it appears to be difficult to have a rule to choose cells to group the data, see discussions by Greenwood and Nikulin[8] (p   194-208 x and ( ) G y .The univariate sample pth quantile of the distribution ( ) F x assumed to be continuous is based the sample distribution function representation of a functional which depends only on one function such as n H is the equivalence of a Taylor expansion of a univari- ate function and the influence function representation of a functional which depends on many functions is the equivalence of a Taylor expansion of a multivariate function with domain in an Euclidean space and having range being the real line.Since we work with marginal survival functions, we define the pth sample quantiles of the marginals survival functions as

−.
can be derived using the definitions of influence functions or obtained from the influence functions of ( ) Open Journal of Statistics Subsequently, we shall introduce the Halton sequences with the bases 1 2 b = and 2 3 b = and the first M terms are denoted by


are 0.99 quantiles or 0.01 survival functions quantiles if marginal empirical survival functions are used instead of distribution functions for estimation without construction of goodness-of-fit tests.The points used are constructed using the procedures given in Section 4.2.We consider the one parameter MO copula and Kumar[10] (p 5).The sample Spearman rho  SP ρ is simply the Pearson correlation coefficient but computed using ranks of the observations from the two empirical marginal distributions, see Conover[30] (p 314-318).If complete data are available, equating  and one might expect that the moment estimator has reasonable efficiency as we only has one parameter in this model and the estimate is based on ranks.The moment estimate can be used to compute 1 0 0 ˆ− = W Ωwhich is needed for chi-square tests and for estimation using quadratic distances.We use 25 M = and there is no problem on inverting the matrix 0 Ω .Clearly if data is already Open Journal of Statistics grouped we can use the unweighted quadratic distance to provide a consistent preliminary estimate for 0 can be estimated using M samples each of size n.The unweighted QD estimator is denoted by  I θ as the identity matrix I is used for the unweighted quadratic distance.The corresponding  Observations are drawn from the model speci- fied by by a H which specifies the model is a contaminated one given by

(
have the influence representation for the l-th element of y ′ are iid we have the equality in distribution asymptotically, vector notations we have the following equality in distribu-, a result which is needed in Section 3.2.Technical Appendix 2 (TA2)In this technical appendix, we shall justify the validity of expression (9) of Section 3.2.
established for MQD estimators and chi-square tests using quadratic distances can be constructed for testing copula models.In Section 4, by viewing grouped data as defining a set of points on the nonnegative quadrant, a rule to select points is proposed based on Quasi-Monte-Carlo numbers and two sample quantiles if complete data is available and the methods can be extended to the situation where complete data is available.The methods can be seen as similar to minimum chi-square methods with random cells but with a rule to define these cells.The choice of random cells for minimum chi-square methods is less well defined.Section 5 illustrates the implementations of MQD methods using a limited simulation study by comparing the methods of moment estimator (MM) estimators based on sample Spearman rho which requires the availability of complete data versus the MQD estimator which uses grouped data for the one parameter Marshall-Olkin model and it appears that the chi-square tests table.The efficient quadratic distances is derived and can be used for estimation and model testing.Asymp-A.Luong DOI: 10.4236/ojs.2018.83028432 Open Journal of Statistics totic theory is have some power to detect alternatives which can be represented as mixture or contaminated copula model such as the mixture of one parameter Marshall-Olkin copula model and Gaussian copula model from the study.The findings appear to be in line with chi-square tests in one dimension which also display similar properties if intervals are chosen properly.

Table 1 .
Asymptotic relative efficiencies comparisons for MQD estimators versus MM estimator using N = 1000 samples of size n = 1000 for the one parameter MO copula