^{1}

^{*}

^{1}

^{1}

The purpose of this paper is to propose a new model of asymmetry for square contingency tables with ordered categories. The new model may be appropriate for a square contingency table if it is reasonable to assume an underlying bivariate t-distribution with different marginal variances having any degrees of freedom. As the degrees of freedom becomes larger, the proposed model approaches the extended linear diagonals-parameter symmetry model, which may be appropriate for a square table if it is reasonable to assume an underlying bivariate normal distribution. The simulation study based on bivariate t-distribution is given. An example is given.

Consider an R × R square contingency table with the same row and column ordinal classifications. Let p i j denote the probability that an observation will fall in the ith row and jth column of the table ( i = 1 , ⋯ , R ; j = 1 , ⋯ , R ). The symmetry (S) model is defined by

p i j = p j i ( i < j ) ; (1)

see Bowker [

p i j = θ j − i p j i ( i < j ) . (2)

This indicates that the probability that an observation will fall in the ( i , j ) th cell, i < j , is θ j − i times higher than the probability that it falls in the ( j , i ) th cell. A special case of the LDPS model obtained by putting θ = 1 is the S model. Furthermore, Tomizawa [

p i j = θ 1 j − i θ 2 j 2 − i 2 p j i ( i < j ) . (3)

This indicates that the probability that an observation will fall in the ( i , j ) th cell, i < j , is θ 1 j − i θ 2 j 2 − i 2 times higher than the probability that it falls in the ( j , i ) th cell.

Consider random variables X and Y having a joint bivariate normal distribution with means E ( X ) = μ 1 , E ( Y ) = μ 2 , variances Var ( X ) = σ 1 2 , Var ( Y ) = σ 2 2 and covariance Cov ( X , Y ) = ρ σ 1 σ 2 , where ρ is the correlation coefficient C o r r ( X , Y ) . The joint bivariate normal density f ( x , y ) satisfies

f ( x , y ) f ( y , x ) = δ 1 y − x δ 2 y 2 − x 2 , (4)

where

δ 1 = exp [ 1 1 − ρ 2 { ( μ 2 σ 2 2 − μ 1 σ 1 2 ) + ρ ( μ 2 − μ 1 ) σ 1 σ 2 } ] ,

δ 2 = e x p [ 1 2 ( 1 − ρ 2 ) ( 1 σ 1 2 − 1 σ 2 2 ) ] .

When σ 1 2 = σ 2 2 ( = σ 2 ) , f ( x , y ) satisfies

f ( x , y ) f ( y , x ) = δ y − x , (5)

where

δ = e x p [ 1 1 − ρ ( μ 2 − μ 1 σ 2 ) ] .

Agresti [

Consider a bivariate t-distribution with m degree of freedom. The limit of this joint probability density function as m → ∞ is bivariate normal. Therefore, the LDPS model may be appropriate if it is reasonable to assume an underlying bivariate t-distribution with equal marginal variances such that m degree of freedom is very large. Also, the ELDPS model may be appropriate if it is reasonable to assume an underlying bivariate t-distribution with different marginal variances such that m degree of freedom is very large.

Consider the R × R square contingency table with ordered categories. For any fixed constant m ( m > 2 ) , Iki et al. [

p i j − 2 m + 2 − p j i − 2 m + 2 = η m ( j − i ) ( i < j ) . (6)

A special case of this model obtained by putting η m = 0 is the S model. The TS(m) model indicates that the difference between the symmetric two probabilities raised to the power [ = − 2 / ( m + 2 ) ] is proportional to the distance from the main diagonal of the R × R table. The TS(m) model may be appropriate if it is reasonable to assume an underlying bivariate t-distribution with equal marginal variances having m degrees of freedom (see Iki et al. [

Now, we are interested in considering a new model, which is appropriate if it is reasonable to assume an underlying bivariate t-distribution with different marginal variances.

The purpose of present paper is to introduce an extended TS(m) model. Section 2 proposes an extended TS(m) model and describes the properties of the new model. Section 3 describes the relationships between the extended TS(m) model and t-distribution by the simulation study. Section 4 illustrates the use of our new model with father’s and son’s occupational structure data. Section 5 provides some concluding remarks.

Consider random variables X and Y having a joint bivariate t-distribution with m ( m > 2 ) degrees of freedom, and means E ( X ) = μ 1 , E ( Y ) = μ 2 , variances Var ( X ) = m σ 1 2 / ( m − 2 ) , Var ( Y ) = m σ 2 2 / ( m − 2 ) , and correlation coefficient C o r r ( X , Y ) = ρ . The probability density function h ( x , y ) is

h ( x , y ) = 1 2 π σ 1 σ 2 1 − ρ 2 ( 1 + Q ( x , y ) m ) − m + 2 2 , (7)

where

Q ( x , y ) = 1 1 − ρ 2 [ ( x − μ 1 σ 1 ) 2 − 2 ρ ( x − μ 1 σ 1 ) ( y − μ 2 σ 2 ) + ( y − μ 2 σ 2 ) 2 ] .

See, e.g., Muirhead [

h ( x , y ) = c [ 1 + 1 m ( a 1 x + b 1 y + a 2 x 2 + b 2 y 2 + d ( x , y ) ) ] − m + 2 2 , (8)

where

c = 1 2 π σ 1 σ 2 1 − ρ 2 , a 1 = 2 σ 1 ( 1 − ρ 2 ) ( ρ μ 2 σ 2 − μ 1 σ 1 ) , b 1 = 2 σ 2 ( 1 − ρ 2 ) ( ρ μ 1 σ 1 − μ 2 σ 2 ) ,

a 2 = 1 σ 1 2 ( 1 − ρ 2 ) , b 2 = 1 σ 2 2 ( 1 − ρ 2 ) , d ( x , y ) = 1 1 − ρ 2 ( − 2 ρ σ 1 σ 2 x y + μ 1 2 σ 1 2 + μ 2 2 σ 2 2 − 2 ρ μ 1 μ 2 σ 1 σ 2 ) .

We note that d ( x , y ) = d ( y , x ) . Also, the probability density function h ( x , y ) satisfies

( h ( x , y ) ) − 2 m + 2 − ( h ( y , x ) ) − 2 m + 2 = k m ( y 2 − x 2 ) + l m ( y − x ) ( x < y ) , (9)

where

k m = σ 1 2 − σ 2 2 m σ 1 2 σ 2 2 ( 1 − ρ 2 ) ( 2 π σ 1 σ 2 1 − ρ 2 ) 2 m + 2 ,

l m = 2 { σ 2 2 μ 1 − σ 1 2 μ 2 + ρ σ 1 σ 2 ( μ 1 − μ 2 ) } m σ 1 2 σ 2 2 ( 1 − ρ 2 ) ( 2 π σ 1 σ 2 1 − ρ 2 ) 2 m + 2 .

For continuous bivariate data, when we make the R × R square contingency table formed using ( R − 1 ) ’s cut points for each of row and column variables, we are interested in the structure of asymmetry of bivariate discrete probabilities { p i j } . Consider the R × R square contingency table with ordered categories. For any fixed constant m, we propose a model defined by

p i j = γ [ 1 + 1 m ( α 1 i + β 1 j + α 2 i 2 + β 2 j 2 + ψ ( i , j ) ) ] − m + 2 2 , (10)

where ψ ( i , j ) = ψ ( j , i ) . We shall refer to this model as an extended t-distribution type symmetry (ETS(m)) model. The ETS(m) model may be appropriate if it is reasonable to assume an underlying bivariate t-distribution with different marginal variances having m degrees of freedom. Under the ETS(m) model, setting τ i j = α 1 i + β 1 j + α 2 i 2 + β 2 j 2 + ψ ( i , j ) , we see that

l i m m → ∞ p i j p j i = l i m m → ∞ ( 1 + τ i j m ) − m + 2 2 ( 1 + τ j i m ) − m + 2 2 = lim m → ∞ { ( 1 + τ i j m ) m τ i j } − τ i j 2 ( 1 + 2 m ) { ( 1 + τ j i m ) m τ j i } − τ j i 2 ( 1 + 2 m ) = exp [ − τ i j 2 ] exp [ − τ j i 2 ] = exp [ 1 2 ( α 1 − β 1 ) ( j − i ) + 1 2 ( α 2 − β 2 ) ( j 2 − i 2 ) ] = θ 1 j − i θ 2 j 2 − i 2 , (11)

where

θ 1 = e x p [ 1 2 ( α 1 − β 1 ) ] , θ 2 = e x p [ 1 2 ( α 2 − β 2 ) ] .

Namely, the ETS(m) model approaches the ELDPS model as m becomes larger, although the TS(m) model approaches the LDPS model as m becomes larger (see Appendix 1).

The ETS(m) model is also expressed as

p i j − 2 m + 2 − p j i − 2 m + 2 = γ m ( j 2 − i 2 ) + η m ( j − i ) ( i < j ) , (12)

where

γ m = γ − 2 m + 2 ( β 2 − α 2 ) m , η m = γ − 2 m + 2 ( β 1 − α 1 ) m .

A special case of this model obtained by putting γ m = 0 is the TS(m) model.

The maximum likelihood estimates of expected frequencies under the ETS(m) model could be obtained using the Newton-Raphson method in the log-likelihood equation (see Appendix 2). For the ETS(m) model, { p i j } are determined by R ( R − 1 ) / 2 of { ψ ( i , j ) , i < j } , R of { ψ ( i , i ) } , 1 of γ m and 1 of η m , thus a total of ( R 2 + R + 4 ) / 2 . Therefore, the number of degrees of freedom (df) for the ETS(m) is R 2 − ( R 2 + R + 4 ) / 2 = ( R 2 − R − 4 ) / 2 , which is one less than that for the TS(m) model, and equal to that for the ELDPS model.

As described in Section 2, the ETS(m) model may be appropriate for a square ordinal table if it is reasonable to assume an underlying bivariate t-distribution with different marginal variances having m degrees of freedom. We shall consider the simulation study based on bivariate t-distribution. Consider random variables X and Y having a bivariate t-distribution with m ( m > 2 ) degrees of freedom, and means E ( X ) = μ 1 , E ( Y ) = μ 2 ( = μ 1 + 0.2 ) , variances Var ( X ) = m σ 1 2 / ( m − 2 ) , Var ( Y ) = m σ 2 2 / ( m − 2 ) and correlation coefficient C o r r ( X , Y ) = ρ ( = 0.2 ) . (Note that it is possible to take various values of ρ.) Such random numbers are obtained by using normal random number and chi-square random number with m degree of freedom. Suppose that there is an underlying bivariate t-distribution with some conditions, namely, σ 2 2 / σ 1 2 = 1.2 , 1.5 and 1.7, and a 4 × 4 table of sample size 1000 is formed using cut points for each variable at μ 1 , μ 1 ± 0.6 σ 1 .

Then, we shall count the frequencies of acceptance (at the 0.05 significance level) based on the likelihood ratio chi-squared statistic for testing the hypothesis that the ETS(m) model or the TS(m) model with the corresponding m degrees of freedom of underlying t-distribution holds per 10000 times for 4 × 4 tables on each conditions. From

(a) For the ETS(m) model | ||
---|---|---|

σ 2 2 / σ 1 2 | m | Frequencies |

1.2 1.2 1.2 1.2 1.5 1.5 1.5 1.5 1.7 1.7 1.7 1.7 | 5 20 50 100 5 20 50 100 5 20 50 100 | 9323 9442 9418 9459 9374 9413 9451 9412 9407 9430 9424 9421 |

(b) For the TS(m) model | ||

σ 2 2 / σ 1 2 | m | Frequencies |

1.2 1.2 1.2 1.2 1.5 1.5 1.5 1.5 1.7 1.7 1.7 1.7 | 5 20 50 100 5 20 50 100 5 20 50 100 | 1486 1725 1558 1555 0 0 0 0 0 0 0 0 |

The data in ^{2} for the ETS(m) model approach the value of G^{2} for the ELDPS model as

Father’s | Son’s status | Total | |||
---|---|---|---|---|---|

status | (1) | (2) | (3) | (4) | |

(1) | 80 | 72 | 37 | 19 | 208 |

(80.00) | (72.50) | (33.76) | (24.96) | ||

(2) | 44 | 155 | 61 | 31 | 291 |

(43.42) | (155.00) | (54.65) | (30.35) | ||

(3) | 26 | 73 | 218 | 45 | 362 |

(29.38) | (78.72) | (218.00) | (43.14) | ||

(4) | 69 | 156 | 166 | 614 | 1005 |

(64.45) | (156.41) | (167.26) | (614.00) | ||

Total | 219 | 456 | 482 | 709 | 1866 |

Note: (1) is upper non-manual, (2) lower non-manual, (3) manual, and (4) agriculture.

Applied models | df | G^{2} |
---|---|---|

TS (5) | 5 | 80.64* |

TS (20) | 5 | 91.58* |

TS (50) | 5 | 94.76* |

TS (100) | 5 | 95.92* |

ETS (5) | 4 | 3.82 |

ETS (20) | 4 | 2.68 |

ETS (50) | 4 | 2.43 |

ETS (100) | 4 | 2.35 |

ELDPS | 4 | 2.27 |

*means significant at the 0.05 level.

the value of m increases. Under the ETS(5) model, the maximum likelihood estimates of γ 5 and η 5 are γ ^ 5 = 0.335 and η ^ 5 = − 1.405 , respectively.

Therefore, the difference between the probability raised to the power − 2 m + 2

[=−0.286] that the occupational status category of the father in pair is i and that of his son is j ( i < j ) and the probability raised to the power that the occupational status category of the father in pair is j and that of his son is i, is γ ^ 5 ( j 2 − i 2 ) + η ^ 5 ( j − i ) [ = 0.335 × ( j 2 − i 2 ) − 1.405 × ( j − i ) ] . For example, the difference between the probability raised to the power that the occupational status category of the father in pair is (1) “upper non-manual” and that of his son is (2) “lower non-manual” and the probability raised to the power that the occupational status category of the father in pair is (2) “lower non-manual” and that of his son is (1) “upper non-manual”, is estimated to be -0.400 [ = 0.335 × 3 − 1.405 × 1 ]. Namely, the probability that the occupational status category of the father in pair is (1) “upper non-manual” and that of his son is (2) “lower non-manual” is estimated to be greater than the probability that the occupational status category of the father in pair is (2) “lower non-manual” and that of his son is (1) “upper non-manual”. Also, the probability that the occupational status category of the father in pair is (1) and that of his son is (3) is estimated to be greater than the probability that the occupational status category of the father in pair is (3) and that of his son is (1). However, for the other symmetric two probabilities with respect to the main diagonal, the probability that the occupational status category of the father in pair is j and that of his son is i ( i < j ) is estimated to be greater than the probability that the occupational status category of the father in pair is i and that of his son is j.

From the result of simulation studies, the ETS(m) model would be appropriate for a square contingency table if it is reasonable to assume an underlying bivariate t-distribution with different marginal variances, although, the TS(m) model may be appropriate for a square contingency table if it is reasonable to assume an underlying bivariate t-distribution with equal marginal variances having m degrees of freedom.

The authors would like to thank referees for helpful comments.

Iki, K., Okada, M. and Tomizawa, S. (2018) An Extended Bivariate T-Distribution Type Symmetry Model for Square Contingency Tables. Open Journal of Statistics, 8, 249-257. https://doi.org/10.4236/ojs.2018.82015

The TS(m) model is also expressed as

p i j = γ [ 1 + 1 m ( α i + β j + ϕ ( i , j ) ) ] − m + 2 2 , (A.1)

where ϕ ( i , j ) = ϕ ( j , i ) . Under the TS(m) model, we see that

l i m m → ∞ p i j p j i = θ j − i , (A.2)

where

θ = e x p [ 1 2 ( α − β ) ] .

Namely, the TS(m) model approaches the LDPS model as m becomes larger.

Let n i j denote the observed frequency in the ith row and jth column of the table ( i = 1 , ⋯ , R ; j = 1 , ⋯ , R ), with n = ∑ ∑ n i j . Assume that a multinomial distribution applies to the R × R table. We consider the maximum likelihood estimates of expected frequencies { m i j } under the ETS(m) model. We must maximize the Lgrangian

L = ∑ i = 1 R ∑ j = 1 R n i j l o g p i j − λ ( ∑ i = 1 R ∑ j = 1 R p i j − 1 ) − ∑ ∑ i < j ϕ i j { p i j − 2 m + 2 − p j i − 2 m + 2 − γ m ( j 2 − i 2 ) − η m ( j − i ) } , (A.3)

with respect to { p i j } , λ , { ϕ i j } , γ m and η m . Setting the partial derivations of L equal to zero, we obtain the equations:

p k l − 2 m + 2 − p l k − 2 m + 2 = γ m ( l 2 − k 2 ) + η m ( l − k ) ( k < l ) , (A.7)

∑ ∑ i < j ϕ i j ( j 2 − i 2 ) = 0 , (A.8)

∑ ∑ i < j ϕ i j ( j − i ) = 0. (A.9)

Using the Newton-Raphson method, we can solve Equations (A.4) to (A.9) with respect to { p i j } , γ m and η m . Noting that { m i j = n p i j } , we obtain the maximum likelihood estimates of { m i j } and the parameters γ m and η m under the ETS(m) model.