^{1}

^{*}

^{1}

^{*}

The purpose of this paper is to obtain the expression of the sample mean difference variance of the Student’s distributive model. In the 2007 the study of the mean difference variance, after some decades, was resumed by Campobasso [1] . Using the Nair’s [2] and Lomnicki’s general results [3] , he obtained the variance of sample mean difference for different distributive models (Laplace ’ s, triangular, power, logit, Pareto ’ s and Gumbel’s model). In addition he extended the knowledge comparing to the ones already known for the other distributive model (normal, rectangular and exponential model).

Let X be a continuous random variable with density function f(x) and distribution function F(x). Then, let X 1 , X 2 , ⋯ , X n be a simple random sample from such population; the sample mean difference is

Δ ¯ = ∑ i = 1 n ∑ j = 1 n | X i − X j | n ( n − 1 ) (1)

The mean value of Δ ¯ is equal to the mean difference of the population

Δ = ∫ − ∞ + ∞ ∫ − ∞ + ∞ | x − y | f ( x ) f ( y ) d x d y (2)

In 1952 Lomnicki [

var ( Δ ¯ ) = 1 n ( n − 1 ) [ 4 ( n − 1 ) σ 2 + 16 ( n − 2 ) I − 2 ( 2 n − 3 ) Δ 2 ] (3)

in which σ 2 and Δ are the variance and the mean difference of the considered distributive model respectively, whereas

I = ∫ − ∞ + ∞ G ( x ) H ( x ) f ( x ) d x (4)

in which

G ( x ) = ∫ − ∞ x ( x − y ) f ( y ) d y (5)

and

H ( x ) = G ( x ) + μ − x , (6)

in which μ is the mean value of such distributive model.

The mean value μ and the variance are known for almost all the distributive models. Concerning the mean difference Δ , the known results are collected in Girone’s and Mazzitelli’s paper [

So, to determine the expression of the sample mean difference variance it’s only needed the calculation of I.

The density function of the Student’s distributive model is

f ( x ) = ( 1 + x 2 / 2 ) g + 1 2 g B ( g / 2 , g / 2 ) , − ∞ < x < + ∞ (7)

in which the parameter g is called number of degrees of freedom.

Using Mathematica software for such model for g = 5 , 7 , ⋯ , 19 the obtained values of I g are shown in

The second term is easily represented by the following formula

g | values di I g |
---|---|

3 | 15/(2π^{2}) − 1/2 |

5 | 1925/(432π^{2}) − 5/18 |

7 | 17,017/(4500π^{2}) − 7/30 |

9 | 9,561,123/(2,744,000π^{2}) − 3/14 |

11 | 2,369,851/(714,420π^{2}) − 11/54 |

13 | 2,170,568,075/(67,6190,592π^{2}) − 13/66 |

15 | 3,920,876,125/(1,250,497,248π^{2}) − 5/26 |

17 | 1,077,676,328,213/(349,825,132,800π^{2}) − 17/90 |

19 | 135,999,445,173,949/(44,757,574,933,500π^{2}) − 19/102 |

− g 6 ( g − 2 ) . (8)

The first term expression is more complicated to be determined. After several attempts comparing each first term to the previous one, we pointed out the recurring formula:

A g = A g − 2 g ( g − 4 ) ( g − 3 ) 2 ( 3 g − 4 ) ( 3 g − 8 ) ( g − 2 ) 4 ( 3 g − 7 ) ( 3 g − 11 ) , per g = 5 , 7 , ⋯ (9)

with the initial value A 3 = 15 / ( 2 π 2 ) .

Considering the previous relation, we came to the following expression of the first I g term for odd g values greater than 3:

A g = 3 g Γ [ ( g − 1 ) / 2 ] 2 Γ ( g / 2 − 1 / 3 ) Γ ( g / 2 + 1 / 3 ) 2 ( g − 2 ) Γ ( g / 2 ) 2 Γ ( g / 2 − 1 / 6 ) Γ ( g / 2 − 5 / 6 ) π , (10)

and then the I g expression

I g = 3 g Γ [ ( g − 1 ) / 2 ] 2 Γ ( g / 2 − 1 / 3 ) Γ ( g / 2 + 1 / 3 ) 2 ( g − 2 ) Γ ( g / 2 ) 2 Γ ( g / 2 − 1 / 6 ) Γ ( g / 2 − 5 / 6 ) π − g 6 ( g − 2 ) (11)

Using again Mathematica software for g = 4 , 6 , ⋯ , 2 0 the obtained values of I g are shown in

The second term of I g is represented again by the simple formula

− g 6 ( g − 2 ) . (12)

After several attempts, comparing each first term to the previous one, we pointed out the recurring formula:

A g = A g − 2 g ( g − 4 ) ( g − 3 ) 2 ( 3 g − 4 ) ( 3 g − 8 ) ( g − 2 ) 4 ( 3 g − 7 ) ( 3 g − 11 ) , per g = 6 , 8 , ⋯ (13)

g | values di I_{g} |
---|---|

4 | 8 /15 − 1/3 |

6 | 9/22 − 1/4 |

8 | 8000/21,879 − 2/3 |

10 | 30,625/89,148 − 5/24 |

12 | 1,778,112/5,386,025 − 1/5 |

14 | 166,012/516,925 − 7/36 |

16 | 345,506,304/1,097,845,315 − 1/6 |

18 | 6,832,522,125/22,049,643,544 − 3/16 |

20 | 8,450,958,230,000/27,608,909,922,531 − 5/27 |

with the initial value A 4 = 8 / 15 . It has to be noticed that the recurring formula is the same one as the odd case.

Considering the previous relation, we came to the following expression of the first I g term for even g values greater than 4:

A g = 2 3 − 2 g 3 g ( g − 2 ) Γ ( g − 2 ) 2 Γ ( g / 2 − 1 / 3 ) Γ ( g / 2 + 1 / 3 ) Γ ( g / 2 ) 4 Γ ( g / 2 − 1 / 6 ) Γ ( g / 2 − 5 / 6 ) (14)

and then the I g expression

I g = 2 3 − 2 g 3 g ( g − 2 ) Γ ( g − 2 ) 2 Γ ( g / 2 − 1 / 3 ) Γ ( g / 2 + 1 / 3 ) Γ ( g / 2 ) 4 Γ ( g / 2 − 1 / 6 ) Γ ( g / 2 − 5 / 6 ) − g 6 ( g − 2 ) (15)

Through some algebraic steps it is easily verified that the two I g formulas for the odd case and the even one are the same and, moreover, a single more compact expression is the following

I g = 2 3 − 2 g 3 g Γ ( g / 2 − 1 / 3 ) Γ ( g / 2 + 1 / 3 ) ( g − 1 ) 2 ( g − 2 ) B ( g / 2 , g / 2 ) Γ ( g / 2 − 1 / 6 ) Γ ( g / 2 − 5 / 6 ) − g 6 ( g − 2 ) (16)

Let us remind that for the Student’s distributive model the expressions of the mean value ( μ ), the variance ( σ 2 ) and the mean difference (Δ) are the following:

μ = 0 , (17)

σ 2 = g g − 2 , (18)

Δ = g B [ ( g − 1 ) / 2 , ( g + 1 ) / 2 ] ( 2 g − 1 ) B ( g , g ) B ( g / 2 , g / 2 ) 2 2 g − 3 . (19)

Using the Lomnicki’s formula we came to the following expression of the mean difference variance for the Student’s distributive model:

var ( Δ ¯ ) = 4 g ( n + 1 ) 3 ( g − 2 ) n ( n − 1 ) + 2 5 − 2 g g Γ [ g − 1 / 2 ] 2 Γ [ ( g − 1 ) / 2 ] 2 ( 3 − 2 n ) Γ [ g / 2 ] 6 n ( n − 1 ) + 8 3 g Γ [ ( g − 1 ) / 2 ] 2 h ( g ) ( n − 2 ) π ( g − 2 ) Γ [ g / 2 ] 2 n ( n − 1 ) (20)

in which

h ( g ) = Γ ( g / 2 − 1 / 3 ) Γ ( g / 2 + 1 / 3 ) Γ ( g / 2 − 1 / 6 ) Γ ( g / 2 − 5 / 6 ) (21)

It is easily checked that, as g diverges,

var ( Δ ¯ ) = 72 − 48 3 + 4 π − ( 48 + 24 3 + 4 π ) n 3 π n ( n − 1 ) , (22)

that represents the sample difference variance for the normal model. It is also easily verified that, as n diverges, the above mentioned variance approaches zero, which means that Δ ¯ is also a consistent estimator.

The sample mean difference Δ ¯ is a correct estimator of the mean difference population for every distributive model. To verify if it is also consistent or not we need to calculate its variance, in this paper we have obtained the variance of Δ ¯ formal expression for the Student’s distributive model in terms of the parameter g (degrees of freedom) and of the sample size n.

Because, even for the Student’s distributive model, such variance approaches zero as the sample size n diverges, Δ ¯ results consistent. As g diverges the Student’s distributive model tends to the normal one. As a matter of fact the variance of Δ ¯ expression we found approaches the variance of Δ ¯ for the normal distributive model.

The helpful and constructive comments of a referee which lead to an improvement of the presentation of the paper and support from the editorial staff of Open Journal of Statistics to process the paper are all gratefully acknowledged.

The authors declare no conflicts of interest regarding the publication of this paper.

Manca, F. and Marin, C. (2020) On the Mean Difference Variance in Random Samples of Student’s Variables. Open Journal of Statistics, 10, 659-663. https://doi.org/10.4236/ojs.2020.104040