<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article  PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "http://dtd.nlm.nih.gov/publishing/3.0/journalpublishing3.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" dtd-version="3.0" xml:lang="en" article-type="research article"><front><journal-meta><journal-id journal-id-type="publisher-id">JAMP</journal-id><journal-title-group><journal-title>Journal of Applied Mathematics and Physics</journal-title></journal-title-group><issn pub-type="epub">2327-4352</issn><publisher><publisher-name>Scientific Research Publishing</publisher-name></publisher></journal-meta><article-meta><article-id pub-id-type="doi">10.4236/jamp.2019.78122</article-id><article-id pub-id-type="publisher-id">JAMP-94421</article-id><article-categories><subj-group subj-group-type="heading"><subject>Articles</subject></subj-group><subj-group subj-group-type="Discipline-v2"><subject>Physics&amp;Mathematics</subject></subj-group></article-categories><title-group><article-title>
 
 
  Chi-Square Distribution: New Derivations and Environmental Application
 
</article-title></title-group><contrib-group><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Thomas</surname><given-names>M. Semkow</given-names></name><xref ref-type="aff" rid="aff1"><sup>1</sup></xref><xref ref-type="corresp" rid="cor1"><sup>*</sup></xref></contrib><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Nicole</surname><given-names>Freeman</given-names></name><xref ref-type="aff" rid="aff2"><sup>2</sup></xref></contrib><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Umme-Farzana</surname><given-names>Syed</given-names></name><xref ref-type="aff" rid="aff1"><sup>1</sup></xref></contrib><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Douglas</surname><given-names>K. Haines</given-names></name><xref ref-type="aff" rid="aff1"><sup>1</sup></xref></contrib><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Abdul</surname><given-names>Bari</given-names></name><xref ref-type="aff" rid="aff1"><sup>1</sup></xref></contrib><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Abdul</surname><given-names>J. Khan</given-names></name><xref ref-type="aff" rid="aff1"><sup>1</sup></xref></contrib><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Kimi</surname><given-names>Nishikawa</given-names></name><xref ref-type="aff" rid="aff1"><sup>1</sup></xref></contrib><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Adil</surname><given-names>Khan</given-names></name><xref ref-type="aff" rid="aff1"><sup>1</sup></xref></contrib><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Adam</surname><given-names>G. Burn</given-names></name><xref ref-type="aff" rid="aff3"><sup>3</sup></xref></contrib><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Xin</surname><given-names>Li</given-names></name><xref ref-type="aff" rid="aff1"><sup>1</sup></xref></contrib><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Liang</surname><given-names>T. Chu</given-names></name><xref ref-type="aff" rid="aff3"><sup>3</sup></xref></contrib></contrib-group><aff id="aff2"><addr-line>Averill Park Central School District, Averill Park, NY, USA</addr-line></aff><aff id="aff3"><addr-line>Department of Environmental Health Sciences, University at Albany, State University of New York, Rensselaer, NY, USA</addr-line></aff><aff id="aff1"><addr-line>Wadsworth Center, New York State Department of Health, Albany, NY, USA</addr-line></aff><pub-date pub-type="epub"><day>12</day><month>08</month><year>2019</year></pub-date><volume>07</volume><issue>08</issue><fpage>1786</fpage><lpage>1799</lpage><history><date date-type="received"><day>19,</day>	<month>July</month>	<year>2019</year></date><date date-type="rev-recd"><day>16,</day>	<month>August</month>	<year>2019</year>	</date><date date-type="accepted"><day>19,</day>	<month>August</month>	<year>2019</year></date></history><permissions><copyright-statement>&#169; Copyright  2014 by authors and Scientific Research Publishing Inc. </copyright-statement><copyright-year>2014</copyright-year><license><license-p>This work is licensed under the Creative Commons Attribution International License (CC BY). http://creativecommons.org/licenses/by/4.0/</license-p></license></permissions><abstract><p>
 
 
  We describe two new derivations of the chi-square distribution. The first derivation uses the induction method, which requires only a single integral to calculate. The second derivation uses the Laplace transform and requires minimum assumptions. The new derivations are compared with the established derivations, such as by convolution, moment generating function, and Bayesian inference. The chi-square testing has seen many applications to physics and other fields. We describe a unique version of the chi-square test where both the variance and location are tested, which is then applied to environmental data. The chi-square test is used to make a judgment whether a laboratory method is capable of detection of gross alpha and beta radioactivity in drinking water for regulatory monitoring to protect health of population. A case of a failure of the chi-square test and its amelioration are described. The chi-square test is compared to and supplemented by the 
  <em>t</em>-test.
 
</p></abstract><kwd-group><kwd>Mathematical Induction</kwd><kwd> Laplace Transform</kwd><kwd> Gamma Distribution</kwd><kwd> Chi-Square Test</kwd><kwd> Gross Alpha-Beta</kwd><kwd> Drinking Water</kwd></kwd-group></article-meta></front><body><sec id="s1"><title>1. Introduction</title><p>The chi-square distribution (CSD) has been one of the most frequently used distributions in science. It is a special case of the gamma distribution (see Section 2). The latter has been an important distribution in fundamental physics, for example as kinetic energy distribution of particles in an ideal gas (Maxwell-Boltzmann) [<xref ref-type="bibr" rid="scirp.94421-ref1">1</xref>] or the kinetic energy distribution of particles emitted from excited nuclei in nuclear reactions [<xref ref-type="bibr" rid="scirp.94421-ref2">2</xref>] . A historical context for the development of the CSD is described in References [<xref ref-type="bibr" rid="scirp.94421-ref3">3</xref>] and [<xref ref-type="bibr" rid="scirp.94421-ref4">4</xref>] . Its first derivation is attributed to Bienaym&#233; [<xref ref-type="bibr" rid="scirp.94421-ref5">5</xref>] , who used multiple integrals over normal variables and substitutions. Abbe [<xref ref-type="bibr" rid="scirp.94421-ref6">6</xref>] used a method of integration in the complex plane to solve multiple integrals. The most general derivation is attributed to Helmert, who proposed a classic transformation to derive CSD, including calculation of the Jacobian determinant of transformation [<xref ref-type="bibr" rid="scirp.94421-ref7">7</xref>] . This transformation can be worked out into polar variables, which is described in statistical textbooks [<xref ref-type="bibr" rid="scirp.94421-ref4">4</xref>] [<xref ref-type="bibr" rid="scirp.94421-ref8">8</xref>] .</p><p>The established fundamental derivations of the CSD described above lend themselves to complicated handling of multiple integrals. On the contrary, the simplified derivations use the fact that CSD is a special case of the gamma distribution. Owing to the integrable and recursive properties of the gamma distribution, as well as its moment generating function (Mgf), simplified derivations of CSD are described in the textbooks [<xref ref-type="bibr" rid="scirp.94421-ref9">9</xref>] [<xref ref-type="bibr" rid="scirp.94421-ref10">10</xref>] . Another simplified derivation uses Bayesian inference [<xref ref-type="bibr" rid="scirp.94421-ref11">11</xref>] . In Section 2, we refer to these methods for comparisons.</p><p>In this work, we present two new methods of derivation of the CSD. They are both within the simplified category. One of them is mathematical induction. The original derivation was done by Helmert [<xref ref-type="bibr" rid="scirp.94421-ref12">12</xref>] using a 2-step forward mathematical induction. We have elaborated on that and observed that the CSD has certain recursive property, which enables its derivation using a single-step induction plus the well-known theorem for beta and gamma functions. Another derivation method we describe is by the Laplace transform. This method has some similarity to the Mgf and characteristic function methods, owing to the presence of exponentiation. It uses a complex-variable integration and it is free from many assumptions of the other methods. The two new derivations of the CSD by mathematical induction and Laplace transform are described in Section 2.</p><p>Chi-square testing (CST) is closely related to and based upon the CSD. It has its origins in the discovery of the goodness-of-fit test by Pearson [<xref ref-type="bibr" rid="scirp.94421-ref13">13</xref>] . In the goodness-of-fit, one calculates the test statistics as</p><p>χ ν 2 = ∑ i = 1 m ( O i − E i ) 2 E i , (1)</p><p>where O i is frequency of observation, E i is expected frequency based on an assumed model distribution, for category of type i, and m is the number of categories. Both O i and E i are unitless. ν = m − 1 − p is the number of degrees of freedom, where p is number of parameters of the model distribution calculated from the data. For any model distribution, Equation (1) leads asymptotically to the CSD when the number of observations is large, which has been proved for the multinomial distribution by Pearson [<xref ref-type="bibr" rid="scirp.94421-ref13">13</xref>] . The goodness-of-fit CST has been extensively used in statistics and widely applied to many fields [<xref ref-type="bibr" rid="scirp.94421-ref3">3</xref>] [<xref ref-type="bibr" rid="scirp.94421-ref14">14</xref>] . It is worth noting that the interpretation of the degrees of freedom was provided by Fisher [<xref ref-type="bibr" rid="scirp.94421-ref15">15</xref>] . As example in physics, CST goodness-of-fit has been used to verify Poisson fluctuations of radioactivity counter [<xref ref-type="bibr" rid="scirp.94421-ref14">14</xref>] [<xref ref-type="bibr" rid="scirp.94421-ref16">16</xref>] .</p><p>Another form of the chi-square variable from Equation (1) is written in the general form as</p><p>χ ν 2 = ∑ i = 1 n ( x i − μ i σ i ) 2 , (2)</p><p>where n is the number of observations, x i is the observed variable, μ i is the expected value, σ i is the standard deviation, and ν ≤ n . The variables in Equation (2) can be expressed in physical units. In the limit of large number of observations, the variable and parameters of Equation (2) are approximated by those of the normal variates, and the χ ν 2 distributes as CSD. In this work, we generalize this CST test to a combined test for variance and location as well as verify it with the t-test [<xref ref-type="bibr" rid="scirp.94421-ref17">17</xref>] . The test statistics studied are described in Section 3.</p><p>Within the context of this work, we present a unique application of the CST to the detection of radioactive contaminants in drinking water required by the Safe Drinking Water Act (SDWA) in the US. The bulk of natural alpha and beta/gamma (photon) radioactivity in drinking water originates from the possible presence of <sup>238</sup>U and <sup>232</sup>Th natural radioactive-series progeny, <sup>226,228</sup>Ra and their progeny, as well as <sup>40</sup>K radionuclides [<xref ref-type="bibr" rid="scirp.94421-ref18">18</xref>] . The SDWA regulations [<xref ref-type="bibr" rid="scirp.94421-ref19">19</xref>] establish a Maximum Contaminant Level (MCL) of 15 pCi/L (555 mBq/L) for gross alpha (GA) radioactivity, excluding U and Rn. For gross beta (GB) radioactivity, the MCL is limited by the total body or any organ radiation dose of 4 mrem/y (40 μSv/y). For both GA and GB, the Maximum Contaminant Level Goal (MCLG) is zero. Furthermore, SDWA requires Detection Limits (DL) of 3 pCi/L (111 mBq/L) and 4 pCi/L (148 mBq/L) for GA and GB radioactivity, respectively. These DLs must be met by all public health laboratories accredited for monitoring of GA and GB radioactivity in drinking water in the US. In Section 4, we detail a CST procedure to verify if the required above-mentioned DLs are met [<xref ref-type="bibr" rid="scirp.94421-ref20">20</xref>] . We investigate the reasons and consequences of failed CST and ameliorate such cases.</p></sec><sec id="s2"><title>2. Chi-Square Distribution</title><p>The probability density function (Pdf) of the CSD is given by</p><p>Pdf ( χ ν 2 | ν ) = ( χ ν 2 ) ν / 2 − 1 e − χ ν 2 / 2 2 ν / 2 Γ ( ν / 2 ) , (3)</p><p>where Γ is the gamma function. The expectation value of CSD is E [ χ 2 ] = ν , and the variance Var [ χ 2 ] = 2 ν [<xref ref-type="bibr" rid="scirp.94421-ref21">21</xref>] . The CSD is a special case of the gamma distribution abbreviated as gamma ( χ ν 2 | a , b ) with the parameters a = ν / 2 and b = 2 [<xref ref-type="bibr" rid="scirp.94421-ref21">21</xref>] .</p><p>To derive Equation (3), we start with the general definition of χ ν 2 statistics given by Equation (2) assuming normal variates. For a single normal variable x 1 with Pdf ( x 1 ) , the probability of x 1 ∈ [ x 1 , x 1 + d x 1 ] is given by</p><p>Pdf ( x 1 ) d x 1 = 1 2 π σ 1 e − ( x 1 − μ 1 σ 1 ) 2 / 2 d x 1 . (4)</p><p>By substituting χ 1 2 = ( ( x 1 − μ 1 ) / σ 1 ) 2 , we obtain from Equation (4)</p><p>Pdf ( χ 1 2 | 1 ) d χ 1 2 = 2 2 π σ 1 e − χ 1 2 / 2 | d x 1 d χ 1 2 | d χ 1 2 = ( χ 1 2 ) 1 / 2 − 1 e − χ 1 2 / 2 2 1 / 2 Γ ( 1 / 2 ) d χ 1 2 = gamma ( χ 1 2 | 1 / 2 , 2 ) d χ 1 2 , (5)</p><p>which has the Pdf given by Equation (3) for ν = 1 . In deriving Equation (5), we also used Γ ( 1 / 2 ) = π , whereas factor of 2 originated from the fact that the x 1 variable ranging from minus infinity to plus infinity has been substituted with the χ 1 2 variable ranging from zero to plus infinity.</p><p>Let us assume that the n + 1 term with the normal x n + 1 variable was added to Equation (2), and that this addition raised the number of degrees of freedom to ν + 1 . Then,</p><p>χ ν + 1 2 = χ ν 2 + ( x n + 1 − μ n + 1 σ n + 1 ) 2 . (6)</p><p>Using the calculus for probability density functions [<xref ref-type="bibr" rid="scirp.94421-ref21">21</xref>] ,</p><p>Pdf ( χ ν + 1 2 | ν + 1 ) d χ ν + 1 2 = ∫ − ∞ + ∞ Pdf ( χ ν 2 | ν ) d χ ν 2 Pdf ( x n + 1 ) d x n + 1 . (7)</p><p>Let us define a new variable z, such as</p><p>( x n + 1 − μ n + 1 σ n + 1 ) 2 = χ ν + 1 2 ( 1 − z ) . (8)</p><p>By realizing that d χ ν + 1 2 = d χ ν 2 , and performing all substitutions, the right side of Equation (7) can be rewritten as</p><p>2 ∫ 0 1 Pdf ( χ ν 2 | ν ) Pdf ( x n + 1 ) | d x n + 1 d z | d z d χ ν + 1 2 = ( χ ν + 1 2 ) ( ν + 1 ) / 2 − 1 e − χ ν + 1 2 / 2 2 ( ν + 1 ) / 2 Γ ( ν / 2 ) Γ ( 1 / 2 ) d χ ν + 1 2 ∫ 0 1 z ν / 2 − 1 ( 1 − z ) 1 / 2 − 1 d z . (9)</p><p>However, the integral on the right side of Equation (9) is the beta function, B ( ν / 2 , 1 / 2 ) , which is related to the gamma functions by [<xref ref-type="bibr" rid="scirp.94421-ref22">22</xref>] ,</p><p>B ( ν / 2 , 1 / 2 ) = Γ ( ν / 2 ) Γ ( 1 / 2 ) Γ ( ( ν + 1 ) / 2 ) . (10)</p><p>By inserting Equation (10) into Equation (9), simplifying, and comparing with the left side of Equation (7), one obtains</p><p>Pdf ( χ ν + 1 2 | ν + 1 ) = ( χ ν + 1 2 ) ( ν + 1 ) / 2 − 1 e − χ ν + 1 2 / 2 2 ( ν + 1 ) / 2 Γ ( ( ν + 1 ) / 2 ) , (11)</p><p>which is the Pdf given by Equation (3) for ν + 1 degrees of freedom and it proves Equation (3) by induction.</p><p>By substituting φ i 2 = ( ( x i − μ i ) / σ i ) 2 , Equation (2) becomes</p><p>χ ν 2 = ∑ i = 1 n φ i 2 (12)</p><p>The sum of independent random variables φ i 2 is called a convolution and the joint distribution function for χ ν 2 can be obtained by calculating an n-dimensional convolution integral. Exploring the properties of this convolution leads to simplifications, which have been used in the literature. By convoluting two gamma distributions χ 1 2 ≡ φ i 2 from Equation (5) and using the theorem that the convolution of two gammas is also a gamma, one obtains gamma ( χ 2 2 | 2 / 2 , 2 ) [<xref ref-type="bibr" rid="scirp.94421-ref9">9</xref>] . By continuing this process of convoluting with χ 1 2 , it is easy to infer that the full convolution is equal to gamma ( χ ν 2 | ν / 2 , 2 ) , where ν = n , which the CSD given by Equation (3). This provides a simplified derivation of CSD using convolution.</p><p>Another simplified derivation of CSD uses the theorem that the Mgf of convolution is a product of individual Mgfs [<xref ref-type="bibr" rid="scirp.94421-ref10">10</xref>] . Thus, by calculating Mfg of χ 1 2 from Equation (5) and taking it to the nth power, one obtains the Mgf for χ ν 2 , where ν = n . One can also calculate the Mgf of the gamma distribution and infer from a comparison that the CSD in Equation (3) is a special case of the gamma distribution [<xref ref-type="bibr" rid="scirp.94421-ref10">10</xref>] .</p><p>In this work we provide yet another simplified derivation of the CSD using Laplace transform [<xref ref-type="bibr" rid="scirp.94421-ref23">23</xref>] . The Laplace transform of Equation (5) is equal to</p><p>∫ 0 ∞ ( χ 1 2 ) 1 / 2 − 1 e − χ 1 2 / 2 2 1 / 2 Γ ( 1 / 2 ) e − s χ 1 2 d χ 1 2 = ( 1 / 2 s + 1 / 2 ) 1 / 2 . (13)</p><p>Subsequently, we use a theorem that the Laplace transform of a nth convolution is a product of the individual transforms, i.e. ( 1 / 2 s + 1 / 2 ) n / 2 . By abbreviating u = χ n 2 , the inverse Laplace transform results in the Pdf of u,</p><p>Pdf ( u | n ) = 1 2 π i ∮ ( 1 / 2 s + 1 / 2 ) n / 2 e s u d s = 1 2 n / 2 1 2 π i ∮ e s u ( s + 1 / 2 ) n / 2 d s . (14)</p><p>To calculate the contour integral in Equation (14), we start with the Cauchy integration formula for an analytic function f ( s ) of a complex variable s having a simple pole at s 0 [<xref ref-type="bibr" rid="scirp.94421-ref24">24</xref>] :</p><p>f ( s 0 ) = 1 2 π i ∮ f ( s ) s − s 0 d s . (15)</p><p>The k − 1 times differentiation of Equation (15), where the differentiation can be of an integer or a fractional order [<xref ref-type="bibr" rid="scirp.94421-ref25">25</xref>] , results in:</p><p>f ( k − 1 ) ( s 0 ) = Γ ( k ) 2 π i ∮ f ( s ) ( s − s 0 ) k d s . (16)</p><p>By comparing Equation (14) to Equation (16), we infer that f ( s ) = e s u , s 0 = − 1 / 2 , and k = n / 2 . By inserting these variables to Equation (16) and plugging it into Equation (14), we obtain:</p><p>Pdf ( u | n ) = 1 2 n / 2 Γ ( n / 2 ) ( d n / 2 − 1 d s n / 2 − 1 e s u ) s = − 1 / 2 = u n / 2 − 1 e − u / 2 2 n / 2 Γ ( n / 2 ) , (17)</p><p>which is the CSD given by Equation (3) for ν = n and χ n 2 = u .</p><p>Another simplified derivation of the CSD uses the Bayesian inference and it is not related to the convolutions described above [<xref ref-type="bibr" rid="scirp.94421-ref11">11</xref>] . It uses a normal likelihood function for multiple samples. It also uses the transformational prior distributions: ∝ 1 / σ for scale parameter σ and a constant for translation parameter μ [<xref ref-type="bibr" rid="scirp.94421-ref26">26</xref>] . Marginalizing the joint distribution ( μ , σ ) over μ results in the CSD, whereas marginalizing over σ results in the t-distribution [<xref ref-type="bibr" rid="scirp.94421-ref27">27</xref>] .</p><p>In Section 5, we summarize the advantages and disadvantages of the simplified derivation methods of CSD described in this section.</p></sec><sec id="s3"><title>3. Test Statistics</title><p>Several models for the CST statistics can be derived from the general Equation (2). For the expected value, we can use either the sample mean x &#175; or the population mean μ , whereas for the standard deviation we can use either individual standard deviations σ i or the sample standard deviation σ x . We do not know the population standard deviation for the data described in Section 4. Model test statistics ∑ ​ ( ( x i − x &#175; ) / σ x ) 2 is always equal to n − 1 and thus not useful. However, the model test statistics ∑ ​ ( ( x i − x &#175; ) / σ i ) 2 can be used to test the variance. Other possibilities are to test for both the variance and location by employing model test statistics ∑ ​ ( ( x i − μ ) / σ i ) 2 or ∑ ​ ( ( x i − μ ) / σ x ) 2 , if the population mean is known which is the case for the data in Section 4.</p><p>For the t-test we perform a standard one-sample test, where we calculate t variable as ( x &#175; − μ ) / ( σ x / n ) . The t-test is the location test. The results of all these test models using radioactivity data are presented in Section 4.</p></sec><sec id="s4"><title>4. Chi-Square- and t-Test for Radioactivity Detection in Drinking Water</title><p>The most convenient method of measuring GA and GB radioactivity in drinking water is by gas proportional counting [<xref ref-type="bibr" rid="scirp.94421-ref28">28</xref>] . In this method, a given quantity of water is evaporated with nitric acid onto a stainless-steel planchet and dried, leaving a residue containing any radioactivity. The planchet is then counted on a gas proportional detector. Alpha and beta particles are counted simultaneously, and they are differentiated by much larger ionization caused by the former.</p><p>As stated in Section 1, this method must be able to determine GA and GB at the DL, to be verified by the CST [<xref ref-type="bibr" rid="scirp.94421-ref20">20</xref>] using a minimum of seven samples. EPA recommends a right-tail (RT) CST at 99% Confidence Level (CL), or 0.01 significance. To accomplish this, n = 9 samples of community drinking water were spiked with <sup>230</sup>Th and <sup>90</sup>Sr/<sup>90</sup>Y radionuclides providing alpha and beta radioactivity, respectively. The spiking activities (i.e. the expected μ ) were: 2.9888 &#177; 0.0402 pCi/L for alpha and 4.1860 &#177; 0.0549 pCi/L for beta, close to the required DL values. The values of spiking activities and their uncertainties were obtained from the standards traceable to the National Institute of Standards and Technology (NIST). Then the experimental procedure was followed, and the measured GA and GB activities x i are depicted as points in <xref ref-type="fig" rid="fig1">Figure 1</xref> and <xref ref-type="fig" rid="fig2">Figure 2</xref>, respectively.</p><p>Also shown in <xref ref-type="fig" rid="fig1">Figure 1</xref> and <xref ref-type="fig" rid="fig2">Figure 2</xref> are the individual standard deviations σ i , depicted as vertical lines. These standard uncertainties are propagated, including the Poisson statistics of radioactivity counting and background subtraction, uncertainties of the detector efficiency, cross-talk between alpha and beta particles, as well as solution-pipetting uncertainties. Therefore, they are slightly different for different samples.</p><p>The GA results are described first. The sample average for GA is given by x &#175; = 3.0951 pCi/L (red horizontal thick line) which is close to the expected μ (green horizontal thick line) as seen in <xref ref-type="fig" rid="fig1">Figure 1</xref>. The sample standard deviation is given by σ x = 0.7000 pCi/L. The results of the variance test, as defined in Section 3, are given in column 3 of <xref ref-type="table" rid="table1">Table 1</xref>. The number of the degrees of freedom is ν = 8 because one constraint is from calculating the mean. The observed χ 2 statistics is equal to 14.0 for gross alpha. The right-tail (RT) and left-tail (LT) χ 2 are calculated from the CSD at 0.01 significance each. Since 1.6 &lt; 14.0 &lt; 20.1 , each tail test passes at 0.01 significance and two-tail (2T) test passes at 0.02 significance. Then, the two combined variance/location tests, as defined in Section 3 are given in columns 4 and 5 using σ i and σ x , respectively. ν = n = 9 in these cases, because there are no constraints. They both pass for GA.</p><p>The t-test statistics is calculated as described in Section 3 resulting in 0.45 for GA, as given in column 6 in <xref ref-type="table" rid="table1">Table 1</xref>. The RT probability of 0.33 and 2T probability of 0.66 are larger than 0.01 and 0.02, respectively, ensuring the passage of the location t-test.</p><p>The gross beta activities plotted in <xref ref-type="fig" rid="fig2">Figure 2</xref>, with the mean x &#175; = 5.1274 pCi/L (red horizontal thick line) and σ x = 0.3050 pCi/L differ significantly from the expected μ (green horizontal thick line) beyond the observed uncertainties. That fact did not affect the variance test which passed for GB (column 3 in <xref ref-type="table" rid="table1">Table 1</xref>). However, the observed χ 2 of 43.1 and 93.7 exceed the calculated RT χ 2 of 21.7 (columns 4 and 5 in <xref ref-type="table" rid="table1">Table 1</xref>), therefore the combined variance/location tests failed. This failure is supported by the t-test, where the high t = 9.26 (column 6) resulted in very low values of the RT and 2T probabilities (columns 7 and 8) and failures of the test for GB.</p><p>To elucidate the reasons for failure of the GB CST and t-test, fifteen non-spiked Method Blank (MB) community water samples were prepared and measured. The average GA activity was below detection; however, the average GB was 0.8121 &#177; 0.2801 pCi/L. This MB was then subtracted from the spiked GB results and the corrected GB activities are plotted in <xref ref-type="fig" rid="fig3">Figure 3</xref>. The mean of the corrected GB is x &#175; = 4.3153 pCi/L ( σ x = 0.3050 pCi/L), very close to the value for spiked radioactivity. The corrected observed χ 2 are now 2.7, 3.2 and 9.6 (columns 3, 4, and 5 in <xref ref-type="table" rid="table1">Table 1</xref>) ensuring the passage of the three CSTs. This is supported by the passage of the t-test also (columns 6, 7, and 8).</p><table-wrap id="table1" ><label><xref ref-type="table" rid="table1">Table 1</xref></label><caption><title> The results of χ<sup>2</sup>- and t-tests. Abbreviations: RT right-tail, LT left-tail, 2T two-tail. Significance is 0.01 for each tail</title></caption><table><tbody><thead><tr><th align="center" valign="middle" >1</th><th align="center" valign="middle" >2</th><th align="center" valign="middle" >3</th><th align="center" valign="middle" >4</th><th align="center" valign="middle" >5</th><th align="center" valign="middle" >6</th><th align="center" valign="middle" >7</th><th align="center" valign="middle" >8</th></tr></thead><tr><td align="center" valign="middle"  rowspan="5"  >Experiment, reference</td><td align="center" valign="middle"  colspan="4"  >χ<sup>2</sup>-test</td><td align="center" valign="middle"  colspan="3"  >t-test</td></tr><tr><td align="center" valign="middle" >Parameter</td><td align="center" valign="middle" >Variance, σ i</td><td align="center" valign="middle" >Variance and location, σ i</td><td align="center" valign="middle" >Variance and location, σ x</td><td align="center" valign="middle"  colspan="3"  >Location</td></tr><tr><td align="center" valign="middle" >Deg free</td><td align="center" valign="middle" >8</td><td align="center" valign="middle" >9</td><td align="center" valign="middle" >9</td><td align="center" valign="middle" >8</td><td align="center" valign="middle"  colspan="2"  ></td></tr><tr><td align="center" valign="middle" >Calc RT</td><td align="center" valign="middle" >20.1</td><td align="center" valign="middle" >21.7</td><td align="center" valign="middle" >21.7</td><td align="center" valign="middle"  colspan="3"  ></td></tr><tr><td align="center" valign="middle" >Calc LT</td><td align="center" valign="middle" >1.6</td><td align="center" valign="middle" >2.1</td><td align="center" valign="middle" >2.1</td><td align="center" valign="middle" >t</td><td align="center" valign="middle" >RT prob</td><td align="center" valign="middle" >2T prob</td></tr><tr><td align="center" valign="middle"  rowspan="2"  >Gross Alpha, <xref ref-type="fig" rid="fig1">Figure 1</xref></td><td align="center" valign="middle" >Observed</td><td align="center" valign="middle" >14.0</td><td align="center" valign="middle" >13.4</td><td align="center" valign="middle" >8.2</td><td align="center" valign="middle" >0.45</td><td align="center" valign="middle" >0.33</td><td align="center" valign="middle" >0.66</td></tr><tr><td align="center" valign="middle" >Test result</td><td align="center" valign="middle" >Passed</td><td align="center" valign="middle" >Passed</td><td align="center" valign="middle" >Passed</td><td align="center" valign="middle" ></td><td align="center" valign="middle" >Passed</td><td align="center" valign="middle" >Passed</td></tr><tr><td align="center" valign="middle"  rowspan="2"  >Gross Beta, <xref ref-type="fig" rid="fig2">Figure 2</xref></td><td align="center" valign="middle" >Observed</td><td align="center" valign="middle" >3.8</td><td align="center" valign="middle" >43.1</td><td align="center" valign="middle" >93.7</td><td align="center" valign="middle" >9.26</td><td align="center" valign="middle" >7.5E−06</td><td align="center" valign="middle" >1.5E−05</td></tr><tr><td align="center" valign="middle" >Test result</td><td align="center" valign="middle" >Passed</td><td align="center" valign="middle" >Failed</td><td align="center" valign="middle" >Failed</td><td align="center" valign="middle" ></td><td align="center" valign="middle" >Failed</td><td align="center" valign="middle" >Failed</td></tr><tr><td align="center" valign="middle"  rowspan="2"  >Gross Beta-MB subtracted, <xref ref-type="fig" rid="fig3">Figure 3</xref></td><td align="center" valign="middle" >Observed</td><td align="center" valign="middle" >2.7</td><td align="center" valign="middle" >3.2</td><td align="center" valign="middle" >9.6</td><td align="center" valign="middle" >1.27</td><td align="center" valign="middle" >0.12</td><td align="center" valign="middle" >0.24</td></tr><tr><td align="center" valign="middle" >Test result</td><td align="center" valign="middle" >Passed</td><td align="center" valign="middle" >Passed</td><td align="center" valign="middle" >Passed</td><td align="center" valign="middle" ></td><td align="center" valign="middle" >Passed</td><td align="center" valign="middle" >Passed</td></tr></tbody></table></table-wrap><p>The reasons for the elevated GB in MB of community drinking water were investigated. Ten L of water were evaporated to 50 mL and measured using precise gamma-ray spectrometry [<xref ref-type="bibr" rid="scirp.94421-ref29">29</xref>] . It was determined that the concentration of the beta/gamma emitter, <sup>40</sup>K was 0.6926 &#177; 0.0790 pCi/L. It was also possible to identify several beta/gamma progenies of the <sup>238</sup>U series: <sup>234</sup>Th, <sup>214</sup>Pb, <sup>214</sup>Bi, and <sup>210</sup>Pb, as well as those from the <sup>232</sup>Th series: <sup>228</sup>Ac, <sup>212</sup>Pb, and <sup>208</sup>Tl. The combined activity of the beta/gamma progeny was 0.1513 &#177; 0.0672 pCi/L. Therefore, the sum of <sup>40</sup>K and beta/gamma progeny was 0.8440 &#177; 0.1037 pCi/L. The latter is consistent with the GB activity of 0.8121 &#177; 0.2801 pCi/L from the MB measurement to within the measured uncertainties. Also associated with the decay of <sup>238</sup>U and <sup>232</sup>Th is their alpha activity plus alpha progeny of similar activity to that of the beta/gamma progeny. This alpha activity could not have been detected by gamma spectrometry and was below the detection by GA in the MB measurement. However, the fact that GA of 3.0951 pCi/L is slightly higher than the expected 2.9888 pCi/L is an indication of that. Unlike in the case of beta activity, the small alpha progeny activity did not affect the CST or t-test. It should be noted that this level of naturally present radioactivity in the community water is much below the MCL, and thus poses small risk to the population.</p></sec><sec id="s5"><title>5. Summary and Conclusions</title><p>We have described five simplified methods of deriving the chi-square distribution. Three of them: by convolution, moment generating function, and Bayesian inference are described in the literature and have been outlined here for comparison. The simplest of them seems to be the convolution method. It only uses the substitution from the normal distribution to a chi-square variable and requires a calculation of a single convolution integral on the above. It infers the form of multiple convolution on gamma distribution leading to the chi-square distribution. The moment generating function method of derivation is more advanced as it requires the knowledge of the moment generating function and the gamma distribution. The Bayesian inference method requires the knowledge about likelihood function and prior probabilities but does not require the knowledge about the gamma distribution.</p><p>In this work, we have proposed two new methods for derivation of the chi-square distribution: by induction and by Laplace transform. The method of induction uses operational calculus with only a single integral leading to beta function. The proposed derivation applies modern formalism and seems to be simpler than the original derivation by Helmert as early as in 1876. A disadvantage of the induction method is that it requires a prior knowledge of the chi-square distribution to perform induction on it. There is a significant advantage, however. All other methods require either no constraints in the data; i.e. the number of degrees of freedom must be equal to the number of observations, or one constraint in case of Bayesian inference. The induction method leaves any constraints intact by adding one induction step to the existing number of degrees of freedom. The proposed derivation method by Laplace transform is more advanced because it uses integration in the complex plane. The significant advantage of the Laplace transform, and the Bayes inference methods is that they do not require prior knowledge about the gamma distribution.</p><p>We have also described a unique application of the chi-square test to environmental science. In chi-square testing, it is important to delineate systematic effects from the random uncertainties. In this work, a systematic natural contamination of laboratory method blank caused the chi-square test for combined variance/location to fail; however, it did not affect the chi-square test for variance alone. After subtracting the systematic method blank, the chi-square variance/location test was shown to have passed. This was confirmed by the location t-test. It is also imperative to perform analysis of uncertainty. In this work, using either individual or sample standard deviations did not affect the variance/location chi-square test. While the chi-square test provides verification if a laboratory test method is adequate to monitor gross alpha and gross beta radioactivity in drinking water, the test statistics combining variance and location is more useful than the one based on the variance alone because it can identify systematic bias.</p></sec><sec id="s6"><title>Acknowledgements</title><p>N. F. acknowledges partial support by the Questar III STEM Research Institute for Teachers of Science, Engineering, Mathematics, and Technology. K. N. acknowledges partial support by the US Food and Drug Administration under Grant 5U18FD005514-04. Thanks are due to J. Witmer for his valuable comments.</p></sec><sec id="s7"><title>Conflicts of Interest</title><p>The authors declare no conflicts of interest regarding the publication of this paper.</p></sec><sec id="s8"><title>Cite this paper</title><p>Semkow, T.M., Freeman, N., Syed, U.-F., Haines, D.K., Bari, A., Khan, A.J., Nishikawa, K., Khan, A., Burn, A.G., Li, X. and Chu, L.T. (2019) Chi-Square Distribution: New Derivations and Environmental Application. Journal of Applied Mathematics and Physics, 7, 1786-1799. https://doi.org/10.4236/jamp.2019.78122</p></sec><sec id="s9"><title>Appendix</title>A.1. Glossary<p>CL: Confidence Level</p><p>CSD: Chi-Square Distribution</p><p>CST: Chi-Square Test</p><p>DL: Detection Limit for radionuclides</p><p>EPA: U.S. Environmental Protection Agency</p><p>GA: Gross Alpha Radioactivity</p><p>GB: Gross Beta Radioactivity</p><p>L: Liter</p><p>LT: Left Tail</p><p>MB: Method Blank</p><p>mBq: milli-Becquerel</p><p>MCL: Maximum Contaminant Level</p><p>MCLG: Maximum Contaminant Level Goal</p><p>Mgf: Moment generating function</p><p>mL: milli-Liter</p><p>mrem: milli-rem</p><p>NIST: National Institute of Standards and Technology</p><p>pCi: pico-Curie</p><p>Pdf: Probability density function</p><p>RT: Right Tail</p><p>SDWA: Safe Drinking Water Act</p><p>STEM: Science, Technology, Engineering and Mathematics</p><p>y: year</p><p>μSv: micro-Sievert</p><p>2T: Two Tail</p>A.2. Variables<p>a, b: parameters of the gamma distribution</p><p>B: beta function</p><p>E: expectation value</p><p>E i : expected frequency</p><p>f ( s ) : analytic function</p><p>gamma: gamma distribution</p><p>i, k: indices</p><p>m: number of categories</p><p>n: number of observations</p><p>O i : observed frequency</p><p>p: number of parameters for model distribution</p><p>s: complex variable</p><p>s 0 : pole</p><p>t: t-test variable</p><p>Var: variance</p><p>x i : normal random variable</p><p>x &#175; : sample mean</p><p>u, z: substituted variables</p><p>Γ : gamma function</p><p>μ , μ i : expected variable: population, individual</p><p>ν : number of degrees of freedom</p><p>σ , σ i , σ x : standard deviation, individual, sample</p><p>φ i 2 : individual chi-square</p><p>χ 2 , χ i 2 , χ n 2 , χ ν 2 : chi-square, for i, n observations, ν degrees of freedom</p></sec></body><back><ref-list><title>References</title><ref id="scirp.94421-ref1"><label>1</label><mixed-citation publication-type="other" xlink:type="simple">Hill, T.L. (1986) An Introduction to Statistical Thermodynamics. Dover Publications, New York, 122.</mixed-citation></ref><ref id="scirp.94421-ref2"><label>2</label><mixed-citation publication-type="other" xlink:type="simple">Satchler, G.R. (1990) Introduction to Nuclear Reactions. Oxford U. P., New York, 248. https://doi.org/10.1007/978-1-349-20531-8</mixed-citation></ref><ref id="scirp.94421-ref3"><label>3</label><mixed-citation publication-type="other" xlink:type="simple">Lancaster, H.O. (1969) The Chi-Squared Distribution. J. Wiley &amp; Sons, New York, Chap. 1.</mixed-citation></ref><ref id="scirp.94421-ref4"><label>4</label><mixed-citation publication-type="other" xlink:type="simple">Gorroochurn, P. (2016) Classic Topics on the History of Modern Mathematical Statistics: From Laplace to More Recent Times. J. Wiley &amp; Sons, Hoboken, Chap. 3.  
https://doi.org/10.1002/9781119127963</mixed-citation></ref><ref id="scirp.94421-ref5"><label>5</label><mixed-citation publication-type="other" xlink:type="simple">Bienaymé, I.-J. (1852) Sur la Probabilité des Erreurs d’Aprés la Méthode des Moindres Carrés. Liouville’s Journal de Mathématiques Poures et Appliquées, Séries 1, 17, 33-78.</mixed-citation></ref><ref id="scirp.94421-ref6"><label>6</label><mixed-citation publication-type="other" xlink:type="simple">Abbe, D.E. (1863) Ueber die Gesetzm&amp;#228;ssigkeit in der Vertheilung der Fehler bei Beobachtungsreihen. Dissertation, Jena.</mixed-citation></ref><ref id="scirp.94421-ref7"><label>7</label><mixed-citation publication-type="other" xlink:type="simple">Helmert, F.R. (1876) Die Genauigkeit der Formel von Peters zur Berechnung des wahrscheinlichen Beobachtungsfehlers directer Beobachtungen gleicher Genauigkeit. Astronomische Nachrichten, 88, 113-132.  
https://doi.org/10.1002/asna.18760880802</mixed-citation></ref><ref id="scirp.94421-ref8"><label>8</label><mixed-citation publication-type="other" xlink:type="simple">Stuart, A. and Ord, K. (1994) Kendall’s Advanced Theory of Statistics, Vol. 1, Distribution Theory. Arnold Hodder Headline Group, London, Chap. 11.</mixed-citation></ref><ref id="scirp.94421-ref9"><label>9</label><mixed-citation publication-type="other" xlink:type="simple">Ross, S. (2006) A First Course in Probability. Pearson Prentice Hall, Upper Saddle River, Sec. 6.3.</mixed-citation></ref><ref id="scirp.94421-ref10"><label>10</label><mixed-citation publication-type="other" xlink:type="simple">Berry, D.A. and Lindgren, B.W. (1996) Statistics: Theory and Methods. Wadsworth Publishing, Belmont, Sec. 5.12, 6.4.</mixed-citation></ref><ref id="scirp.94421-ref11"><label>11</label><mixed-citation publication-type="book" xlink:type="simple">Gull, S.F. (1988) Bayesian Inductive Inference and Maximum Entropy. In: Erickson, G.J. and Smith, C.R., Eds., Maximum-Entropy and Bayesian Methods in Science and Engineering, Kluwer Academic, Dordrecht, 53-74.  
https://doi.org/10.1007/978-94-009-3049-0_4</mixed-citation></ref><ref id="scirp.94421-ref12"><label>12</label><mixed-citation publication-type="journal" xlink:type="simple"><name name-style="western"><surname>Helmert</surname><given-names> F.R. </given-names></name>,<etal>et al</etal>. (<year>1876</year>)<article-title>Ueber die Wahrscheinlichkeit der Potenzsummen der Beobachtungsfehler und über einige damit im Zusammenhange stehende Fragen</article-title><source> Zeitschrift für Mathematik und Physik</source><volume> 21</volume>,<fpage> 192</fpage>-<lpage>218</lpage>.<pub-id pub-id-type="doi"></pub-id></mixed-citation></ref><ref id="scirp.94421-ref13"><label>13</label><mixed-citation publication-type="other" xlink:type="simple">Pearson, K. (1900) On the Criterion that a Given System of Deviations from the Probable in the Case of Correlated System of Variables Is Such That It Can Be Reasonably Supposed to Have Arisen from Random Sampling. Philosophical Magazine Series 5, 50, 157-175. https://doi.org/10.1080/14786440009463897</mixed-citation></ref><ref id="scirp.94421-ref14"><label>14</label><mixed-citation publication-type="other" xlink:type="simple">Greenwood, P.E. and Nikulin, M.S. (1996) A Guide to Chi-Squared Testing. J. Wiley &amp; Sons, New York, Sec. 3.18.</mixed-citation></ref><ref id="scirp.94421-ref15"><label>15</label><mixed-citation publication-type="other" xlink:type="simple">Fisher, R.A. (1922) On the Interpretation of χ2 from Contingency Tables and the Calculation of P. Journal of the Royal Statistical Society A, 85, 87-94.  
https://doi.org/10.2307/2340521</mixed-citation></ref><ref id="scirp.94421-ref16"><label>16</label><mixed-citation publication-type="other" xlink:type="simple">Evans, R.D. (1985) The Atomic Nucleus. Krieger Publishing, Malabar, Chap. 27.</mixed-citation></ref><ref id="scirp.94421-ref17"><label>17</label><mixed-citation publication-type="other" xlink:type="simple">Johnson, N.L., Kotz, S. and Balakrishnan, N. (1995) Continuous Univariate Distributions, Vol. 2. J. Wiley &amp; Sons, New York, Chap. 28.</mixed-citation></ref><ref id="scirp.94421-ref18"><label>18</label><mixed-citation publication-type="other" xlink:type="simple">Eisenbud, M. and Gesell, T. (1997) Environmental Radioactivity from Natural, Industrial, and Military Sources. Academic Press, San Diego, Chap. 6.  
https://doi.org/10.1016/B978-012235154-9/50010-4</mixed-citation></ref><ref id="scirp.94421-ref19"><label>19</label><mixed-citation publication-type="other" xlink:type="simple">Environmental Protection Agency (2000) 40 CFR Parts 9, 141, and 142 National Primary Drinking Water Regulations; Radionuclides; Final Rule. Federal Register, 65, 76708-76752.</mixed-citation></ref><ref id="scirp.94421-ref20"><label>20</label><mixed-citation publication-type="other" xlink:type="simple">EPA (2017) Procedure for Safe Drinking Water Act Program Detection Limits for Radionuclides. Report EPA 815-B-17-003, Cincinnati.</mixed-citation></ref><ref id="scirp.94421-ref21"><label>21</label><mixed-citation publication-type="other" xlink:type="simple">Johnson, N.L., Kotz, S. and Balakrishnan, N. (1994) Continuous Univariate Distributions, Vol. 1. J. Wiley &amp; Sons, New York, Chap. 12, 17, 18.</mixed-citation></ref><ref id="scirp.94421-ref22"><label>22</label><mixed-citation publication-type="other" xlink:type="simple">Johnson, N.L., Kemp, A.W. and Kotz, S. (2005) Univariate Discrete Distributions. J. Wiley &amp; Sons, Hoboken, Chap. 1. https://doi.org/10.1002/0471715816</mixed-citation></ref><ref id="scirp.94421-ref23"><label>23</label><mixed-citation publication-type="other" xlink:type="simple">Margenau, H. and Murphy, G.M. (1976) The Mathematics of Physics and Chemistry. Krieger Publishing, Huntington, Sec. 8.5.</mixed-citation></ref><ref id="scirp.94421-ref24"><label>24</label><mixed-citation publication-type="other" xlink:type="simple">Dettman, J.W. (1965) Applied Complex Variables. Dover Publications, New York, Sec. 3.6.</mixed-citation></ref><ref id="scirp.94421-ref25"><label>25</label><mixed-citation publication-type="other" xlink:type="simple">Oldham, K.B. and Spanier, J. (1974) The Fractional Calculus: Theory and Applications of Differentiation and Integration to Arbitrary Order. Dover Publications, Mineola, Sec. 3.4.</mixed-citation></ref><ref id="scirp.94421-ref26"><label>26</label><mixed-citation publication-type="other" xlink:type="simple">Jaynes, E.T. (2004) Probability Theory: The Logic of Science. Cambridge U. P., Cambridge, Chap. 12. https://doi.org/10.1017/CBO9780511790423</mixed-citation></ref><ref id="scirp.94421-ref27"><label>27</label><mixed-citation publication-type="other" xlink:type="simple">Student (1908) The Probable Error of a Mean. Biometrika, 6, 1-25.  
https://doi.org/10.1093/biomet/6.1.1</mixed-citation></ref><ref id="scirp.94421-ref28"><label>28</label><mixed-citation publication-type="other" xlink:type="simple">Semkow, T.M. and Parekh, P.P. (2001) Principles of Gross Alpha and Beta Radioactivity Detection in Water. Health Physics, 81, 567-574.  
https://doi.org/10.1097/00004032-200111000-00011</mixed-citation></ref><ref id="scirp.94421-ref29"><label>29</label><mixed-citation publication-type="other" xlink:type="simple">Khan, A.J., Semkow, T.M., Beach, S.E., Haines, D.K., Bradt, C.J., Bari, A., Syed, U.-F., Torres, M., Marrantino, J., Kitto, M.E., Menia, T. and Fielman, E. (2014) Application of Low-Background Gamma-Ray Spectrometry to Monitor Radioactivity in the Environment and Food. Applied Radiation and Isotopes, 90, 251-257.  
https://doi.org/10.1016/j.apradiso.2014.04.011</mixed-citation></ref></ref-list></back></article>