Generalized chi-squared distribution

In probability theory and statistics, the generalized chi-squared distribution (or generalized chi-square distribution) is the distribution of a quadratic form of a multinormal variable (normal vector), or a linear combination of different normal variables and squares of normal variables. Equivalently, it is also a linear sum of independent noncentral chi-square variables and a normal variable. There are several other such generalizations for which the same term is sometimes used; some of them are special cases of the family discussed here, for example the gamma distribution.

Generalized chi-squared distribution
Probability density function
Generalized chi-square probability density function
Cumulative distribution function
Generalized chi-square cumulative distribution function
Notation
Parameters , vector of weights of noncentral chi-square components
, vector of degrees of freedom of noncentral chi-square components
, vector of non-centrality parameters of chi-square components
, scale of normal term
, offset
Support
Mean
Variance
CF

Definition edit

The generalized chi-squared variable may be described in multiple ways. One is to write it as a weighted sum of independent noncentral chi-square variables   and a standard normal variable  :[1][2]

 

Here the parameters are the weights  , the degrees of freedom   and non-centralities   of the constituent non-central chi-squares, and the coefficients   and   of the normal. Some important special cases of this have all weights   of the same sign, or have central chi-squared components, or omit the normal term.

Since a non-central chi-squared variable is a sum of squares of normal variables with different means, the generalized chi-square variable is also defined as a sum of squares of independent normal variables, plus an independent normal variable: that is, a quadratic in normal variables.

Another equivalent way is to formulate it as a quadratic form of a normal vector  :[3][4]

 .

Here   is a matrix,   is a vector, and   is a scalar. These, together with the mean   and covariance matrix   of the normal vector  , parameterize the distribution. The parameters of the former expression (in terms of non-central chi-squares, a normal and a constant) can be calculated in terms of the parameters of the latter expression (quadratic form of a normal vector).[4] If (and only if)   in this formulation is positive-definite, then all the   in the first formulation will have the same sign.

For the most general case, a reduction towards a common standard form can be made by using a representation of the following form:[5]

 

where D is a diagonal matrix and where x represents a vector of uncorrelated standard normal random variables.

Computing the pdf/cdf/inverse cdf/random numbers edit

The probability density, cumulative distribution, and inverse cumulative distribution functions of a generalized chi-squared variable do not have simple closed-form expressions. However, numerical algorithms [5][2][6][4] and computer code (Fortran and C, Matlab, R, Python, Julia) have been published to evaluate some of these, and to generate random samples.

Applications edit

The generalized chi-squared is the distribution of statistical estimates in cases where the usual statistical theory does not hold, as in the examples below.

In model fitting and selection edit

If a predictive model is fitted by least squares, but the residuals have either autocorrelation or heteroscedasticity, then alternative models can be compared (in model selection) by relating changes in the sum of squares to an asymptotically valid generalized chi-squared distribution.[3]

Classifying normal vectors using Gaussian discriminant analysis edit

If   is a normal vector, its log likelihood is a quadratic form of  , and is hence distributed as a generalized chi-squared. The log likelihood ratio that   arises from one normal distribution versus another is also a quadratic form, so distributed as a generalized chi-squared.[4]

In Gaussian discriminant analysis, samples from multinormal distributions are optimally separated by using a quadratic classifier, a boundary that is a quadratic function (e.g. the curve defined by setting the likelihood ratio between two Gaussians to 1). The classification error rates of different types (false positives and false negatives) are integrals of the normal distributions within the quadratic regions defined by this classifier. Since this is mathematically equivalent to integrating a quadratic form of a normal vector, the result is an integral of a generalized-chi-squared variable.[4]

In signal processing edit

The following application arises in the context of Fourier analysis in signal processing, renewal theory in probability theory, and multi-antenna systems in wireless communication. The common factor of these areas is that the sum of exponentially distributed variables is of importance (or identically, the sum of squared magnitudes of circularly-symmetric centered complex Gaussian variables).

If   are k independent, circularly-symmetric centered complex Gaussian random variables with mean 0 and variance  , then the random variable

 

has a generalized chi-squared distribution of a particular form. The difference from the standard chi-squared distribution is that   are complex and can have different variances, and the difference from the more general generalized chi-squared distribution is that the relevant scaling matrix A is diagonal. If   for all i, then  , scaled down by   (i.e. multiplied by  ), has a chi-squared distribution,  , also known as an Erlang distribution. If   have distinct values for all i, then   has the pdf[7]

 

If there are sets of repeated variances among  , assume that they are divided into M sets, each representing a certain variance value. Denote   to be the number of repetitions in each group. That is, the mth set contains   variables that have variance   It represents an arbitrary linear combination of independent  -distributed random variables with different degrees of freedom:

 

The pdf of   is[8]

 

where

 

with   from the set   of all partitions of   (with  ) defined as

 

See also edit

References edit

  1. ^ Davies, R.B. (1973) Numerical inversion of a characteristic function. Biometrika, 60 (2), 415–417
  2. ^ a b Davies, R.B. (1980) "Algorithm AS155: The distribution of a linear combination of χ2 random variables", Applied Statistics, 29, 323–333
  3. ^ a b Jones, D.A. (1983) "Statistical analysis of empirical models fitted by optimisation", Biometrika, 70 (1), 67–88
  4. ^ a b c d e Das, Abhranil; Wilson S Geisler (2020). "Methods to integrate multinormals and compute classification measures". arXiv:2012.14331 [stat.ML].
  5. ^ a b Sheil, J., O'Muircheartaigh, I. (1977) "Algorithm AS106: The distribution of non-negative quadratic forms in normal variables",Applied Statistics, 26, 92–98
  6. ^ Imhof, J. P. (1961). "Computing the Distribution of Quadratic Forms in Normal Variables" (PDF). Biometrika. 48 (3/4): 419–426. doi:10.2307/2332763. JSTOR 2332763.
  7. ^ D. Hammarwall, M. Bengtsson, B. Ottersten (2008) "Acquiring Partial CSI for Spatially Selective Transmission by Instantaneous Channel Norm Feedback", IEEE Transactions on Signal Processing, 56, 1188–1204
  8. ^ E. Björnson, D. Hammarwall, B. Ottersten (2009) "Exploiting Quantized Channel Norm Feedback through Conditional Statistics in Arbitrarily Correlated MIMO Systems", IEEE Transactions on Signal Processing, 57, 4027–4041

External links edit