In probability and statistics, an elliptical distribution is any member of a broad family of probability distributions that generalize the multivariate normal distribution. Intuitively, in the simplified two and three dimensional case, the joint distribution forms an ellipse and an ellipsoid, respectively, in iso-density plots.
In statistics, the normal distribution is used in classical multivariate analysis, while elliptical distributions are used in generalized multivariate analysis, for the study of symmetric distributions with tails that are heavy, like the multivariate t-distribution, or light (in comparison with the normal distribution). Some statistical methods that were originally motivated by the study of the normal distribution have good performance for general elliptical distributions (with finite variance), particularly for spherical distributions (which are defined below). Elliptical distributions are also used in robust statistics to evaluate proposed multivariate-statistical procedures.
Elliptical distributions are defined in terms of the characteristic function of probability theory. A random vector on a Euclidean space has an elliptical distribution if its characteristic function satisfies the following functional equation (for every column-vector )
for some location parameter , some nonnegative-definite matrix and some scalar function . The definition of elliptical distributions for real random-vectors has been extended to accommodate random vectors in Euclidean spaces over the field of complex numbers, so facilitating applications in time-series analysis. Computational methods are available for generating pseudo-random vectors from elliptical distributions, for use in Monte Carlo simulations for example.
Some elliptical distributions are alternatively defined in terms of their density functions. An elliptical distribution with a density function f has the form:
where is the normalizing constant, is an -dimensional random vector with median vector (which is also the mean vector if the latter exists), and is a positive definite matrix which is proportional to the covariance matrix if the latter exists.
Examples include the following multivariate probability distributions:
In the 2-dimensional case, if the density exists, each iso-density locus (the set of x1,x2 pairs all giving a particular value of ) is an ellipse or a union of ellipses (hence the name elliptical distribution). More generally, for arbitrary n, the iso-density loci are unions of ellipsoids. All these ellipsoids or ellipses have the common center μ and are scaled copies (homothets) of each other.
The multivariate normal distribution is the special case in which . While the multivariate normal is unbounded (each element of can take on arbitrarily large positive or negative values with non-zero probability, because for all non-negative ), in general elliptical distributions can be bounded or unbounded—such a distribution is bounded if for all greater than some value.
There exist elliptical distributions that have undefined mean, such as the Cauchy distribution (even in the univariate case). Because the variable x enters the density function quadratically, all elliptical distributions are symmetric about
If two subsets of a jointly elliptical random vector are uncorrelated, then if their means exist they are mean independent of each other (the mean of each subvector conditional on the value of the other subvector equals the unconditional mean).:p. 748
If random vector X is elliptically distributed, then so is DX for any matrix D with full row rank. Thus any linear combination of the components of X is elliptical (though not necessarily with the same elliptical distribution), and any subset of X is elliptical.:p. 748
Elliptical distributions are used in statistics and in economics.
Statistics: Generalized multivariate analysisEdit
In statistics, the multivariate normal distribution (of Gauss) is used in classical multivariate analysis, in which most methods for estimation and hypothesis-testing are motivated for the normal distribution. In contrast to classical multivariate analysis, generalized multivariate analysis refers to research on elliptical distributions without the restriction of normality.
For suitable elliptical distributions, some classical methods continue to have good properties. Under finite-variance assumptions, an extension of Cochran's theorem (on the distribution of quadratic forms) holds.
An elliptical distribution with a zero mean and variance in the form where is the identity-matrix is called a spherical distribution. For spherical distributions, classical results on parameter-estimation and hypothesis-testing hold have been extended. Similar results hold for linear models, and indeed also for complicated models ( especially for the growth curve model). The analysis of multivariate models uses multilinear algebra (particularly Kronecker products and vectorization) and matrix calculus.
Robust statistics: AsymptoticsEdit
Another use of elliptical distributions is in robust statistics, in which researchers examine how statistical procedures perform on the class of elliptical distributions, to gain insight into the procedures' performance on even more general problems, for example by using the limiting theory of statistics ("asymptotics").
Economics and financeEdit
Elliptical distributions are important in portfolio theory because, if the returns on all assets available for portfolio formation are jointly elliptically distributed, then all portfolios can be characterized completely by their location and scale – that is, any two portfolios with identical location and scale of portfolio return have identical distributions of portfolio return. Various features of portfolio analysis, including mutual fund separation theorems and the Capital Asset Pricing Model, hold for all elliptical distributions.:p. 748
- Cambanis, Huang & Simons (1981, p. 368)
- Fang, Kotz & Ng (1990, Chapter 2.9 "Complex elliptically symmetric distributions", pp. 64-66)
- Johnson (1987, Chapter 6, "Elliptically contoured distributions, pp. 106-124): Johnson, Mark E. (1987). Multivariate statistical simulation: A guide to selecting and generating continuous multivariate distributions. John Wiley and Sons., "an admirably lucid discussion" according to Fang, Kotz & Ng (1990, p. 27).
- Frahm, G., Junker, M., & Szimayer, A. (2003). Elliptical copulas: Applicability and limitations. Statistics & Probability Letters, 63(3), 275–286.
- Nolan, John (September 29, 2014). "Multivariate stable densities and distribution functions: general and elliptical case". ResearchGate. Retrieved 2017-05-26.
- Pascal, F.; et al. "Parameter Estimation For Multivariate Generalized Gaussian Distributions" (PDF). Retrieved 2017-05-26.
- Schmidt, Rafael (2012). "Credit Risk Modeling and Estimation via Elliptical Copulae". In Bol, George; et al. (eds.). Credit Risk: Measurement, Evaluation and Management. Springer. p. 274. ISBN 9783642593659.
- Owen & Rabinovitch (1983)
- (Gupta, Varga & Bodnar 2013)
- (Chamberlain 1983; Owen and Rabinovitch 1983)
- Anderson (2004, The final section of the text (before "Problems") that are always entitled "Elliptically contoured distributions", of the following chapters: Chapters 3 ("Estimation of the mean vector and the covariance matrix", Section 3.6, pp. 101-108), 4 ("The distributions and uses of sample correlation coefficients", Section 4.5, pp. 158-163), 5 ("The generalized T2-statistic", Section 5.7, pp. 199-201), 7 ("The distribution of the sample covariance matrix and the sample generalized variance", Section 7.9, pp. 242-248), 8 ("Testing the general linear hypothesis; multivariate analysis of variance", Section 8.11, pp. 370-374), 9 ("Testing independence of sets of variates", Section 9.11, pp. 404-408), 10 ("Testing hypotheses of equality of covariance matrices and equality of mean vectors and covariance vectors", Section 10.11, pp. 449-454), 11 ("Principal components", Section 11.8, pp. 482-483), 13 ("The distribution of characteristic roots and vectors", Section 13.8, pp. 563-567))
- Fang & Zhang (1990)
- Fang & Zhang (1990, Chapter 2.8 "Distribution of quadratic forms and Cochran's theorem", pp. 74-81)
- Fang & Zhang (1990, Chapter 2.5 "Spherical distributions", pp. 53-64)
- Fang & Zhang (1990, Chapter IV "Estimation of parameters", pp. 127-153)
- Fang & Zhang (1990, Chapter V "Testing hypotheses", pp. 154-187)
- Fang & Zhang (1990, Chapter VII "Linear models", pp. 188-211)
- Pan & Fang (2007, p. ii)
- Kollo & von Rosen (2005, p. xiii)
- Kariya, Takeaki; Sinha, Bimal K. (1989). Robustness of statistical tests. Academic Press. ISBN 0123982308.
- Kollo & von Rosen (2005, p. 221)
- Chamberlain (1983)
- Anderson, T. W. (2004). An introduction to multivariate statistical analysis (3rd ed.). New York: John Wiley and Sons. ISBN 9789812530967.
- Cambanis, Stamatis; Huang, Steel; Simons, Gordon (1981). "On the theory of elliptically contoured distributions". Journal of Multivariate Analysis. 11: 368–385. doi:10.1016/0047-259x(81)90082-8.
- Chamberlain, G. (1983). "A characterization of the distributions that imply mean-variance utility functions", Journal of Economic Theory 29, 185–201. doi:10.1016/0022-0531(83)90129-1
- Fang, Kai-Tai; Zhang, Yao-Ting (1990). Generalized multivariate analysis. Science Press (Beijing) and Springer-Verlag (Berlin). ISBN 3540176519. OCLC 622932253.
- Fang, Kai-Tai; Kotz, Samuel; Ng, Kai Wang ("Kai-Wang" on front cover) (1990). Symmetric multivariate and related distributions. Monographs on statistics and applied probability. 36. London: Chapman and Hall. ISBN 0 412 314 304. OCLC 123206055.
- Gupta, Arjun K.; Varga, Tamas; Bodnar, Taras (2013). Elliptically contoured models in statistics and portfolio theory (2nd ed.). New York: Springer-Verlag. doi:10.1007/978-1-4614-8154-6. ISBN 978-1-4614-8153-9.
- Kollo, Tõnu; von Rosen, Dietrich (2005). Advanced multivariate statistics with matrices. Dordrecht: Springer. ISBN 978-1-4020-3418-3.
- Owen, J., and Rabinovitch, R. (1983). "On the class of elliptical distributions and their applications to the theory of portfolio choice", Journal of Finance 38, 745–752. JSTOR 2328079
- Pan, Jianxin; Fang, Kaitai (2007). Growth curve models and statistical diagnostics. Springer series in statistics. Science Press (Beijing) and Springer-Verlag (New York). doi:10.1007/978-0-387-21812-0. ISBN 9780387950532. OCLC 44162563.