Open main menu
A square matrix of order 4. The entries form the main diagonal of a square matrix. For instance, the main diagonal of the 4-by-4 matrix above contains the elements a11 = 9, a22 = 11, a33 = 4, a44 = 10.

In mathematics, a square matrix is a matrix with the same number of rows and columns. An n-by-n matrix is known as a square matrix of order . Any two square matrices of the same order can be added and multiplied.

Square matrices are often used to represent simple linear transformations, such as shearing or rotation. For example, if is a square matrix representing a rotation (rotation matrix) and is a column vector describing the position of a point in space, the product yields another column vector describing the position of that point after that rotation. If is a row vector, the same transformation can be obtained using , where is the transpose of .

Main diagonalEdit

The entries   (i = 1, ..., n) form the main diagonal of a square matrix. They lie on the imaginary line which runs from the top left corner to the bottom right corner of the matrix. For instance, the main diagonal of the 4-by-4 matrix above contains the elements a11 = 9, a22 = 11, a33 = 4, a44 = 10.

The diagonal of a square matrix from the top right to the bottom left corner is called antidiagonal or counterdiagonal.

Special kindsEdit

Name Example with n = 3
Diagonal matrix  
Lower triangular matrix  
Upper triangular matrix  

Diagonal or triangular matrixEdit

If all entries outside the main diagonal are zero,   is called a diagonal matrix. If only all entries above (or below) the main diagonal are zero,  ' is called a lower (or upper) triangular matrix.

Identity matrixEdit

The identity matrix   of size   is the   matrix in which all the elements on the main diagonal are equal to 1 and all other elements are equal to 0, e.g.

 

It is a square matrix of order  , and also a special kind of diagonal matrix. It is called identity matrix because multiplication with it leaves a matrix unchanged:

AIn = ImA = A for any m-by-n matrix  .

Symmetric or skew-symmetric matrixEdit

A square matrix A that is equal to its transpose, i.e.,  , is a symmetric matrix. If instead, A was equal to the negative of its transpose, i.e., A = −AT, then A is a skew-symmetric matrix. In complex matrices, symmetry is often replaced by the concept of Hermitian matrices, which satisfy  , where   denotes the conjugate transpose of the matrix, i.e., the transpose of the complex conjugate of  .

By the spectral theorem, real symmetric (or complex Hermitian) matrices have an orthogonal (or unitary) eigenbasis; i.e., every vector is expressible as a linear combination of eigenvectors. In both cases, all eigenvalues are real.[1] This theorem can be generalized to infinite-dimensional situations related to matrices with infinitely many rows and columns, see below.

Invertible matrix and its inverseEdit

A square matrix   is called invertible or non-singular if there exists a matrix   such that

 .[2][3]

If   exists, it is unique and is called the inverse matrix of  , denoted  .

Normal matrixEdit

A square matrix A is called normal if  , i.e. if it commutes with its transpose.

Definite matrixEdit

Positive definite Indefinite
   
Q(x,y) = 1/4 x2 + 1/4y2 Q(x,y) = 1/4 x2 − 1/4 y2
 
Points such that Q(x,y) = 1
(Ellipse).
 
Points such that Q(x,y) = 1
(Hyperbola).

A symmetric n×n-matrix is called positive-definite (respectively negative-definite; indefinite), if for all nonzero vectors   the associated quadratic form given by

Q(x) = xTAx

takes only positive values (respectively only negative values; both some negative and some positive values).[4] If the quadratic form takes only non-negative (respectively only non-positive) values, the symmetric matrix is called positive-semidefinite (respectively negative-semidefinite); hence the matrix is indefinite precisely when it is neither positive-semidefinite nor negative-semidefinite.

A symmetric matrix is positive-definite if and only if all its eigenvalues are positive.[5] The table at the right shows two possibilities for 2-by-2 matrices.

Allowing as input two different vectors instead yields the bilinear form associated to A:

BA (x, y) = xTAy.[6]

Orthogonal matrixEdit

An orthogonal matrix is a square matrix with real entries whose columns and rows are orthogonal unit vectors (i.e., orthonormal vectors). Equivalently, a matrix A is orthogonal if its transpose is equal to its inverse:

 

which entails

 

where I is the identity matrix.

An orthogonal matrix A is necessarily invertible (with inverse A−1 = AT), unitary (A−1 = A*), and normal (A*A = AA*). The determinant of any orthogonal matrix is either +1 or −1. A special orthogonal matrix is an orthogonal matrix with determinant +1. As a linear transformation, every orthogonal matrix with determinant +1 is a pure rotation, while every orthogonal matrix with determinant −1 is either a pure reflection, or a composition of reflection and rotation.

The complex analogue of an orthogonal matrix is a unitary matrix.

OperationsEdit

TraceEdit

The trace, tr(A) of a square matrix A is the sum of its diagonal entries. While matrix multiplication is not commutative, the trace of the product of two matrices is independent of the order of the factors:

 

This is immediate from the definition of matrix multiplication:

 

Also, the trace of a matrix is equal to that of its transpose, i.e.,

 .

DeterminantEdit

 
A linear transformation on   given by the indicated matrix. The determinant of this matrix is −1, as the area of the green parallelogram at the right is 1, but the map reverses the orientation, since it turns the counterclockwise orientation of the vectors to a clockwise one.

The determinant   or   of a square matrix   is a number encoding certain properties of the matrix. A matrix is invertible if and only if its determinant is nonzero. Its absolute value equals the area (in  ) or volume (in  ) of the image of the unit square (or cube), while its sign corresponds to the orientation of the corresponding linear map: the determinant is positive if and only if the orientation is preserved.

The determinant of 2-by-2 matrices is given by

 

The determinant of 3-by-3 matrices involves 6 terms (rule of Sarrus). The more lengthy Leibniz formula generalises these two formulae to all dimensions.[7]

The determinant of a product of square matrices equals the product of their determinants:[8]

 

Adding a multiple of any row to another row, or a multiple of any column to another column, does not change the determinant. Interchanging two rows or two columns affects the determinant by multiplying it by −1.[9] Using these operations, any matrix can be transformed to a lower (or upper) triangular matrix, and for such matrices the determinant equals the product of the entries on the main diagonal; this provides a method to calculate the determinant of any matrix. Finally, the Laplace expansion expresses the determinant in terms of minors, i.e., determinants of smaller matrices.[10] This expansion can be used for a recursive definition of determinants (taking as starting case the determinant of a 1-by-1 matrix, which is its unique entry, or even the determinant of a 0-by-0 matrix, which is 1), that can be seen to be equivalent to the Leibniz formula. Determinants can be used to solve linear systems using Cramer's rule, where the division of the determinants of two related square matrices equates to the value of each of the system's variables.[11]

Eigenvalues and eigenvectorsEdit

A number λ and a non-zero vector   satisfying

 

are called an eigenvalue and an eigenvector of  , respectively.[12][13] The number λ is an eigenvalue of an n×n-matrix A if and only if A−λIn is not invertible, which is equivalent to

 [14]

The polynomial pA in an indeterminate X given by evaluation the determinant det(XInA) is called the characteristic polynomial of A. It is a monic polynomial of degree n. Therefore the polynomial equation pA(λ) = 0 has at most n different solutions, i.e., eigenvalues of the matrix.[15] They may be complex even if the entries of A are real. According to the Cayley–Hamilton theorem, pA(A) = 0, that is, the result of substituting the matrix itself into its own characteristic polynomial yields the zero matrix.

See alsoEdit

NotesEdit

  1. ^ Horn & Johnson 1985, Theorem 2.5.6
  2. ^ Brown 1991, Definition I.2.28
  3. ^ Brown 1991, Definition I.5.13
  4. ^ Horn & Johnson 1985, Chapter 7
  5. ^ Horn & Johnson 1985, Theorem 7.2.1
  6. ^ Horn & Johnson 1985, Example 4.0.6, p. 169
  7. ^ Brown 1991, Definition III.2.1
  8. ^ Brown 1991, Theorem III.2.12
  9. ^ Brown 1991, Corollary III.2.16
  10. ^ Mirsky 1990, Theorem 1.4.1
  11. ^ Brown 1991, Theorem III.3.18
  12. ^ Eigen means "own" in German and in Dutch.
  13. ^ Brown 1991, Definition III.4.1
  14. ^ Brown 1991, Definition III.4.9
  15. ^ Brown 1991, Corollary III.4.10

ReferencesEdit

  • Brown, William C. (1991), Matrices and vector spaces, New York, NY: Marcel Dekker, ISBN 978-0-8247-8419-5
  • Horn, Roger A.; Johnson, Charles R. (1985), Matrix Analysis, Cambridge University Press, ISBN 978-0-521-38632-6
  • Mirsky, Leonid (1990), An Introduction to Linear Algebra, Courier Dover Publications, ISBN 978-0-486-66434-7

External linksEdit