Computing the permanent

In linear algebra, the computation of the permanent of a matrix is a problem that is thought to be more difficult than the computation of the determinant of a matrix despite the apparent similarity of the definitions.

The permanent is defined similarly to the determinant, as a sum of products of sets of matrix entries that lie in distinct rows and columns. However, where the determinant weights each of these products with a ±1 sign based on the parity of the set, the permanent weights them all with a +1 sign.

While the determinant can be computed in polynomial time by Gaussian elimination, it is generally believed that the permanent cannot be computed in polynomial time. In computational complexity theory, a theorem of Valiant states that computing permanents is #P-hard, and even #P-complete for matrices in which all entries are 0 or 1 Valiant (1979). This puts the computation of the permanent in a class of problems believed to be even more difficult to compute than NP. It is known that computing the permanent is impossible for logspace-uniform ACC⁰ circuits.(Allender & Gore 1994)

The development of both exact and approximate algorithms for computing the permanent of a matrix is an active area of research.

Definition and naive algorithm edit

The permanent of an n-by-n matrix A = (a_i,j) is defined as

\operatorname {perm} (A)=\sum _{\sigma \in S_{n}}\prod _{i=1}^{n}a_{i,\sigma (i)}.

The sum here extends over all elements σ of the symmetric group S_n, i.e. over all permutations of the numbers 1, 2, ..., n. This formula differs from the corresponding formula for the determinant only in that, in the determinant, each product is multiplied by the sign of the permutation σ while in this formula each product is unsigned. The formula may be directly translated into an algorithm that naively expands the formula, summing over all permutations and within the sum multiplying out each matrix entry. This requires n! n arithmetic operations.

Ryser formula edit

The best known^[1] general exact algorithm is due to H. J. Ryser (1963). Ryser's method is based on an inclusion–exclusion formula that can be given^[2] as follows: Let $A_{k}$ be obtained from A by deleting k columns, let $P(A_{k})$ be the product of the row-sums of $A_{k}$ , and let $\Sigma _{k}$ be the sum of the values of $P(A_{k})$ over all possible $A_{k}$ . Then

\operatorname {perm} (A)=\sum _{k=0}^{n-1}(-1)^{k}\Sigma _{k}.

It may be rewritten in terms of the matrix entries as follows^[3]

\operatorname {perm} (A)=(-1)^{n}\sum _{S\subseteq \{1,\dots ,n\}}(-1)^{|S|}\prod _{i=1}^{n}\sum _{j\in S}a_{ij}.

Ryser's formula can be evaluated using $O(2^{n-1}n^{2})$ arithmetic operations, or $O(2^{n-1}n)$ by processing the sets $S$ in Gray code order.^[4]

Balasubramanian–Bax–Franklin–Glynn formula edit

Another formula that appears to be as fast as Ryser's (or perhaps even twice as fast) is to be found in the two Ph.D. theses; see (Balasubramanian 1980), (Bax 1998); also (Bax & Franklin 1996). The methods to find the formula are quite different, being related to the combinatorics of the Muir algebra, and to finite difference theory respectively. Another way, connected with invariant theory is via the polarization identity for a symmetric tensor (Glynn 2010). The formula generalizes to infinitely many others, as found by all these authors, although it is not clear if they are any faster than the basic one. See (Glynn 2013).

The simplest known formula of this type (when the characteristic of the field is not two) is

\operatorname {perm} (A)={\frac {1}{2^{n-1}}}\left[\sum _{\delta }\left(\prod _{k=1}^{n}\delta _{k}\right)\prod _{j=1}^{n}\sum _{i=1}^{n}\delta _{i}a_{ij}\right],

where the outer sum is over all $2^{n-1}$ vectors $\delta =(\delta _{1}=1,\delta _{2},\dots ,\delta _{n})\in \{\pm 1\}^{n}$ .

Special cases edit

Planar and K_3,3-free edit

The number of perfect matchings in a bipartite graph is counted by the permanent of the graph's biadjacency matrix, and the permanent of any 0-1 matrix can be interpreted in this way as the number of perfect matchings in a graph. For planar graphs (regardless of bipartiteness), the FKT algorithm computes the number of perfect matchings in polynomial time by changing the signs of a carefully chosen subset of the entries in the Tutte matrix of the graph, so that the Pfaffian of the resulting skew-symmetric matrix (the square root of its determinant) is the number of perfect matchings. This technique can be generalized to graphs that contain no subgraph homeomorphic to the complete bipartite graph K_3,3.^[5]

George Pólya had asked the question^[6] of when it is possible to change the signs of some of the entries of a 01 matrix A so that the determinant of the new matrix is the permanent of A. Not all 01 matrices are "convertible" in this manner; in fact it is known (Marcus & Minc (1961)) that there is no linear map $T$ such that $\operatorname {per} T(A)=\det A$ for all $n\times n$ matrices $A$ . The characterization of "convertible" matrices was given by Little (1975) who showed that such matrices are precisely those that are the biadjacency matrix of bipartite graphs that have a Pfaffian orientation: an orientation of the edges such that for every even cycle $C$ for which $G\setminus C$ has a perfect matching, there are an odd number of edges directed along C (and thus an odd number with the opposite orientation). It was also shown that these graphs are exactly those that do not contain a subgraph homeomorphic to $K_{3,3}$ , as above.

Computation modulo a number edit

Modulo 2, the permanent is the same as the determinant, as $(-1)\equiv 1{\pmod {2}}.$ It can also be computed modulo $2^{k}$ in time $O(n^{4k-3})$ for $k\geq 2$ . However, it is UP-hard to compute the permanent modulo any number that is not a power of 2. Valiant (1979)

There are various formulae given by Glynn (2010) for the computation modulo a prime $p$ . First, there is one using symbolic calculations with partial derivatives.

Second, for $p$ = 3 there is the following formula for an n×n-matrix $A$ , involving the matrix's principal minors (Kogan (1996)):

\operatorname {per} (A)=(-1)^{n}\sum _{J\subseteq \{1,\dots ,n\}}\det(A_{J})\det(A_{\bar {J}}),

where $A_{J}$ is the submatrix of $A$ induced by the rows and columns of $A$ indexed by $J$ , and ${\bar {J}}$ is the complement of $J$ in $\{1,\dots ,n\}$ , while the determinant of the empty submatrix is defined to be 1.

The expansion above can be generalized in an arbitrary characteristic p as the following pair of dual identities:

{\begin{aligned}\operatorname {per} (A)&=(-1)^{n}\sum _{{J_{1}},\ldots ,{J_{p-1}}}\det(A_{J_{1}})\dotsm \det(A_{J_{p-1}})\\\det(A)&=(-1)^{n}\sum _{{J_{1}},\ldots ,{J_{p-1}}}\operatorname {per} (A_{J_{1}})\dotsm \operatorname {per} (A_{J_{p-1}})\end{aligned}}

where in both formulas the sum is taken over all the (p − 1)-tuples

{J_{1}},\ldots ,{J_{p-1}}

that are partitions of the set

\{1,\dots ,n\}

into p − 1 subsets, some of them possibly empty.

The former formula possesses an analog for the hafnian of a symmetric $A$ and an odd p:

\operatorname {haf} ^{2}(A)=(-1)^{n}\sum _{{J_{1}},\ldots ,{J_{p-1}}}\det(A_{J_{1}})\dotsm \det(A_{J_{p-1}})(-1)^{|J_{1}|+\dots +|J_{(p-1)/2}|}

with the sum taken over the same set of indexes. Moreover, in characteristic zero a similar convolution sum expression involving both the permanent and the determinant yields the Hamiltonian cycle polynomial (defined as ${\textstyle \operatorname {ham} (A)=\sum _{\sigma \in H_{n}}\prod _{i=1}^{n}a_{i,\sigma (i)}}$ where $H_{n}$ is the set of n-permutations having only one cycle):

\operatorname {ham} (A)=\sum _{J\subseteq \{2,\dots ,n\}}\det(A_{J})\operatorname {per} (A_{\bar {J}})(-1)^{|J|}.

In characteristic 2 the latter equality turns into $\operatorname {ham} (A)=\sum _{J\subseteq \{2,\dots ,n\}}\det(A_{J})\operatorname {det} (A_{\bar {J}})$ what therefore provides an opportunity to polynomial-time calculate the Hamiltonian cycle polynomial of any unitary $U$ (i.e. such that $U^{\textsf {T}}U=I$ where $I$ is the identity n×n-matrix), because each minor of such a matrix coincides with its algebraic complement: $\operatorname {ham} (U)=\operatorname {det} ^{2}(U+I_{/1})$ where $I_{/1}$ is the identity n×n-matrix with the entry of indexes 1,1 replaced by 0. Moreover, it may, in turn, be further generalized for a unitary n×n-matrix $U$ as $\operatorname {ham_{K}} (U)=\operatorname {det} ^{2}(U+I_{/K})$ where $K$ is a subset of {1, ..., n}, $I_{/K}$ is the identity n×n-matrix with the entries of indexes k,k replaced by 0 for all k belonging to $K$ , and we define ${\textstyle \operatorname {ham_{K}} (A)=\sum _{\sigma \in H_{n}(K)}\prod _{i=1}^{n}a_{i,\sigma (i)}}$ where $H_{n}(K)$ is the set of n-permutations whose each cycle contains at least one element of $K$ .

This formula also implies the following identities over fields of characteristic 3:

for any invertible $A$

\operatorname {per} (A^{-1})\operatorname {det} ^{2}(A)=\operatorname {per} (A);

for any unitary $U$ , that is, a square matrix $U$ such that $U^{\textsf {T}}U=I$ where $I$ is the identity matrix of the corresponding size,

\operatorname {per} ^{2}(U)=\det(U+V)\det(-U)

where $V$ is the matrix whose entries are the cubes of the corresponding entries of $U$ .

It was also shown (Kogan (1996)) that, if we define a square matrix $A$ as k-semi-unitary when $\operatorname {rank} (A^{\textsf {T}}A-I)=k$ , the permanent of a 1-semi-unitary matrix is computable in polynomial time over fields of characteristic 3, while for $k$ > 1 the problem becomes #3-P-complete. (A parallel theory concerns the Hamiltonian cycle polynomial in characteristic 2: while computing it on the unitary matrices is polynomial-time feasible, the problem is #2-P-complete for the k-semi-unitary ones for any k > 0). The latter result was essentially extended in 2017 (Knezevic & Cohen (2017)) and it was proven that in characteristic 3 there is a simple formula relating the permanents of a square matrix and its partial inverse (for $A_{11}$ and $A_{22}$ being square, $A_{11}$ being invertible):

\operatorname {per} {\begin{pmatrix}A_{11}&A_{12}\\A_{21}&A_{22}\end{pmatrix}}=\operatorname {det} ^{2}(A_{11})\operatorname {per} {\begin{pmatrix}A_{11}^{-1}&A_{11}^{-1}A_{12}\\A_{21}A_{11}^{-1}&A_{22}-A_{21}A_{11}^{-1}A_{12}\end{pmatrix}}

and it allows to polynomial-time reduce the computation of the permanent of an n×n-matrix with a subset of k or k − 1 rows expressible as linear combinations of another (disjoint) subset of k rows to the computation of the permanent of an (n − k)×(n − k)- or (n − k + 1)×(n − k + 1)-matrix correspondingly, hence having introduced a compression operator (analogical to the Gaussian modification applied for calculating the determinant) that "preserves" the permanent in characteristic 3. (Analogically, it would be worth noting that the Hamiltonian cycle polynomial in characteristic 2 does possess its invariant matrix compressions as well, taking into account the fact that ham(A) = 0 for any n×n-matrix A having three equal rows or, if n > 2, a pair of indexes i,j such that its i-th and j-th rows are identical and its i-th and j-th columns are identical too.) The closure of that operator defined as the limit of its sequential application together with the transpose transformation (utilized each time the operator leaves the matrix intact) is also an operator mapping, when applied to classes of matrices, one class to another. While the compression operator maps the class of 1-semi-unitary matrices to itself and the classes of unitary and 2-semi-unitary ones, the compression-closure of the 1-semi-unitary class (as well as the class of matrices received from unitary ones through replacing one row by an arbitrary row vector — the permanent of such a matrix is, via the Laplace expansion, the sum of the permanents of 1-semi-unitary matrices and, accordingly, polynomial-time computable) is yet unknown and tensely related to the general problem of the permanent's computational complexity in characteristic 3 and the chief question of P versus NP: as it was shown in (Knezevic & Cohen (2017)), if such a compression-closure is the set of all square matrices over a field of characteristic 3 or, at least, contains a matrix class the permanent's computation on is #3-P-complete (like the class of 2-semi-unitary matrices) then the permanent is computable in polynomial time in this characteristic.

Besides, the problem of finding and classifying any possible analogs of the permanent-preserving compressions existing in characteristic 3 for other prime characteristics was formulated (Knezevic & Cohen (2017)), while giving the following identity for an n×n matrix $A$ and two n-vectors (having all their entries from the set {0, ..., p − 1}) $\alpha$ and $\beta$ such that ${\textstyle {\sum _{i=1}^{n}\alpha _{i}=\sum _{j=1}^{n}\beta _{j}}}$ , valid in an arbitrary prime characteristic p:

\operatorname {per} (A^{(\alpha ,\beta )})=\det ^{p-1}(A)\operatorname {per} (A^{-1})^{((p-1){\vec {1}}_{n}-\beta ,(p-1){\vec {1}}_{n}-\alpha )}\left(\prod _{i=1}^{n}\alpha _{i}!\right)\left(\prod _{j=1}^{n}\beta _{j}!\right)(-1)^{n+\sum _{i=1}^{n}\alpha _{i}}

where for an n×m-matrix $M$ , an n-vector $x$ and an m-vector $y$ , both vectors having all their entries from the set {0, ..., p − 1}, $M^{(x,y)}$ denotes the matrix received from $M$ via repeating $x_{i}$ times its i-th row for i = 1, ..., n and $y_{j}$ times its j-th column for j = 1, ..., m (if some row's or column's multiplicity equals zero it would mean that the row or column was removed, and thus this notion is a generalization of the notion of submatrix), and ${\vec {1}}_{n}$ denotes the n-vector all whose entries equal unity. This identity is an exact analog of the classical formula expressing a matrix's minor through a minor of its inverse and hence demonstrates (once more) a kind of duality between the determinant and the permanent as relative immanants. (Actually its own analogue for the hafnian of a symmetric $A$ and an odd prime p is ${\textstyle \operatorname {haf} ^{2}(A^{(\alpha ,\alpha )})=\det ^{p-1}(A)\operatorname {haf} ^{2}(A^{-1})^{((p-1){\vec {1}}_{n}-\alpha ,(p-1){\vec {1}}_{n}-\alpha )}\left(\prod _{i=1}^{n}\alpha _{i}!\right)^{2}(-1)^{n(p-1)/2+n+\sum _{i=1}^{n}\alpha _{i}}}$ ).

And, as an even wider generalization for the partial inverse case in a prime characteristic p, for $A_{11}$ , $A_{22}$ being square, $A_{11}$ being invertible and of size ${n_{1}}$ x ${n_{1}}$ , and ${\textstyle {\sum _{i=1}^{n}\alpha _{i}=\sum _{j=1}^{n}\beta _{j}}}$ , there holds also the identity

\operatorname {per} {\begin{pmatrix}A_{11}&A_{12}\\A_{21}&A_{22}\end{pmatrix}}^{(\alpha ,\beta )}={\det }^{p-1}(A_{11})\operatorname {per} {\begin{pmatrix}A_{11}^{-1}&A_{11}^{-1}A_{12}\\A_{21}A_{11}^{-1}&A_{22}-A_{21}A_{11}^{-1}A_{12}\end{pmatrix}}^{((p-1){\vec {1}}_{n}-\beta ,(p-1){\vec {1}}_{n}-\alpha )}\left(\prod _{i=1}^{n}\alpha _{1,i}!\right)\left(\prod _{j=1}^{n}\beta _{1,j}!\right)(-1)^{n_{1}+\sum _{i=1}^{n}\alpha _{1,i}}

where the common row/column multiplicity vectors $\alpha$ and $\beta$ for the matrix $A$ generate the corresponding row/column multiplicity vectors $\alpha _{s}$ and $\beta _{t}$ , s,t = 1,2, for its blocks (the same concerns $A$ 's partial inverse in the equality's right side).

Approximate computation edit

When the entries of A are nonnegative, the permanent can be computed approximately in probabilistic polynomial time, up to an error of εM, where M is the value of the permanent and ε > 0 is arbitrary. In other words, there exists a fully polynomial-time randomized approximation scheme (FPRAS) (Jerrum, Sinclair & Vigoda (2001)).

The most difficult step in the computation is the construction of an algorithm to sample almost uniformly from the set of all perfect matchings in a given bipartite graph: in other words, a fully polynomial almost uniform sampler (FPAUS). This can be done using a Markov chain Monte Carlo algorithm that uses a Metropolis rule to define and run a Markov chain whose distribution is close to uniform, and whose mixing time is polynomial.

It is possible to approximately count the number of perfect matchings in a graph via the self-reducibility of the permanent, by using the FPAUS in combination with a well-known reduction from sampling to counting due to Jerrum, Valiant & Vazirani (1986). Let $M(G)$ denote the number of perfect matchings in $G$ . Roughly, for any particular edge $e$ in $G$ , by sampling many matchings in $G$ and counting how many of them are matchings in $G\setminus e$ , one can obtain an estimate of the ratio ${\textstyle \rho ={\frac {M(G)}{M(G\setminus e)}}}$ . The number $M(G)$ is then $\rho M(G\setminus e)$ , where $M(G\setminus e)$ can be approximated by applying the same method recursively.

Another class of matrices for which the permanent is of particular interest, is the positive-semidefinite matrices.^[7] Using a technique of Stockmeyer counting, they can be computed within the class ${\textsf {BPP}}^{\textsf {NP}}$ , but this is considered a feasible class in general. It is NP-hard to approximate permanents of PSD matrices within a subexponential factor, and it is conjectured to be ${\textsf {BPP}}^{\textsf {NP}}$ -hard^[8] If further constraints on the spectrum are imposed, there are more efficient algorithms known. One randomized algorithm is based on the model of boson sampling and it uses the tools proper to quantum optics, to represent the permanent of positive-semidefinite matrices as the expected value of a specific random variable. The latter is then approximated by its sample mean.^[9] This algorithm, for a certain set of positive-semidefinite matrices, approximates their permanent in polynomial time up to an additive error, which is more reliable than that of the standard classical polynomial-time algorithm by Gurvits.^[10]

Notes edit

^ As of 2008, see Rempała & Wesolowski (2008)
^ van Lint & Wilson (2001) p. 99
^ CRC Concise Encyclopedia of Mathematics
^ Nijenhuis & Wilf (1978)
^ Little (1974), Vazirani (1988)
^ Pólya (1913), Reich (1971)
^ See open problem (4) at Shtetl Optimized: Introducing some British people to P vs. NP, 22 July 2015
^ Meiburg, Alexander (2023), "Inapproximability of Positive Semidefinite Permanents and Quantum State Tomography", Algorithmica, 85 (12): 3828–3854, arXiv:2111.03142, doi:10.1007/s00453-023-01169-1
^ Chakhmakhchyan, Levon; Cerf, Nicolas; Garcia-Patron, Raul (2017), "A quantum-inspired algorithm for estimating the permanent of positive semidefinite matrices", Phys. Rev. A, 96 (2): 022329, arXiv:1609.02416, Bibcode:2017PhRvA..96b2329C, doi:10.1103/PhysRevA.96.022329, S2CID 54194194
^ Gurvits, Leonid (2005), "On the complexity of mixed discriminants and related problems", Mathematical Foundations of Computer Science 2005, Lecture Notes in Computer Science, vol. 3618, pp. 447–458, doi:10.1007/11549345_39, ISBN 978-3-540-28702-5

References edit

Allender, Eric; Gore, Vivec (1994), "A uniform circuit lower bound for the permanent", SIAM Journal on Computing, 23 (5): 1026–1049, CiteSeerX 10.1.1.51.3546, doi:10.1137/s0097539792233907
Balasubramanian, K. (1980), Combinatorics and Diagonals of Matrices (PDF), Ph.D. Thesis, Department of Statistics, Loyola College, Madras, India, vol. T073, Indian Statistical Institute, Calcutta
Bax, Eric (1998), Finite-difference Algorithms for Counting Problems, Ph.D. Dissertation, vol. 223, California Institute of Technology
Bax, Eric; Franklin, J. (1996), A finite-difference sieve to compute the permanent, Caltech-CS-TR-96-04, California Institute of Technology
Glynn, David G. (2010), "The permanent of a square matrix", European Journal of Combinatorics, 31 (7): 1887–1891, doi:10.1016/j.ejc.2010.01.010
Glynn, David G. (2013), "Permanent formulae from the Veronesean", Designs, Codes and Cryptography, 68 (1–3): 39–47, doi:10.1007/s10623-012-9618-1, S2CID 36911503
Jerrum, M.; Sinclair, A.; Vigoda, E. (2001), "A polynomial-time approximation algorithm for the permanent of a matrix with non-negative entries", Proc. 33rd Symposium on Theory of Computing, pp. 712–721, doi:10.1145/380752.380877, ISBN 978-1581133493, S2CID 8368245, ECCC TR00-079
Jerrum, Mark; Valiant, Leslie; Vazirani, Vijay (1986), "Random generation of combinatorial structures from a uniform distribution", Theoretical Computer Science, 43: 169–188, doi:10.1016/0304-3975(86)90174-X
Kogan, Grigoriy (1996), "Computing permanents over fields of characteristic 3: Where and why it becomes difficult", Proceedings of 37th Conference on Foundations of Computer Science, pp. 108–114, doi:10.1109/SFCS.1996.548469, ISBN 0-8186-7594-2, S2CID 39024286
Knezevic, Anna; Cohen, Greg (2017), Some facts on Permanents in Finite Characteristics, arXiv:1710.01783, Bibcode:2017arXiv171001783K
van Lint, Jacobus Hendricus; Wilson, Richard Michale (2001), A Course in Combinatorics, Cambridge University Press, ISBN 978-0-521-00601-9
Little, C. H. C. (1974), "An extension of Kasteleyn's method of enumerating the 1-factors of planar graphs", in Holton, D. (ed.), Proc. 2nd Australian Conf. Combinatorial Mathematics, Lecture Notes in Mathematics, vol. 403, Springer-Verlag, pp. 63–72
Little, C. H. C. (1975), "A characterization of convertible (0, 1)-matrices", Journal of Combinatorial Theory, Series B, 18 (3): 187–208, doi:10.1016/0095-8956(75)90048-9
Marcus, M.; Minc, H. (1961), "On the relation between the determinant and the permanent", Illinois Journal of Mathematics, 5 (3): 376–381, doi:10.1215/ijm/1255630882
Nijenhuis, Albert; Wilf, Herbert S. (1978), Combinatorial Algorithms, Academic Press
Pólya, G. (1913), "Aufgabe 424", Arch. Math. Phys., 20 (3): 27
Reich, Simeon (1971), "Another solution of an old problem of pólya", American Mathematical Monthly, 78 (6): 649–650, doi:10.2307/2316574, JSTOR 2316574
Rempała, Grzegorz A.; Wesolowski, Jacek (2008), Symmetric Functionals on Random Matrices and Random Matchings Problems, Springer, p. 4, ISBN 978-0-387-75145-0
Ryser, Herbert John (1963), Combinatorial Mathematics, The Carus Mathematical Monographs, Vol. 14, Mathematical Association of America, ISBN 978-1-61444-014-7
Vazirani, Vijay V. (1988), "NC algorithms for computing the number of perfect matchings in K_3,3-free graphs and related problems", Proc. 1st Scandinavian Workshop on Algorithm Theory (SWAT '88), Lecture Notes in Computer Science, vol. 318, Springer-Verlag, pp. 233–242, doi:10.1007/3-540-19487-8_27, hdl:1813/6700, ISBN 978-3-540-19487-3
Valiant, Leslie G. (1979), "The complexity of computing the permanent", Theoretical Computer Science, 8 (2), Elsevier: 189–201, doi:10.1016/0304-3975(79)90044-6, S2CID 1637832
"Permanent", CRC Concise Encyclopedia of Mathematics, Chapman & Hall/CRC, 2002

Computing the permanent

Contents

Definition and naive algorithm edit

Ryser formula edit

Balasubramanian–Bax–Franklin–Glynn formula edit

Special cases edit

Planar and K_3,3-free edit

Computation modulo a number edit

Approximate computation edit

Notes edit

References edit

Further reading edit

Computing the permanent

Definition and naive algorithm edit

Ryser formula edit

Balasubramanian–Bax–Franklin–Glynn formula edit

Special cases edit

Planar and K3,3-free edit

Computation modulo a number edit

Approximate computation edit

Notes edit

References edit

Further reading edit

Planar and K_3,3-free edit