Preconditioned Crank–Nicolson algorithm

In computational statistics, the preconditioned Crank–Nicolson algorithm (pCN) is a Markov chain Monte Carlo (MCMC) method for obtaining random samples – sequences of random observations – from a target probability distribution for which direct sampling is difficult.

The most significant feature of the pCN algorithm is its dimension robustness, which makes it well-suited for high-dimensional sampling problems. The pCN algorithm is well-defined, with non-degenerate acceptance probability, even for target distributions on infinite-dimensional Hilbert spaces. As a consequence, when pCN is implemented on a real-world computer in large but finite dimension N, i.e. on an N-dimensional subspace of the original Hilbert space, the convergence properties (such as ergodicity) of the algorithm are independent of N. This is in strong contrast to schemes such as Gaussian random walk Metropolis–Hastings and the Metropolis-adjusted Langevin algorithm, whose acceptance probability degenerates to zero as N tends to infinity.

The algorithm as named was highlighted in 2013 by Cotter, Roberts, Stuart and White,[1] and its ergodicity properties were proved a year later by Hairer, Stuart and Vollmer.[2] In the specific context of sampling diffusion bridges, the method was introduced in 2008.[3]

Description of the algorithm edit

Overview edit

The pCN algorithm generates a Markov chain   on a Hilbert space   whose invariant measure is a probability measure   of the form

 

for each measurable set  , with normalising constant   given by

 

where   is a Gaussian measure on   with covariance operator   and   is some function. Thus, the pCN method applied to target probability measures that are re-weightings of a reference Gaussian measure.

The Metropolis–Hastings algorithm is a general class of methods that try to produce such Markov chains  , and do so by a two-step procedure of first proposing a new state   given the current state   and then accepting or rejecting this proposal, according to a particular acceptance probability, to define the next state  . The idea of the pCN algorithm is that a clever choice of (non-symmetric) proposal for a new state   given   might have an associated acceptance probability function with very desirable properties.

The pCN proposal edit

The special form of this pCN proposal is to take

 
 

or, equivalently,

 

The parameter   is a step size that can be chosen freely (and even optimised for statistical efficiency). One then generates   and sets

 
 

The acceptance probability takes the simple form

 

It can be shown[2] that this method not only defines a Markov chain that satisfies detailed balance with respect to the target distribution  , and hence has   as an invariant measure, but also possesses a spectral gap that is independent of the dimension of  , and so the law of   converges to   as  . Thus, although one may still have to tune the step size parameter   to achieve a desired level of statistical efficiency, the performance of the pCN method is robust to the dimension of the sampling problem being considered.

Contrast with symmetric proposals edit

This behaviour of pCN is in stark contrast to the Gaussian random walk proposal

 

with any choice of proposal covariance  , or indeed any symmetric proposal mechanism. It can be shown using the Cameron–Martin theorem that for infinite-dimensional   this proposal has acceptance probability zero for  -almost all   and  . In practice, when one implements the Gaussian random walk proposal in dimension  , this phenomenon can be seen in the way that

  • for fixed  , the acceptance probability tends to zero as  , and
  • for a fixed desired positive acceptance probability,   as  .

References edit

  1. ^ Cotter, S. L.; Roberts, G. O.; Stuart, A. M.; White, D. (2013). "MCMC methods for functions: modifying old algorithms to make them faster". Statist. Sci. 28 (3): 424–446. arXiv:1202.0709. doi:10.1214/13-STS421. ISSN 0883-4237. S2CID 36562755.
  2. ^ a b Hairer, M.; Stuart, A. M.; Vollmer, S. J. (2014). "Spectral gaps for a Metropolis–Hastings algorithm in infinite dimensions". Ann. Appl. Probab. 24 (6): 2455–2490. arXiv:1112.1392. doi:10.1214/13-AAP982. ISSN 1050-5164. S2CID 73662504.
  3. ^ Beskos, A.; Roberts, G. O.; Stuart, A. M.; Voss, J. (2008). "MCMC Methods for Diffusion Bridges". Stochastics and Dynamics. 8 (3): 319–350. doi:10.1142/S0219493708002378.