Skip to article frontmatterSkip to article content
Tutorial

Stochastic Matrices

Abstract

The conditions for a matrix to be stochastic.

Keywords:Matrices.

A stochastic matrix, AA, is a square matrices with positive real values as elements, AMnn(R+)A \in \mathbb{M}_{nn}(\mathbb{R^{+}}). Each element represents the probability of some transformation taking place.

They are applied to vectors that contain probabilities of a system being in a given state. For example,

p=(p1,p2,,pn)  where  pi0  i ,inpi=1,\bm{p} = (p_{1}, p_2, \ldots, p_{n}) ~ ~ \textrm{where} ~ ~ p_{i}\geq0~\forall~i ~, \sum_{i}^{n} p_{i} = 1,

and pip_{i} is the probability of they system being in the iith state.

If

A=(a11a12a1na21a22a2nan1an2ann),A = \begin{pmatrix} a_{11} & a_{12} & \ldots & a_{1n} \\ a_{21} & a_{22} & \ldots & a_{2n} \\ \vdots & \vdots & \vdots & \vdots \\ a_{n1} & a_{n2} & \ldots & a_{nn} \\ \end{pmatrix},

then a11a_{11} is the probability of initially being in the 1 state and ending up in the 1 state, a12a_{12} is the probability of initially being in the 1 state and ending up in the 2 state etc..

The total probability that a state starts in a given state and ends up in another must be one, hence

j=1naij=1,\sum_{j=1}^{n}a_{ij} = 1,

and AA is left stochastic.

Fixed Points

Let AA be a left stochastic matrix and p\bm{p} be a probability vector such that

pA=p,\bm{p}A = \bm{p},

then p\bm{p} is the stationary probability vector, or fixed point, of the transformation AA.

All doubly stochastic matrices have the maximally mixed probability vector as there fixed point, such that

(1/n,1/n,,1/n)A=(1/n,1/n,,1/n),(1/n, 1/n, \ldots, 1/n) A = (1/n, 1/n, \ldots, 1/n),

if AA is doubly stochastic.

If there exists a stocastic matrix AA such that pA=p\bm{p}A = \bm{p'}, then p\bm{p'} can be considered to be more mixed then p\bm{p}.

Birkhoff-Von Neumann Theorem

AMnn(R+)A \in \mathbb{M}_{nn}(\mathbb{R^{+}}) is a doubly stochastic matrix if and only if there exists a set of numbers {pi}\{p_{i}\} where pi0  ip_i \geq 0~\forall~i and kpk=1\sum_k p_{k} = 1 such that

A=ipiVi,A = \sum_{i} p_{i} V_{i},

where ViMnn(R+)V_{i} \in \mathbb{M}_{nn}(\mathbb{R^{+}}) are the n×nn \times n permutation matrices. Therefore, all doubly stochastic matrices are convex combinations of permutation matrices.