Full Text
UDC 519.217
MATHEMATICS
Corresponding Member of the Academy of Sciences of the USSR N. V. SMIRNOV,
O. V. SARMANOV, V. K. ZAKHAROV
A LOCAL LIMIT THEOREM FOR THE NUMBERS OF TRANSITIONS IN A MARKOV CHAIN AND ITS APPLICATIONS
A simple homogeneous Markov chain with \(s+1\) states \(E_i,\ i=1,2,\ldots,s+1\), and a positive matrix of transition probabilities \(\{p_{ij}\}\), \(p_{ij}>0,\ i,j=1,2,\ldots,s+1\), is considered. By \(p_i^{(1)}\) we denote the initial probabilities of \(E_i\) and suppose that \(p_i^{(1)}>0\). According to \((1)\), the matrix \(\{p_{ij}\}\) will be simple and regular, and the chain will be ergodic; in this case there exists a unique set of positive final probabilities \(p_i,\ i=1,2,\ldots,s+1\), independent of the initial probabilities \(p_i^{(1)}\);
\[ \sum_{i=1}^{s+1} p_i^{(1)}=\sum_{i=1}^{s+1} p_i=1. \]
Let \(m_i\) denote the number of occurrences of \(E_i\) in \(n\) trials, and let \(m_{ij}\) denote the number of transitions from state \(E_i\) to state \(E_j\) in the same \(n\) trials.
- Let us compute the number \(K\) of distinct chains of length \(n\), composed of \(s+1\) states, having a prescribed set of numbers of transitions \(m_{ij}\) from \(E_i\) to \(E_j\), beginning with state \(E_{i_0}\) and ending with state \(E_{j_0}\).
The numbers \(m_i\) and \(m_{ij}\) satisfy the conditions
\[ \sum_{i=1}^{s+1} m_i=n,\qquad \sum_{i,j=1}^{s+1} m_{ij}=n-1 \]
and are connected by the relations
\[ \sum_{j=1}^{s+1} m_{ij}=m_i \quad \text{for } i\ne j_0;\qquad \sum_{j=1}^{s+1} m_{j_0j}=m_{j_0}-1; \tag{1} \]
\[ \sum_{i=1}^{s+1} m_{ij}=m_j \quad \text{for } j\ne i_0;\qquad \sum_{i=1}^{s+1} m_{ii_0}=m_{i_0}-1, \]
in consequence of which, among the \((s+1)(s+2)\) variables \(m_i\) and \(m_{ij}\), only \(s^2+s+1\) will be independent; as independent variables we shall choose \(m_i,\ i=1,2,\ldots,s+1\), and \(m_{ij},\ i,j=1,2,\ldots,s\).
We shall carry out the computation of \(K\) by induction on the number of states of the chain. Let
\[ K_{i_0j_0}^{(s)}(m_1,m_2,\ldots,m_s;k_{ij};\ i,j=1,2,\ldots,s-1) \]
be the number, known to us, of chains from \(s\) states with fixed numbers \(m_i\) of occurrences of all \(E_i,\ i=1,2,\ldots,s\), and numbers of transitions \(k_{ij},\ i,j=1,2,\ldots,s-1\) (\(k_{is}\) and \(k_{sj}\) are uniquely expressed in terms of only the indicated parameters). From these chains we construct new chains into which the state \(E_{s+1}\) will enter \(m_{s+1}\) times and in which a fixed set of transitions \(m_{ij},\ i,j=1,2,\ldots,s\), from state \(E_i\) to state \(E_j\), will be obtained.
Consider an arbitrary chain composed of \(s\) states. Among its elements we must in a certain way place series of states \(E_{s+1}\). Among all pairs of adjacent states \(E_iE_j,\ i,j=1,2,\ldots,s\), and the number of such pairs is \(k_{ij}\), choose \(\gamma_{ij}\) pairs which we shall separate by a series of states \(E_{s+1}\); the number of such choices is equal to \(C_{k_{ij}}^{\gamma_{ij}}\). Thus, \(m_{s+1}\) occurrences of the state \(E_{s+1}\) must be divided into
\[ \gamma=\sum_{i,j=1}^{s}\gamma_{ij} \]
series,
which can be done in a number of ways equal to \(C_{m_{s+1}-1}^{\gamma-1}\). Note that when \(\gamma_{ij}\) series of states \(E_{s+1}\) are placed between the components of the pairs \(E_i'E_j\), the number of these pairs will decrease by \(\gamma_{ij}\), and precisely this new number of remaining pairs \(k_{ij}-\gamma_{ij}\) must be equal to \(m_{ij}\) in the new chain; hence
\[ k_{ij}=m_{ij}+\gamma_{ij},\qquad i,j=1,2,\ldots,s. \tag{2} \]
Since
\[ \sum_{i,j=1}^{s} k_{ij}=\sum_{i=1}^{s} m_i-1, \]
we have
\[ \gamma=\sum_{i=1}^{s} m_i-1-\sum_{i,j=1}^{s} m_{ij}=\mathrm{const}. \tag{3} \]
For fixed \(\gamma_{ij}\), the number of new chains obtained from the old ones by adding \(m_{s+1}\) states \(E_{s+1}\) will be equal to
\[ K_{i_0j_0}^{(s)} (m_1,m_2,\ldots,m_s;\,m_{ij}+\gamma_{ij};\, i,j=1,2,\ldots,s-1) C_{m_{s+1}-1}^{\gamma-1} \prod_{i,j=1}^{s} C_{m_{ij}+\gamma_{ij}}^{\gamma_{ij}} . \]
Summing over all \(\gamma_{ij}\), we obtain the recurrence formula
\[ K_{i_0j_0}^{(s+1)}(m_1,m_2,\ldots,m_{s+1};\,m_{ij};\,i,j=1,2,\ldots,s)= \]
\[ = C_{m_{s+1}-1}^{\gamma-1} \sum_{\substack{\gamma_{ij}\\ i,j=1,2,\ldots,s}} K_{i_0j_0}^{(s)}(m_1,m_2,\ldots,m_s;\,m_{ij}+\gamma_{ij};\,i,j=1,2,\ldots,s-1) \times \]
\[ \times \prod_{i,j=1}^{s} C_{m_{ij}+\gamma_{ij}}^{\gamma_{ij}} . \tag{4} \]
Let us also show how \(K_{i_0j_0}^{(2)}\) is found for two states \(E_{i_0},E_{j_0}\) (with \(i_0\ne j_0\)).
We divide the \(m_{i_0}\) states \(E_{i_0}\) and the \(m_{j_0}\) states \(E_{j_0}\) into the same number \(m_{i_0}-m_{i_0i_0}\) of ordered series (with such a number of series there will be exactly \(m_{i_0i_0}\) transitions from \(E_{i_0}\) to \(E_{i_0}\), while the numbers of transitions \(m_{i_0j_0}, m_{j_0i_0}\), and \(m_{j_0j_0}\) are fixed automatically). The ordered series, without disturbing their order, are placed into one chain, putting the series from \(E_{i_0}\) in the odd positions and the series from \(E_{j_0}\) in the even positions; then the chains will begin with \(E_{i_0}\) and end with \(E_{j_0}\).
Thus the number of all such chains is
\[ K_{i_0j_0}^{(2)} = C_{m_{i_0}-1}^{m_{i_0}-m_{i_0i_0}-1} C_{m_{j_0}-1}^{m_{i_0}-m_{i_0i_0}-1}. \tag{5} \]
The quantity \(K_{i_0i_0}^{(2)}\) is found analogously.
By direct substitution one can verify that the following expression satisfies the recurrence formula (4) (for \(i_0\ne j_0\)):
\[ K= \frac{(m_{j_0}-1)\prod_{i=1}^{s+1} m_i!} {m_{j_0}m_{s+1}\prod_{i,j=1}^{s+1} m_{ij}!} \left| \begin{array}{ccccc} 1-\dfrac{m_{11}}{m_1} & \cdots & -\dfrac{m_{1i_0}}{m_1} & \cdots & -\dfrac{m_{1j_0}}{m_1} & \cdots & -\dfrac{m_{1s}}{m_1} \\ \cdots & \cdots & \cdots & \cdots & \cdots & \cdots & \cdots \\ -\dfrac{m_{i_0 1}}{m_{i_0}} & \cdots & 1-\dfrac{m_{i_0i_0}}{m_{i_0}} & \cdots & -\dfrac{m_{i_0j_0}}{m_{i_0}} & \cdots & -\dfrac{m_{i_0s}}{m_{i_0}} \\ \cdots & \cdots & \cdots & \cdots & \cdots & \cdots & \cdots \\ -\dfrac{m_{j_0 1}}{m_{j_0}-1} & \cdots & -\dfrac{m_{j_0i_0}+1}{m_{j_0}-1} & \cdots & 1-\dfrac{m_{j_0j_0}-1}{m_{j_0}-1} & \cdots & -\dfrac{m_{j_0s}}{m_{j_0}-1} \\ \cdots & \cdots & \cdots & \cdots & \cdots & \cdots & \cdots \\ -\dfrac{m_{s1}}{m_s} & \cdots & -\dfrac{m_{si_0}}{m_s} & \cdots & -\dfrac{m_{sj_0}}{m_s} & \cdots & 1-\dfrac{m_{ss}}{m_s} \end{array} \right|. \tag{6} \]
For \(i_0=j_0\), the row of the determinant with this number takes the form
\[ -\frac{m_{j_0 1}}{m_{j_0}-1},\ldots, 1-\frac{m_{j_0}}{m_{j_0}-1},\ldots, -\frac{m_{j_0s}}{m_{j_0}-1}, \]
and the remaining rows remain unchanged.
Remark. An expression equivalent to (6) was obtained by elementary methods in (2); the formula for \(K\) given in (3) is erroneous.
2. Let \(P_{i_0 j_0}^{(n)}(\{m_{ij}\})\) be the probability of observing numbers of transitions forming the matrix \(\{m_{ij}\}\), \(i,j=1,2,\ldots,s+1\), in \(n\) trials beginning with the occurrence of \(E_{i_0}\) and ending with the occurrence of \(E_{j_0}\). Then
\[ P_{i_0 j_0}^{(n)}(\{m_{ij}\})=Kp_{i_0}^{(1)}\prod_{i,j=1}^{s+1}p_{ij}^{m_{ij}}, \tag{7} \]
where \(K\) is determined by formula (6). Putting
\[ \begin{aligned} m_{ij}&=np_i p_{ij}+\xi_{ij}\sqrt{np_i p_{ij}}, \qquad &&i,j=1,2,\ldots,s+1,\\ m_i&=np_i+z_i\sqrt{np_i}, \qquad &&i\ne j_0,\\ m_{j_0}-1&=np_{j_0}+z_{j_0}\sqrt{np_{j_0}},\\ \Delta \xi_{ij}&=1/\sqrt{np_i p_{ij}}, \qquad &&i,j=1,2\ldots,s,\\ \Delta z_i&=1/\sqrt{np_i}, \qquad &&i=1,2,\ldots,s. \end{aligned} \tag{8} \]
we find, as a consequence of (1),
\[ \sum_{j=1}^{s+1}\xi_{ij}\sqrt{p_{ij}}=z_i,\qquad i=1,2,\ldots,s+1. \tag{9} \]
Moreover, the following limiting relations (obtained as \(n\to\infty\)) are valid:
\[ \sum_{i=1}^{s+1}\xi_{ij}\sqrt{p_i p_{ij}}=z_j\sqrt{p_j},\qquad j=1,2,\ldots,s+1, \]
\[ \sum_{i=1}^{s+1} z_i\sqrt{p_i}=0. \tag{9'} \]
Regarding the variables \(\xi_{ij}\) and \(z_i\) as varying within finite limits, with the aid of Stirling’s formula and taking (8) and (9) into account, we bring (7) to the form
\[ P_{i_0 j_0}^{(n)}(\{m_{ij}\}) = p_{i_0}^{(1)}p_{j_0}Ce^{-Q/2} \prod_{i,j=1}^{s}\Delta\xi_{ij} \prod_{i=1}^{s}\Delta z_i (1+O(n^{-1/2})), \tag{10} \]
where
\[ C= \frac{1}{(2\pi)^{(s^2+s)/2}} \left( \frac{\displaystyle\prod_{i=1}^{s}p_i} {\displaystyle p_{s+1}^{\,s+2}\prod_{i=1}^{s}p_{i,s+1}\prod_{j=1}^{s+1}p_{s+1,j}} \right)^{1/2} \left| \begin{array}{ccccc} 1-p_{11} & -p_{12} & \ldots & -p_{1s}\\ -p_{21} & 1-p_{22} & \ldots & -p_{2s}\\ \cdot & \cdot & \cdot & \cdot & \cdot\\ -p_{s1} & -p_{s2} & \ldots & 1-p_{ss} \end{array} \right|, \tag{11} \]
\[ Q= \sum_{i,j=1}^{s+1}\xi_{ij}^{2} - \sum_{i=1}^{s+1}z_i^{2} = \sum_{i,j=1}^{s+1}\left(\xi_{ij}-z_i\sqrt{p_{ij}}\right)^2{}^*. \tag{12} \]
Summing (10) over \(i_0\) and \(j_0\) from 1 to \(s+1\), we obtain the probability of observing the transitions \(\{m_{ij}\}\) in an arbitrary segment of \(n\) observations:
\[ P^{(n)}(\{m_{ij}\}) = Ce^{-Q/2} \prod_{i,j=1}^{s}\Delta\xi_{ij} \prod_{i=}^{s}\Delta z_i (1+O(n^{-1/2})). \tag{13} \]
Thus, we have proved the
Local limit theorem. The numbers of transitions \(\{m_{ij}\}\), under the normalization (8), have a joint asymptotically normal dis-
* In (3), in the expression for \(Q\), \(p_i\) is printed instead of \(p_{ij}\).
distribution with density \(C e^{-Q/2}\) with \(s^2+s\) normally correlated variables \(\xi_{ij}\), \(i,j=1,2,\ldots,s\), \(z_i\), \(i=1,2,\ldots,s\).
Remark. Integrating \(C e^{-Q/2}\) with respect to the variables \(\xi_{ij}\), \(i,j=1,2,\ldots,s\), over the limits from \(-\infty\) to \(\infty\), we obtain the local limit theorem for the numbers of states of a Markov chain, previously proved in (4). On the other hand, considering pairs of states as states of a new chain having \((s+1)^2\) states, one can obtain the same local theorem by using the result of A. N. Kolmogorov (4).
- The frequencies \(m_i/n\) and \(m_{ij}/m_i\) are naturally to be regarded as statistical estimates, respectively, of the final probabilities \(p_i\) and of the transition probabilities \(p_{ij}\), obtained by the maximum-likelihood method.
Let us note that the random variable (12) has a \(\chi^2\) distribution with \(s^2+s\) degrees of freedom.
From (12) and (8) it follows that
\[ Q=\sum_{i,j=1}^{s+1}\frac{(m_{ij}-m_i p_{ij})^2}{n p_i p_j} =\sum_{i,j=1}^{s+1}\left(\frac{m_{ij}}{m_i}-p_{ij}\right)^2 \frac{n p_i}{p_{ij}}+\eta_n =Q_1+\eta_n, \tag{14} \]
where \(\eta_n\), as \(n\to\infty\), tends to zero with probability one.
Considering an inequality of the form \(Q_1<L\) as a criterion for testing, from observations, the simple hypothesis \(H_0\) fixing the probabilities \(\{p_{ij}\}\), we conclude on the basis of (14) that, for large \(n\), this criterion is asymptotically distributed according to the \(\chi^2\) law with \(s^2+s\) degrees of freedom. As always, the critical region is the region of large values of \(Q_1\).
If, along with the hypothesis \(H_0\), we allow the possibility of an alternative hypothesis \(H_1\), close to \(H_0\), under which the probabilities \(p_{ij}\) receive a certain displacement \(c_{ij}/\sqrt{n}\)
\[
\left(\sum_{j=1}^{s+1} c_{ij}=0\right),
\]
then, under the hypothesis \(H_1\), the criterion \(Q_1\) asymptotically follows a noncentral \(\chi^2\)-distribution with \(s^2+s\) degrees of freedom and noncentrality parameter \(\sum c_{ij}^{\,2}\). This circumstance can be used to estimate the power of the criterion and to estimate a sufficient number of observations for distinguishing the hypotheses.
Remark. One could also have refrained from assuming the positivity of all \(p_{ij}\). The main conclusions of the paper remain valid if the matrix \(\{p_{ij}\}\) is indecomposable and regular (1), since then there exists a unique set of final probabilities. The equality to zero of individual \(p_{ij}\) leads only to a corresponding reduction in the number of degrees of freedom.
V. A. Steklov Mathematical Institute
Academy of Sciences of the USSR
Received
8 XII 1965
REFERENCES
- S. N. Bernstein, Collected Works, 4, Moscow, 1964, p. 455.
- P. Whittle, J. Roy. Stat. Soc., ser. B, 17, 235 (1955).
- N. V. Smirnov, Vestn. LGU, No. 11, 47 (1955).
- A. N. Kolmogorov, Izv. AN SSSR, ser. matem., 13, 281 (1949).