UDC 519.28
MATHEMATICS
Submitted 1968-01-01 | RussiaRxiv: ru-196801.36771 | Translated from Russian

Full Text

UDC 519.28

MATHEMATICS

I. S. DOBROKHOTOV

ON TESTS OF A GENERAL LINEAR HYPOTHESIS WITH UNKNOWN WEIGHTS OF OBSERVATIONS

(Presented by Academician Yu. V. Linnik on 30 III 1967)

In the book of Yu. V. Linnik \((^1)\), properties are indicated that characterize similar Scheffé-type tests as Neyman structures. Such tests have hitherto been known only in the Behrens—Fisher problem. The conditions found are satisfied, for example, by all tests based on the \(t\)-ratio, in particular by the well-known test of E. Barankin \((^3)\) for one problem on linear regression. In \((^1)\) the question is posed of extending the results obtained to the case of a general linear hypothesis with unknown weights of observations. The present note serves this purpose.

Let \(X_1,\ldots,X_N\), where \(X_i \in N(a_i,\sigma_i^2)\), be a repeated sample, and suppose that

\[ a_i=\sum_1^s \alpha_{ij}\beta_j \]

(\(\beta_j\) are new parameters, and the matrix \(\alpha=\|\alpha_{ij}\|\) is assumed given). Suppose that the \(\sigma_i^2\), generally speaking, are different. The general linear hypothesis \(H_0\) is the assertion

\[ \beta_1=B_1,\ldots,\beta_r=B_r, \]

where \(B_i\) \((i=1,\ldots,r)\) are given constants.

Denote by \(\mathfrak X\) the space generated by stochastically independent linear forms \(l_1,\ldots,l_\mu\) and \(L_1,\ldots,L_\nu\) such that

\[ E\{l_i\mid H_0\}=E\{L_j\}=0^* \quad (i=1,\ldots,\mu;\ j=1,\ldots,\nu), \]
\[ D\{l_i\}/D\{l_1\}=C_i \quad (i=1,\ldots,\mu), \]

and \(C_i>0\) does not depend on \(\sigma=(\sigma_1,\ldots,\sigma_N)\), whereas the forms \(L_j\) may have different variances.

Consider two continuous homogeneous functions \(T_1\) and \(T_2\) of degree \(n>0\) with the following properties:

\[ T_1(l_1,\ldots,l_\mu)>0,\quad \text{if }(l_1,\ldots,l_\mu)\ne(0,\ldots,0), \]
\[ T_2(L_1,\ldots,L_\nu)>0,\quad \text{if }(L_1,\ldots,L_\nu)=(0,\ldots,0), \]

and

\[ T_2>C>0, \tag{1} \]

if at least one of its arguments is equal to one. Then the following is valid.

Theorem. If a similar test \(\varphi\) (generally speaking, randomized) is defined on the space \(\mathfrak X\) and accepts the null hypothesis at least under the conditions

\[ T_1/T_2\leqslant \varepsilon \tag{2} \]

(\(\varepsilon>0\) sufficiently small) and

\[ \sqrt{Q_1}=\left[\sum_1^\mu \frac{l_i^2}{c_i}\right]^{1/2}<\delta \tag{3} \]

\[ {}^*\ \text{The mathematical expectations of the forms }L_j\text{ are equal to zero for any values }\beta=(\beta_1,\ldots,\beta_s). \]

(\(\delta>0\) is any fixed number), then the variances of the linear forms \(L_1,\ldots,L_\nu\) satisfy the relations

\[ D\{L_j\}/D\{l_1\}=d_j \qquad (j=1,\ldots,\nu), \tag{4} \]

and \(d_j>0\) do not depend on \(\sigma\).

It follows immediately from this that \(\varphi\) is a Neyman structure for the exponential family generated by the forms \(l_1,\ldots,l_\mu,L_1,\ldots,L_\nu\). For \(\mu=1\) and in the case of the Behrens—Fisher problem, we again obtain the result of Yu. V. Linnik \((^1)\).

As in \((^1)\), the proof is by contradiction and is based on consideration of the test-similarity condition

\[ E_\sigma\{\varphi\mid H_0\}=\alpha \quad \text{uniformly in } \sigma . \tag{5} \]

Let us briefly explain the proof. Suppose that (4) holds for \(j=1,\ldots,p<\nu\), and transform (5) as described in \((^1)\). As a result, \(\sigma\) are replaced by new parameters \(\theta\), which we regard as complex. Then we consider a parametric point that is singular for both sides of the transformed relation (5). In a neighborhood of this point the stated relation has the form

\[ \int_{\mathfrak x}\cdots\int \varphi\, \frac{dl_1\cdots dl_\mu\,dL_1\cdots dL_\nu}{Q^{\tau+(\mu+\nu)/2}} \]

\[ = A_1G_1(\tau)\xi^{-\tau}(i\xi)^{-\tau-(\nu-p)/2} \prod_{p+1}^{\nu}\left[d_{jN-1}i\xi(d_{jN}-d_{jN-1})\right]^{1/2}, \tag{6} \]

where

\[ Q=Q_1+\sum_{1}^{p}\frac{L_j^2}{d_j} +\sum_{p+1}^{\nu}L_j^2 \frac{i\xi}{d_{jN-1}i\xi+(d_{jN}-d_{jN-1})} +i\xi^2; \]

\(A_1\) is a positive constant; \(G_1(\tau)\) is a function regular in \(\operatorname{Re}(\tau)>0\), and \(\xi>0\) is a number which we henceforth make arbitrarily small. We are now interested in the behavior of the moduli of both sides of (6) as \(\xi\downarrow 0\). A lower estimate for the modulus of the right-hand side has the form

\[ B\xi^{-2\tau-(\nu-p)/2}, \]

where the symbol \(B\) denotes a bounded quantity. To obtain an upper estimate for the modulus of the left-hand side of (6), we divide the space \(\mathfrak x\) into “layers”:

I. \(\;2^{m-1}\xi\le \sqrt{Q_1}<2^m\xi,\)

II. \(\;\dfrac{1}{2^m}\xi\le \sqrt{Q_1}<\dfrac{1}{2^{m-1}}\xi\) inside (3),

III. \(\;2^{m-1}\delta\le \sqrt{Q_1}<2^m\delta\) in the remaining part of the space \(\mathfrak x\).

The estimate is carried out separately in each “layer.” In doing so, the properties of the functions \(T_1\) and \(T_2\) and the inequalities

\[ |Q|\ge |\operatorname{Re}Q|\ge Q_1,\qquad |Q|\ge |\operatorname{Im}Q|\ge B\xi^2 \]

are used for \(|L_j|<\xi\). In the end we obtain the desired estimate

\[ B\xi^{-2\tau}. \]

To avoid a contradiction in the behavior of both sides of (6), we must have \(\nu-p=0\).

Since the exponential family generated by the forms \(l_1,\ldots,l_\mu,L_1,\ldots,L_\nu\) is one-parametric under \(H_0\), it follows from the Lehmann—Scheffé theory that \(\varphi\) is a Neyman structure with respect to the statistic

\[ \sum_{1}^{\mu}\frac{l_i^2}{\sigma_i}+\sum_{1}^{\nu}\frac{L_j^2}{d_j}. \]

The forms \(l_i\) are naturally to be chosen in such a way that the indicated statistic is not sufficient under the specified alternative \(H_1\); then the test \(\varphi\) is nontrivial.

In many practical cases the sample consists of \(m\) independent subsamples of sizes \(n_k\) \((k=1,\ldots,m)\), drawn respectively from

\[ N(a_k,\sigma_k^2). \]

Then \(N=\sum_1^m n_k\), and \(\mu,\nu\) must satisfy the condition

\[ \mu+\nu \leq \min n_k . \]

The optimal choice of the forms \(l_1,\ldots,l_\mu,L_1,\ldots,L_\nu\) depends on the particular form of the functions \(T_1\) and \(T_2\) and can be carried out, for example, on the basis of the same considerations as in (2) or (3), if tests based on the \(F\)-distribution are constructed.

The result obtained admits a multivariate generalization. In this case \(T_1\) and \(T_2\) are replaced respectively by the matrices \(Q_1\) and \(Q_2\). Instead of inequality (2), acceptance of the null hypothesis is postulated under the condition

\[ \operatorname{sp} Q_1 Q_2^{-1} \leq \varepsilon . \]

Condition (1) takes the form

\[ |Q_2| > c > 0, \]

if all entries of the matrix \(Q_2\) are bounded.

Received
28 III 1967

REFERENCES

\(^{1}\) Yu. V. Linnik, Statistical Problems with Nuisance Parameters, “Nauka,” 1966.
\(^{2}\) H. Scheffe, Ann. Math. Statistics, 14, No. 1, 35 (1943).
\(^{3}\) E. W. Barankin, Proc. Berkeley Symp. on Math. Stat. and Prob., 1949.

Submission history

UDC 519.28