V. A. MEDVEDEV
Let it be required to find the eigenvalues of the equation
Submitted 1963-01-01 | RussiaRxiv: ru-196301.16064 | Translated from Russian

Abstract

Full Text

V. A. MEDVEDEV

APPLICATION OF THE METHOD OF LEAST SQUARES TO EIGENVALUE PROBLEMS

(Presented by Academician G. I. Petrov on 25 I 1963)

Let it be required to find the eigenvalues of the equation

\[ A(\lambda)\varphi=0, \tag{1} \]

i.e., the values \(\lambda\) for which equation (1) has a nontrivial solution; \(A(\lambda)\) is a linear operator depending on the complex parameter \(\lambda\), with domain of definition \(D_A\) in a Hilbert space \(H_1\) and range in a Hilbert space \(H_2\). We shall assume that for each eigenvalue there exists a neighborhood containing no other eigenvalues, and that, if \(\lambda\) is not an eigenvalue, then it is regular, i.e., the operator \(A^{-1}(\lambda)\), inverse to the operator \(A(\lambda)\), is defined on all of \(H_2\) and is bounded. Denote

\[ \varkappa(\lambda)=\inf_{\varphi\in D_A} \frac{\|A(\lambda)\varphi\|_{H_2}^2}{\|\varphi\|_{H_1}^2}. \]

Obviously, \(\varkappa(\lambda)\geq 0\). If \(\lambda=\lambda_i\) is an eigenvalue, then \(\varkappa(\lambda_i)=0\). If \(\lambda\) is a regular value, then

\[ \varkappa(\lambda)=\frac{1}{\|A^{-1}(\lambda)\|^2}>0, \tag{2} \]

where \(\|A^{-1}(\lambda)\|\) is the norm of the operator \(A^{-1}(\lambda)\). Consequently, the characteristic equation for the eigenvalues will be \(\varkappa(\lambda)=0\).

Take a system of linearly independent vectors \(\psi_k\in D_A\) \((k=1,2,\ldots)\), complete in the following sense: for arbitrary fixed \(\lambda\), \(\varphi\in D_A\), and \(\varepsilon>0\) there exists a linear combination \(\sum_{k=1}^{m} a_k\psi_k\) such that

\[ \left\|\varphi-\sum_{k=1}^{m}a_k\psi_k\right\|_{H_1}^{2} + \left\|A(\lambda)\left(\varphi-\sum_{k=1}^{m}a_k\psi_k\right)\right\|_{H_2}^{2} <\varepsilon. \]

A system complete in the indicated sense will henceforth be called \(A\)-complete. Denote

\[ \varkappa_n(\lambda)= \min_{\{a_k\}} \frac{ \left\|A(\lambda)\left(\sum_{k=1}^{n}a_k\psi_k\right)\right\|_{H_2}^{2} }{ \left\|\sum_{k=1}^{n}a_k\psi_k\right\|_{H_1}^{2} }, \quad (n=1,2,\ldots). \]

Obviously, \(\varkappa_n(\lambda)\geq \varkappa(\lambda)\). By virtue of the \(A\)-completeness of the system \(\{\psi_k\}\),

\[ \lim_{n\to\infty}\varkappa_n(\lambda)=\varkappa(\lambda). \]

In particular, \(\varkappa_n(\lambda_i)\to 0\) as \(n\to\infty\), if \(\lambda_i\) is some eigenvalue. Suppose now that the functions \(\varkappa(\lambda)\), \(\varkappa_n(\lambda)\) are continuous. Take a circle \(|\lambda-\lambda_i|=\rho\) such that in the disk \(|\lambda-\lambda_i|\leq\rho\) there are no eigenvalues other than \(\lambda=\lambda_i\). Since \(\lim_{n\to\infty}\varkappa_n(\lambda_i)=0\), it follows that, beginning with some number, the inequality

\[ \varkappa_n(\lambda_i)< \min_{|\lambda-\lambda_i|=\rho}\varkappa(\lambda), \]

will hold.

and, consequently, for such \(n\) the function \(\varkappa_n(\lambda)\) has a local minimum at some point \(\lambda_{in}\),

\[ |\lambda_{in}-\lambda_i|<\rho . \]

Since \(\rho\) is arbitrarily small, the following is true.

Theorem. For any eigenvalue \(\lambda_i\) of equation (1) there exists a sequence of points \(\lambda_{in}\) at which the corresponding functions \(\varkappa_n(\lambda)\) have a local minimum, \(\lim\limits_{n\to\infty}\lambda_{in}=\lambda_i\), and moreover \(\lim\limits_{n\to\infty}\varkappa_n(\lambda_{in})=0\).

Thus, as approximate eigenvalues one must take those values of \(\lambda\) at which the function \(\varkappa_n(\lambda)\) has a local minimum.

Let us note that if there exists a point \(\lambda_0\) at which the function \(\varkappa(\lambda)\) has a local minimum different from zero, then there exists a convergent sequence of points \(\lambda_{0n}\) at which the corresponding functions \(\varkappa_n(\lambda)\) have a local minimum, although \(\lambda_0=\lim\limits_{n\to\infty}\lambda_{0n}\) is not an eigenvalue. The points \(\lambda_{0n}\) are extraneous solutions, which must be rejected. To do this, it is necessary to verify that \(\lim\limits_{n\to\infty}\varkappa_n(\lambda_{0n})\ne 0\). In practical application of the method such a check may prove impossible; therefore it is important to determine whether the function \(\varkappa(\lambda)\) has local minima different from zero. Suppose that at regular points the operator \(A^{-1}(\lambda)\) is a holomorphic function of \(\lambda\), i.e., in some neighborhood of an arbitrary regular point \(\lambda_0\) it can be represented by the norm-convergent series

\[ A^{-1}(\lambda)=A^{-1}(\lambda_0)+(\lambda-\lambda_0)A_1+(\lambda-\lambda_0)^2A_2+(\lambda-\lambda_0)^3A_3+\cdots . \]

Then the mean-value theorem holds,

\[ A^{-1}(\lambda_0)=\frac{1}{2\pi\rho}\int_C A^{-1}(\lambda)\,ds, \]

where \(C\) is a circle of sufficiently small radius \(\rho\) with center at the point \(\lambda_0\). Hence the inequality follows

\[ \|A^{-1}(\lambda_0)\|\leq \frac{1}{2\pi\rho}\int_C \|A^{-1}(\lambda)\|\,ds, \]

and, consequently, \(\|A^{-1}(\lambda)\|\) cannot have a local maximum at the regular point \(\lambda_0\). From (2) it follows that \(\varkappa(\lambda)\) cannot have a local minimum different from zero.

The equation satisfied by \(\varkappa_n(\lambda)\) has the form

\[ |a_{ij}-\varkappa_n\beta_{ij}|=0, \tag{3} \]

where \(\varkappa_n(\lambda)\) is the smallest root of equation (3), with

\[ a_{ij}=(A(\lambda)\psi_j,A(\lambda)\psi_i)_{H_2}, \qquad \beta_{ij}=(\psi_j,\psi_i)_{H_1}\quad (i,j=1,2,3,\ldots,n). \]

If \(\{\psi_k\}\) is an orthonormal system in \(H_1\), then equation (3) is simplified, and \(\varkappa_n(\lambda)\) is the smallest characteristic number of a Hermitian matrix.

Let us dwell on the choice of an \(A\)-complete system. Suppose the operator \(A(\lambda)\) can be represented in the form

\[ A(\lambda)=T(\lambda)A_0, \]

where \(A_0\) is a linear operator with domain coinciding with \(D_A\), with values in \(H_2\), and having a bounded inverse operator \(A_0^{-1}\); the operator \(T(\lambda)\), defined in \(H_2\), is bounded. Take a system \(\{f_k\}\), complete in \(H_2\). Then the system \(\psi_k=A_0^{-1}f_k\) \((k=1,2,\ldots)\) will be \(A\)-complete. Indeed, let \(\varphi\in D_A\). Then \(f=A_0\varphi\in H_2\), and for arbitrary \(\varepsilon>0\) there will be

such a linear combination \(\sum_{k=1}^{m} a_k f_k\) that

\[ \left\| f-\sum_{k=1}^{m} a_k f_k \right\|_{H_2}^{2}<\varepsilon . \]

Further,

\[ \left\| \varphi-\sum_{k=1}^{m} a_k\psi_k \right\|_{H_1}^{2} = \left\| A_0^{-1}\left(f-\sum_{k=1}^{m} a_k f_k\right) \right\|_{H_1}^{2} < \left\| A_0^{-1}\right\|^{2}\varepsilon, \]

\[ \left\| A(\lambda)\left(\varphi-\sum_{k=1}^{m} a_k\psi_k\right) \right\|_{H_2}^{2} = \left\| T(\lambda)\left(f-\sum_{k=1}^{m} a_k\psi_k\right) \right\|_{H_2}^{2} < \left\| T(\lambda)\right\|^{2}\varepsilon . \]

The last two inequalities prove, by virtue of the arbitrariness of \(\varepsilon\), that the system \(\{\psi_k\}\) is \(A\)-complete. If \(H_1=H_2\), and the system of eigenvectors of the equation

\[ A_0\psi-\lambda\psi=0 \]

is complete in \(H_1=H_2\), then as an \(A\)-complete system one may take the system of eigenvectors of this equation. In some problems such a choice of the system may substantially facilitate the computation of the elements of the determinant in equation (3).

Let us consider in more detail the case when \(H_1=H_2\) and equation (1) has the form

\[ A\varphi-\lambda\varphi=0, \tag{4} \]

where \(A\) is a self-adjoint linear operator having a complete system of eigenvectors \(\{\varphi_k\}\) in \(H_1=H_2\), with the corresponding system of eigenvalues \(\{\lambda_k\}\). In this case, as is known, all \(\lambda_k\) are real, and the system \(\{\varphi_k\}\) may be regarded as orthonormal. For arbitrary \(\varphi\in D_A\) and real \(\lambda\) we obtain, using the closure equation,

\[ \frac{\|A\varphi-\lambda\varphi\|^{2}}{\|\varphi\|^{2}} = \frac{\sum_{k=1}^{\infty} |(A\varphi-\lambda\varphi,\varphi_k)|^{2}}{\|\varphi\|^{2}} = \frac{\sum_{k=1}^{\infty} |(\varphi,A\varphi_k-\lambda\varphi_k)|^{2}}{\|\varphi\|^{2}} = \]

\[ = \frac{\sum_{k=1}^{\infty}(\lambda_k-\lambda)^2 |(\varphi,\varphi_k)|^{2}}{\|\varphi\|^{2}} \ge (\lambda_i-\lambda)^2 \frac{\sum_{k=1}^{\infty}|(\varphi,\varphi_k)|^{2}}{\|\varphi\|^{2}} = (\lambda_i-\lambda)^2, \]

where \(\lambda_i\) is the eigenvalue nearest to \(\lambda\). Putting \(\varphi=\varphi_i\), we obtain

\[ \frac{\|A\varphi_i-\lambda\varphi_i\|^{2}}{\|\varphi\|^{2}} = (\lambda_i-\lambda)^2, \]

and, consequently,

\[ \varkappa(\lambda)=(\lambda_i-\lambda)^2 . \tag{5} \]

The above reasoning repeats the reasoning from work \((^2)\) for a second-order differential equation. In contrast to work \((^2)\), we do not fix \(\lambda\). Since \(\varkappa_n(\lambda)\ge \varkappa(\lambda)\), from (5) we obtain

\[ (\lambda_i-\lambda)^2 \le \varkappa_n(\lambda) \quad\text{or}\quad |\lambda_i-\lambda|\le \sqrt{\varkappa_n(\lambda)} . \]

In particular,

\[ |\lambda_i-\lambda_{in}|\le \sqrt{\varkappa_n(\lambda_{in})}, \tag{6} \]

where \(\lambda_{in}\) is the point at which \(\varkappa_n(\lambda)\) has a local minimum. Since

\[ \lim_{n\to\infty}\varkappa_n(\lambda_{in})=0, \]

simultaneously with an approximate eigenvalue we obtain from formula (6) an estimate of the accuracy.

Research Institute of Mechanics
Moscow State University
named after M. V. Lomonosov

Received
24 I 1963

CITED LITERATURE

  1. S. G. Mikhlin, Direct Methods in Mathematical Physics, Moscow—Leningrad, 1950.
  2. N. M. Krylov, N. N. Bogolyubov, Izv. AN SSSR, OMEN, No. , 471 (1929).

Submission history

V. A. MEDVEDEV