Abstract
Full Text
MATHEMATICS
V. A. YAKUBOVICH
SOLUTION OF AN ALGEBRAIC PROBLEM ENCOUNTERED IN CONTROL THEORY
(Presented by Academician V. I. Smirnov on 17 XII 1969)
1°. Suppose that a system of differential equations is given
[
dx/dt=Ax+b\xi
\tag{1}
]
and a Hermitian form (\mathfrak F(x,\xi)). In (1), (A, b) are complex or real constant matrices of orders (n\times n), (n\times m), and the vectors (x,\xi) have orders (n,m), respectively. It is assumed that the system (1) is controllable, i.e., that the rank of the (n\times mn) matrix (|b, Ab,\ldots,A^{n-1}b|) is equal to (n). Consider the problem of representing the form (\mathfrak F(x,\xi)) in the form*
[
\mathfrak F(x,\xi)=\frac{\partial}{\partial t}(x^Hx)+(\xi-h^x)^\Gamma(\xi-h^x),\quad \forall x,\xi,
\tag{2}
]
where (H=H^, h, \Gamma=\Gamma^) are certain constant matrices of orders respectively (n\times n), (n\times m), (m\times m), and the derivative is taken by virtue of system (1), i.e.
[
\frac{d}{dt}(x^Hx)=2\operatorname{Re}[x^H(Ax+b\xi)].
]
It is obvious that the matrix (\Gamma) is determined at once from (2) by the relation (\mathfrak F(0,\xi)=\xi^\Gamma\xi), which we shall assume to be satisfied. The problem consists in determining conditions for the existence of the representation (2) and in finding a convenient procedure for determining matrices (H=H^, h) satisfying relation (2). The solution of this problem, for special (\mathfrak F(x,\xi)), reduces to a number of problems in the theory of absolute stability (see, for example, ((^1))) and in the theory of optimal control (synthesis of optimal control in problems of minimizing quadratic functionals for linear stationary systems, i.e., in problems considered first by A. M. Letov—see, for example, ((^{2-5}))). All these problems pertain either to the case (m=1), or to the case (m>1), (\Gamma\geqslant 0) (for (m>1), (\Gamma\leqslant 0)**).
There are many works in which, under various special assumptions, the indicated problem of the possibility of representation (2) is solved. The sign-definiteness property of the matrix (\Gamma), or the condition (m=1), is then used essentially in the proofs.*** A number of problems in the theory of differential games lead to the problem of representation (2) with (m>1), (\Gamma\not\geqslant 0). Below a solution is given for the formulated problem under certain nonrestrictive assumptions excluding, in the space of coefficients of the form (\mathfrak F(x,\xi)) and the system (1), a set of measure zero. The method of proof is substantially different from ((^1,{}^9)) and others, and makes it possible to consider the case (\Gamma\geqslant 0), which is not amenable to treatment by the known methods ((^1,{}^9)) and others. It is important to note that for (m>1) the procedure obtained below for determining the matrices
* Here and below the asterisk denotes Hermitian conjugation. By (I_n) below is denoted the identity (n\times n) matrix.
** The notation (\Gamma\geqslant 0) ((\Gamma>0)) means that (\Gamma=\Gamma^) and (\xi^\Gamma\xi\geqslant 0) for all (\xi) ((\xi^\Gamma\xi>0) for all (\xi\ne0)). The inequalities (\Gamma\leqslant 0), (\Gamma<0) are defined analogously. The notation (\Gamma\not\geqslant 0) means that the form (\xi^\Gamma\xi) assumes both positive and negative values.
*** For (m=1), solutions of related problems were first obtained in ((^{6-8})). For (m>1), (\Gamma\geqslant 0), a solution of this problem under certain special assumptions (not, however, restricting generality) was given without proof in ((^1)) (Theorem 3.4) and in the general case with proof in ((^9)).
(h, H) is substantially more convenient than those previously known. For example, it does not require factorization of an (m \times m)-matrix polynomial.
(2^\circ). Let (\lambda) be complex, (\omega) a real variable, and let (X(\lambda)) be some matrix or scalar polynomial (X(\lambda)=X_0\lambda^N+\cdots+X_N). Below, by (X(\lambda)^\nabla) we shall denote the polynomial (X(\lambda)^\nabla=[X(-\bar\lambda)]^=X_0^(-\lambda)^N+\cdots+X_N^). Obviously, (X(i\omega)^\nabla=X(i\omega)^), ([X(\lambda)Y(\lambda)]^\nabla=Y(\lambda)^\nabla X(\lambda)^\nabla), ([\det X(\lambda)]^\nabla=\det[X(\lambda)^\nabla]).
Introduce the following notation:
[
A_\lambda=\lambda I_n-A,\quad \delta(\lambda)=\det A_\lambda,\quad
Q(\lambda)=A_\lambda^{-1}\delta(\lambda),\quad
r(\lambda)=
\left|\begin{array}{cc}
Q(\lambda)& b\
\delta(\lambda)& I_m
\end{array}\right|,
\tag{3}
]
[
\mathfrak F(A_{i\omega}^{-1}b\xi,\xi)=\xi^*\Pi(i\omega)\xi,\quad
\Phi(i\omega)=|\delta(i\omega)|^2\Pi(i\omega).
]
Here (\Pi(i\omega)=\Pi(i\omega)^) is the matrix of order (m) of the Hermitian form (\mathfrak F(A_{i\omega}^{-1}b\xi,\xi)). Since (\mathfrak F(0,\xi)=\xi^\Gamma\xi), we have (\Pi(\infty)=\Gamma). From (3) we have
[
\xi^\Pi(i\omega)\xi=\mathfrak F(\delta^{-1}Qb\xi,\delta^{-1}\delta\xi)
=|\delta(i\omega)|^{-2}r(i\omega)^Fr(i\omega),
]
where (F=F^*) is the matrix of the form (\mathfrak F). Therefore
[
\Phi(\lambda)=r(\lambda)^\nabla F r(\lambda).
]
Thus, (\Phi(\lambda)) is a matrix polynomial with leading term ((-1)^n\lambda^{2n}\Gamma), and (\Phi(\lambda)^\nabla=\Phi(\lambda)). Below, for brevity, the argument (\lambda) is sometimes omitted.
Lemma 1. The following formulas hold:
[
\det\Phi=\delta^{m-1}(\delta^\nabla)^{m-1}\det\Gamma\cdot\varphi,\quad
\Phi^{-1}=\Omega/(\delta\delta^\nabla\varphi),
]
where (\varphi,\Omega) are scalar and matrix polynomials;
[
\varphi=\varphi^\nabla;
]
[
\Omega=\Omega^\nabla,\quad
\varphi=(-1)^n\lambda^{2n}[1+O(\lambda^{-1})],
]
[
\Omega=(-1)^n\Gamma^{-1}\lambda^{2n}[I_m+O(\lambda^{-1})]\quad \text{as }\lambda\to\infty.
]
Let (\alpha(\lambda), \beta(\lambda)) be polynomials. By (\operatorname{rem}(\beta/\alpha)) below is denoted the polynomial that is the remainder obtained upon division of (\beta(\lambda)) by (\alpha(\lambda)). If (B(\lambda)=|\beta_{jk}(\lambda)|) is a matrix polynomial, then
[
\operatorname{rem}(B/\alpha)=|\operatorname{rem}(\beta_{jk}/\alpha)|.
]
By (\langle \alpha,\beta\rangle) below is denoted the greatest common divisor of the polynomials (\alpha(\lambda)) and (\beta(\lambda)), and by (\langle\alpha,B\rangle), the greatest common divisor of the polynomials (\alpha(\lambda)) and all (\beta_{jk}(\lambda)).
Theorem. Suppose that the system (1) is controllable and that (\det\Gamma\ne0).
I. Let (2) hold. Then: a) the polynomial (\varphi(\lambda)) admits the factorization
[
\varphi(\lambda)=\psi(\lambda)\psi(\lambda)^\nabla,
]
where (\psi(\lambda)=\det(\lambda I_n-A-bh^)), and, consequently, either (\varphi(i\omega)=0,\ \forall\omega), or all purely imaginary zeros of the polynomial (\varphi(\lambda)) have even multiplicity; b) the following is fulfilled
[
h^q(\lambda)=\delta(\lambda)\Omega_0(\lambda),\quad \forall\lambda,\qquad
q(\lambda)=\operatorname{rem}(Qb\Omega/\psi\delta),\quad
\Omega_0(\lambda)=\operatorname{rem}(\Omega/\psi).
\tag{4}
]
II. Let
[
\langle\delta,\delta^\nabla\rangle=1,\quad
\langle\varphi,\delta\delta^\nabla\rangle=1,\quad
\varphi(i\omega)\ne0,\ \forall\omega.
]
Let (\psi(\lambda)) be an arbitrary polynomial satisfying the factorization relation
[
\varphi(\lambda)\equiv\psi(\lambda)\psi(\lambda)^\nabla,\quad \forall\lambda
]
and such that
[
\langle\psi,\psi^\nabla\rangle=1.
]
Let (h) be an arbitrary (n\times m)-matrix satisfying the identity (4) or, otherwise, such that
[
(I_m-h^A_\lambda^{-1}b)\Omega(\lambda)/\psi(\lambda)
]
is a polynomial. Let (G=G^) be the matrix of the Hermitian form (\mathfrak F(x,0)) and let (H=H^) be the matrix determined from the linear system
[
HA+A^H=G-h\Gamma h^.
]
Then the identity (2) holds.
Remark. Suppose that the conditions
[
\langle\delta,\delta^\nabla\rangle=1,\quad \langle\varphi,\delta\delta^\nabla\rangle=1
]
are not fulfilled. Make in (1) the substitution
[
\xi=\xi_1+a^x,
]
where (a) is some (n\times m)-matrix. Then we obtain a new system
[
dx/dt=A_1x+b\xi_1,
]
where
[
A_1=A+ba^,
]
and a new form
[
\mathfrak F_1(x,\xi_1)=\mathfrak F(x,\xi).
]
It can be shown that the new polynomial (\varphi(\lambda)) coincides with the old one. With a suitable choice of (a), the polynomial (\delta), as is known, can be made arbitrary (with leading term (\lambda^n)). Therefore, for the new system, with a suitable choice of (a), the conditions
[
\text{ It can be shown that } Qb\Omega=\delta q_1,
]
where (q_1) is a polynomial. Therefore
[
q(\lambda)=\operatorname{rem}(\delta q_1/\delta\psi)=\delta q_0,
]
where
[
q_0=\operatorname{rem}(q_1/\psi),
]
and the identity (4), from which the matrix (h) is determined, has the form
[
h^q_0(\lambda)=\Omega_0(\lambda),\quad \forall\lambda.
]
(\langle \delta,\delta^{\nabla}\rangle=1), (\langle \varphi,\delta\delta^{\nabla}\rangle=1) will be satisfied. Thus, if representation (2) holds and (\varphi(i\omega)\ne0), (\forall\omega), then it will be found by means of the indicated substitution and the use of the procedure indicated in part II of the theorem. We note that, after equating the coefficients of equal powers of (\lambda), the equation (h^{*}q(\lambda)=\delta(\lambda)\Omega_{0}(\lambda)) becomes a system of (2nm^{2}) linear equations with respect to (mn) elements of the matrix (h). It can be shown that, for (\varphi(i\omega)\ne0), (\forall\omega), this system always has a solution. Therefore, for the existence of matrices (h,H) satisfying relation (2), it is sufficient that (\varphi(i\omega)\ne0), (\forall\omega).
(3^\circ). Proofs.
Lemma 2. Let (\Sigma(\lambda)=c_{0}^{}Q(\lambda)b_{0}+\Gamma_{0}\delta(\lambda)), where (b_{0},c_{0},\Gamma_{0}) are arbitrary (n\times m)-, (n\times m)- and (m\times m)-matrices. Then (\det\Sigma(\lambda)=\delta(\lambda)^{m-1}\varphi_{0}(\lambda)), where (\varphi_{0}(\lambda)) is a polynomial of degree not exceeding (n).*
Put
[
U=
\left|
\begin{array}{cc}
I_n; & -A_\lambda^{-1}b_0\
c_0^{}; & \Gamma_0
\end{array}
\right|,
\qquad
V=
\left|
\begin{array}{cc}
I_n; & A_\lambda^{-1}b_0\
0; & I_m
\end{array}
\right|,
\qquad
W=
\left|
\begin{array}{cc}
A_\lambda+b_0c_0^{}; & -b_0(\Gamma_0-I_m)\
c_0^{*}; & \Gamma_0
\end{array}
\right|.
]
We have (UV=|R_{jh}|), (j,h=1,2), where (R_{11}=I_n), (R_{12}=0), (R_{22}=\delta\Sigma). Therefore
[
\det(\delta\Sigma)=\det(UV)=\det(VU).
]
Since (VU=\operatorname{diag}(A_\lambda^{-1},I_m)\cdot W), it follows that
[
\det(\delta\Sigma)=\det A_\lambda^{-1}\det W.
]
Obviously, (\varphi_{0}=\det W) is a polynomial of degree (\le n).
Corollary. For (\Gamma_{0}=I_m) we obtain the well-known formula
[
\det(A_\lambda+b_0c_0^{})
=
\det A_\lambda\cdot\det(I_m+c_0^{}A_\lambda^{-1}b_0).
]
Lemma 3. Every minor of order (k) ((k\le m)) of the matrix (\Sigma(\lambda)) defined in Lemma 2 is divisible by (\delta^{k-1}), and the quotient is a polynomial of degree not exceeding (n).
Indeed, the matrix (\Sigma') of the indicated minor has the form (\Sigma'=e_1^{}\Sigma e_2), where (e_1,e_2) are constant matrices of order (m\times k). Applying Lemma 2 to the matrix
[
\Sigma'=c_1^{}Q(\lambda)b_1+\Gamma_1\delta(\lambda),
]
we obtain the required assertion.
Lemma 4. Every minor of order (k) ((k\le m)) of the matrix (\Phi(\lambda)) is divisible by (\delta^{k-1}(\delta^{\nabla})^{k-1}), and the quotient is a polynomial of degree not exceeding (2n).
Let
[
1\le j_1<\cdots<j_k\le m;\qquad
1\le h_1<\cdots<h_k\le n;\qquad
j=(j_1,\ldots,j_k),\quad h=(h_1,\ldots,h_k);
]
(\Phi_{jh}) be the matrix of the corresponding minor of the matrix (\Phi). From the formula (\Phi=r^{\nabla}Fr) we have
[
\det\Phi_{jh}
=
\sum (\det r_{\mu j})^{\nabla}\det F_{\mu\nu}\cdot\det r_{\nu h},
]
where (\mu,\nu) are analogous sets of (k) indices, and (r_{\mu j},F_{\mu\nu}) are analogous (k\times k)-submatrices of the matrices (r,F). Obviously, (r_{\mu\nu}) has the form
[
r_{\mu\nu}=c_{\mu\nu}^{*}Q(\lambda)b_{\mu\nu}+\Gamma_{\mu\nu}\delta(\lambda),
]
where (b_{\mu\nu},c_{\mu\nu},\Gamma_{\mu\nu}) are certain constant (n\times k)-, (n\times k)- and (k\times k)-matrices. By Lemma 2,
[
\det r_{\mu\nu}=\delta^{k-1}\varphi_{\mu\nu},
]
where (\varphi_{\mu\nu}) is a polynomial of degree not exceeding (n). Hence the assertion of Lemma 4 follows.
Lemma 5. Let system (1) be controllable, (\langle\delta,\delta^{\nabla}\rangle=1), and let (h) be an arbitrary (m\times n)-matrix satisfying the factorization relation
[
\Phi(\lambda)=\Psi^{\nabla}\Gamma\Psi,
]
where
[
\Psi=\delta(\lambda)I_m-h^{}Q(\lambda)b.
]
Let the matrix (H=H^{}) be determined from the equation
[
HA+A^{}H=G-h\Gamma h^{},
]
where (G=G^{}) is the matrix of the form (\mathfrak{F}(x,0)). Then identity (2) is fulfilled.*
Denote the form on the right-hand side of (2) by (\mathfrak{H}(x,\xi)), put
[
\mathfrak{H}(x,\xi)-\mathfrak{F}(x,\xi)=\mathfrak{F}{1}(x,\xi),
]
and show that (\mathfrak{F}(x,\xi)\equiv0). We have
[
\mathfrak{F}{1}(x,0)\equiv0,\qquad
\mathfrak{F}(0,\xi)\equiv0.
]
Therefore
[
\mathfrak{F}{1}(x,\xi)=\operatorname{Re}(\xi^{}f^{}x),
]
where (f) is some (n\times m)-matrix. The identity
[
\Phi(i\omega)=\Psi(i\omega)^{}\Gamma\Psi(i\omega)
]
means that (\mathfrak{F}_{1}(x,\xi)\equiv0) for
[
Ax+b\xi=i\omega x.
]
Put
[
\chi(\lambda)=\xi^{}fAb\xi,}^{-1
]
where (\xi) is an arbitrary fixed complex vector. We have
[
\operatorname{Re}\chi(i\omega)=0,\qquad \forall\omega.
]
By the symmetry principle, the poles of (\chi(\lambda)) are located symmetrically with respect to the imaginary axis. On the other hand, they can only be eigenvalues of the matri-
matrices (A), among which there are none situated symmetrically with respect to the imaginary axis, by virtue of the condition (\langle\delta,\delta^\nabla\rangle=1). Therefore (\chi(\lambda)) has no poles. Consequently, (\chi(\lambda)=\mathrm{const}), and, since (\chi\to0) as (\lambda\to\infty), we have (\chi(\lambda)\equiv0). In view of the arbitrariness of (\xi), we have (f^*A_\lambda^{-1}b=0). From the controllability of (1) it follows that (f=0), (\mathfrak F_1(x,\xi)=0).
Proof of Lemma 1. From Lemma 4 it follows that (\varphi) is a polynomial. Since (\Phi=(-1)^n\lambda^{2n}\Gamma+\cdots), it follows that (\det\Phi=(-1)^{mn}\lambda^{2mn}\det\Gamma\times[1+O(\lambda^{-1})]). Hence (\varphi=(-1)^n\lambda^{2n}[1+O(\lambda^{-1})]).
Let (\Omega=\Phi^{-1}\delta\delta^\nabla\varphi). We have* (\Omega=\Phi_{\mathrm{pr}}/[\delta^{m-2}(\delta^\nabla)^{m-2}\det\Gamma]). By Lemma 4, (\Omega) is a polynomial. From (3), (\Omega=\Pi^{-1}\varphi). Therefore, as (\lambda\to\infty), we have (\Omega/\lambda^{2n}\to(-1)^n\Gamma), i.e. (\Omega=\lambda^{2n}(-1)^n\Gamma[1+O(\lambda^{-1})]). The relations (\varphi=\varphi^\nabla), (\Omega=\Omega^\nabla) follow from (\Phi=\Phi^\nabla).
Proof of the theorem. I. Denote (\Psi(\lambda)=\delta I_m-h^Qb). According to the corollary to Lemma 2,
(\det(\lambda I_n-A-bh^)=\delta\det(I_m-h^A^{-1}b)=\delta\det(\delta^{-1}\Psi)). Therefore (\det\Psi=\delta^{m-1}\psi), where (\psi=\det(\lambda I-A-bh^)). For (Ax+b\xi=i\omega x), from (2) we have
(\mathfrak F(A_{i\omega}^{-1}b\xi,\xi)=\xi^\Psi(i\omega)^\Gamma\Psi(i\omega)\xi/|\delta(i\omega)|^2). From (3),
(\mathfrak F(A_\omega^{-1}b\xi,\xi)=\xi^\Phi(i\omega)\xi/|\delta(i\omega)|^2). Therefore (\Phi(i\omega)=\Psi(i\omega)^\times\Gamma\Psi(i\omega)), (\Phi(\lambda)=\Psi(\lambda)^\nabla\Gamma\Psi(\lambda)), (\det\Phi=\delta^{m-1}(\delta^\nabla)^{m-1}\psi\psi^\nabla\det\Gamma). Comparing with the expression for (\det\Phi) in Lemma 1, we obtain (\varphi=\psi\psi^\nabla), whence assertion a) follows. By Lemma 3, (\Phi_{\mathrm{pr}}=\delta^{m-2}Z), where (Z) is a polynomial. From (\Phi=\Psi^\nabla\Gamma\Psi) we have (\Phi_{\mathrm{pr}}=\Psi_{\mathrm{pr}}\Gamma_{\mathrm{pr}}\Psi^\nabla), (\Psi\Phi^{-1}\det\Phi=\Gamma_{\mathrm{pr}}\Psi^\nabla\det\Psi). Substituting into the latter equality the values of (\Phi^{-1}), (\det\Phi) from Lemma 1, and also (\det\Psi=\delta^{m-1}\psi) and (\Psi^\nabla=\delta^{\nabla\,m-2}Z_{\mathrm{pr}}), we obtain (\Psi\Omega=\Gamma^{-1}Z^\nabla\delta\psi), (h^*Qb\Omega\equiv\delta\Omega\pmod{(\psi\delta)}). This implies (4), i.e. c).
II. Let (h) satisfy (4) and (\varphi=\psi\psi^\nabla). Put again (\Psi=\delta I_m-h^*Qb). From (4) we obtain (\Psi\Omega=Z_0\varphi\delta), where (Z_0) is a matrix (m\times m) polynomial. Denote (\langle\psi,Z_0^\nabla\rangle=\psi_0), (Z_0^\nabla=Z_1\psi_0), (\psi=\psi_0\psi_1), where (Z_1,\psi_1) are polynomials, (\langle\psi_1,Z_1\rangle=1). Since (\Omega=\Omega^{\nabla\nabla}) (Lemma 1), from (\Psi\Omega=Z_1^{\nabla\nabla}\psi_0^\nabla\psi\delta) we have (\Psi\Omega\Psi^\nabla=\Psi Z_1\psi_0\psi\,\delta^\nabla). Substituting the values of (\Psi\Omega) and (\psi=\psi_0\psi_1), we obtain (Z_1^\nabla\Psi^\nabla\psi_1\delta=\Psi Z_1\psi_1\delta^\nabla). By the condition (\langle\psi_1,\psi_1^\nabla\rangle=1), (\langle\psi_1,\delta^\nabla\rangle=1), (\langle\delta,\delta^\nabla\rangle=1). Therefore (\Psi Z_1=Y\psi_1\delta), where (Y) is a polynomial, and (Y^\nabla=Y). Consequently, (\Psi\Omega\Psi^\nabla=Y\psi\psi^\nabla\delta\delta^\nabla). Since (\deg\Omega=2n) (Lemma 1), (\deg\Psi=\deg\psi=\deg\delta=n), we have (\deg Y=0), i.e. (Y=\mathrm{const}). Since, as (\lambda\to\infty), (\Psi/\delta\to I_m), (\Omega/\varphi\to\Gamma^{-1}), it follows that (Y=\Gamma^{-1}), and (\Psi\Omega\Psi^\nabla=\Gamma^{-1}\varphi\delta\delta^\nabla). Passing in this relation to inverse matrices and substituting (\Omega^{-1}=\Phi/\delta\delta^\nabla\varphi), we obtain that (\Psi) satisfies the factorization relation (\Phi=\Psi^\Delta\Gamma\Psi). Applying Lemma 5, we obtain the assertion of the theorem.
Leningrad State University
named after A. A. Zhdanov
Received
24 XII 1969
CITED LITERATURE
- F. R. Gantmacher, V. A. Yakubovich, Proc. II All-Union Congress on Theoretical and Applied Mechanics, “Nauka,” 1965.
- A. M. Letov, Automation and Telemechanics, 21, No. 5 (1960).
- R. J. Kalman, Basic Eng. (Trans. ASME, part D), 1 (1964).
- M. Athans, P. Falb, Optimal Control, Moscow, 1968.
- N. N. Krasovskii, Theory of Motion Control, “Nauka,” 1968.
- V. A. Yakubovich, DAN, 143, No. 6 (1962).
- R. J. Kalman, Proc. Nat. Acad. Sci. U. S. A., 49, 201 (1963).
- K. R. Mayer, Proc. Nat. Acad. Sci. U. S. A., 53, 501 (1965).
- V. M. Popov, Hiperstabilitatea Sistemelor Automate, România, 1966.
* By (\Phi_{\mathrm{pr}}) here and below is denoted the adjugate matrix: (\Phi_{\mathrm{pr}}=\Phi^{-1}\det\Phi). By (\deg\Phi) below is denoted the degree of the polynomial (\Phi=\Phi(\lambda)).