UDC 62.505
CYBERNETICS AND CONTROL THEORY
Submitted 1968-01-01 | RussiaRxiv: ru-196801.98841 | Translated from Russian

Full Text

UDC 62.505

CYBERNETICS AND CONTROL THEORY

R. GABASOV

ON THE THEORY OF NECESSARY CONDITIONS FOR THE OPTIMALITY OF SINGULAR CONTROLS

(Presented by Academician L. S. Pontryagin on 13 III 1968)

1. Statement of the problem. Let the equations of motion be given

\[ \dot{x}_i=f_i(x,u,t), \qquad x_i(t_0)=x_{i0}, \]

where \(x=\{x_1,\ldots,x_n\}\), \(u=\{u_1,\ldots,u_r\}\); \(f_i(x,u,t)\) are defined and continuous on \(X\times U\times T\), together with \(\partial f_i(x,u,t)/\partial x_j\); \(X,U,T\) are the domains of variation of the quantities \(x,u,t\).

It is required, among the piecewise-continuous \(r\)-dimensional vector functions \(u(t)\), defined on \(T=[t_0,t_1]\) with values in the bounded set \(U\), to find such a \(u^0(t)\), \(t\in T\), that gives the functional

\[ I(u)=\varphi(x(t_1)) \]

(\(\varphi(x)\) differentiable on \(X\)) its minimal value.

It is known \((^1)\) that the optimal control \(u^0(t)\) and the corresponding optimal trajectory \(x^0(t)\) satisfy the maximum principle of L. S. Pontryagin

\[ H(x^0(t),\psi^0(t),u^0(t),t)=\sup_{u\in U} H(x^0(t),\psi^0(t),u,t), \]

\[ H(x,\psi,u,t)=\psi_i f_i(x,u,t), \]

\[ \dot{\psi}_i^0(t)=-\frac{\partial f_j(x^0(t),u^0(t),t)}{\partial x_i}\psi_j^0(t), \qquad \psi_i(t_1)=-\frac{\partial\varphi(x^0(t_1))}{\partial x_i}. \]

Let the control \(u^*(t)\), \(t\in T\), be such that the function \(H(x^*(t),\psi^*(t),u,t)\) for \(t\in T\) does not depend on \(u\):

\[ H(x^*(t),\psi^*(t),u,t)-H(x^*(t),\psi^*(t),u^*(t),t)\equiv 0, \qquad t\in T,\qquad u\in U. \]

Such a control \(u^*(t)\) will be called a singular control of the first order.

2. Necessary optimality conditions for a singular control of the first order. Following the scheme of \((^2,^3)\), we construct a formula for the increment of the functional, use a needle variation of the control concentrated on an interval of length \(\varepsilon\), and isolate in the increment of the functional the terms of order \(\varepsilon^2\). As a result we arrive at the following assertion.

Theorem 1. Let the functions \(f_i(x,u,t)\), \(\varphi(x)\) be defined and continuous on \(X\times U\times T\), together with \(\partial f_i(x,u,t)/\partial x_j\), \(\partial\varphi(x)/\partial x_i\), \(\partial^2 f_i(x,u,t)/\partial x_j\partial x_k\), \(\partial^2\varphi(x)/\partial x_i\partial x_j\). Then, for a singular optimal control of the first order \(u^0(t)\), \(t\in T\), the conditions

\[ \psi_j(t)\frac{\partial \Delta_{\nu} f_j(x^0(t),u^0(t),t)}{\partial x_i}\Delta_x f_i(x^0(t),u^0(t),t) + \]

\[ +\,[M_{ji}(t)+M_{ij}(t)]\Delta_x f_i(x^0(t),u^0(t),t)\Delta_x f_j(x^0(t),u^0(t),t)\leq 0, \qquad (1) \]

\[ t\in T,\qquad \nu\in U; \]

\[ \dot\psi_i(t)=-\frac{\partial f_j(x^0(t),u^0(t),t)}{\partial x_i}\psi_j(t),\qquad \psi_i(t_1)=-\frac{\partial\varphi(x^0(t_1))}{\partial x_i}; \]

\[ \begin{gathered} \dot M_{ij}(t)=-M_{ik}(t)\frac{\partial f_k(x^0(t),u^0(t),t)}{\partial x_j} -M_{kj}(t)\frac{\partial f_k(x^0(t),u^0(t),t)}{\partial x_i}\\ -\frac12\psi_k(t)\frac{\partial^2 f_k(x^0(t),u^0(t),t)}{\partial x_i\partial x_j},\qquad M_{ij}(t_1)=-\frac12\frac{\partial^2\varphi(x^0(t_1))}{\partial x_i\partial x_j};\\ \Delta_v f_i(x,u,t)=f_i(x,v,t)-f_i(x,u,t). \end{gathered} \]

3. Singular controls of second order. A control \(u^*(t)\), \(t\in T\), is called a singular control of second order if identically with respect to \(t\in T\), \(v\in U\),

\[ \psi_j(t)\Delta_v f_j(x^*(t),u^*(t),t)\equiv 0, \]

\[ \psi_j(t)\frac{\partial\Delta_v f_j(x^*(t),u^*(t),t)}{\partial x_i} +\bigl[M_{ij}(t)+M_{ji}(t)\bigr]\Delta_v f_j(x^*(t),u^*(t),t)\equiv 0. \]

To obtain necessary conditions for optimality of singular controls of second order in the increment of the functional caused by a needle variation of the control, we isolate the terms of order \(\varepsilon^3\).

Theorem 2. Let \(f_i(x,u,t)\), \(\varphi(x)\) be defined and continuous on \(X\times U\times T\), together with
\(\partial f_i(x,u,t)/\partial x_j\), \(\partial\varphi(x)/\partial x_i\),
\(\partial^2 f_i(x,u,t)/\partial x_j\partial x_k\),
\(\partial^2\varphi(x)/\partial x_i\partial x_j\),
\(\partial^3 f_i(x,u,t)/\partial x_j\partial x_k\partial x_l\),
\(\partial^3\varphi(x)/\partial x_i\partial x_j\partial x_k\).

If \(u^0(t)\), \(t\in T\), is a singular optimal control of second order, then for all \(t\in T\), \(v\in U\) the conditions

\[ \begin{aligned} &\frac12\psi_k(t)\frac{\partial^2\Delta_v f_k^0(t)}{\partial x_i\partial x_j} \,\Delta_v f_i^0(t)\Delta_v f_j^0(t)+\\ &\quad+\bigl[M_{ik}(t)\partial\Delta_v f_k^0(t)/\partial x_j +M_{kj}(t)\partial\Delta_v f_k^0(t)/\partial x_i\bigr]\Delta_v f_i^0(t)\Delta_v f_j^0(t)+\\ &\quad+\bigl[N_{ijk}(t)+N_{ikj}(t)+N_{kji}(t)\bigr]\Delta_v f_i^0(t)\Delta_v f_j^0(t)\Delta_v f_k^0(t)\leqslant 0; \end{aligned} \tag{2} \]

\[ \dot\psi_i(t)=-\frac{\partial f_j^0(t)}{\partial x_i}\psi_j(t),\qquad \psi_i(t_1)=-\frac{\partial\varphi(x^0(t_1))}{\partial x_i}; \]

\[ \dot M_{ij}(t)=-M_{ik}(t)\frac{\partial f_k^0(t)}{\partial x_j} -M_{kj}(t)\frac{\partial f_k^0(t)}{\partial x_i} -\frac12\psi_k(t)\frac{\partial^2 f_k^0(t)}{\partial x_i\partial x_j}; \]

\[ M_{ij}(t_1)=-\frac12\frac{\partial^2\varphi(x^0(t_1))}{\partial x_i\partial x_j}; \]

\[ \begin{aligned} \dot N_{ijk}(t)=&-N_{ijl}(t)\frac{\partial f_l^0(t)}{\partial x_k} -N_{ilk}(t)\frac{\partial f_l^0(t)}{\partial x_j} -N_{ljk}(t)\frac{\partial f_l^0(t)}{\partial x_i}\\ &-\frac12 M_{il}(t)\frac{\partial^2 f_l^0(t)}{\partial x_k\partial x_j} -\frac12 M_{lj}(t)\frac{\partial^2 f_l^0(t)}{\partial x_k\partial x_i} -\frac16\psi_l(t)\frac{\partial^3 f_l^0(t)}{\partial x_i\partial x_j\partial x_k}; \end{aligned} \]

\[ N_{ijk}(t_1)=-\frac16\frac{\partial^3\varphi(x^0(t_1))}{\partial x_i\partial x_j\partial x_k}. \]

Here \(f_i^0(t)=f_i(x^0(t),u^0(t),t)\).

4. The case of a convex control domain. Let the set \(U\) be convex, and let the functions \(f_i(x,u,t)\), \(\partial f_i(x,u,t)/\partial x_j\) be differentiable with respect to \(u\). Then in Theorem 1 assertion (1) may be replaced by the following:

\[ p_{kl}(t)[v_k-u_k^0(t)][v_l-u_l^0(t)]\leqslant 0,\qquad t\in T,\quad v\in U; \]

\[ p_{kl}(t)=\psi_j(t)\frac{\partial^2 f_j^0(t)}{\partial x_i\partial u_k}\frac{\partial f_i^0(t)}{\partial u_l} +\bigl[M_{li}(t)+M_{ji}(t)\bigr]\frac{\partial f_j^0(t)}{\partial u_k}\frac{\partial f_i^0(t)}{\partial u_l}. \tag{3} \]

If, in addition, \(\partial^2 f_i(x,u,t)/\partial x_j\partial x_k\) are differentiable with respect to \(u\), then inequality (2) of Theorem 2 takes the form

\[ q_{lms}(t)[v_l-u_l^0(t)][v_m-u_m^0(t)][v_s-u_s^0(t)]\leqslant 0,\qquad t\in T,\quad v\in U; \tag{4} \]

\[ \begin{aligned} q_{lms}(t)=&\frac12\psi_k(t)\frac{\partial^3 f_k^0(t)}{\partial x_i\partial x_j\partial u_l} \frac{\partial f_i^0(t)}{\partial u_m} \frac{\partial f_j^0(t)}{\partial u_s}\\ &+\left[M_{ik}(t)\frac{\partial^2 f_k^0(t)}{\partial x_j\partial u_l} +M_{kj}(t)\frac{\partial^2 f_k^0(t)}{\partial x_i\partial u_l}\right] \frac{\partial f_i^0(t)}{\partial u_m} \frac{\partial f_j^0(t)}{\partial u_s}\\ &+\bigl[N_{ijk}(t)+N_{ikj}(t)+N_{kji}(t)\bigr] \frac{\partial f_i^0(t)}{\partial u_l} \frac{\partial f_j^0(t)}{\partial u_m} \frac{\partial f_k^0(t)}{\partial u_s}. \end{aligned} \]

  1. The case of an open control domain. Let \(u^0(t)\) be an interior point of the set \(U\). Then, from property (3) of the optimal control, it follows that the quadratic form \(p_{kl}(t)u_k u_l\), \(t \in T\), is nonpositive; from (4), that the form \(q_{klm}(t)u_k u_l u_m\), \(t \in T\), is nonpositive.

Singular controls for an open set \(U\) were studied in \((^{4-7})\). The necessary optimality conditions obtained above differ from the known ones both in form and in the method of their derivation. The variations of controls used in \((^{4-7})\) essentially presuppose that \(U\) is an open set.

Ural Polytechnic Institute
named after S. M. Kirov

Received
10 III 1968

REFERENCES

\(^{1}\) L. S. Pontryagin, V. G. Boltyanskii et al., The Mathematical Theory of Optimal Processes, Moscow, 1961.
\(^{2}\) L. I. Rozonoer, Automation and Telemechanics, 20, Nos. 10–12 (1959).
\(^{3}\) R. Gabasov, Automation and Telemechanics, 28, No. 8 (1967).
\(^{4}\) G. Kelley, Rocket Technology and Astronautics, No. 8 (1964).
\(^{5}\) G. Robbins, Rocket Technology and Astronautics, No. 6 (1965).
\(^{6}\) R. Kopp, G. Moyer, Rocket Technology and Astronautics, No. 8 (1965).
\(^{7}\) I. B. Vapnyarskii, USSR Computational Mathematics and Mathematical Physics, 7, No. 2 (1967).

Submission history

UDC 62.505