MATHEMATICS

L. A. PETROSYAN

Submitted 1965-01-01 | RussiaRxiv: ru-196501.59488 | Translated from Russian

Full Text

MATHEMATICS

L. A. PETROSYAN

DIFFERENTIAL SURVIVAL GAMES WITH MANY PARTICIPANTS

(Presented by Academician Yu. V. Linnik, 8 X 1964)

The games considered are models of pursuit in a certain convex set $S$ in the plane with several participants: a group of pursuers $P=\{P_1,\ldots,P_n\}$ and pursued objects $E_1,\ldots,E_n$, which move in the set $S$ with constant linear speeds and have the ability at each moment of time to change the direction of their motion. A pursued object $E_j$ is considered caught as soon as there is an $i\in\{1,2,\ldots,n\}$ such that the points $P_i$ and $E_j$ coincide. Each of the pursued objects is interested in reaching the boundary of the set $S$ before being caught by one of the pursuers.

In what follows we shall use the definitions and notation of note $(^1)$.

1. The game $\Gamma(n,1)$. The game $\Gamma(n,1)$ is an antagonistic game of two players: a group of pursuers $P=\{P_1,\ldots,P_n\}$ and a pursued object $E$. In the plane there is given a certain closed convex set $S$. Points $\xi^1,\ldots,\xi^n,\eta\in S$, called the initial positions of the pursued $P_1,\ldots,P_n$ and of the pursued object $E$, and $n+1$ numbers $v_1,\ldots,v_n,u$, $v_i>u$ for all $i=1,2,\ldots,n$, are given.

The structure of the game is specified by the system of equations

\[ \dot{x}^{\,i}=\varphi^i[\psi(x^1,x^2,x^3,\ldots,x^n,t)], \]

\[ \dot{y}=\psi(x^1,x^2,\ldots,x^n,t),\quad i=1,2,\ldots,n, \]

with initial conditions $x^i(0)=\xi^i,\ y(0)=\eta,\ i=1,2,\ldots,n$.

The set $\mathfrak{E}$ of strategies of $E$ and the sets $\mathfrak{P}^i$ of strategies of $P_i$ satisfy conditions 1—4 $(^1)$. From these conditions it follows directly that to each situation $(\varphi^1,\ldots,\varphi^n,\psi)$ and initial conditions $\xi^1,\ldots,\xi^n,\eta$ there correspond uniquely $2(n+1)$ continuous functions $x_1^i(t), x_2^i(t), y_1(t), y_2(t)$, $i=1,2,\ldots,n$.

The pair $\{x_1^i(t),x_2^i(t)\}$ will be called the trajectory of $P_i$. Let $t_P=\min_i (t_P)^i$. The payoff function is defined in the same way as in the game $\Gamma$.

Theorem 1. The game $\Gamma(n,1)$ has an equilibrium situation in pure strategies. Moreover, the optimal strategy of the group $P=\{P_1,\ldots,P_n\}$ consists in each $P_i$ pursuing $E$ according to the $\Pi$-strategy.

Theorem 2. Suppose that in the game $\Gamma(n,1)$ there exists a circle $B$, belonging to $S$, of radius $a$, such that: a) $\xi^1,\ldots,\xi^n\in B$ at the initial moment; b) $\eta\in B$ at the initial moment; c) we radially project the points corresponding to the positions of the pursuers at the initial moment onto the circumference of $B$; denote these projections by $\xi_B^1,\xi_B^2,\ldots,\xi_B^n$, where the indices $1,2,\ldots,n$ increase in the clockwise direction; the distance between two neighboring projections must be less than $a/3$.

If conditions a), b), c) are fulfilled, then $E$ is caught in $S$ for any $v_i/u\ge 1,\ i=1,2,\ldots,n$.

The games $\Gamma'$, $\Gamma'_T$, $\bar{\Gamma}'_T$, $\Gamma'(1,2)$. The game $\Gamma'$ differs from $\Gamma$ only in the payoff function, which is defined as follows.

Let $y(t_P)$ be the point of capture in the situation $(\varphi,\psi)$. Suppose that it belongs to $S$. Denote by $\rho[y(t_P),S]$ the distance from the point of capture to the boundary of $S$. Let $f(\rho)$ be a certain function having the following properties: a) $f(\rho)$ is a strictly decreasing function of $\rho$; b) let $0<\varepsilon<1$, then $f(0)=\varepsilon$; c) let $\rho_0$ be the distance from $\eta$ to $S$; if capture occurs at the point $\eta$, then $f(\rho_0)=0$.

The payoff function in the game $\Gamma'$, $K'(\xi,\eta,\varphi,\psi)$, is defined through $f(\rho)$ as follows.

\[ K'(\xi,\eta,\varphi,\psi)= \begin{cases} -1, & \text{if } t_P>t_{S_E},\\ 1-f(\rho), & \text{if } t_P\leqslant t_{S_E},\\ 0, & \text{if } t_P=t_{S_E}=\infty. \end{cases} \]

Theorem 3. In the game $\Gamma'$ there exists an equilibrium situation in pure strategies. The player $P$ has, moreover, a unique optimal strategy—the $P$-strategy.

Consider a certain generalization $\Gamma'_T$ of the game $\Gamma'$, in which $P$ can begin pursuit only after some time $T$ has elapsed from the start of the game. The game $\Gamma'_T$ obtained in this way will be called the game with delay $T$, or simply the game with delay.

Lemma 1. In the game $\Gamma'_T$ the $P$-strategy is optimal for $P$.

Suppose further that in the time interval $[0,T]$ the player $E$ may use only some subclass $\mathfrak{C}_T$ of his strategies. Denote this game by $\bar{\Gamma}'_T$.

Lemma 2. In the game $\bar{\Gamma}'_T$ the $P$-strategy is optimal for $P$; for any $\varepsilon>0$, $E$ has an $\varepsilon$-optimal strategy.

We now turn to the solution of the game $\Gamma'(1,2)$. The game $\Gamma'(1,2)$ is a three-person game: the pursuer $P$ and the pursued $E_1$ and $E_2$. We shall assume that the players $E_1$ and $E_2$ are forbidden to form a coalition. The strategy spaces are the same as in the game $\Gamma'$. $v$ is the linear velocity of the pursuer $P$; $u_1$ is the linear velocity of the pursued $E_1$; $u_2$ is the linear velocity of the pursued $E_2$.

The inequality $v>u_i,\ i=1,2$, holds. As in the game $\Gamma'$, the strategies of the players are denoted by $\varphi,\psi^1,\psi^2$.

The payoff of $E_i,\ i=1,2$, is defined as the negative of the payoff of $P$ in the game $\Gamma'(P,E_i)$. Let $K_i$ be the payoff of $E_i,\ i=1,2$. Then the payoff of $P$ is given by the expression $K_P=-K_1-K_2$.

Thus the game $\Gamma'(1,2)$ is a zero-sum game. In what follows, we shall denote by $\psi^i(t)$ the optimal strategy of $E_i$ in the game $\Gamma'(P,E_i)$, beginning at time $t$, and by $\varphi^{i,3-i}$, $i=1,2$, the strategy of the pursuer $P$ consisting in the fact that $P$ successively applies the $P$-strategy first to $E_i$, and then to $E_{3-i}$.

Consider two situations $(\vec{\psi}^{\,1},\vec{\psi}^{\,2},\varphi^{1,2})$ and $(\vec{\psi}^{\,1},\vec{\psi}^{\,2},\varphi^{2,1})$. Let $K_P^{1,2}$ be the payoff in one situation, and $K_P^{2,1}$ in the other.

Theorem 4. If $K_P^{1,2}\geqslant K_P^{2,1}$, then an $\varepsilon$-Nash equilibrium situation in the game $\Gamma'(1,2)$ is constructed as follows:

a) the optimal strategy of $P$, $\varphi^{1,2}$, until the moment while the inequality

\[ K_P(\vec{\psi}^{\,1},\vec{\psi}^{\,2},\varphi^{1,2}) \geqslant K(\vec{\psi}^{\,1},\psi^2,\varphi^{2,1}), \tag{1} \]

holds, and $\varphi^{2,1}$ as soon as $E_2$ begins to apply the strategy $\psi^2$ that violates (1);

b) the optimal strategy of $E_1$, $\psi^1$;

c) an $\varepsilon$-optimal strategy $E_2$ is defined as an $\varepsilon$-optimal strategy $\psi^{\varepsilon 2}$ in the game $\Gamma'_p (E_2, P)$ under the restrictions on the class of admissible strategies

\[ K_P(\vec{\psi}^{\,1}, \psi^2, \varphi^{1,2}) \geqslant K_P(\vec{\psi}^{\,1}, \psi^2, \varphi^{2,1}), \tag{2} \]

\[ K_{E_2}(\psi^1, \psi^2, \varphi^{1,2}) \geqslant K_{E_2}(\vec{\psi}^{\,1}, \psi^2, \varphi^{2,1}). \]

The game $\Gamma(m,n)$. Under the preceding conditions there are $m$ pursuers $P_1,\ldots,P_m$ and $n$ evaders $E_1,\ldots,E_n$. We shall denote this game by $\Gamma(m,n)$. For solving this game, so far, using methods of integer linear programming, it has been possible only to estimate from below the value of the game $\operatorname{Val}\Gamma(m,n)$. The estimate of $\operatorname{Val}\Gamma(m,n)$ is obtained as follows. Suppose that the set of pursuers $P_1,\ldots,P_m$ is divided into $n$ pursuing squads $M_1,\ldots,M_n$ in such a way that, by the end of the game, each of the squads $M_1,\ldots,M_n$ can pursue only one of the evaders $E_i$ according to a $P$-strategy. For each fixed partition of the set of pursuers into squads $M_1,\ldots,M_n$, the greatest payoff of the set of pursuers $f(M_1,\ldots,M_n)$ is obtained as the solution of the following integer linear programming problem [2]:

\[ \max \sum_{i,j}^{m,n} \xi_{ij} a_{ij} \tag{3} \]

under the conditions

\[ \sum_{i=1}^{m} \xi_{ij} \leqslant 1,\quad \sum_{j=1}^{n} \xi_{ij} \leqslant 1 \]

for all $i,j$ (here $a_{ij}$ is the payoff of the $i$-th squad $M_i$ from pursuing $E_j$).

The evaders cannot enter into coalitions.
It is clear that

\[ \operatorname{Val}\Gamma(m,n) \geqslant \max_{\{M_1,\ldots,M_n\}} f(M_1,\ldots,M_n), \tag{4} \]

where $\{M_1,\ldots,M_n\}$ is the set of all possible partitions of $P_1,\ldots,P_m$ into $n$ groups.

Received
5 X 1964

CITED LITERATURE

$^{1}$ L. A. Petrosyan, DAN, 161, No. 1 (1965).
$^{2}$ D. Gale, The Theory of Linear Economic Models, N. Y., 1960.

Submission history

[v1] 1965-01-01

Full Text

DIFFERENTIAL SURVIVAL GAMES WITH MANY PARTICIPANTS

CITED LITERATURE

Submission history

Access Paper

Citation

Share

Related Papers

Feedback

MATHEMATICS