Full Text
UDC 518.9
MATHEMATICS
S. I. Zukhovitskii, R. A. Polyak, M. E. Primak
ON A CONCAVE \(n\)-PERSON GAME AND ONE MODEL OF PRODUCTION
(Presented by Academician L. V. Kantorovich, 22 IX 1969)
In the present paper a method is given for finding a normalized equilibrium point of a concave \(n\)-person game, and a connection is established between this problem and A. Wald’s classical model of production. A dynamic variant of this model is also considered, and a numerical method is constructed for finding its equilibrium state.
- Let there be given functions \(\varphi_i(x) \equiv \varphi_i(x_1,\ldots,x_i,\ldots,x_n)\) \((i=1,\ldots,n)\), concave in \(x_i \in E^{m_i}\) for fixed \(x_1,\ldots,x_{i-1},x_{i+1},\ldots,x_n\), and continuous in \(x=(x_1,\ldots,x_i,\ldots,x_n)\), and a convex compact set \(\Omega\) in the \(m=m_1+\cdots+m_n\)-dimensional space of vectors \(x=(x_1,\ldots,x_i,\ldots,x_n)\).
Denote
\[
\Phi(x,y)=\sum_{i=1}^{n}\varphi_i(x_1,\ldots,y_i,\ldots,x_n).
\]
A point \(x^*\in\Omega\) for which the condition
\[
\Phi(x^*,x^*)=\max\{\Phi(x^*,x)\mid x\in\Omega\},
\tag{1}
\]
is satisfied is called a normalized equilibrium point of the concave \(n\)-person game with payoff functions \(\varphi_i(x_1,\ldots,x_i,\ldots,x_n)\) under strategies \((x_1,\ldots,x_i,\ldots,x_n)\in\Omega\).
The assumption of strict increase of the vector function
\[
g(x)=\bigl(\partial\varphi_1(x)/\partial x_{11},\ldots,\partial\varphi_1(x)/\partial x_{1m_1},\ldots,\partial\varphi_n(x)/\partial x_{n1},\ldots
\]
\[
\ldots,\partial\varphi_n(x)/\partial x_{nm_n}\bigr),
\]
i.e., the requirement \((g(x)-g(y),\,y-x)>0,\ \forall(x,y)\in\Omega\times\Omega\ (x\ne y)\), which ensures the uniqueness of the vector \(x^*\), will be replaced by the conditions:
\[
\text{1) }\Omega\text{ is a strictly* convex compact set;}\quad
\text{2) }\|g(x)\|>0,\ \forall x\in\Omega.
\tag{2}
\]
Denote
\[
F(x)=\max\{(g(x),y-x)\mid y\in\Omega\}=(g(x),\theta(x)-x)=(g(x),\xi(x)).
\]
Since problem (1) is a convex programming problem, and
\[
g(x^*)=\nabla\Phi(x^*,y)\bigm|_{y=x^*},
\]
we have \(F(x^*)=0\) (see, for example, \((^2)\)).
Below, for finding a normalized equilibrium point, a numerical method is constructed which is a certain generalization of the conditional-gradient method \((^{2,3})\).
- As an initial approximation we choose an arbitrary vector \(x^0\in\Omega\). Suppose that \(k\) steps of the algorithm have already been carried out and approximations \(x^1,\ldots,x^k\) have been constructed. To construct the \((k+1)\)-st approximation, we solve the following convex programming problem:
\[ \max\{(g(x^k),x-x^k)\mid x\in\Omega\}=F(x^k)=(g(x^k),\theta(x^k)-x^k) \]
and compute the new approximation by the formula
\[ x^{k+1}=x^k+t_k(\theta(x^k)-x^k)=x^k+t_k\xi^k, \]
where \(t_k\) may be chosen in one of the following ways, corresponding to three algorithms A1, A2, A3:
* The set \(\Omega\) is strictly convex if from \(x,y\in\Omega\) it follows that \(\lambda x+(1-\lambda)y\in\operatorname{int}\Omega,\ \forall\,0<\lambda<1\).
$1^\circ$. As $t_k$ one takes the $k$-th term of an arbitrary sequence $\{t_k\}$ satisfying only the conditions: a) $t_k \to 0$, $0 < t_k \leqslant 1$; b)
\[
\sum_{k=0}^{\infty} t_k = \infty .
\]
$2^\circ$.
\[
t_k=\max\{t\mid F(x^k+t\xi^k)\leqslant F(x^k)(1-{}^{1}/_{2}t),\ 0<t\leqslant1\}.
\]
$3^\circ$.
\[
t_k=\max\{t=2^{-\nu}\mid F(x^k+2^{-\nu}\xi^k)\leqslant F(x^k)(1-2^{-(\nu+1)}),\ \nu\text{ is an integer}\}.
\]
3. Theorem 1. Suppose condition (2) is satisfied and $\nabla F(x)$ is a continuous vector function. Then the following assertions hold:
$1^\circ$. If for the Jacobian $H(x)$ of the vector function $g(x)$ there exists $\mu>0$ such that
\[
(H(x)\xi,\xi)\leqslant -\mu(\xi,\xi),\qquad \forall x\in\Omega,\quad \forall \xi\in E^m,
\tag{3}
\]
then the sequence $\{x^k\}$ generated by algorithm A1 contains a subsequence $\{x^{k_i}\}$ converging to $x^*$.
$2^\circ$. If $(H(x)\xi,\xi)\leqslant0$, $\forall x\in\Omega$ and $\forall \xi\in E^m$, then the sequence $\{x^k\}$ generated by algorithm A2 converges to $x^*$.
$3^\circ$. If condition (3) is satisfied and $\nabla F(x)$ satisfies the Lipschitz condition, then the entire sequence $\{x^k\}$ generated by algorithm A1 converges to $x^*$; but if it is generated by algorithm A3, then
\[
F(x^k)\leqslant F(x^0)q^k,\qquad \|x^k-x^*\|^2\leqslant \frac{F(x^0)}{\mu}q^k\quad (q<1).
\]
A subsequence converging to $x^*$, whose existence is guaranteed in $1^\circ$, can be constructed as follows:
\[
k_1=0,\qquad k_{i+1}=\min\{k\mid k>k_i,\ F(x^k)<F(x^{k_i})\}.
\]
In algorithm A3, to find the value of $t_k$ it may be necessary at each step to check the inequality many times:
\[
F(x^k+2^{-\nu}\xi^k)\leqslant F(x^k)(1-2^{-(\nu+1)}).
\]
Since computing the value $F(x)$ is associated with solving an extremal problem, such a method may turn out to be uneconomical, and it can be replaced by the following:
\[
t_k=\max\{t=t_{k-1}2^{-\nu}\mid F(x^k+t_{k-1}2^{-\nu}\xi^k)\leqslant
\]
\[
\leqslant F(x_k)(1-t_{k-1}2^{-(\nu+1)}),\ \nu\text{ is an integer}\}.
\]
Then, starting from some index $k$, the parameter $t_k$ will not change, and to determine it one will need to compute the function $F(x)$ only at a single point $x^k+t_{k-1}\xi^k$.
Let us now pass to the consideration of two production models which are, in essence, certain concave $n$-person games.
4. First we consider the classical production model of A. Wald (${}^4,{}^5$). Suppose that in some economy $n$ products are produced and $r$ resources are used, available in quantities respectively $a_1,\ldots,a_r$. The quantity $a_{ij}$ $(i=1,\ldots,n;\ j=1,\ldots,r)$ indicates the expenditure of the $j$-th resource necessary for producing one unit of the $i$-th product. However, unlike the classical problem of maximum profitability of enterprises (${}^6$), the prices of the products produced are not constant, but depend on the quantity of products produced. Let the function $f_i(x)\equiv f_i(x_1,\ldots,x_n)$ $(i=1,\ldots,n)$ indicate the price of a unit of the $i$-th product under the condition that the products are produced in quantities $x_1,\ldots,x_i,\ldots,x_n$. In (${}^4$) A. Wald, under certain assumptions concerning the functions $f_i(x)$ and under the assumption that
\[
\text{the set }\quad
\Omega_0=\left\{x\ \middle|\ \sum_{i=1}^{n}a_{ij}x_i\leqslant a_j\ (j=1,\ldots,r),\ x_i\geqslant0\ (i=1,\ldots,n)\right\}
\]
is a polyhedron, answered affirmatively the question of the existence of a nonnegative production vector $x^*=(x_1^*,\ldots,x_i^*,\ldots,x_n^*)$ and a nonnegative
the vector of resource valuations \(v^*=(v_1^*,\ldots,v_r^*)\) such that
\[ \sum_{i=1}^{n} a_{ij}x_i^* \leqslant a_j \ (j=1,\ldots,r);\qquad \sum_{j=1}^{r} a_{ij}v_j^* \geqslant f_i(x_1^*,\ldots,x_i^*,\ldots,x_n^*) \ (i=1,\ldots,n), \]
\[ \left(a_j-\sum_{i=1}^{r} a_{ij}x_i^*\right)v_j^*=0 \ (j=1,\ldots,r);\quad \left(\sum_{j=1}^{r} a_{ij}v_j^*-f_i(x^*)\right)x_i^*=0 \ (i=1,\ldots,n). \tag{4} \]
In [5] G. Kuhn, regarding the \(f_i(x)\) as merely nonnegative and continuous for all \(x\geqslant 0\), proved the existence of a production vector \(x^*\in\Omega_0\) that is a solution of the following linear programming problem:
\[ \max\left\{\sum_{i=1}^{n} f_i(x^*)x_i \mid x\in\Omega_0\right\} = \sum_{i=1}^{n} f_i(x^*)x_i^*. \tag{5} \]
The existence of vectors \(x^*\) and \(v^*\) satisfying conditions (4) is a simple consequence of equality (5) and the duality theorems of linear programming. It turns out that the vector \(x^*\) establishing equilibrium in Walras’s model is a normalized equilibrium point of the following concave \(n\)-person game.
Let
\[ \varphi_i(x_1,\ldots,x_i,\ldots,x_n) = \int_{0}^{x_i} f_i(x_1,\ldots,y,\ldots,x_n)\,dy \]
be the payoff of the \(i\)-th player, under the condition that the players have chosen, as their strategies, respectively, the values \(x_1,\ldots,x_i,\ldots,x_n\).
Consider the problem of finding a normalized equilibrium point, i.e., a point \(x^*=(x_1^*,\ldots,x_i^*,\ldots,x_n^*)\), for which
\[ \Phi(x^*,x^*)=\sum_{i=1}^{n}\varphi_i(x_i^*)= \]
\[ = \max\left\{\sum_{i=1}^{n}\varphi_i(x_1^*,\ldots,y,\ldots,x_n^*) \mid (y_1,\ldots,y_i,\ldots,y_n)\in\Omega_0\right\}. \tag{6} \]
Theorem 2. If each function \(f_i(x)\) \((i=1,\ldots,n)\) is jointly continuous in the variables and does not increase in the corresponding variable \(x_i\), and \(\Omega_0\) is a polyhedron, then, in order that the point \(x^*\) be a solution of problem (5), it is necessary and sufficient that this point be a solution of problem (6).
The established equivalence of A. Wald’s model to a concave \(n\)-person game makes it possible to apply, in finding an equilibrium state in this model, the numerical methods constructed in [7]. The algorithms constructed in Sec. 2 can also be applied to finding an equilibrium state in the dynamic variant of A. Wald’s model presented below.
- Consider an economy in which \(n\) products are produced and \(r\) types of resources are used. We shall assume that the rate at which the output of each product changes at time \(t\) depends linearly on the quantity of each type of product produced at that time and on the quantities of resources used at that time. Therefore one may assume that the economy under consideration is described by the system of differential equations
\[ dx/dt=A(t)x+B(t)u,\qquad x(0)=x_0, \tag{7} \]
where the elements \(a_{ij}(t)\) and \(b_{ik}(t)\) of the matrices \(A(t)\) and \(B(t)\) are piecewise-continuous functions, bounded on \([0,T]\).
The control vector \(u(t)=(u_1(t),\ldots,u_r(t))\) is chosen from some set \(U\subset E^r\). In what follows we shall assume that system (7)
and the set \(U\) satisfy the conditions under which, at time \(T\), the attainability set
\[ \Omega_T=\left\{x(T)\mid x(T)=Y(T)\left(x_0+\int_0^T Y^{-1}(t)B(t)u(t)\,dt\right),\ u(t)\in U\right\} \]
is bounded, closed, and strictly convex. Here \(Y(t)\) is the fundamental matrix of system (7).
Let, as in item 4, the functions \(f_i(x_1,\ldots,x_i,\ldots,x_n)\) \((i=1,\ldots,n)\) determine the prices of the products.
Consider the problem of finding a control \(u^*(t)=(u_1^*(t),\ldots,\ldots,u_r^*(t))\) such that the trajectory \(x^*(t)=(x_1^*(t),\ldots,x_n^*(t))\) of system (7) at time \(t=T\) determines a point realizing
\[ \max\left\{\sum_{i=1}^n f_i(x^*(T))x_i\mid x\in\Omega_T\right\} = \sum_{i=1}^n f_i(x^*(T))x_i^*(T). \]
In other words, one seeks a control of the economy \(u^*(t)\) that leads at time \(t=T\) to the production of such quantities \(x_i^*(t)\) \((i=1,\ldots,n)\) of products that, at prices \(f_i(x^*(T))\) \((i=1,\ldots,n)\), the vector \(x^*(T)\) maximizes the value of the output produced.
For the dynamic version of A. Wald’s model there is a statement analogous to theorem 2, from which, in particular, the existence of the vector \(x^*(T)\) follows.
As already noted above, the algorithms of item 2 may be used to find the vector \(x^*(T)\). In this case, at each step, the linear optimal-control problem (8) is solved, consisting in finding the maximum of a linear function on the set of endpoints of trajectories \(\Omega_T\).
Let us dwell in more detail on the first algorithm. As the initial approximation, choose a vector \(x^0(T)\in\Omega_T\). Suppose that the approximations \(x^1(T),\ldots,x^k(T)\) have already been constructed.
1) Find a solution \(\psi^k(t)\) of the system
\[ d\psi/dt=-A^*(t)\psi,\qquad \psi_j^k(T)=f_j(x^k(T))\quad (j=1,\ldots,n). \]
2) Find the vector \(u^k(t)\) from the condition that, at every time \(t\in[0,T]\), the equality
\[ (\psi^k(t),B(t)u^k(t))=\max(\psi^k(t),B(t)u) \]
hold; that is, the vector \(u^k(t)\) is found from L. S. Pontryagin’s maximum principle.
3) Find the new approximation \(x^{k+1}(T)\) by the formula
\[ x^{k+1}(T)=(1-\lambda_k)x^k(T)+\lambda_kY(T)\left(x_0+\int_0^T Y^{-1}(t)B(t)u^k(t)\,dt\right), \]
where: a) \(\lambda_k\to0,\ 0<\lambda_k\le1\); b)
\[ \sum_{k=0}^{\infty}\lambda_k=\infty. \]
Moscow Civil Engineering Institute
named after V. V. Kuibyshev
Ukrainian Branch of the Scientific Research
Institute of Planning and Norms
Received
10 IX 1969
REFERENCES
- J. B. Rosen, Econometrica, 33, No. 3 (1965).
- V. F. Dem’yanov, A. M. Rubinov, Approximate Methods for Solving Extremal Problems, L., 1968.
- M. Frank, P. Wolfe, Naval Res. Logistics Quarterly, 3 (1956).
- A. Wald, Zs. Nationalökonomie, 7, No. 5 (1936).
- G. U. Kun, “Linear inequalities and related questions,” Collection, IL, 1959.
- L. V. Kantorovich, Economic Calculation of the Best Use of Resources, Publishing House of the Academy of Sciences of the USSR, 1959.
- S. I. Zukhovitskii, R. A. Polyak, M. E. Primak, DAN, 185, No. 1 (1969).
- L. S. Pontryagin, V. G. Boltyanskii et al., The Mathematical Theory of Optimal Processes, M., 1961.