UDC 536.751:539.121.42:530.145.63
PHYSICS
Submitted 1970-01-01 | RussiaRxiv: ru-197001.08821 | Translated from Russian

Full Text

UDC 536.751:539.121.42:530.145.63

PHYSICS

V. L. LYUBOSHITS

EXTREMAL PROPERTIES OF ENTROPY IN THE MIXING OF POLARIZED GASES

(Presented by Academician B. M. Pontecorvo on 10 IV 1970)

1. Earlier, using the example of polarized gases consisting of identical atoms with spin \(1/2\), a new approach to the Gibbs paradox was developed \(\left({}^{8}\right)\). It turned out that the change in entropy upon mixing the two gases decreases monotonically to zero as the polarization vectors characterizing them approach one another. Under conditions in which, before mixing, both gases occupy equal volumes at equal pressures, the quantity \(\Delta S\) can assume a continuous series of values in the interval

\[ 0 \leq \Delta S \leq 2kN \ln 2, \tag{1} \]

where \(N\) is the number of atoms of each of the gases, and \(k\) is Boltzmann’s constant.

The aim of the present work is to investigate the extremal properties of the entropy for mixtures of two polarized gases at arbitrary values of the spin \(s\). It will be shown that inequality (1) is valid in the general case*.

2. Let there be, in a volume \(V_1\), a gas containing \(N\) atoms in a spin state with density matrix \(\hat A\), and in a volume \(V_2(=V_1)\), separated from \(V_1\) by an impermeable partition, a gas containing \(N\) of the same atoms in a spin state with density matrix \(\hat B\). In the general case \(A\) and \(\hat B\) are Hermitian positive definite \((2s+1)\)-row matrices with trace equal to \(1\)**. Up to additive terms of the type \(Nf(T)\), which do not interest us and depend on the temperature, the entropies of gases \(A\) and \(B\) before mixing have the form:

\[ S_A(N,V)=-kN\operatorname{Sp}\hat A\ln\hat A+kN\ln\frac{V}{N}; \]

\[ S_B(N,V)=-kN\operatorname{Sp}\hat B\ln\hat B+kN\ln\frac{V}{N}, \tag{2} \]

where

\[ \operatorname{Sp}\hat A\ln\hat A=\sum_{i=1}^{2s+1} a_i\ln a_i,\qquad \operatorname{Sp}\hat B\ln\hat B=\sum_{i=1}^{2s+1} b_i\ln b_i, \]

\(a_i\) and \(b_i\) are the eigenvalues of the matrices \(\hat A\) and \(\hat B\), respectively \((a_i \geq 0,\ b_i \geq 0)\). A formula of the type (2) for mixed quantum ensembles was first indicated by von Neumann \(\left({}^{2}\right)\). If the spin-relaxation time is much longer than the characteristic diffusion time (see \(\left({}^{1}\right)\)), then after complete mixing of the gases in the volume \(2V\) the density matrix will have the form

\[ \hat C=\frac{1}{2}(\hat A+\hat B). \tag{3} \]

Hence the change in entropy is

\[ \Delta S_{AB}(N)=S_C(N,2V)-S_A(N,V)-S_B(N,V)=kN\operatorname{Sp}\hat A\ln\hat A+ \]

\[ {}+kN\operatorname{Sp}\hat B\ln\hat B-kN\operatorname{Sp}(\hat A+\hat B)\ln\frac{\hat A+\hat B}{2}. \tag{4} \]

\[ \overline{\phantom{mmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmmm}} \]

* As in work \(\left({}^{1}\right)\), for definiteness we speak of spin, although all further conclusions apply to any discrete quantum number characterizing the internal state of the atom.

** For spin \(1/2\), \(A=\frac{1}{2}(1+\boldsymbol{\sigma}\mathbf P^{(A)})\), \(B=\frac{1}{2}(1+\boldsymbol{\sigma}\mathbf P^{(B)})\), where \(\boldsymbol{\sigma}\) is the Pauli vector operator, and \(\mathbf P^{(A)}\) and \(\mathbf P^{(B)}\) are the polarization vectors of the gases.

  1. Let us consider an important special case, when \(\hat A\) and \(\hat B\) correspond to pure ensembles with wave functions \(\psi^{(A)}\) and \(\psi^{(B)}\). For pure states one of the eigenvalues of the density matrix is equal to 1, the others are zero. Consequently, in formula (2) we must put \(\operatorname{Sp}\hat A\ln\hat A=\operatorname{Sp}\hat B\ln\hat B=0\). Solving further the standard problem of determining the eigenvalues of the Hermitian matrix \(\hat C\), we obtain

\[ c_1=\frac12\left(1+\left|\langle\psi^{(A)}|\psi^{(B)}\rangle\right|\right),\qquad c_2=\frac12\left(1-\left|\langle\psi^{(A)}|\psi^{(B)}\rangle\right|\right), \]

\[ c_3=c_4=\ldots c_{2s+1}=0. \tag{5} \]

The eigenvalues \(c_1\) and \(c_2\) correspond to the spin functions

\[ \psi^{(C)}_1= \frac{\psi^{(A)}+\psi^{(B)}e^{i\eta}} {\sqrt{2\left(1+\left|\langle\psi^{(A)}|\psi^{(B)}\rangle\right|\right)}} , \qquad \psi^{(C)}_2= \frac{\psi^{(A)}-\psi^{(B)}e^{i\eta}} {\sqrt{2\left(1-\left|\langle\psi^{(A)}|\psi^{(B)}\rangle\right|\right)}} , \tag{6} \]

where \(\eta=\arg\langle\psi^{(B)}|\psi^{(A)}\rangle\). Taking (3), (4), and (5) into account, we arrive at the relation

\[ \Delta S=-kN\left[ \left(1+\left|\langle\psi^{(A)}|\psi^{(B)}\rangle\right|\right) \ln\frac{1-\left|\langle\psi^{(A)}|\psi^{(B)}\rangle\right|}{2} +\right. \]

\[ \left. +\left(1-\left|\langle\psi^{(A)}|\psi^{(B)}\rangle\right|\right) \ln\frac{1-\left|\langle\psi^{(A)}|\psi^{(B)}\rangle\right|}{2} \right], \tag{7} \]

obtained in \((^1)\) for \(s=1/2\).

  1. Let us return to the general expression (4). Suppose first that the matrices \(\hat A\) and \(\hat B\) commute. Then we can write

\[ \Delta S_{AB}(N)=kN\left( \sum_{i=1}^{2s+1} a_i\ln a_i+ \sum_{i=1}^{2s+1} b_i\ln b_i- \sum_{i=1}^{2s+1} (a_i+b_i)\ln\frac{a_i+b_i}{2} \right). \tag{8} \]

Since

\[ \sum_{i=1}^{2s+1} a_i\ln a_i\geq \sum_{i=1}^{2s+1} a_i\ln\frac{a_i+b_i}{2} \quad \text{(see }(^1,^3)\text{),} \]

it is clear that \(\Delta S_{AB}\geq 0\).

On the other hand, from the inequality \(a_i\ln\dfrac{a_i}{a_i+b_i}\leq 0\) it follows that \(\Delta S_{AB}(N)\leq 2kN\ln 2\). Thus, for commuting matrices \(\hat A\) and \(\hat B\) inequality (1) is proved. Moreover, according to (8), for \(\hat A=\hat B\) (the criterion of identity of gases) the quantity \(\Delta S=0\). If, however, \(\hat A\hat B=\hat B\hat A=0\) (the criterion of complete distinguishability of gases), then the entropy change is \(\Delta S=2kN\ln 2\).

  1. We now pass to the general case. Let the matrices \(\hat A\) and \(\hat B\), generally speaking, not commute with each other. We associate with the matrix \(\hat B\) two matrices \(\hat B^{(1)}\) and \(\hat B^{(2)}\), which we define as follows:

a) the eigenvalues of \(\hat B^{(1)}\) and \(\hat B^{(2)}\), up to a permutation of the indices numbering them, coincide with the eigenvalues of the matrix \(\hat B\);

b) \(\hat B^{(1)}\) and \(\hat B^{(2)}\) commute with each other and with the matrix \(\hat A\);

c) if the eigenvalues of \(\hat A\) are numbered in the order

\[ a_1\geq a_2\geq a_3\ldots \geq a_{2s+1}, \tag{9} \]

then the eigenvalues of \(\hat B^{(1)}\) and \(\hat B^{(2)}\) corresponding to the same basis functions are arranged in the order

\[ b^{(1)}_1\geq b^{(1)}_2\geq\ldots\geq b^{(1)}_{2s+1}, \tag{10a} \]

\[ b^{(2)}_1=b^{(2)}_{2s+1}\leq b^{(2)}_2=b^{(1)}_{2s}\leq\ldots\leq b^{(2)}_{2s+1}=b^{(1)}_1. \tag{10б} \]

The following is valid.

Theorem. Let \(\hat A\) and \(\hat B\) be Hermitian positive definite matrices of the same rank with trace equal to 1, and let \(F(x)\) be a twice differen-

function in the interval \(0 \leq x \leq 2\). Then, if in this interval \(\dfrac{d^2 F(x)}{dx^2} \geq 0\), then

\[ \operatorname{Sp} F(\hat A+\hat B^{(2)}) \leq \operatorname{Sp} F(\hat A+\hat B) \leq \operatorname{Sp} F(\hat A+\hat B^{(1)}) \tag{11} \]

(recall that \(\operatorname{Sp} F(\hat A+\hat B)=\sum_{i=1}^{2s+1} F(\lambda_i)\), where \(\lambda_i\) are the eigenvalues of the matrix \(\hat A+\hat B\)).

Proof. Consider the set of Hermitian positive-definite matrices \(\hat B\) related to one another by unitary transformations. All matrices of this set have the same eigenvalues. Therefore, if the matrix \(\hat A\) is fixed, we can put them in correspondence with the same matrices \(\hat B^{(1)}\) and \(\hat B^{(2)}\). Let the quantity \(\operatorname{Sp} F(\hat A+\hat B)\), defined on the set of matrices \(\hat B\), take an extremal value (maximum or minimum) at \(\hat B=\tilde B\). Denote

\[ Z=\operatorname{Sp} F(\hat A+\hat U^+ \tilde B \hat U)-\operatorname{Sp} F(\hat A+\tilde B), \]

where \(\hat U\) is a unitary matrix. For \(\hat U\) the representation \(\hat U=e^{i\varepsilon \hat H}\) is valid, where \(\varepsilon\) is a real quantity and \(\hat H\) is a Hermitian matrix. Accordingly, for small \(|\varepsilon|\),

\[ \hat A+\hat U^+\tilde B\hat U = \hat A+\tilde B-i\varepsilon(\hat H\tilde B-\tilde B\hat H) +\varepsilon^2 \hat H\tilde B\hat H -\frac{1}{2}\varepsilon^2(\hat H^2\tilde B+\tilde B\hat H^2) +o(\varepsilon^3). \tag{12} \]

Represent \(Z\) in the form

\[ Z=\sum_{i=1}^{2s+1} F'(\lambda_i)(\lambda_i'-\lambda_i)+o\bigl((\lambda_i'-\lambda_i)^2\bigr), \tag{13} \]

where \(\lambda_i\) and \(\lambda_i'\) are the eigenvalues of the matrices \(\hat A+\tilde B\) and \(A+\hat U^+\tilde B\hat U\), respectively, and \(F'(x)=dF(x)/dx\).

If \(\tilde B\) corresponds to a maximum of the quantity \(\operatorname{Sp}F(\hat A+B)\), then, independently of the sign of \(\varepsilon\) and of the specific form of \(\hat H\), \(Z\leq 0\); in the case of a minimum, always \(Z\geq 0\). This means that the quantity \(Z\), and consequently also the differences \(\lambda_i'-\lambda_i\), cannot be of order \(\varepsilon\). According to perturbation theory,

\[ \lambda_i'-\lambda_i = 2\varepsilon\, \operatorname{Im}\sum_{k=1}^{2s+1} H_{ik}\tilde B_{ki} +o(\varepsilon^2), \tag{14} \]

where \(H_{ik}\) and \(\tilde B_{ki}\) are the matrix elements of \(\hat H\) and \(\hat B\) in the representation of the eigenstates of the matrix \(\hat A+\tilde B\). The requirement

\[ \operatorname{Im}\sum_{k=1}^{2s+1} H_{ik}\tilde B_{ki}=0 \]

(taking into account the Hermiticity of \(\hat H\) and \(\hat B\)) gives

\[ \tilde B_{ki}=\tilde b_i \delta_{ik}. \tag{15} \]

It follows from this that the matrices \(\tilde B\) and \(\hat A\) must commute. In this case \(\lambda_i=a_i+\tilde b_i\).

Using perturbation theory, we now find \((\lambda_i'-\lambda_i)\) to second order in the parameter \(\varepsilon\). According to (12) and (15),

\[ \lambda_i'-\lambda_i = \varepsilon^2 \sum_{l=1}^{2s+1} |H_{il}|^2 \tilde b_l -\varepsilon^2 \sum_{l=1}^{2s+1} |H_{il}|^2 \tilde b_i +\varepsilon^2 \sum_{l\ne i}^{2s+1} \frac{|H_{il}|^2(\tilde b_l-\tilde b_i)^2}{a_i-a_l+\tilde b_i-\tilde b_l} = \]

\[ = -\varepsilon^2 \sum_{l\ne i}^{2s+1} |H_{il}|^2 \frac{2(a_i-a_l)(\tilde b_i-\tilde b_l)} {a_i-a_l+\tilde b_i-\tilde b_l}. \tag{16} \]

Hence

\[ Z=-\varepsilon^2 \sum_{i=1}^{2s+1}\sum_{l>i}^{2s+1} \frac{F'(a_i+\widetilde b_i)-F'(a_l+\widetilde b_l)} {a_i+\widetilde b_i-a_l-\widetilde b_l}\, |H_{il}|^2(a_i-a_l)(\widetilde b_i-\widetilde b_l). \tag{17} \]

Since, by definition,

\[ \frac{d^2F(x)}{dx^2}\geqslant 0, \]

it is clear that all quantities

\[ K_{il}\equiv \frac{F'(\lambda_i)-F'(\lambda_l)}{\lambda_i-\lambda_l}\geqslant 0. \]

As a result, from the maximum condition \(Z\leqslant 0\) it follows that, for all \(i\) and \(l\),

\[ (a_i-a_l)\bigl(\widetilde b_i^{(\max)}-\widetilde b_l^{(\max)}\bigr)\geqslant 0. \tag{18} \]

Similarly, from the minimum condition \(Z\geqslant 0\) it follows that, for all \(i\) and \(l\),

\[ (a_i-a_l)\bigl(\widetilde b_i^{(\min)}-b_l^{(\min)}\bigr)\leqslant 0. \tag{19} \]

It is easy to see that conditions (18) and (19) are completely equivalent to the definitions of the matrices \(\widehat B^{(1)}\) and \(\widehat B^{(2)}\), respectively, with respect to \(A\) (see (9), (10a), (10b)). Thus, \(\widehat B^{(\max)}=\widehat B^{(1)}\), \(\widehat B^{(\min)}=\widehat B^{(2)}\).

Thereby relation (11) has been proved.

  1. If in the interval \(0\leqslant x\leqslant 2\)

\[ \frac{d^2F(x)}{dx^2}\leqslant 0, \]

then in formula (11), obviously, the replacement \(\widehat B^{(1)} \leftrightarrows \widehat B^{(2)}\) should be made. In particular,

\[ -\operatorname{Sp}(\widehat A+\widehat B^{(1)})\ln\frac{\widehat A+\widehat B^{(1)}}{2} \leqslant -\operatorname{Sp}(\widehat A+\widehat B)\ln\frac{\widehat A+\widehat B}{2} \leqslant \]

\[ \leqslant -\operatorname{Sp}(\widehat A+\widehat B^{(2)})\ln\frac{\widehat A+\widehat B^{(2)}}{2}. \tag{20} \]

Hence, on the basis of (4), for the change of entropy we can write (for \(\widehat B\ne \widehat B^{(1)},\widehat B^{(2)}\))

\[ \Delta S_{AB^{(1)}}(N)<\Delta S_{AB}(N)<\Delta S_{AB^{(2)}}(N). \tag{22} \]

Since, in accordance with the conclusions of section (4), for the quantities \(\Delta S_{AB^{(1)}}(N)\) and \(\Delta S_{AB^{(2)}}(N)\) relation (1) is valid (\(\widehat A\), \(\widehat B^{(1)}\), and \(\widehat B^{(2)}\) commute), it is clear that from (21) there follows the validity of (1) for arbitrary density matrices \(A\) and \(B\) (of course, under the mixing conditions indicated above). At the same time, the equalities written earlier, \(\widehat A=\widehat B\) and \(\widehat A\widehat B=0\) (for pure ensembles, respectively, \(\psi^{(A)}=\psi^{(B)}\) and \(\langle\psi^{(A)}|\psi^{(B)}\rangle=0\)), in the general case constitute necessary and sufficient conditions for the minimal and maximal change of entropy upon mixing polarized gases.

The author expresses deep gratitude to B. N. Valuev and M. I. Podgoretskii for discussion and valuable comments.

Joint Institute for Nuclear Research
Dubna, Moscow Region

Received
30 III 1970

REFERENCES

  1. V. L. Lyuboshits, M. I. Podgoretskii, DAN, 194, No. 3 (1970).
  2. J. von Neumann, Mathematical Foundations of Quantum Mechanics, Ch. 5, “Nauka,” 1964.
  3. L. Brillouin, Science and Information Theory, Ch. 2, Moscow, 1960.

Submission history

UDC 536.751:539.121.42:530.145.63