Decoherence as an Inherent Characteristic of Quantum Mechanics ()
1. Introduction
The measurement problem in quantum mechanics is an unresolved problem in modern physics, and a subject of considerable debate [1] [2]. The microscopic world in conventional quantum mechanics is treated differently from the macroscopic world in classical mechanics. However, these two worlds are, in fact, linked to each other. Therefore, it may seem that the present formulation of quantum mechanics is insufficient to describe nature. Nevertheless, for many decades, it has been unclear how exactly to formulate the measurement problem, which has led to several definitions of this problem.
For a better understanding of the discussion in subsequent sections, we first provide some background on the problems that exist between classical physics and quantum mechanics, and the theories suggested to solve them. In the present work, we examine a Stern-Gerlach-like experiment, where the spin in the z-direction of an electron S is measured, to clarify these problems.
In classical mechanics, the relation between the state of a system under investigation and the physical quantities associated with it is trivial. However, certain assumptions are needed to connect these in quantum mechanics. In its standard formulation, the eigenstate-eigenvalue link [3] [4] states that eigenvalues and eigenstates have a one-to-one correspondence except in the case of degeneracy. Thus, in our Stern-Gerlach-like experiment, we assume that
(1)
(2)
where
and
represent the eigenstates of S with eigenvalues
and
, respectively.
However, some studies have suggested that this condition is unnecessary [5]. Nevertheless, if we do not agree with the eigenstate-eigenvalue link, then it follows that standard quantum mechanics is insufficient to describe nature. To better understand this, we examine the above assumption in more detail. Note that because the assumption eigenstate
eigenvalue is a part of Born’s rule [6], and discarding this assumption implies discarding Born’s rule. Conversely, discarding the assumption eigenvalue
eigenstate implies that there may be states other than the eigenstate that correspond to the measured value of the concerned physical quantity. For instance, let
be the value obtained after the measurement of a quantity. If the state obtained directly after this measurement is not the eigenstate corresponding to
, then it is possible to obtain a measured value that is different from
even after the first measurement. Thus, if we discard the assumption eigenvalue
eigenstate, then the standard theory of quantum mechanics appears to be insufficient.
We introduce an ideal measurement device M and assume that S and M interact by obeying the Schrödinger equation. Let
be the neutral state of M before measurement, and
and
be the states of M after measurement with measured values
and
, respectively. Subsequently, taking into account the eigenstate-eigenvalue link, an ideal measurement process
is defined as
(3)
(4)
Let the initial state of S be
. Then, the state obtained after the measurement process is given by
(5)
Note that either the measured value
or
is expected as a result of the measurement in Equation (5). However, the measurement process and the eigenstate-eigenvalue link contradict each other because the initial state of S is not the eigenstate corresponding to the eigenvalues
or
.
Thus, the above example shows that it is necessary to adopt a modification of standard quantum mechanics. The most traditional approach is to adopt the projection postulate [7]. As will be discussed in Section 2, the transition from a superposition of states to an eigenstate is necessary so that the expectation values in quantum mechanics obey the classical equations of motion. However, this transition cannot be described by the Schrödinger equation because it is not a unitary process. By adopting the projection postulate, the quantum states not only develop unitarily by obeying the Schrödinger equation but also non-unitarily owing to the collapse of the wave packets during the measurement process. For example, a state
changes to the corresponding eigenstates
after the measurement with corresponding measured values
. Thus, in the Stern-Gerlach-like experiment, a non-unitary change such as
(6)
is permitted in addition to the unitary development represented by Equation (5).
Although the projection postulate is a type of nonlocal requirement, there are several studies that demonstrate its consistency with special relativity [8]. Note that classical information cannot be transferred during the collapse of a wave packet.
In contrast to the above description, the collapse of a wave packet is not assumed in the many-worlds interpretation of quantum mechanics [9] [10]. In this interpretation, even macroscopic states maintain coherent superpositions, and therefore, the assumption that only one outcome can be obtained from one appropriate measurement process, which is usually regarded as a matter of course, is discarded.
Decoherence [11] [12] may be regarded as one of the most successful theories to explain the quantum measurement process without any additional postulates. Moreover, it is frequently applied to the many-worlds interpretation [13]. Decoherence was first proposed by Zeh et al. [14] [15], which was followed by the important work of Zurek [16]. The efforts of these authors ensured that the decoherence theory was actively studied. In this theory, the observed decay of interference is explained by the interactions between the system and the environmental degrees of freedom. For example, let the state
that represents a unified system consisting of an observed system and a measurement device be described by the superposition of the two states
and
, such that:
(7)
Next, we suppose that this state interacts with the state
representing the environmental degrees of freedom. The interaction dynamics between these states are given by
(8)
(9)
The density matrix
of the interacting system is then
(10)
Note that because the environmental degrees of freedom are very large, the states
and
resulting from the interaction are approximately orthogonal if
and
are distinguishable. Therefore, the reduced density matrix
of the unified system, which is obtained by tracing out the environmental degrees of freedom, becomes
(11)
Thus, we observe the decay of interference in the system. Moreover, the preferred basis problem has also been studied in this framework by Zurek [17], who also presented a solution for this problem.
However, certain problems remain with the decoherence theory. First, the unitary interaction between the system and the environment does not lead to a complete removal of the interference terms. The reduced density matrix given by Equation (11) does not indicate whether the system is in state
or
because reduced density matrices represent improper mixtures [18] that only provide a probability distribution. This implies that the coherence has been delocalised into a larger system including the environment [2]. The next problem may be more severe. Note that because the coherent terms disappear after in the interaction between the system and the environment, obtaining an outcome for a superposition of states implies direct violation of the eigenstate-eigenvalue link.
Considering the above arguments, it seems necessary to remove certain natural conditions from or add certain new unnatural conditions to standard quantum mechanics to explain the measurement process. In other words, the quantum measurement problem is reduced to the problem of determining these conditions.
Note that in the present study, we do not delve into a discussion regarding what the measurement problem actually is. Instead, we define our problem in the form of the following question: is it possible to explain the measurement process within the framework of pure quantum mechanics? In this study, pure quantum mechanics refers to the theory of quantum mechanics without any additional postulates, such as the projection postulate. We demonstrate that the answer to this question is, in fact, yes; thus, we can indeed explain the measurement process within the framework of pure quantum mechanics.
The remaining paper is organised as follows.
In Section 2, the relations between the quantum mechanical and classical equations of motion are briefly reviewed. It is shown that the transition from a superposition of states to an eigenstate is necessary to ensure that the expectation values in quantum mechanics obey the classical equations of motion; specifically, decoherence is necessary to connect the classical and quantum worlds.
In Section 3, several Bell-type inequalities are reviewed. The fact that these inequalities do not hold true for quantum mechanics indicates the presence of some sort of nonlocality. Although it is unlikely that the universe is nonlocal, the measurement problem of quantum mechanics is not a problem of explaining the nonlocality itself. The question we should rather ask is whether such theories of quantum mechanics are reasonable enough to explain the measurement process and experiments that demonstrate the reasonableness of its own. The Kochen-Specker theorem is also discussed in Section 3. It is well known that this theorem prohibits counterfactual definiteness of quantum mechanics. Nevertheless, it is important to consider the meaning of counterfactual definiteness in relation to the nonlocality.
Section 4 presents the main findings of this study. We show that decoherence is an inherent characteristic of quantum states caused by the quantum uncertainty relation, and it can explain the measurement process within the framework of pure quantum mechanics. It is also shown that there is no inconsistency between decoherence and the counterfactual definiteness discussed in Section 3.
Section 5 concludes the paper.
2. Relation between Quantum Mechanical and Classical Equations of Motion
In this section, we show that the transition from a superposition of states to an eigenstate is necessary to ensure that the quantum mechanical expectation values obey the classical equations of motion.
2.1. Schrödinger Equation, Path Integral Formulation, and Hamilton-Jacobi Equation
The time evolution of physical quantities characterising macroscopic objects is described by the equations of motion in classical mechanics. Conversely, the operators corresponding to the physical quantities characterising microscopic objects do not evolve with time but the states evolve according to Schrödinger equation in the Schrödinger picture of quantum mechanics. Thus, the relation between Schrödinger equation and classical equations of motions is nontrivial. However, it can be visualised easily by adopting the path integral formulation.
2.1.1. Relation between Schrödinger Equation and Hamilton-Jacobi Equation
It has been shown that the phase of a wave function obeys the classical Hamilton-Jacobi equation under certain assumptions [19] [20]. Let
be the position at time t of a particle moving in a real field
and
be its wave function, where
.
obeys the Schrödinger equation such that
(12)
If we separate the wave function
into its absolute value
and phase
, such that
(13)
then Equation (12) can be separated into its real and imaginary parts. The real part is given by
(14)
and the imaginary part is given by
(15)
Let us first consider the imaginary part in Equation (15). The right-hand side of Equation (15) multiplied by A yields
(16)
Assuming that the momentum p of the particle is
, then Equation (15) multiplied by A results in the continuity equation
(17)
where
is the velocity of the particle and
.
Furthermore, ignoring the terms of
, Equation (14) becomes
(18)
This is the classical Hamilton-Jacobi equation, assuming that S is the action and
is the momentum of the particle.
Reasonableness of these assumptions based on the similarity between quantum mechanics and optics has been discussed in several textbooks [21] [22].
2.1.2. Path Integral Quantisation
Note that in this section, we omit the subscript i in
for simplicity. Adopting the path integral quantisation method [23] makes the discussion in the previous section clearer. In this theory, the wave function
at time
can be described as evolving from the wave function
at time
, (
), such that
(19)
where
is the Lagrangian and
(20)
with
. Differentiating
with respect to
, we obtain
(21)
where
(22)
On the other hand, applying the Hamiltonian operator
, where
, on the wave function
in Equation (19), we obtain
(23)
where the terms of
or higher are ignored. We can see that
on the right-hand side of Equation (23) is the classical Hamiltonian function with
substituted by
on comparing Equation (23) with
(24)
and the Schrödinger equation
(25)
Substituting Equations (21) and (23) into Equation (25), we obtain the Hamilton-Jacobi equation
(26)
Then, classical mechanics states that
and
must satisfy Hamilton’s equations of motion:
(27)
(28)
As shown above, the classical equations of motion can be derived from the Schrödinger equation in the limit
without any analogy, if the path integral quantisation method is applied to the wave function. However, it is worth noting that the derived classical equations of motion contain no information about the initial state of the system because the information
has been lost in the above formalism; specifically, the discussion here is formal and hence does not reflect the state of the system. So far, all we have shown with the aid of quantum mechanics is that classical quantities obey classical equations of motion. Therefore, we still need to investigate the equations that the corresponding quantities in quantum mechanics (i.e. the expectation values of an operator) obey in the classical limit.
2.2. Ehrenfest Theorem
The Ehrenfest theorem [24] demonstrates how quantum mechanical expectation values obey the classical equations of motion. Although Ehrenfest [25] originally proposed the theorem in the Schrödinger picture, in this section we employ the Heisenberg picture. Note that because the operators corresponding to the physical quantities depend explicitly on time in the Heisenberg picture, the relation is rather straightforward. Let
and
be the expectation values of the coordinate
and momentum
, respectively; then, the equations obeyed by
and
in quantum mechanics should be the same as the classical Hamilton’s equations of motion under certain conditions.
Let us suppose that a quantum mechanical system is described by coordinates
and their respective conjugate momenta
. Operator
, which does not depend on time in the Schrödinger picture, evolves in the Heisenberg picture obeying
(29)
where
is the Hamiltonian operator and
represents a commutator. Because the commutator between
and
is
(30)
therefore,
satisfies
(31)
(32)
Next, substituting
with
and
in Equation (29) and using Equations (31) and (32), we obtain the evolution equations for the respective expectation values as
(33)
(34)
Furthermore, if we assume
(35)
(36)
where H is the classical Hamiltonian, we obtain
(37)
(38)
We note that the above equations are in the form of Hamilton’s equations of motion in classical mechanics.
Note that the assumptions in Equations (35) and (36) are justified if
is quadratic in
and
or if the dispersion of the measured
and
values are small. The dispersion is expected to be small for eigenstates and large for a superposition of states. Therefore, the superposition of states must evolve into eigenstates to ensure that the expectation values of quantum mechanical quantities obey the classical equations of motion.
3. Bell-Type Inequalities and the Kochen-Specker Theorem
In this section, we review Bell-type inequalities and the Kochen-Specker theorem. These theorems indicate that the world described by quantum mechanics is considerably different from the macroscopic world we are familiar with.
It has been experimentally confirmed [26] - [32] that quantum mechanical expectation values violate Bell-type inequalities, strongly suggesting that the quantum mechanical world is nonlocal or nonseparable. D’Espagnat [18] defined separability as follows: If a physical system remains, during a certain time, mechanically (including electromagnetically, etc.) isolated from other systems, then the evolution of its properties during this whole time interval cannot be influenced by operations carried out on other systems. Conversely, the Kochen-Specker theorem implies that quantum mechanics is counterfactually indefinite. Nevertheless, the meaning of counterfactual definiteness should be considered in relation to the nonlocality of quantum mechanics; specifically, the Kochen-Specker theorem forbids quantum mechanics to be locally counterfactually definite. In other words, quantum mechanics is not forbidden to be nonlocally counterfactually definite by Bell-type inequalities or the Kochen-Specker theorem. This fact is important to the consisitency of discussion in Subsection 4.2.
3.1. EPR-Bohm Experiment
First, we introduce the Einstein-Podolsky-Rosen (EPR)-Bohm experiment [33] [34], in which the spins of two spin 1/2 particles, labelled as 1 and 2 in Figure 1, are measured. The sum of their spins should be zero, and their initial state
can be written as
(39)
where
and
are the spin eigenstates in the z direction. These satisfy
(40)
(41)
where
is the spin operator in the z direction. The spin operator in the r direction, which is perpendicular to the direction of the particles’ motion (i.e. along x direction) and makes an angle
with the z direction in the yz-plane, is given by
(42)
The eigenstates of
are
and
, given by
(43)
which satisfy the following equations:
(44)
(45)
By using Equation (43),
can be rewritten as
(46)
Let us suppose that two observers simultaneously perform measurements on the spins of particles 1 and 2 along the directions z and r, respectively. Rewriting
as
(47)
we can easily calculate the probabilities of observing the spin combinations
,
,
, and
, which are given by
(48)
(49)
(50)
(51)
respectively.
It should be noted that these equations hold true for the measurements in which the angle between the spins of the two particles is
.
3.2. Bell’s Inequality and CHSH Inequality
3.2.1. Derivation
We examine the EPR-Bohm experiment in which the spin of particle 1 along directions a or a’ and the spin of particle 2 along directions b or b’ are measured. Let A, A’, B, and B’ be the measured values of the spin divided by
, respectively. In this section, we assume separability: the measurement of the spin of particle 1 never affects the measurement of the spin of particle 2 and vice versa. We also assume counterfactual definiteness: the spin value is determined separately before the measurement. In other words, each spin is assumed to have a definite value even if the measurement has not actually been performed. Thus, A, A’, B, and B’ assume a definite value +1 or −1 each.
Based on these assumptions, we define the quantity M as
(52)
Note that because one of the terms on the right-hand side of Equation (52) is always 0 and the other term is always +2 or −2, M is always +2 or −2. Therefore, the average value
obtained after several measurements of M satisfies
(53)
This is the CHSH inequality [35].
Next, we examine the case in which A, A’, B, and B’ are probabilistically determined by hidden local variables [2] [18] [36]. Let us suppose that the measured values depend on a set of hidden local variables represented by
. Then, A, A’, B, and B’ depend on
and assume a value of either +1 or −1 each. Let
be the probability that
has a value
for a given
. The average value
of
is then given by
(54)
Thus,
satisfies
(55)
Similar inequalities are satisfied by the average values of
, and
. The average value
of AB can be written as
(56)
where
is the probability distribution of
, and
satisfies
(57)
Again, similar inequalities are satisfied by the average values of
and
.
Next, let us consider
:
(58)
From inequality (57), it follows that the absolute value of Equation (58) satisfies
(59)
or
(60)
Note that the fact that the sum of the spins of particles 1 and 2 is zero when
has not been used in the derivation of inequality (60). If we take this into account and put
, then we can substitute
into inequality (60) to obtain
(61)
This was the first inequality proposed by Bell in 1964 [37].
3.2.2. Contradiction between CHSH Inequality and Quantum Mechanics
In this section, we show that quantum-mechanical expectation values violate the CHSH inequality (53).
We calculate the quantum-mechanical expectation value
of M defined in Equation (52) using Equations (48) to (51). Let
and
, and
be the angles that directions
, and
make with the z direction in the yz-plane, respectively. The expectation value of the first term of M then becomes
(62)
The expectation values of the remaining three terms of M can be similarly calculated. Thus,
can be written as
(63)
We calculate the maximum and minimum values of
using Equation (63) to check whether it satisfies the CHSH inequality.
can be extremised with respect to
, and
, thus yielding three critical conditions. Differentiating equation (63) with respect to
, we obtain
(64)
which yields
(65)
or
(66)
Differentiating with respect to
, we obtain
(67)
which yields
(68)
or
(69)
Finally, differentiating with respect to
, we obtain
(70)
which yields
(71)
or
(72)
, as per Equation (63), has a maximum or minimum value when Equations (65), (68), and (72) are satisfied, such that
(73)
(74)
Therefore, we can conclude that quantum-mechanical expectation values violate the CHSH inequality (53).
3.3. Wigner’s Inequality and Its Variations
Note that we again assume counterfactual definiteness and nonseparability in the derivation of the inequalities in this section.
3.3.1. Wigner’s Inequality
We re-examine the EPR-Bohm type experiment introduced in Section 3.1, where the spin was measured along two of the three directions: a,
and
. Let A and B be the definite values of the spin along a divided by
of particles 1 and 2, respectively. Similarly,
and
are the definite values along
, and
and
the definite values along
. Let
be the angle between a and
,
between
and
, and
between
and a. Moreover, let
be the probability that
and
, and other probabilities can be defined similarly.
Note that because the sum of the spins of particles 1 and 2 along the same direction is zero,
can be written as
(75)
where
is the probability that the definite values are
, and
, irrespective of whether they are measurable. Similarly,
(76)
and
(77)
Using these equations, we obtain
(78)
Recalling that probability cannot be less than zero, we obtain Wigner’s inequality [38], such that
(79)
Using Equation (48) to confirm that the quantum-mechanical expectation value violates Wigner’s inequality (79), we obtain
(80)
When
, inequality (80) becomes
(81)
Moreover, if we use
(82)
inequality (81) can be rewritten as
(83)
The above inequality is violated when
(84)
3.3.2. Mermin’s Inequality
Mermin [39] suggested a variation of Wigner’s inequality under the same conditions.
Mermin computed the probability that the spins of particles 1 and 2 measured in different directions have different signs. Let us consider separately the cases where 1) all of them have the same sign, that is,
or 2) two of them have the same sign and the other has a different sign. For example,
.
In case 1), it can be shown trivially that the probability of the spins of the two particles being different is 1. In case 2), the corresponding probability is 1/3. Therefore, the probability that the spins of particles 1 and 2 measured in different directions have different signs exists between 1 and 1/3. This can be expressed as
(85)
However, when all the angles between a,
and
are
, then
can be calculated using Equations (49) and (50), and becomes
(86)
which shows that quantum mechanical probabilities violate Mermin’s inequality.
3.3.3. Albert’s Inequality
Albert [40] [41] also derived an inequality under the same conditions as Wigner’s inequality.
Note that the number of combinations of
is 8 considering that
,
and
. These are
(87)
As can be easily checked, each of the above combinations satisfies at least one of the following three conditions:
and
. Therefore, the sum of the probabilities that one of these three conditions is satisfied is 1 or more. In other words, we can write
(88)
where at least one term on the right-hand side must be 1/6 or more. However, when all the angles between a,
and
are
, every term on the right-hand side of condition (88) is calculated to be 1/8 using Equations (49) and (50). This again shows that quantum mechanical expectation values violate Albert’s inequality (88).
3.4. Kochen-Specker Theorem
In the previous section, we derived a series of Bell-type inequalities assuming not only separability but also counterfactual definiteness. Because quantum mechanical expectation values violate these inequalities, it implies that quantum mechanics must be non-separable or counterfactually indefinite or both. Interestingly, the Kochen-Specker theorem [42] solely requires the assumption of counterfactual definiteness.
First, we define the function
of an operator
as follows:
is expanded using its eigenvalues
and the projection operators
on the respective states corresponding to
such that
(89)
Then,
can be defined as
(90)
Let us suppose that every physical quantity A has a definite value1 [A] belonging to the spectrum of its corresponding operator
. In this section, we introduce an assumption that we call “FUNC’’, which plays an important role in the derivation of the Kochen-Specker theorem. According to FUNC, if an operator
is defined as
, then the definite values [A] and [B] satisfy
.
Using FUNC, we derive a sum rule for the commuting operators
and
in an N-dimensional Hilbert space. We also define an operator
, whose eigenvectors include all the vectors that form an orthonormal basis by simultaneously diagonalising
and
.
can be expanded as
(91)
where
represents the eigenvalues of
and
represents the projection operators onto the eigenvectors corresponding to
. If functions f and g satisfy
and
, then
can be written as
(92)
where
(93)
Therefore, the definite values of the left- and right-hand sides of the above equation also satisfy
(94)
On the other hand, the sum of
and
becomes
(95)
Note that the second and fourth steps in Equation (95) are a consequence of FUNC. Thus, combining Equations (94) and (95) we obtain the sum rule
(96)
The Kochen-Specker theorem can be proved using Equation (96). Let
be the orthonormal eigenstates of a given physical quantity, and
(97)
be the projection operator onto
, which satsifies
(98)
where
is the identity operator. By applying the sum rule (given by Equation (96)) on Equation (98), we obtain
(99)
Note that because all definite values belong to the spectrum of their corresponding operators,
and
or 1. Thus, one
must be 1 and the remaining 0 to satisfy Equation (99). In summary, if it is possible to assign every physical quantity a definite value satisfying FUNC, then one of the projection operators onto its eigenstate must be assigned a definite value 1 and the remaining must be assigned a definite value 0. Kochen and Specker [42] proved that it is not possible to assign such definite values to all physical quantities.
We reconsider the assumption FUNC. In FUNC, it is not possible to know which physical quantities are being observed simultaneously. Although it may seem natural that the definite values satisfy FUNC, Bell [43] insisted that contextual definite values do not need to satisfy FUNC. This implies that the Kochen-Specker theorem does not deny the existence of contextual definite values. We can say that the spin in each direction has a contextual definite value in the EPR-Bohm experiment introduced in Section 3.1, provided the definite value of the spin in the r direction of particle 2 varies in accordance with the direction of the spin of particle 1, which was measured simultaneously. Furthermore, it is known that quantum mechanics must be nonlocal under such an assumption [44] [45].
3.5. Nonlocality and Weak Values
In view of the discussion in the previous sections, we need to check whether the counterfactual definiteness assumed during the derivations of the Bell-type inequalities is local. We consider the case in which the observed values are determined by local hidden variables, as shown in Equation (54). Note that Equation (58) is necessary for the derivation of CHSH inequality, and the density matrices for AB and AB’ must be the same to satisfy Equation (58). Therefore, we can conclude that the counterfactual definiteness assumed in this case is local. Thus, nonlocal counterfactual definiteness (i.e. existence of contextual definite values) is not forbidden in the above discussion. Conversely, if quantum mechanics is counterfactually indefinite, then we need to introduce nonlocality to explain the correlation between EPR pairs. Therefore, quantum mechanics must be nonlocal irrespective of whether it is counterfactually definite or indefinite, although it may seem difficult to appreciate this fact.
Weak measurements [46] clearly demonstrate the nonlocality of quantum mechanics. As confirmed by many experiments, the measured values in a weak measurement agree with the corresponding weak values. Nevertheless, the physical meaning of these concepts remained unclear for a long time. However, if we consider the nonlocality of quantum mechanics, then the physical meaning of weak measurements and weak values can be completely understood within the framework of a conventional quantum mechanical approach [47].
4. Decoherence as an Inherent Characteristic of Quantum Mechanics
As stated in the Introduction, most studies in this field have yielded a negative answer to the question regarding whether the measurement process can be explained within the framework of pure quantum mechanics. However, in this section, we demonstrate that the answer is affirmative, that is, we can indeed explain the measurement process within the framework of pure quantum mechanics. We propose a new decoherence theory, in which the uncertainty of microscopic objects gives rise to decoherence as an inherent characteristic of pure quantum mechanics. Note that because this decoherence exists prior to a measurement, the eigenstate-eigenvalue link can be maintained. Moreover, we do not intend to explain the nonlocality or counterfactual indefiniteness of quantum mechanics by means of other concepts, as it is beyond the scope of this work. However, we do attempt to illustrate how the measurement process can be understood within the scope of pure quantum mechanics.
We examine three experiments in this section. First, in Section 4.1, we examine a Stern-Gerlach-like experiment with an electron to illustrate our idea of decoherence. In Section 4.2, we apply our theory to an EPR pair of electrons and show that the correlation between spatially separated particles is not a result of wave packet collapse or similar processes. In Section 4.3, the double-slit experiment with electrons is examined to demonstrate the effectiveness of our theory for cases with continuous eigenvalues. This experiment also illuminates how pure quantum mechanics is able to demonstrate that electrons behave as interfering particles, that is, particles whose detection rate is consistent with interference.
4.1. Stern-Gerlach-Like Experiment
We examine a Stern-Gerlach-like experiment with an electron S whose spin is measured in the z direction, as shown in Figure 2. Note that
and
are the eigenstates of the electron with eigenvalues
and
, respectively. S initially travels along the x-axis and enters an inhomogeneous magnetic field along the z-axis, where a magnetic force acts on it. When S exits the magnetic field, its momentum in the z direction is −p if its spin is
and +p if the spin is
. We define the momentum eigenstates
and
with eigenvalues −p and +p, respectively. Furthermore,
is defined as the momentum eigenstate having 0 momentum. Note that because the spin and momentum operators along the same direction commute with each other, the state of S can be described as a simultaneous eigenstate of these two operators. We also define a unitary operator
that represents the interaction between S and the magnetic field, such that
(100)
(101)
![]()
Figure 2. Stern-Gerlach-like experiment.
We define the initial state
of S to be
(102)
The state
after the interaction between S and the magnetic field can be written as
(103)
When S reaches one of the detectors, the entire event is recorded. If we ignore the evolution of S’s state between the magnetic field and the detector, then
is the state of S just before detection. The naïve density matrix
of S is defined as
(104)
Note that
and
differ in physical contents, as explained in the following.
Next, we consider the uncertainty relation. Because we want to measure S’s momentum in the z direction, we must allow some uncertainty in its position in the same direction. To account for this uncertainty, we introduce the density matrix
of S translated to a distance
in the z direction, which is defined as
(105)
where
is the translation operator in the z direction.
satisfies the following equation
(106)
and is defined by
(107)
for the momentum operator
in the z direction.
Next, we define the average density matrix
of S as
(108)
where
is the uncertainty in S’s position along the z direction and from Equation (105) we can write
(109)
We want to know what is observed by the macroscopic detector; hence, we set
(110)
which leads to
(111)
Therefore, the average density matrix, which describes the state of S to be detected, loses its interference terms and becomes
(112)
The above equation demonstrates the decoherence prior to the measurement process.
Note that
is not a consequence of the nonunitary (or unitary) time evolution from the state
, but
is exactly the density matrix corresponding to
to be detected.
Note also that if condition (110) is weakened, then the interference terms in
would be retained to a certain extent. Performing the integral Equation in (108), we obtain
(113)
Because we do not adopt the projection postulate in this study, we observe in accordance with the eigenstate-eigenvalue link that the first term on the right-hand side of this equation does not contribute to the probability that S will be detected as a particle with momentum −p or +p in the z direction. Therefore, there is a
probability, which vanishes in the limit
, that S will be detected as a particle with momentum −p or +p in the z direction if we observe its position with an uncertainty
.
In studies that propose that the environment causes decoherence, an equation similar to Equation (112) is obtained by taking the partial trace of Equation (104). The state described by Equation (104) is a pure state; hence, the state described by the density matrix after the partial trace is an improper mixed state, which only provides the probability distribution of the outcome. By contrast, Equation (112) represents a proper mixed state and describes the state itself to be detected with the macroscopic detector. Therefore, we can conclude that the state to be detected is not given by Equation (103) but is either
or
. In this calculation, we have not used any additional postulates such as the projection postulate. Thus, it is worth noting that this decoherence is not a result of the interaction between S and the detector or other environmental factors, but is rather due to the uncertainty relation.
Note also that the density matrix of S is not always written as shown in Equation (112). If we want to measure another observable of S, then we need to take an average the density matrix over its conjugate observables. Consequently, we can obtain the average density matrix that has a diagonal form in this observable, as illustrated in the remainder of this section.
Let us suppose that an ideal macroscopic measuring device M that measures S’s energy is employed instead of the aforementioned detector. Furthermore, S is prepared in its neutral state
and is at either of the two energy levels
and
, in accordance with its spin, after it exits the magnetic field.
is a unitary operator that transforms
to
or
, which are the eigenstates whose eigenvalues are
and
, respectively. Thus, we have
(114)
(115)
The initial state of the unified system is represented by
, which is defined as
(116)
Then, the state of S just prior to detection by M can be written as
(117)
and its density matrix
is given by
(118)
Note that because we want to measure the energy of S, we require a time interval. Therefore, we define the density matrix
averaged over the measurement time interval
as
(119)
where
is the Hamiltonian density of S, which satisfies
(120)
(121)
Therefore, we can write
(122)
with
(123)
To obtain a macroscopic result, we set
(124)
which leads to
(125)
Therefore, the average density matrix in this case becomes
(126)
Thus, we observe that equation (126) is diagonal in S’s energy.
4.2. EPR-Bohm Experiment
In this section, we re-examine the EPR-Bohm experiment, which was briefly reviewed in Section 3.1. We adopt the same setup as in the previous section, where particles 1 and 2 enter inhomogeneous magnetic fields along the z and r directions, respectively. When the particles exit the magnetic fields, each particle gains a momentum in its respective direction of the magnetic field. Initially, neither particle has a momentum in the z or r direction. If we define the zero momentum state of the particles as
, then the state
before entering the magnetic field can be written as
(127)
where
is defined in Equation (39). The state
after interaction between the electrons and magnetic fields becomes
(128)
where the unitary operator
represents the interaction, and p and q are the momenta in the z and r directions, respectively. As discussed in the previous section, we must allow some uncertainty in the position of the particles along the corresponding directions. Therefore, the density matrix
that describes the state to be measured is defined as
(129)
where
is the density matrix of particle 1 translated to a distance
in the z direction and particle 2 translated to a distance
in the r direction. By using
(130)
(131)
where
and
are the momentum operators of particle 1 in the z direction and particle 2 in the r direction, respectively, and
can be written as
(132)
Similar to the previous section, because we want to know what is observed by the macroscopic detectors, we set
(133)
which leads to
(134)
(135)
Therefore, the interference terms disappear and the average density matrix takes a form similar to that in Equation (112):
(136)
Because Equation (136) describes a proper mixed state, the state to be detected is not a superposition but rather one of the following states:
,
,
, and
. It is worth noting that the correlation between the spins of the two particles should not be a result of the measurement process, as it is with a wave packet collapse, because Equation (136) represents the density matrix of the electrons prior to the measurement process. Therefore, we should not regard the EPR experiment as evidence for instantaneous propagation during a wave packet collapse.
Conversely, we need to consider the relation between the discussions in this and the previous section. Note that because we can obtain only one average density matrix for an observed pair of electrons, as discussed earlier, we do not need to assume that the state prior to the measurement possesses a definite value of the spin in each direction. In other words, the probable counterfactual definiteness is not local but contextual. As discussed in the previous section, contextual counterfactual definiteness is not forbidden by Bell-type inequalities or the Kochen-Specker theorem. The spin in either direction is not fixed in the initial state given by Equation (39), whereas the spin of each electron in the respective directions is fixed in state described by Equation (136). The transformation between Equations (39) and (136) is owing to the uncertainty relation. However, as stated in Section 1, we do not intend to explain the nonlocality of quantum mechanics by means of other concepts. We only attempt to illustrate how measurement processes can be understood within the scope of pure quantum mechanics in this study.
4.3. Double-Slit Experiment
In this section, we examine the double-slit experiment, where electrons are emitted from a source at certain time intervals (see Figure 3). The electrons travel along the x-axis through a double slit in the
plane and eventually arrive somewhere on the screen in the
plane. The slits are separated in the z direction such that the two slits are positioned at
and
. We define the eigenstate
of the z coordinate operator
as
(137)
Moreover, we define
as the state of the electron at
, and
and
as the states obtained after the electron passes through slits 1 and 2, respectively. Thus,
(138)
The wave function
is defined as
(139)
with the normalisation
(140)
The probability density that the electron is observed at
is given by
(141)
and
can be defined in the same way as in Equation (139), which gives
(142)
(143)
Then, we have
(144)
and the probability density
is written as
(145)
4.3.1. Decoherence on the Screen
In this section, the electrons are not assumed to be observed just after passing through the slits. Although the interference terms are included in the probability density
, each electron behaves as a particle on the screen. Therefore, we illustrate here that the density matrix of the electrons to be observed on the screen is not given by
but is rather proportional to
(146)
In contrast to the discussion in the previous sections, we need to allow some uncertainty in the momentum in the z direction because we want to know the position in this direction. Therefore, we define the average density matrix
that describes the state of the electron to be observed on the screen as
(147)
where
is defined as
(148)
with
(149)
is the operator that transforms the momentum and satisfies
(150)
where
is the eigenstate of the momentum in the z direction. Thus, we can write
(151)
In this experiment, we want to know where the electron impinges on the screen, therefore we set
(152)
which leads to
(153)
Using the above equation, we can introduce a delta-function in Equation (151) as follows:
(154)
such that Equation (151) becomes
(155)
Note that Equation (155) is the desired form of the density matrix. The state that Equation (155) describes is a proper mixture, and hence, we can predict that each electron will behave as a particle. However, because
is a probability density that includes the interference terms, the electrons form a striped interference pattern when they impinge on the screen.
4.3.2. Decoherence Near the Slits
Next, we examine the case in which the electrons are assumed to be observed near either of the slits, namely, at
. In this case,
(156)
which follows from the definitions of
and
. Therefore, equation (155) becomes
(157)
which shows that the interference terms have disappeared. If we now observe the electron on the screen again, its probability density is no longer given by
but rather by
.
4.4. Summary
We examined three fundamental experiments in this work. We studied the Stern-Gerlach-like experiment to explain how the uncertainty relation between the position and momentum of a particle introduces decoherence prior to a measurement. The EPR-Bohm experiment was considered to conclude that the correlation of the EPR pair is not a consequence of the instantaneous propagation during a wave function collapse. We also discussed its relation to the contextual counterfactual definiteness of quantum mechanics. Finally, we showed that our theory can be applied to experiments with continuous eigenvalues, such as the double-slit experiment with electrons. Thus, we demonstrated how pure quantum mechanics describes the fact that electrons behave as interfering particles, namely, particles whose detection rate is consistent with interference.
5. Conclusions
In this study, we demonstrated that decoherence is one of the inherent characteristics of pure quantum mechanics. Quantum states describe both the wave as well as particle behaviour of microscopic objects, specifically, the microscopic objects propagate as waves and are observed as particles. In this context, it is worth paying careful attention to the meaning of the phrase observing the wave nature of a state. It is equivalent to saying identifying the state propagating as a wave by observing many particles. As shown in Section 4.3, there is essentially no difference between observing the electrons on the screen and near the slit. Irrespective of where the electrons are observed, they are observed as particles, which helps us determine the amplitude of the corresponding state.
Therefore, we conclude that the quantum measurement process can be explained within the framework of pure quantum mechanics. Moreover, we believe that our study can be applied to a more general discussion about the quantum-to-classical transition in nature.
NOTES
1Note that in this subsection, definite values are denoted with [ ].