The Bell Inequality, Inviolable by Data Used Consistently with Its Derivation, Is Satisfied by Quantum Correlations Whose Probabilities Satisfy the Wigner Inequality ()
1. Introduction
Bell constructed an inequality from the computation of three cross-correlations of three simultaneously existing random variables [1] , and applied it to an experiment that produced only two variables per random realization. The properties of the third variable result from its mathematical definition. The purpose of the third variable was to provide insight into the random quantum mechanical state of entanglement, and evidence as to whether or not predicted correlations could be accounted for by random but causal variables. The mathematical construction uses correlations of readouts equaling ±1 (as detector clicks are labeled) that are functions of variables that might be reasonably assumed to be randomly occurring initial conditions. (The experimental schematic of classic Bell experiments is shown in Figure 1.) To analyze physical aspects of the situation, Bell postulated one measurement on the left side of the apparatus and two measurements at mutually exclusive settings on the right side [2] . However, only one particle is produced on each side, and each is destroyed by observation. A key question immediately arises: how is the inequality to be applied to the experimental situation that it has been designed to address?
There are only two ways known to the author to increase the number of experimental variables from two to three to enable application of the Bell inequality. One way is to add a polarizer and two detectors in each polarizer output on one side in Figure 1. A final detection then reveals which of the four possible paths on that side of the apparatus was taken using an analysis known as retrodiction. In this case the probability of the last path segment depends conditionally on the random selection of the preceding path segment, and the resulting three correlations are unequal [3] . This is described by the quantum principle that two successive polarization measurements on a photon are non-commutative.
Bell specifically rejected the above choice of realization of variables for application of his inequality [2] , and stated that the second random readout at a
Figure 1. Schematic of Bell experiment in which a source sends two particles (photons most often used) to two detectors having angular settings
and
, (denoted as a and b in Bell’s notation) and alternative settings
and
. While one measurement operation on the A-side, e.g. at setting
, commutes with one on the B-side at
, any additional measurements at either
or
are non-commutative with prior measurements at
and
, respectively. This figure was drawn by the author and modified in notation for use in Ref. 4, as well as other papers.
different polarizer setting was to be realized as the readout that would have occurred in the same random trial if that alternate setting had been used in place of the first setting: “the ‘spin’ of each particle is measured once only.” Thus, the construction of the three random-variable Bell inequality, based on identical hidden variables for each of three observations, one of which is unperformed, is inapplicable to experimental data, since one cannot undo results at one instrument setting in a random trial to obtain results that would have occurred at a different instrument setting.
However, it will be shown in Section 2 that the inequality that Bell derived holds for physical data under far more general conditions than Bell originally realized [4] [5] , and in a form leading to its satisfaction under realizable experimental conditions. (This is also true for the four variable version of the inequality.) Given that three data sets, random or deterministic, have been experimentally or mathematically obtained, so as to be writeable on paper, the same algebraic steps that Bell used to derive his statistical inequality may be applied to them. The Bell inequality again results, but is now identically satisfied by cross-correlations of three finite sets of data with all items equal to ±1. Thus, if applied to actual physical data for three variables, the Bell inequality is not statistical or probabilistic, but is an identity-inequality of algebra that may then be applied to random or deterministic situations. (This same generally unrecognized fact holds for the four variable inequality.) The correlation estimates that result when applying the inequality to three data sets of ±1’s may now statistically converge to three different functional forms. It is these functional forms that are experimentally verifiable, and not whether the identity-inequality is satisfied, since it must be satisfied by any three data sets. Finally, since the data are random in the case of Bell experiments, the resulting correlations may be expressed in terms of the probabilities that give rise to them, yielding an inequality in probabilities, the Wigner inequality, that must also now be satisfied. This inequality is often presented as logically independent of the Bell inequality but this cannot be so when both are applied to the same physical situation.
Given that the general form of the Bell inequality is purely algebraic involving three physical data sets, and that it must be identically satisfied by such data sets, the remaining problem is to create a procedure for acquiring appropriate data from Bell experiments that create data in pairs. Since the inequality under Bell’s assumptions cannot be experimentally realized, a procedure must be found that provides data at two mutually exclusive settings for a B-side particle in Figure 1 corresponding to two fixed outputs on the A-side. Such data may be acquired by conducting experimental runs [6] at
and
while using fixed setting
to allow two B-side data sets to be obtained for each output at setting
on the A-side. The data pairs must be shifted so that the
-outputs from the two runs match. Three data sets are than obtained since the data sequences at
from the two runs are identical after shifting. The two experimental trials at mutually exclusive settings of B are now probabilistically independent except for their conditional dependence on a common output at A. (This situation is known as “conditional independence” in mathematics [7] .) The two cross-correlated observations on the B-side now have a different form than their correlation with the output at A. The number of data sets now equals the number of variables in Bell’s inequality derivation. The data are correlated precisely as Bell’s original variables, but the hidden variables for each correlation (if assumed to exist) must now be different since they occur in different trials.
To understand key aspects of this more simply, consider flipping a loaded coin with load 1. After a given flip, one could ask the equivalent of Bell’s question: suppose the load had been load 2 on that same flip, what would the outcome have been? This is clearly an unanswerable question in the case of causal random variables as assumed in Bell’s representation, unless they are all under total control and can be fully analyzed, as is unrealistic in random experiments. The common way of addressing this situation is to flip the coin a large number of times with each loading and determine the probabilities of heads and tails for the two cases. It should be observed that in the case of continuous variables leading to a pair of discrete events, a range of variable values leads to the same outcome, but a change in any decimal place in a variable at the boundary between outcomes could lead to a change in outcomes. Thus, in a world of finite precision instruments, causality does not necessarily imply predictability.
The incorrect assumption by Bell, in the absence of actual calculation, that the third correlation in his statistical inequality had the same functional form as the first two in either theoretical or experimental realization has no logical basis, and has been a fundamental source of confusion for more than fifty years. (The same problems discussed above affect the four variable inequality but are necessarily more complex to deal with.) This simplifying assumption, inconsistent with the derivation of the inequality, has led experimentalists to compute correlations in individual pairs. In the case of a wide-sense spatially stationary process [8] for which any number of variables may be measured and all pairs have a correlation of the same form, the Bell inequality would not be violated. This special form of stochastic process, commonly used in optics as an approximation, does not hold in the Bell experiment case.
2. The Constant in Bell’s Inequality in Three Variables Results from the Use of Three Variables
The facts underlying the above discussion will now be exhibited in detail beginning with a review of Bell’s derivation of the inequality. Bell considered the output of an ideal source of entangled spins. In experiments, photons have been used in place of spins due to the development of very efficient Bell-correlated photon sources [9] . From the experimental setups (see Figure 1) two photons at a time emerge traveling in different directions. Given Bell’s rejection of added polarizers and detectors that would allow obtaining three measurements at once, only two polarization paths at a time through the apparatus may be obtained.
Measurements on the A-side of the apparatus at angular setting
are represented in Bell’s notation by the function
, and on the B-side at angular setting
by
. The variables
are random variables with a probability density
assumed to determine the results of measurements represented by functions A and B, postulated to be deterministic. The random results of measurements might then be interpreted causally as due to uncontrolled initial conditions sampled randomly. The detector clicks associated with variables (polarization directions in the photon case) A and B are assigned label-values equal to ±1 with the additional requirement
, so that required measurement outcomes resulting from entanglement are fulfilled [1] .
To obtain additional conditions on the correlations of readouts, Bell assumed an output at one setting on the A side of the apparatus and corresponding outputs at two alternative settings on the B-side using the same hidden variable values for each. Since only one measurement is to be performed per side, given Bell’s prescription, two experimental realizations of photons must ultimately be used to obtain such data but that would imply different values for the assumed hidden variables in the two realizations. Thus, correlations satisfying Bell’s conditions are physically unrealizable. (A more general derivation of the inequality below that is not based on a specific representation of hidden variables eliminates this difficulty.) In Bell’s derivation the correlation of an A-side with a B-side measurement is given by
. (2.1)
The difference between two such correlations is
(2.2)
where
is used for the second variable consistent with Bell’s explicit prescription that two alternative measurements are theoretically considered as occurring on one side of the inequality [2] . The final correlation follows from the fact that
, (2.3)
since
. (A similar result holds for the fourth correlation in the four variables inequality [10] .) Thus, the third correlation is obtained from the variable values occurring in the first two correlations
and
. This widely unrecognized but crucial fact has recently also been noted by Hess [11] in a review of Bell’s theorem and related topics. Hess found that it was also established in a random variables context by Vorobev [12] . Closely related results have been obtained in [13] [14] , and possibly others, though they are little known to the community at large.
While in the last line of (2.2)
superficially appears to be the result of an independent measurement process and to have the same dependence on hidden variables as do the other correlations, its computation in fact reuses the data from the previous correlations, data pair by data pair. Thus, it would not be surprising if the correlation
had a different functional form from the others even under Bell’s theoretical construction. If a result at
is obtained and continuous causal variables are involved in its outcome, then given that the ranges of those variables are now restricted due to causing that particular outcome, possible outcomes at
are now affected. Thus, a result at
would be conditionally dependent in probability on an outcome at
as well as
, and this does not appear to have been considered, given Bell’s notation. While such results may be computed, based on assumed mathematical models of hidden variables, they are still un-measurable unless the variables can be physically controlled with mathematical precision. The author proposes an experimental resolution of this problem, based on a more general form of Bell’s inequality now to be given.
The possible laboratory realization of (2.2) is conceptually facilitated if one considers that to compute Bell’s desired correlation functions, three data sets must somehow be acquired and written on paper, one data set for each of Bell’s variables. (Outputs occur at settings
,
, and
, with that at
ultimately recorded on the opposite side at
=
, thus resulting in a minus sign before the final correlation as in Bell’s version due to the requirement of entanglement.) Note that while correlations can be directly measured in some optical experiments, they are not measured in Bell experiments. Only detector clicks are observed from which correlations are later computed. Assume that the data sets, random or deterministic, are labeled by instrument settings
,
, and
. The corresponding data set items are denoted by subscripted variables
,
, and
with N items in each set. Each datum equals ±1. The data actualization of (2.2) begins with
. (2.4)
where the unusual factorization holds only for the specific values of the variables considered, each equal to ±1. Summing (2.4) over i from 1 to N and taking absolute values of both sides yields
(2.5)
the same result as (2.2) but now seen to hold for correlation estimates using any finite data sets. Further, it is independent of the assumption of hidden variables on which it is commonly thought to depend. Note that (2.5) does not depend on whether the processes yielding the data are local, nonlocal, random or deterministic: no assumptions have been made regarding the physical character of the data other than that each item equals ±1, and all three data sets are available to compute the cross-correlations. Thus, result (2.5) holds even if there is nonlocal “pickup” between detectors so that the B setting affects A, etc. Note that the correlation estimates may converge to three different functional forms and satisfy (2.5). Thus, the resulting inequality is identically satisfied if Bell’s mathematical steps are applied to actual physical data, and just as in (2.3), data pair by data pair,
so that the final correlation is determined by reused data from the first two correlations. The third correlation does not result from an independent data acquisition process.
Under casual examination, the last line of (2.2) might suggest that the variables used to compute the three correlations may be measured and correlated in three separate variable pairs, since their dependence on hidden variables is the same in the final correlation integrals. However, the physical procedure used to obtain the data indicates a different conclusion. The constant in the inequality arises from the fact that the same value of
multiples
, and
, the same value of
multiples
and
, the same value of
multiplies
and
. Thus, the three variable values must all be available at the time the cross-correlations are calculated. (A similar algebraic condition holds in the case of the four variable inequality that is also easily shown to be an inequality-identity.) While the variables are symmetrically treated in an algebraic sense, their condition of physical acquisition is quite asymmetric and, as will be seen, results in different functional forms for the correlations.
Failure to consider the affects of the measurement procedure on the algebraic relations between variables has led to the processing of measurements for correlations in individual pairs [15] , and thus mathematical inconsistency [16] [17] with inequality derivation. Further, following Bell, the third correlation has been assumed to have the same form as the first two on which it depends. The unstated assumption is that the process involved is characterized by a kind of spatial stationarity: correlations of all pairs of variables have the same form and depend on coordinate angular differences. Such an assumption is inconsistent with the fact that the functional form of the correlations depends on how the measurements are obtained due to their being non-commutative. The result is that if the three Bell correlations all had the same functional form in spite of the constraint imposed on the third correlation by the inequality, that form would be different than the cosine form.
Given the difficulty of applying a three variable inequality to a random process producing two variables per realization, the mathematical conditions under which the inequality holds have in practice been neglected. Six random variables have been acquired in three random trials to obtain three independent correlations, whereas the three variable inequality, as demonstrated above in two derivations, depends on the use of three cross-correlated random variables to obtain three correlations.
3. How Can Bell’s Inequality Be Applied to Experiments?
Bell constructed (2.2) to apply to results of experiments schematized in Figure 1. This originally pertained to entangled spins for which plus and minus values corresponded to vector components on the z-axis. In the optical case, plus and minus values correspond to labels applied to counts having wave polarizations along vertical and horizontal axes, respectively. A factor of 2 then multiplies the angular difference in the arguments of the correlations compared with the spin case. The formalism in the two cases is close to interchangeable. However, it is the optical case that has become commonly used in practice due to the high efficiency of the sources. The formalism used below describes the spin case originally treated by Bell, while the verbiage corresponds to the experimental optical case.
The correlations
and
require data at mutually exclusive settings b and
of a polarization beam splitter. Two separate experiments are required to obtain such data. To apply (2.2) requires that the setting
be the same on each trial, and that outputs at b and
be recorded for each output value at setting
. Conditions for applicability of the outcomes of (2.2) and (2.5) are then satisfied, and one finds that the resulting correlation for
has a different form from that of
and
, a fact not recognized by Bell, or later by experimentalists.
The correlation
is easily derived using well known quantum mechanical probabilities resulting from entanglement of spins [17] . (This results in the angular difference being divided by 2 in (3.1) below compared to the photon case.) The subscripted pluses and minuses of the probabilities below indicate ±1 outputs at instrument settings
and b,
(3.1)
From these probabilities
(3.2)
where
follows immediately by replacing b with
.
Computation of
requires the conditional probabilities
and
obtainable from joint probabilities (3.1) already written in terms of conditional probabilities resulting from entanglement since
. The data from separate runs at independent settings b and
are correlated and conditionally dependent on each of the outputs of ±1 at
. Thus, for the +1 case,
(3.3)
where the subscripted variables equal plus or minus 1, and in the notation of (3.3),
is indicated as having output +1. Since the probabilities describe outputs in independent trials and at different variable settings, the probability
factors [7] . Thus,
(3.4)
Using the conditional probabilities obtained from (3.1), one obtains
(3.5)
and from (3.4),
(3.6)
The same result is obtained for
so that [18]
(3.7)
Inserting (3.7) and Bell correlations into (2.2) yields:
(3.8)
Thus, Bell’s inequality is satisfied.
4. The Wigner Inequality in Probabilities Results from the Bell Inequality When Both Apply to the Same Physical Situation
4.1. Bell Variable Re-Definition
Two of the three variables in (2.4) have been given labels
and
since as Bell indicated, they represent values of alternative measurements on the right-hand B-side of a Bell experiment apparatus (Figure 1). However, the final term in (2.5) may be written
(4.1)
where
, for detector settings
and
that are equal but on opposite sides of the apparatus. (Given the properties of entanglement, measurements at the same angular settings on opposite sides of a Bell apparatus have opposite signs, i.e.,
and
. The above sign change is also achieved in (2.2) by replacing
with
.) Thus, using Bell measurements in (2.5) with the
variable in the right-most term replaced by an equal setting on the opposite side, one has
(4.2)
and
(4.3)
assuming convergence of the correlation estimates as
.
4.2. Probabilities Corresponding to the Correlations
Since probabilities must exist that determine the correlations between each of the variable pairs in (4.3), the correlations may be written in terms of those probabilities. The notation to be used for the probabilities is
, where x and y equal +1 or −1 and indicate the outputs at instrument setting angles
and b respectively. In the quantum mechanical case after assuming that (perfectly) entangled particle pairs produce the measurements, the probabilities are symmetrical as in (3.1) so that
and
(4.4a)
with normalization condition
. (4.4b)
Then
(4.4c)
so that
. (4.4d)
The result for
is similar since it will immediately be shown that
and
. (4.4e)
This follows from (3.1):
(4.5a)
with an equal result for
. Similarly,
equals
(4.5b)
with an equal result for
leading to normalization
. Then as in (4.4d):
(4.5c)
Inserting results (4.4d) and (4.5c) into the Bell inequality yields the Wigner inequality [19] [20]
(4.6)
The last step follows from the use of the variable
(
) on the opposite side of the apparatus from
to reverse the sign of the output due to entanglement, i.e.,
. The Wigner inequality follows:
(4.7)
The Wigner inequality is usually derived from purely probability assumptions [19] based on entanglement that do not address the fact that the right-side probability is conditional on the left-hand side outcomes, given the conditions necessary to obtain two independent measurements on one side of the apparatus. The probability calculations to obtain the Wigner inequality ultimately result from the operations necessary to obtain a physical realization of Bell’s variables.
5. Quantum Mechanical Probabilities Satisfy the Wigner In-Equality
It now must be shown that the probabilities required by the experimental conditions for applicability of Inequality (4.6) satisfy the inequality. From (3.1)
(5.1a)
and
(5.1b)
From (4.5b)
(5.2)
Note that
.
Inserting (5.1a) (and its equivalent for
) together with (5.2) in (4.7) one obtains:
(5.3)
Thus, the Wigner inequality is satisfied by quantum probabilities corresponding to a performable Bell experiment.
6. Computational Counter-Example
It has been shown above that the Bell inequality is satisfied by any three laboratory data sets without the assumptions of locality and hidden variables commonly thought to be necessary to construct it. It holds immediately for any three data sets that can be written on paper. That does not in itself, however, imply that independent random processes using common boundary conditions can produce Bell correlations. Nevertheless, a number of researchers have claimed achievement of such derivations [16] . (Bell correlations have also been computed based on very small information transfer from detector A to detector B [21] .) The articles referred to are not physical derivations of the correlations, which are effectively limited by our unsettled understanding of photons and their relation to electromagnetic waves. However, derivations of Bell correlations based on more physical principles have also been given [22] [23] .
It thus seems fitting to conclude this logical assessment of the Bell and Wigner inequalities with a short algorithmic counter-example yielding a Bell cosine correlation. It will be developed from two independently constructed random variables and probabilities followed by a third random process that determines whether or not a corresponding two photon event has occurred. The latter is analogous to the spontaneous two-photon down-conversion process used in the source for classic Bell experiments. The model developed was suggested by an example in a Papoulis monograph [8] that begins with a non-stationary random process in two continuous angle variables and imposes a condition that makes the process stationary.
Assume two functions using arbitrarily chosen settings
and
:
(6.1)
with product
. (6.2)
If
and b are independent random variables with the same probability density and zero mean, the average of (6.2) is
(6.3)
where it will be assumed for use below that
.
Expression (6.3) needs to be converted to one with variables equal to ±1 as necessary to associate photon detection counts with the computation:
. (6.4)
The factors being multiplied now equal ±1. But the average of (6.4) is not equal to that of (6.3). To accomplish this last step an uncertainty in count-pair production is introduced by assuming a Poisson process defined by the probability of a twin-photon event occurrence:
. The probability of a zero or non-event is approximated by
. The average value of the product (6.4), its value averaged by its rate of occurrence, followed by averaging over
and b using
yields (6.3):
(6.5)
In the above derivation, detector efficiency does not occur and values of
and
may be freely chosen. Thus, entanglement is not the sole generator of Bell cosine correlations. It should also be observed that Bell’s formalism does not represent the above example since the variables
and
now result from a ratio of two functions controlled by separate, qualitatively different random processes. The overall process is more complex than that implied by the Bell notation. The physical processes considered in [22] [23] are even more complex.
7. Conclusion
The inequality originally derived by Bell is a statistical expression in three cross-correlated random observables, each correlation assumed to be a function of two of the same three random variables. Two of the three observables occur at mutually exclusive settings on one side of a Bell apparatus but were defined by Bell to be alternative results of which only one could be observed. Both observables depended on the same hidden variable values. Thus, in Bells formulation, the correlations in the inequality were not all actually observable since the mutually exclusive alternatives occurred for the same values of the hidden variables. This situation is greatly simplified when it is discovered that the same inequality in correlations that Bell derived from a purely theoretical construction holds as an identity for correlation estimates using any three finite data sets, corresponding to Bell’s three assumed observables. However, the data sets may now be local, nonlocal, random, deterministic, or nonsensical. The Bell inequality is an expression in algebra identically satisfied by data with values of + or −1. This implies that for laboratory data, when used consistently with Bell’s derivation, there can be no Bell inequality violation. The same conclusion easily follows in the four variables case. The logical result is that there is no Bell’s theorem as ordinarily understood.
The more general derivation of the inequality given above leads to an experimental procedure to obtain data for all the correlations, even the one involving the unobservable alternative that served in Bell’s original formulation. When two mutually exclusive settings on one side of a Bell-experimental apparatus are employed in different experimental runs, the same inequality in correlations is now applicable.
There appear to be very few ways that three data observations on two particles destroyed by observation can be acquired. While measurements at alternate mutually exclusive settings for deterministic variables are routinely obtained in laboratories, the case of alternate random variables and associated probabilities requires that an experiment be repeated many times at each setting of interest. The novelty in the Bell experiment case is that outputs at each of the two alternate settings are correlated with each other through their correlation with a common output at a third variable setting. The three variables’ correlations are now consistent with the resultant inequality in correlations.
Historically hidden in the statistical derivation of the inequality, the identity-inequality beginning with correlation estimates implies a corresponding inequality in the probabilities that produce them. In the quantum mechanical case that is the subject of the Bell theorem, quantum mechanical probabilities produce correlations that satisfy the Bell inequality while the probabilities satisfy the Wigner inequality that follows from it. The conclusion is made foolproof by the fact, as stated, that the Bell inequalities in either three variables (the original form), or four, are easily proven to be algebraic identities in the number of variables used to compute them.
The fact that the Bell inequalities in both three and four variables are algebraic identities has been unrecognized by experimentalists who have sought to test the inequalities experimentally. However, given the lack of recognition of the underlying basis of the inequalities in cross-correlation, correlations of independently observed data pairs have been inserted into them. The resulting correlations that must be different when consistent with inequality derivation and the quantum mechanical experiment, now all have the same functional form so as to result in inequality violation. Only for a kind of wide sense stationary process would the procedure used yield correlations that satisfied the inequalities, but they would then have a different functional form.
Finally, the Bell correlation is shown to be computable using an independent computer simulation without assuming physical variable characteristics commonly thought necessary to obtain cosine correlations. The observables use two functions for their definition and two qualitatively different random processes in the overall construction, the second reminiscent of that occurring in physical Bell sources. It is doubtful that this two-part process is adequately represented by Bell’s probability notation. This example also shows that entanglement is not a unique condition for the production of cosine correlations.
From the above, the Bell theorem as usually understood does not exist since the inequality is identically satisfied under the conditions of derivation. Thus, theoretically predicted correlations that violate the Bell inequality represent no three data sets that can exist.
Acknowledgements
The presentation of the material given above has been influenced by many discussions of the issues with Joe Foremen, personal communications with Karl Hess and Armen Gulian, and critiques of earlier papers on this topic by Michael Hall. Frank Lad pointed out ambiguities in the initial version of this paper that hopefully were eliminated in revision.