Commodity Arbitrage and the Law of One Price: Setting the Record Straight ()
1. Introduction
Arbitrage and the Law of One Price (LOP) are basic implications of price theory. The failure to respond to risk-free profit opportunities would reject basic assumptions like profit and wealth maximization. But claims about arbitrage and the LOP are mixed. At least part of this confusion is the result of some authors using their own definitions for “arbitrage” and the “Law of One Price”. Imagine the confusion in physics if some researchers used their own special definitions for technical terms like “neutron” or “muon”!
There is a strong consensus that arbitrage is effective in financial markets and, as a result, that the LOP holds in financial markets. For example [1] says that: “Evidence in financial markets of an opportunity for pure arbitrage, and therefore a violation of the law of one price, is considered an anomaly…”. While [2] says the following:
Arbitrage is one of the fundamental pillars of financial economics. It seems generally accepted that financial markets do not offer risk-free arbitrage opportunities, at least when allowance is made for transaction costs.
This literature uses “arbitrage” and the “Law of One Price” as they are defined in dictionaries and encyclopedias.
On the other hand, the consensus is that arbitrage and the LOP fail in commodity markets. But this consensus is based on research that does not use the terms “arbitrage” and the “Law of One Price” as they are defined in dictionaries and encyclopedias. [3] describes that consensus as follows:
The Law of One Price states that international relative price differentials should be arbitraged away so that identical goods in different countries should sell for the same price when expressed in a common currency. Yet the evidence from the empirical literature shows that not only are relative prices quite different across countries, but also such deviations are highly volatile and persistent.
But the evidence that [3] refers to uses retail markets where price differentials do not represent risk-free profits as is required by the definition of arbitrage and the Law of One Price found in the relevant dictionaries and encyclopedias.
2. Literature Review
Subsection 2.1 reviews how dictionaries and encyclopedias devoted to economics define “arbitrage” and the “Law of One Price”. Subsection 2.2 reviews the research that uses retail prices to test the LOP. Section 2.3 reviews the literature on Border Effects, which also uses retail prices. Section 2.4 reviews the literature using commodity auction prices.
2.1. Definitions
Dictionaries and encyclopedias specializing in economics clearly define arbitrage as a “risk-free” transaction1. For example, The New Palgrave Dictionary of Economics begins the discussion of arbitrage as follows: “An arbitrage opportunity is an investment strategy that guarantees a positive payoff in some contingency with no possibility of a negative payoff….”. Wikipedia (25 May 2010) says the following: “When used by academics, an arbitrage is a transaction that involves no negative cash flow at any probabilistic or temporal state and a positive cash flow in at least one state; in simple terms, it is a risk-free profit”.
These definitions imply that, for a commodity price differential to reject effective arbitrage, it must represent a risk-free profit.
The following quote from The Penguin Dictionary of Economics is a fairly typical definition of the Law of One Price:
The law, articulated by Jevons, stating that “In the same open market, at any moment, there cannot be two prices for the same kind of article.” The reason is that, if they did exist, arbitrage should occur until the prices converge.
The New Palgrave Dictionary of Economics does not have a separate heading for the LOP. The law is discussed on page 189 under the heading of Arbitrage. “The assertion that two perfect substitutes (for example, two shares of stock in the same company) must trade at the same price is an implication of no arbitrage that goes under the name of the law of one price”.
These definitions of the Law of One Price imply that, for a price differential to reject the LOP it must represent a risk-free profit.
The auction prices used in the financial literature and by [4] [5] and related research using commodity auction prices are consistent with these definitions of arbitrage and the LOP because they can represent risk-free profits. That is not true of the research referred to by [3] .
2.2. Retail Prices and the LOP
One cannot test the LOP as it is defined in dictionaries and encyclopedias in retail markets because retail price differentials do not represent risk-free profits. Suppose one grocery store sells seedless red grapes at $0.99 per pound and a store across the street sells them for $2.00 a pound. That price differential does not reject effective arbitrage and the LOP because one cannot buy them at $0.99 and sell them for $2.00 for a risk-free profit2.
Several articles study international price convergence in retail markets and claim that they are testing the LOP. None find strong support for the LOP. Recent examples using individual prices rather than indexes include [6] - [10] 3.
The most influential is probably [6] . It uses prices for identical goods in duty free stores on Scandinavian ferries. Reference [9] describes the results in [6] as follows: they “find that the LOOP does not even hold for identical goods sold at the same location as long as these goods are denominated in different currencies…..”.
But arbitrage is not possible on those Scandinavian ferries or similar ferries around the world. No one on a Scandinavian ferry can buy a particular brand of vodka using Swedish kronor and then turn around and sell it back to the duty free store for Finnish markka. As a result, the price differentials in [6] do not reject effective arbitrage and the LOP because they do not represent risk-free profits.
The same problem applies to all supposed tests of the LOP using retail prices; those price differentials do not represent risk-free profits. Retail stores sell at retail prices. They do not buy at retail, they buy at wholesale. Put bluntly, at the retail level all goods are “non-traded” because no one, not even duty free stores on Scandinavian ferries, both buys and sells at retail.
There is another problem with the use of retail prices. The exchange rates typically used to convert foreign retail prices into domestic equivalents are auction prices. For example [6] uses end-of-month rates from the IMF’s International Financial Statistics. Expectations drive auction prices like exchange rates. They do not drive retail prices.
Mixing retail commodity prices and auction exchange rates in this way artificially increases the volatility of the converted commodity retail prices and reduces the correlation between domestic and foreign prices. Using auction commodity prices and auction exchange rates eliminates this mixture of retail and auction prices.
Although arbitrage is not possible at the retail level, there are economic links between international retail commodity markets. One link works from the domestic retail market through the domestic wholesale market to the foreign wholesale market and then through that market to the foreign retail market.
Another link operates from the domestic retail market through domestic production to domestic auction markets like those in Table 1 then through those markets to foreign auction markets, foreign production and finally foreign retail markets. These indirect links between international retail markets are not very strong in the short run, but they presumably provide the long-run link that we see in tests of the Law of One Price and Purchasing Power Parity using retail price indexes4. However using auction prices produces much stronger short-run links.
2.3. Border Effects
Several articles estimate Border Effects using retail indexes or prices. They include [12] - [18] 5. They appeal to either arbitrage or the LOP, or both, and reject them6.
Reference [13] for example says the following:
In Figure 2, we repeat the exercise for 1990. The comparison with Figure 1 is striking. The between-country distribution has diverged from the two within country distributions. Japanese prices expressed in US dollars have risen even more relative to US prices. The violation of the law of one price became even more severe.
But the price differentials [13] refers to do not reject effective arbitrage and the Law of One Price as they are defined in the relevant dictionaries and encyclopedias because their retail price differentials do not represent risk-free profits. In addition, in constructing those differentials they use auction exchange rates, which are driven by expectations, and retail prices, which are not driven by expectations.
Given the weak links between international retail markets discussed above, we should expect relatively wide borders at the retail level. But those wide borders do not reject
Table 1. Products, intervals, and sources.
aWeekly. Original Sources: **Financial Times (UK). ¨Journal of Commerce (N.Y.). †Samuel Montague.
Several articles use commodity prices and exchange rates from auction markets to evaluate commodity arbitrage and the LOP. See [4] [5] [11] [21] - [24] . With the exception of [11] [21] they all use monthly grain prices7. This research is much more supportive of effective arbitrage and the LOP.
3. New Evidence
To produce risk-free profits, commodity arbitrage normally requires three “simultaneous” transactions8: 1) buying a commodity spot or with a nearby futures contract in one location, 2) arranging for the shipping of the commodity to a second location and 3) selling the commodity in that second location when it is due to arrive. The first and third transactions normally require active auction markets. The second requires active markets where arbitragers can find the necessary transportation as needed.
Forward contracts pose practical problems for testing commodity LOP because forward prices are difficult to find. I have neither the time nor the resources to do so. Like most previous work using commodity auction prices, I am forced to use spot prices. Fortunately, as [26] and [27] show, while forward exchange rates are highly biased estimates of future spot exchange rates, forward commodity markets provide better estimates of future commodity prices. So while my price differentials also do not represent risk-free profits, they come much closer to doing so than research using retail prices.
3.1. Grains
The top half of Table 1 describes the sources for grain prices. Between the US and Rotterdam prices cover over 25 years. Earlier research covers only 10 to l1 years. Freight rates for wheat vary with the size of the ship and at times more than one rate is published. These are the same as in [4] . Unlike prices, freight rates are not spot. Footnotes in World Wheat Statistics describe the freight rates as follows: “Estimated mid- month rates based on current chartering practices for vessels to load six weeks ahead.” Freight rates are forward, not spot, prices. These forward rates imply that there is an active forward market for shipping grain.
These grain prices have several advantages: 1) “identical” products, 2) all prices in dollars, 3) freight rates are available for wheat, 4) for Rotterdam wheat prices cover an unusually large number of years, about 25, 5) export prices are free on board (FOB) and import prices include certificates, insurance and freight (CIF), (For Japan, prices do not include insurance.) and 6) most important of all, arbitrage should be possible in all these markets.
The first part of Table 1 describes the types of grains, data sources, ports and the acronyms used later to identify the different grains. A “J”, “P”, “G” or “R” attached to an acronym indicates the port. For example, DNSR is Dark Northern Spring wheat at Rotterdam.
For trade with Japan, I use the same intervals used earlier by [4] [5] [24] . For a visual inspection of that data, see [4] . I do not extend the Japanese data for two reasons: 1) starting in 1982 several months of Japanese prices are missing and 2) in the 1990s Japan erected non-tariff barriers to wheat imports that created artificial price differentials9.
Wheat is a “heavy” grain. For wheat, one metric ton equals 36.7437 bushels. For corn, one metric ton equals 39.3682 bushels. Since a metric ton of corn takes up more space than a metric ton of wheat, I would expect freight rates for corn to be slightly higher than for wheat. Unfortunately I do not have separate freight rates for corn. I apply the freight rates for wheat to corn. Since the difference should be small and the two freight rates should move together, applying freight rates for wheat to corn should not be a serious problem.
Reference [22] also uses corn prices between the US and Rotterdam, but their data cover only six years. The corn prices in Table 1 start as soon as they are available from UNCTAD’s Handbook of Statistics and cover about 11 years. They end in mid 1998 because about then Europe began to impose restrictions on importing genetically modified foods and most corn grown in the United States is genetically modified.
Some grain prices are missing. Except for DNSR during the early 1990s, there are never more than two missing months in a row. When only one month is missing, it is replaced with the previous month. When two months in a row are missing, the first month is replaced with the preceding month and the second month is replaced with the following month10.
In the early 1990s, about 18 months of data are missing for DNSR and then a few months later about another six months are missing. As Table 1 indicates, there is a second source for the missing data, but where they overlap the two sources do not always agree. It is possible to use the USDA data to fill in the missing data from World Grain Statistics, but the replacement produces unusual error terms that require long lags in tests for unit roots and cointegration. Those long lags reduce the significance of the tests. As an alternative, where there are USDA prices for DNSR, I report separate results using those prices for Rotterdam with freight rates and Gulf prices from World Grain Statistics.
3.2. Rubber, Metals and Petroleum
The latter half of Table 1 describes the additional prices, relevant ports, intervals and sources. Rubber and metal prices are for Wednesdays and were supplied by [11] and [21] who describe them in more detail.
Rubber prices may not be for identical products. For each Wednesday, the Journal of Commerce quotes a single price for rubber in London. In another section it quotes US prices for several different grades that on a given day can range in price from 26.25 to 42.75 cents per pound. As a result, there is a strong possibility that the rubber prices are not for identical products.
All petroleum prices, both at US and foreign ports, are monthly averages of closing daily prices in dollars. Foreign prices are converted into US dollars using the closing dollar price of the foreign currency for that day. To the best of my knowledge, this is the first time that a wide range of petroleum prices have been used to test the LOP.
3.3. Test Equations
There are freight rates for wheat. The direction of trade for grains is known because the United States is a major exporter and US prices are FOB while foreign prices are CIF. For grains one can account for the relevant shipping costs by computing the price differentials as the foreign forward CIF price minus the spot US FOB price plus the freight rate. That approach produces a test equation where ut should represent insurance, certificates and bid-ask spreads.
(1)
is the log of the dollar price of grain at time t in a foreign port one month forward. is the log of the spot price in the appropriate US port plus the freight rate for loading at time t. After accounting for other costs, should represent potential risk-free profits.
Two critical parts of that test equation are missing: ft, the freight rate for loading at time t, and, the forward price for t + 1. But ft−1 can serve as a proxy for ft because ft−1 is the price for loading in a bit less than two weeks.
Fortunately forward commodity prices appear to be reasonable estimates of future spot commodity prices. As a result, the future spot price can serve as a proxy for.
Replacing ft with ft-1 and with provides a measure of price differentials for grains.
(2)
where εt is the additional error due to using ft−1 as a proxy for ft and as a proxy for.
The lack of information about freight rates and the direction of trade for the other products requires a different measure of price differentials for those products.
(3)
Equation (3) is the price differential normally used in the commodity LOP and Borders literature.
Using Equation (3) with grain prices provides consistent tests for all products and also provides some insight regarding the importance of time and transportation costs.
Like most auction prices, these have unit roots. Reference [24] shows that and are cointegrated and that the corresponding differentials are stationary. To save space I do not replicate their results or report similar results for and.
Reference [29] shows that and are cointegrated and that are stationary, which implies that the price differentials in Equations (2) and (3) are stationary. Given that stationarity, the next step is to estimate half lives for all pairs of commodities.
3.4. Half Lives
Following [30] , I estimate half lives using Equation (4).
(4)
Xt is the price differential using either Equation (2) or (3). When Xt is AR (1), half lives are calculated using Ln(0.5)/Ln(b). When Xt is AR(2) or longer, half lives are calculated using impulse responses as [31] suggest.
Estimates of Equation (4) use both Equations (2) and (3) for grain prices and just Equation (3) for all other prices. Estimates use a variety of error tests: LM, Arch and two Q tests. There is also a Jarque-Bera test for normality. The only test that is significant is the one for normality. As expected, residuals have fat tails.
To save space, I omit the estimates of Equation (4). They are in [29] . Tables 2-4 report the half lives implied by those estimates.
Table 2 reports the half lives for grains implied by. The average half life is four months.
Table 3 reports the half lives for grains implied by. While the average half life in Table 2 is four months, in Table 3 it is only 1.1 months. Omitting transportation costs and ignoring time substantially increases half lives for grains and, one would presume, does the same for other half lives.
Table 4 reports the half lives for petroleum products, rubber and metals using. The average half life in Table 4 is 1.1 months11.
As [6] points out, when using retail prices, half lives for price differentials commonly run between 3 to 5 years. Although they have the best retail data, [6] cannot reject at even the 10% level that their differentials have unit roots, which would imply that their half lives are infinite.
Table 2. Half lives for grains measured in months using Equation (3).
Table 3. Half lives for grains measured in months using Equation (2).
Table 4. Half lives measured in months for petroleum products, rubber and metals.
On the other hand, the half lives in Tables 2-4 are measured in months, not years. Many of them are measured in fractions of a month. I conjecture that with better data they would be measured in weeks, days or even fractions of a day like the deviations from CIP.
Half lives in Tables 2-4 do not show any obvious signs of border effects. Half lives between international ports like New York and Rotterdam are not obviously larger than between New York and Los Angeles. The next section, for the first time, uses auction prices to evaluate “Border Effects”.
3.5. Border Effects
Although the Borders literature uses other explanatory variables, it concentrates on how distance and borders affect the variance or standard deviation of changes in logs of ratios of prices like. The log of is usually denoted Pt with the first difference denoted ∆Pt. That literature presumably does not use half lives for Pt because it is often impossible to reject an infinite half life for retail price differentials even when the products are identical.
Standard deviations of ∆Pt and half lives of Pt are both natural measures of market integration, but they measure different things. Consider the spectrum for DPt. The area under the spectrum for DPt is the variance. The half life for Pt reflects how quickly that spectrum falls as frequency falls. As a result, half lives for Pt and variances for ∆Pt can be very different.
An extreme case would be where variances in DPt are small and spectra flat. Small standard deviations reject segmentation. Flat spectra imply infinite half lives. Or variances for DPt could be very large and spectra fall rapidly as frequency goes to zero. Now standard deviations would support segmentation and half lives reject segmentation. The bottom line is that neither half lives nor standard deviations are ideal measures of integration. Here the two measures agree; both standard deviations and half lives imply that international commodity auction markets are highly integrated.
Although there are many auction markets for commodities, there are far fewer auction markets than retail markets. As a result, the number of commodities evaluated here is far smaller than in most of the Borders literature. To avoid unnecessary reductions in degrees of freedom, tests use just borders and distance to evaluate Border Effects12. When there is a border, the border dummy is one. Otherwise it is zero. Distance is the logarithm of the distance measure. There are three measures of distance: 1) distance by ship (Ship), 2) distance as the crow flies (Crow), and 3) distance by highway (Highway)13. For Ship, distance is measured as the shortest distance by sea. The source for Ship is PORTWORLD.COM. For Highway it is MAPQUEST.COM. For Crow it is GeoBYTES.com/CityDistanceTool.htm.
Between portslike New York and Rotterdam, Ship is clearly the appropriate way to measure distance. Highways do not exist, and both great circle routes and “as the crow flies” can seriously understate the actual distance traveled. As a result, I use only Ship between international ports. I use all three measures between ports in the United States because it is not clear which is appropriate. To move products by ship from Los Angeles to New York by the shortest route they must go through the Panama Canal. The alternative is to ship by train or truck, which is much shorter but more expensive per mile.
It seems unlikely that grains or petroleum products would be shipped between New York and Los Angeles. If grain prices are higher in New York than Los Angeles, grain shipments from the Midwest would be diverted from Los Angeles to New York rather than shipped to Los Angeles by train or truck and then sent to New York by ship.
Something similar applies to petroleum products. Oil refineries are scattered around the United States with production and refineries concentrated in Louisiana, Oklahoma and Texas. As a result, conventional arbitrage probably does not operate between New York and Los Angeles. If petroleum prices are relatively high in New York, shipments from Oklahoma or Louisiana are diverted from Los Angels to New York rather than sent to Los Angeles and then shipped to New York. This difference in market structure between international and domestic ports may help explain the “negative” Border Effects reported below where borders appear to increase integration.
3.5.1. Borders and Half Lives
I use Equation (5) to evaluate Border Effects using half lives.
(5)
Zj is the log of a half life, e.g., for Diesel between NY and Rotterdam. Dj is the log of distance, e.g., the log of Ship between the two ports. Bj is a border dummy that is one when there is a border and zero otherwise14. Table 5 reports the estimates of Equation (5) using half lives.
On the left-hand side of Table 5, all half lives are for logs of. On the right- hand side, half lives for just grains are for logs of while all other half lives are for logs of. The first column on both sides excludes distance. The second excludes the border. The third includes both.
In the first third of Table 5 all distance is by Ship. In the next third distance between just US ports is by Crow. In the last third distance between just US ports is by Highway. Although cumbersome, this arraignment should reveal how different measures of distance affect the results.
There is no evidence in Table 5 of a border effect in the sense that borders inhibit trade. Estimates of b are never positive and significant. When Dj is included, b is usually negative. How distance is measured does not appear to be important. The results for distance are similar in all three parts of Table 5.
Distance itself is important. In the left-hand side, when B is excluded, d is always significant. When B is included, most of the significance disappears. That result suggests some multicolinearity. That multicolinearity largely disappears in the right-hand side. Whether or not B is included, in the right-hand side d is always significant at the 1 percent level15.
When using half lives and spot auction prices, there is some suggestion that borders might increase integration. Standard deviations produce even stronger evidence of such an effect.
3.5.2. Borders and Standard Deviations
Presumably the Borders literature uses variances or standard deviations rather than half lives because it is difficult to reject a unit root for retail Pt. As pointed out earlier, auction Pt are stationary.
To save space, I do not report the standard deviations. The relevant tables are in [29] . But some of their characteristics are interesting.
Table 5. Estimates of Equation (5) using half lives.
aSignificant at 10% level. bSignificant at 5% level. cSignificant at 1% level.
These standard deviations are much smaller than the ones for retail prices. For example, the average variance for ∆Pt in Table 2 of [32] is 2.12. The implied standard deviation of 1.46 is almost 10 times larger than the largest of the standard deviations using auction prices. The large standard deviations using commodity retail prices are at least partly the result of mixing sticky retail prices and volatile auction exchange rates. Using auction commodity prices solves this problem.
Including freight rates and lagging export prices generally reduces half lives for grains, but generally increases standard deviations16. The average for grains using, is 0.03. The average using is 0.05. This difference illustrates the point made earlier that half lives and standard deviations measure different things. Although they measure different things, here they produce similar results. Both reject wide borders for commodity auction markets.
Table 6 is identical to Table 5 except that in Table 6 the Zj are standard deviations rather than half lives. When Bj appears alone in Table 6, b is always negative, but never significant. When Dj appears alone, d is always positive, but significant only when by ship. When Dj and Bj appear together, d is positive and usually significant. But b is always negative and significant in Table 6.
With half lives in Table 5, b is occasionally negative, but never negative and significant. In Table 6, all b are negative and some are significant at the 1% level. Excluding tin, silver and rubber produces similar, but slightly weaker, results.
Table 6. Estimates of Equation (5) using standard deviations.
aSignificant at 10% level. bSignificant at 5% level. cSignificant at 1% level.
Using Ship, Crow or Highway makes little difference. Using squares, square roots or reciprocals produces similar results. For auction prices, Border Effects appear to be negative. Standard deviations for ∆Pt are generally smaller when there is a border. Why?
The market structure in the US is the most likely explanation for the negative Border Effect. Shipping products from where they are produced to where the price is higher is more cost effective than conventional arbitrage between New York and Los Angeles. If prices are higher in New York than in Los Angeles, goods are not shipped from LA to NY. Instead grain or petroleum products that would have been shipped to LA are shipped to NY instead.
4. Summary
Dictionaries and encyclopedias clearly define arbitrage as a “risk-free” transaction. They also make it clear that the Law of One Price depends on effective arbitrage. As a result, for commodity price differentials to reject effective arbitrage and the Law of One price, they must represent risk-free profits.
Using retail prices, the Borders literature and most of the commodity LOP literature claim that arbitrage is ineffective and that the LOP fails. This view of commodity arbitrage and the LOP seems to be the conventional wisdom. Pointing out that that conventional wisdom is mistaken because the retail price differentials used in that literature do not represent risk-free profits is a major objective of this article.
Using auction prices where arbitrage is possible, both the finance literature and the commodity literature support effective arbitrage and the Law of One Price. Using a wider range of commodity auction prices over longer intervals than before, my results reinforce that support for effective commodity arbitrage and the Law of One Price and also reject Border Effects.