The full text on this page is automatically extracted from the file linked above and may contain errors and inconsistencies.
The Fed’s Entry into Check Clearing Reconsidered Jeffrey M. Lacker, Jeffrey D. Walker, John A. Weinberg T he Federal Reserve’s check clearing business has been a significant part of its operations since its founding and has set the precedent for other payment services offered by the Federal Reserve (Fed) to depository institutions. While there is considerable debate about the proper role for the Fed in the modern payment system, there seems to be much less disagreement concerning the Fed’s entry into check collection.1 Many scholars believe that when the Fed took on the check payment function during the first decade of the institution’s existence, its entry served to enhance the efficiency of the check payment system. Indeed, Fed documents on its role in the payment system speak of the “breakdown of the check collection system” around the turn of the century.2 According to the conventional view, check collection prior to the founding of the Fed was decentralized and defective in a number of ways. By centralizing the system, the Fed was able to eliminate many of the defects. Our purpose in this article is to reexamine the facts concerning the Fed’s entry into check clearing and to evaluate the conventional view in light of those facts. We find that the evidence of inefficiency in the pre-Fed check collection system is inconclusive. Further, inefficiency would imply that there was some form of market failure, yet most discussions of check clearing in the early part of this century are vague or silent on possible sources of market failure. Absent a clearly articulated explanation of why participants in the check collection system failed to achieve efficient results, we find the conventional view to be unconvincing. The authors wish to thank Ned Prescott, John Walter, Alex Wolman, and Tom Humphrey for helpful comments. The views expressed do not necessarily reflect those of the Federal Reserve Bank of Richmond or the Federal Reserve System. The authors remain solely responsible for the contents of this article. 1 See, for example, Benston and Humphrey (1997). of Governors of the Federal Reserve System (1990). 2 Board Federal Reserve Bank of Richmond Economic Quarterly Volume 85/2 Spring 1999 1 2 Federal Reserve Bank of Richmond Economic Quarterly We propose a different interpretation of the Fed’s entry, one based on the network characteristics of check collection. Under this alternative view, preFed check collection arrangements were relatively efficient, and the complaints many observers voiced about the excessive costs of the pre-Fed system should be understood as complaints about the distribution of the system’s costs rather than about its aggregate costs. Moreover, this interpretation explains why the founders felt compelled to give the Reserve Banks check clearing powers, given the reserve requirements in the Federal Reserve Act. New light is also shed on the par collection controversy, and the tortuous process, spanning several years, by which the Reserve Banks established their check collection service. Hence, the economics of network organization gives a coherent account of the facts concerning check collection prior to the founding of the Fed and the process by which the Fed entered the check clearing business. Before presenting our view of the Fed’s entry into check clearing, we present some of the key facts and review the conventional view. The relevant facts concern the system before the Fed and the means by which the Fed became a significant provider of check clearing services. Section 1 discusses nonpar remittance and the importance of correspondent relationships for clearing outof-town checks in the pre-Fed system. Nonpar remittance occurs when a bank on which a check is drawn pays a collecting bank less than the par value of the check. Correspondents are banks, usually larger city banks, that perform a variety of services, including check collection, for other banks. The Fed’s ability to successfully penetrate the check clearing market was dependent on Congress giving it the proper authority. While the Federal Reserve Act authorized the Reserve Banks to clear checks, the Fed’s initial attempts at inducing fully voluntary participation by member banks were unsuccessful, as we discuss in Section 2. It was not until Congress granted the Reserve Banks a competitive advantage—the sole right to present by mail at par—that the Fed was able to become a significant participant in the market. Sections 3 and 4 deal with the conventional view, according to which many of the features of the correspondent banking system represent inefficiencies that existed because of nonpar payment. Chief among these features was the observation that occasionally a check would pass through the hands of many widely dispersed intermediaries on its way to the paying bank. Such “circuitous routing” is often cited as unambiguous evidence of inefficiency. However, circuitous routing is quite consistent with our alternative view of the pre-Fed system as efficient, as we discuss in Section 5. In Section 6 we reconsider the Fed’s entry into check clearing in light of our alternative view. The Fed was able to gain market share because it enjoyed a legal privilege in presentment that was unavailable to private collecting banks. Exercising that privilege had the effect of shifting the allocation of the common costs of check clearing away from collecting banks and toward small country banks and taxpayers. Lacker, Walker, Weinberg: Check Clearing Reconsidered 1. 3 CHECK CLEARING BEFORE THE FOUNDING OF THE FED By the mid-nineteenth century, the use of checks had become a prominent means of payment in American banking and commerce. As early as 1855, the value of checkable deposits exceeded the value of bank notes in circulation (Spahr 1926, p. 84). In this earlier period, however, there were distinct geographic differences in payment practices. Checks were used primarily for payments within cities, and notes were used predominantly in the countryside.3 Payments between the country and the city and across geographic regions were made using bank drafts. A bank draft is like a check, except it is drawn on an account held by one bank with another. Hence, a Midwesterner wishing to make a purchase from an East Coast city would go to his local bank and purchase a draft drawn on that bank’s balances held with a bank in an eastern city. Toward the end of the nineteenth century, checks began to replace drafts as means of payment in interregional transactions, a change that was nearly complete by the turn of the century (Preston 1920, p. 566). During this same period, the check also replaced the bank note as a means of payment among people in the countryside. By 1900, the value of demand deposits was more than quadruple the value of currency in circulation (Friedman and Schwartz 1963, p. 705). In the period just prior to the founding of the Fed, the check had become a dominant payment instrument for making both long distance and local payments. Check clearing involves the delivery of items to the banks on which they are drawn. Many observers have pointed out that, in the United States prior to the existence of the Fed, clearing was affected by the different legal treatments accorded to different forms of delivery.4 While a paying bank was obligated to make payment (remit) at par for checks presented in person (over the counter), there was no such obligation for checks presented through the mail.5 Banks were free to extract a presentment fee (exchange charge) from their payments on such indirect presentments. This practice of nonpar banking is the focus of many discussions of the Fed’s entry into the check collection business. The prospects of receiving less than par for a check gives a collecting bank an incentive to find a way of getting the check to the bank on which it is drawn without mailing it directly. Alternative means of transport and presentment will be used, presumably, if the total cost does not exceed the presentment fee. 3 Spahr (1926, p. 60); Federal Reserve Bank of Richmond (1922, p. 1). (1926, pp. 103–05); Baxter (1983, p. 559); Duprey and Nelson (1986, p. 20); Summers and Gilbert (1996, p. 4); Weinberg (1997, p. 38); Gilbert (1998, pp. 123, 129); James (1998, p. 143). 5 See the discussion in Spahr (1926, pp. 103–04). The requirement that checks presented over the counter be paid at par had its origins in English common law. 4 Spahr 4 Federal Reserve Bank of Richmond Economic Quarterly For a check drawn on a nearby bank, in-person presentment was relatively cheap. Hence the clearing and settlement of local checks presented few challenges for the banking system of the United States.6 In the larger cities a number of banks typically would have frequent business with one another, making it worthwhile for them to form cooperative clearing organizations such as clearinghouses.7 In such an arrangement, the banks’ representatives would meet daily at a designated location to exchange items drawn on each other, avoiding the duplicative cost of bilateral contacts. By mutual agreement among participants, presentment at the clearinghouse was taken to be equivalent to presentment in person at the paying bank’s premises.8 In less densely populated areas, a typical bank had a relatively small number of banks nearby from which it regularly received checks. Maintaining a clearinghouse arrangement among the smaller number of banks did not tend to be economical. Instead, rural banks usually made direct exchanges with other banks in the same general area (Spahr 1926, p. 98). Such direct presentations were made once or twice a week, or daily, depending on the distance between the banks and on the volume of checks flowing between them. Settlement of local checks, either in the city clearinghouses or among banks in rural communities, could be made through the exchange of currency or by debiting interbank balances (Cannon 1900, pp. 36– 46). The clearing of out-of-town, or interregional, checks was accomplished via a network of bilateral agreements on clearing terms that generally took the form of correspondent banking relationships.9 This relationship involved a bank in a larger city serving as a correspondent bank for a bank in a smaller city or town. The latter bank, the respondent, would hold balances with its correspondent. When the respondent received a check drawn on a bank in the city or area where the correspondent conducted business, the respondent would send the check to the correspondent, who would, in turn, present the check directly to the paying bank. The correspondent would receive payment from the paying bank and credit the amount to the respondent’s account. Often, the correspondent would credit the respondent for the par value of the check even if the paying bank did not remit at par (Spahr 1926, pp. 101, 111). Sometimes the respondent agreed, in return, to remit at par on checks sent to it by its correspondent. The correspondent’s main form of compensation was typically the interest margin it could earn on the funds held as balances by the respondent bank (Spahr 1926, pp. 101, 111–12). It was not uncommon for a correspondent 6 Spahr (1926, p. 98); Duprey and Nelson (1986, p. 19). (1900, pp. 148–54); Spahr (1926, pp. 79–82). 8 Hallock (1903, p. 59); Spahr (1926, pp. 104–05). 9 For descriptions of correspondent banking at the Fed’s founding, see Watkins (1929, Ch. 6), Spahr (1926, pp. 99–101). 7 Cannon Lacker, Walker, Weinberg: Check Clearing Reconsidered 5 to pay presentment fees to its respondents while paying them par for all the checks it collected for them. Often a bank would act as correspondent for banks outside the region, offering to collect checks drawn on any bank in the neighboring territory, with the proceeds credited to the account of the distant bank. There appears to have been active competition for collection business between correspondent banks, as evidenced by the many advertisements in bank directories from the late nineteenth century.10 The importance of correspondent relationships is reflected in the magnitude of interbank balances. In the years preceding the Fed’s founding, the amount of deposits at national banks that were held for other banks was roughly 40 percent as large as national bank deposits held for individuals (Watkins 1929, pp. 10–18). These interbank balances were held predominantly at banks in the larger cities and especially in the major financial centers. Correspondent banks, then, were linked together into a network of banks through which checks were collected. When a bank received a check drawn on an out-of-town correspondent bank—what might be termed a regular interregional check—it would be presented through the established clearing arrangement.11 When a bank received a check drawn on a distant bank with which it did not have an established relationship—what might be termed an irregular interregional check—the check would most likely be sent on to a correspondent. If a correspondent received from a respondent a check drawn on a paying bank with which the correspondent did not have a relationship, then the correspondent would typically send the check to one of its correspondent banks located near the paying bank.12 In fact, to aid routing, bank directories listed each bank’s correspondents; the bank holding the check could look for correspondents it shared with the paying bank.13 The next bank receiving the check might present it directly to the paying bank. Such indirect routing had two advantages for the correspondent. First, it avoided having to pay a presentment fee to a paying bank from which it received no compensating benefit. Second, it saved the cost of sending a single item to the paying bank instead of bundling the item with others being sent on a normal shipment (Cannon 1900, p. 76). 10 Williams (1901). Typical notices: “Prompt and careful attention given to collections throughout Mississippi and Alabama” (First National Bank of Meridian, Mississippi); “Unsurpassed facilities for handling collections, especially Oklahoma and Kansas” (Kansas National Bank, Wichita, Kansas); “Send us your Southwest Collections” (Home Savings Bank & Trust Co., Phoenix, Arizona). 11 The term “correspondent” is sometimes used narrowly to refer to a small number of distant banks that a bank formally designates to receive items drawn on it. A bank typically had no more than two or three correspondents in this sense, and they were listed in banking directories. The term is often used more broadly, however, to refer to any bank with which a bank regularly exchanges items by mail. We will use the term “correspondent” in this broader sense. 12 Cannon (1900, p. 76); Spahr (1926, pp. 111–12). 13 Williams (1901). 6 Federal Reserve Bank of Richmond Economic Quarterly Irregular interregional items, then, were often handled by sending them to banks with whom the correspondent conducted regular interregional business. Concerns about the operation of the check clearing system prior to the founding of the Fed focused largely on these irregular interregional checks. In such a system, there were inevitably cases in which the next bank to receive a check was not a correspondent of the paying bank. Hence, the process of sending the check to a correspondent might be repeated more than once. Furthermore, some mistakes in judgment and in handling of items were inevitable. Accordingly, there are documented examples of checks traveling circuitous routes, through the hands of many banks, in making their way from the bank of first deposit to the paying bank.14 Settlement of an irregular interregional check began when the paying bank remitted to the bank that finally presented the check. If the presenting bank was a correspondent for the paying bank, then settlement could be made by debiting the paying bank’s account balances. If there was no account relationship between the paying and presenting banks, then payment would typically be made in the form of a draft on the paying bank’s account with one of its correspondents, often a New York bank (most banks maintained a relationship with at least one New York bank).15 The presenting bank could send the draft to its New York correspondent who would credit the amount to the presenting bank’s account. The presenting bank’s New York correspondent would, in turn, present the draft to the paying bank’s correspondent through the New York clearinghouse. In this way, New York reserve balances served increasingly as a universal settlement medium. This use of bank drafts in settlement stands in contrast to the greater use of specie earlier in the nineteenth century.16 Collecting and paying banks incurred a variety of expenses in the process of clearing and settling interregional checks. The resource costs incurred by collecting banks, including the costs of recordkeeping and postage, were estimated to have amounted on average to about 3/4 of 1 percent to 1 percent of the value of collected items (Spahr 1926, p. 113). Other costs borne by collecting banks resulted from the negotiated arrangements for clearing and collection. For instance, many collecting banks gave credit to their account holders at the time of an item’s deposit. By doing so, the collecting bank would incur the float costs that accrued until settlement was received from the paying bank. In addition, national banks faced a tax of 1/2 of 1 percent on deposits. These costs, together with the presentment fee charged by some paying banks (1/8 of 1 percent on average), added roughly another 3/4 of 1 percent to the costs incurred by collecting banks (Spahr 1926, p. 114). On the other side of the 14 We discuss the examples of circuitous routing below. (1900, p. 46); Spahr (1926, p. 100); Watkins (1929, p. 104); Williams (1901, passim). Alternatively, payment could be made in specie or banknotes. 16 Preston (1920, p. 565); Spahr (1926, pp. 45–51). 15 Cannon Lacker, Walker, Weinberg: Check Clearing Reconsidered 7 transaction, paying banks incurred clerical costs in receiving and remitting for checks presented on them (Langston 1921, pp. 13–39). It seems apparent that a substantial share of the cost of clearing interregional checks was borne by collecting banks (Spahr 1926, p. 113). Customers of collecting banks typically received the full par value for their deposits of out-of-town checks. Even if the collecting bank passed along some charge to the depositor of the check, there appears to be little evidence of systematic price discrimination by businesses between customers paying for goods and services with out-of-town checks and those paying by other means. Most observers conclude that competition for both local deposits and correspondent business drove city banks to absorb much of the cost of collecting country checks.17 2. THE FED’S ENTRY INTO CHECK CLEARING A flurry of banking and monetary reform proposals around the turn of the century ultimately led to the passage of the Federal Reserve Act on December 23, 1913. The central motive of the Act, and many other reform proposals as well, was to prevent recurrent financial panics of the kind typical of the late-nineteenth and early-twentieth centuries in the United States. The Federal Reserve System was designed to prevent such panics by providing “an elastic currency” that allowed a relatively rapid expansion of the supply of notes when needed.18 Nationwide Federal Reserve check clearing was not envisioned in the earliest versions of the Act introduced in Congress.19 The bill introduced by Senator Carter Glass in February 1913, for example, required Reserve Banks to accept from their members at par any checks drawn on any other member’s deposit at a Reserve Bank. This would have allowed member banks to settle obligations to other member banks with checks drawn on their own Reserve Bank deposits, much the way interbank obligations were settled using drafts drawn on bank deposits at New York banks. Ultimately, however, the final version of the Federal Reserve Act allowed Reserve Bank check clearing. Each Reserve Bank was required to accept checks written by depositors at member banks, and was permitted, though not required, to accept checks written by depositors at member banks of other Reserve Banks.20 17 Watkins (1929, p. 106); Fellows (1940, p. 18); Miller (1949, p. 11); Duprey and Nelson (1986, p. 20); Summers and Gilbert (1996, p. 4). 18 See Willis (1923) or Timberlake (1978), for example. 19 See Stevens (1996) for an account of the legislative history of the check clearing provisions of the Federal Reserve Act. 20 Board of Governors (1915, pp. 23– 44). Section 16 of the Federal Reserve Act as finally passed on December 23, 1913, stated, in part, “Every Federal reserve bank shall receive on deposit at par from member banks or from Federal reserve banks checks and drafts drawn upon any of its depositors, and when remitted by a Federal reserve bank, checks and drafts drawn by 8 Federal Reserve Bank of Richmond Economic Quarterly It was a number of years before the Reserve Banks successfully penetrated the check collection business. On March 4, 1915, the Federal Reserve Board announced that it had directed the Reserve Banks to establish a “voluntary reciprocal plan” for intradistrict check clearing.21 Member banks would be able to collect at par from all other member banks in their district that joined the plan. A member bank joining the plan had to agree to accept their own checks at par. Details of the scheme were left up to the individual Reserve Banks. The hope was that many member banks would join, making the plan attractive to those that at first held out. The voluntary reciprocal plan proved disappointing, however. By late July the Reserve Banks were reporting that most banks said they would not join unless the plan were made mandatory. Reported participation in the plan peaked in October 1915 at only 2,456 banks, out of about 7,600 member banks, and withdrawals exceeded additions every month thereafter.22 A new plan was introduced early the next year. On May 1, 1916, the Federal Reserve Board released a circular to member banks—Circular No. 1, Series of 1916—detailing a “compulsory plan” to be put into effect in June (later postponed to July 15).23 Member banks would now be required to pay at par on checks presented to them by their Reserve Bank. Presentation by a Reserve Bank through the mail would be construed as presentation at their counters. The Reserve Bank would defray the paying bank’s cost of sending payment, either in notes or acceptable checks on other banks, if the bank’s reserve balance was insufficient. Each Reserve Bank would accept checks from any depositor in any other Federal reserve bank or member bank upon funds to the credit of said depositor in said reserve bank or member bank. Thus each Reserve Bank was required to accept on deposit at par from member banks or other Reserve Banks checks and drafts drawn upon any of its depositors. Section 16 went on to state, “The Federal Reserve Board . . . may at its discretion exercise the functions of a clearing house for such Federal reserve banks, or may designate a Federal reserve bank to exercise such functions, and may also require each bank to exercise the functions of a clearing house for its member banks.” Section 13 adds that “Any Federal Reserve Bank may receive from any of its member banks . . . checks and drafts upon solvent member banks, payable upon presentation; or solely for exchange purposes, may receive from other Federal reserve banks . . . checks and drafts upon solvent member or Federal reserve banks, payable upon presentation.” Thus each Reserve Bank was permitted to accept from its member banks checks or drafts drawn upon solvent member banks, or, “solely for exchange purposes,” checks or drafts from other Federal Reserve Banks drawn upon solvent member banks or other Federal Reserve Banks. 21 Board of Governors of the Federal Reserve System, Federal Reserve Bulletin (1915, pp. 6–9) [hereafter “Federal Reserve Bulletin”]. 22 Federal Reserve Bulletin (1915, pp. 192–95); Spahr (1926, pp. 174). 23 Federal Reserve Bulletin (1916, pp. 259–60, 262–64). Each Reserve Bank issued its own circular detailing the operation of the plan in its district; Chicago’s is reprinted in the Federal Reserve Bulletin (1916, pp. 312–14). Regulation J was released later in 1916, incorporating the September 7, 1916, amendment (see below) and superceding Circular No. 1. To this day Regulation J codifies the Board’s requirement that member banks accept checks at par (see Board of Governors [1998], 12 CFR 210.9). Lacker, Walker, Weinberg: Check Clearing Reconsidered 9 its member banks at par if they were drawn on other member banks or on nonmember banks that paid at par, although a small per-item service charge was allowed.24 The plan was nationwide in scope; each Reserve Bank would accept at par checks drawn on banks in other districts if they were Federal Reserve members or they agreed to pay at par. Member banks were still free to collect checks outside of the Federal Reserve System as they saw fit, and they were free to carry balances with other banks for purposes of clearing. They were still free to charge presentment fees as well, but the Board’s Circular prevented them from charging presentment fees to the Reserve Banks. The original Federal Reserve Act did not permit the Reserve Banks to accept checks drawn upon nonmembers. Early in 1916 the Board recommended to Congress various changes in the Act, including alteration of Section 13 of the Act to allow Reserve Banks to accept “checks and drafts payable upon presentation within its district.”25 The amendment passed without change on September 7, 1916. The amendment also removed the qualification that Reserve Banks could only accept checks drawn on “solvent” banks, further expanding the field of acceptable checks. The Fed’s strategy was to make the service as attractive as possible by increasing the number of banks on which they could collect.26 Although nonmember banks still could not deposit checks directly with the Reserve Banks, they were entitled to send checks to the Fed through their correspondents that were member banks, and many apparently did (Spahr 1926, p. 197). The Board viewed the inability to accept checks directly from nonmember banks as an impediment to the success of the compulsory plan: “Any clearing and collection plan to be effective must be so comprehensive as to include all checks.”27 The Reserve Bank check collection service was at first intended as a benefit of membership, but the Board decided that it would be better to offer clearing services to nonmembers to entice them to remit at par on checks sent to them by the Reserve Banks through the mail. As part of a package of suggested amendments sent to the Congress in December 1916, the Board included an amendment to Section 16 that would permit nonmember banks to use the Fed’s clearing service, provided they agreed to pay their own checks at par and kept a “compensating balance” with the Reserve Bank. The amount of the compensating balance was to be determined by the Federal Reserve Board. 24 The Board’s circular stipulated that Reserve Banks keep an accurate account of the cost of the clearing service and that the Board would fix the charge by rule. Charges ranged from 0.9 cents to 2.0 cents per item. Charges were ultimately lowered and then abolished on July 1, 1918 (Spahr 1926, pp. 192–93, 211). 25 Federal Reserve Bulletin (1916, pp. 323–24). 26 In discussing the compulsory plan in June 1916, the Board said “it is thought that in the near future checks upon practically all banks throughout the United States can be handled at par by Federal Reserve Banks” (Federal Reserve Bulletin 1916, p. 263). 27 Federal Reserve Bulletin (1917, p. 100). 10 Federal Reserve Bank of Richmond Economic Quarterly Congress ultimately enacted the suggested check clearing provisions, but not without a fight that revealed the depth of opposition to the Fed’s check collection plans.28 After being called back into session in April, Congress again took up the Board’s suggested amendments. Representatives of the American Bankers Association took the opportunity to lobby for a bank’s right to charge presentment fees, even against the Reserve Banks. As a result, Senator Hardwick of Georgia introduced an amendment that would have added to Section 16 the proviso that nothing in the Act shall be construed as prohibiting a member or nonmember bank from “making reasonable charges, but in no case to exceed 10 cents per $100 . . . for collection or payment of checks and drafts. . . .”29 As drafted, the Hardwick Amendment would have effectively negated the provision of Circular No. 1 requiring banks participating in the compulsory plan to pay at par on checks sent by the Reserve Banks through the mail. The Hardwick Amendment was ruled out of order in the House, where the Board’s suggested amendments were passed on May 5. The Senate then approved the bill but included the Hardwick Amendment, despite the opposition of the Federal Reserve Board. In conference, Glass prevailed upon the conferees to modify the Amendment in two ways. The Federal Reserve Board was given the authority to determine and regulate the charges, and a clause was added stating that “no such charges shall be made against the Federal reserve banks.” The House passed the resulting bill, as did the Senate, after the reading of a letter from President Wilson that described the original Hardwick Amendment “as most unfortunate and as almost destructive of the function of the Federal reserve banks as a clearing house for member banks.”30 The Board’s ban on charging presentment fees against Reserve Banks was now law.31 As part of the compulsory plan, the Board directed Reserve Banks to maintain so-called “par lists” consisting of the nonmember banks in their districts that accepted checks at par. The par list and the Fed’s campaign for universal par presentment became the center of the celebrated “par collection controversy” (see Spahr [1926], Ch. 7). At first, only banks that explicitly agreed to remit at par were added to the System’s list, but in early 1919 a concerted effort was begun to expand the list. Reserve Banks took aggressive measures to 28 See Spahr (1926, p. 200) and Wyatt (1944). contemporary account of the legislative action appears in “The Hardwick Amendment” (1917, pp. 40 – 41). 30 Miller (1949, p. 20); Congressional Record (1917, p. 3761). 31 At the request of the Federal Reserve Board, the Attorney General issued an opinion on the scope of the new language in Section 13. He said that the Federal Reserve had no power to regulate the exchange charges of nonmember banks who were not depositors under the clearing system, and if nonmembers insisted on making charges, the Reserve Banks could not handle checks drawn on them, since Section 13 now prohibited such charges (Federal Reserve Bulletin 1918, pp. 367–70). 29 A Lacker, Walker, Weinberg: Check Clearing Reconsidered 11 attempt to collect at par on all banks in their districts.32 Some Reserve Banks put recalcitrant banks on the par list without their explicit permission; they accumulated their checks and had them presented directly over the counter, where banks were generally required to pay par.33 The nonmember par list grew from about 10,000 in December 1918 to over 19,000 at the high-water mark in November 1920, leaving only about 1,700 nonpar banks.34 The opposition to the Federal Reserve’s methods was fierce in some quarters, however. Some banks refused to cooperate, and the resulting litigation— including cases that reached the Supreme Court—established limits on the measures the Reserve Banks could employ to obtain par remittance.35 Checks could no longer be accumulated for presentation at the counter to “coerce” banks into paying par. Banks could pay checks at their counter by draft rather than lawful money. Moreover, the Supreme Court ruled that the Federal Reserve was under no congressional mandate to bring about universal par clearance. In response, the Board ordered the Reserve Banks to cease using agents other than banks in making collections and to stop accepting checks drawn on nonpar banks.36 Banks withdrew from the par list until nonpar banks numbered nearly 4,000.37 Nonpar banking persisted thereafter, chiefly in small one-bank towns or in small towns with only nonpar banks.38 The number of nonpar banks declined sharply in the early 1970s and finally sank to zero in 1980.39 3. THE CONVENTIONAL VIEW Many payments system researchers have described the pre-Federal Reserve check collection system as inefficient in the sense that real resource costs were higher than they would have been under a centrally run system. Likewise, it is conventionally argued that the Fed-imposed clearing arrangements resulted in lower real resource costs than private arrangements. Walter Spahr (1926) 32 See Harding (1921); Tippetts (1924, pp. 635–36); Tippetts (1929, pp. 277–80); Preston (1920, pp. 571–78). 33 Under some state laws, however, the bank was not required to pay in legal tender but could pay instead with a draft. See Spahr (1926, pp. 284–86). 34 Spahr (1926, p. 248) displays data compiled from various issues of the Federal Reserve Bulletin. 35 The major cases are described in Spahr (1926, pp. 249–82, 284–86) and Tippetts (1924, 1929). The key decision came in the “Richmond case,” Farmers and Merchants National Bank of Monroe, North Carolina, et al. v. Federal Reserve Bank of Richmond, Virginia, in which the U.S. Supreme Court upheld the constitutionality of a North Carolina law that authorized state banks to charge a presentment fee of no more than 1/8 of 1 percent and specifically allowed payment by draft for checks presented over the counter by a Federal Reserve Bank, post office, or express company. 36 Federal Reserve Bulletin (1923, pp. 903–04, 1194). 37 Federal Reserve Bulletin (1928, p. 535). 38 Stevens (1998, p. 19); Jessup (1967, p. 26). 39 The reasons for the decline of nonpar banking have not, to our knowledge, been studied. 12 Federal Reserve Bank of Richmond Economic Quarterly compiled the most comprehensive and widely cited statement of this view 12 years after the founding of the Federal Reserve. Most recent proponents of the conventional view echo Spahr. Spahr claimed that one of the most serious problems with the pre-Fed system was the excessive cost of collecting country checks and that nonpar checking was to blame. The practice of nonpar checking had its origins in earlier times when banks incurred significant shipping costs in remitting specie to distant banks for settlement. The amount of the charges had been steadily declining during the 50 years immediately prior to the Fed as transportation costs fell; nevertheless, nonpar remittance practices remained a significant source of revenue to some country banks and a significant source of irritation to some city banks in 1914. The most widespread criticism of exchange charges was that technological advances had removed most of the country banks’ settlement costs, eliminating the once-valid justification for the fees. Spahr describes the fees as “excessive” and out of proportion to the costs incurred by the levying banks (Spahr 1926, p. 241). Besides imposing excessive costs, presentment fees were said to cause costly and inefficient collection practices. Chief among these was circuitous routing: some checks were sent to banks in roundabout ways through a number of different banks in order to avoid presentment fees, resulting in excessive postage and clerical costs and extended check float. Although there are no data on the extent of circuitous routing to avoid exchange charges, the literature contains a number of examples. Cannon (1900) describes an example of a check drawn on The Peconic Bank of Sag Harbor, Long Island, and deposited in a bank in Hoboken, New Jersey. The now-famous check traveled from the Hoboken bank to a New York City bank, and then to banks in Boston, Tonawanda, Albany, Port Jefferson, Far Rockaway, New York City (again, but a different bank), Riverhead, and Brooklyn, before finally arriving at the Sag Harbor bank. James Hallock (1903) cites the Sag Harbor check and three others as well. Spahr (1926) cites the Sag Harbor check, Hallock’s Stonington, Connecticut, check, plus two additional examples. Other writers typically cite Cannon’s Sag Harbor check.40 Most of the pre-Fed writers on check clearing were city bankers; there is almost no early academic literature on the subject. Nonetheless, virtually all authors around that time supported the thrust of Spahr’s argument. In his 1890 annual report, the Comptroller of the Currency conveys an early official opinion on exchange charges by saying a conservative estimate of their total amount “would constitute a heavy burden upon the commercial interests of the country.” Hallock (1903, p. 17) asserts that the “avoidance of collection 40 See, for example, Conway and Patterson (1914, p. 324), Miller (1949, p. 10), or Baxter (1983, p. 560). Lacker, Walker, Weinberg: Check Clearing Reconsidered 13 charges is the motive for shunting a check up and down the country” and that “the practice is not unusual.” Oliver Sprague (1910, p. 42) observes that “collections and payments are subject to delay and involve heavy expense.” Similarly, turn-of-the-century Banker’s Magazine and A.B.A. Journal articles, some of which Spahr references in his book, confirm the common perception that bankers saw a need for reform. An array of historical descriptions of the U.S. payments system written since 1914 have either referenced or endorsed Spahr’s evaluation of the pre-Fed clearing system.41 Many of the founders of the Federal Reserve System shared Spahr’s assessment. In a debate prior to the Federal Reserve Act, Carter Glass revealed that he thought centralized clearing by the government would reallocate costs in a welfare-improving way: Precisely how much difficulty and cost will be incurred by the Federal Reserve Banks in carrying out the provisions of this section cannot be precisely calculated. It can, however, be positively stated that such expenditures will be very much less than those incurred by banks at the present day in carrying through their exchanges. The proposed provision will eliminate the numerous and well-founded complaints of unjust charges for exchange; and, while it will prevent certain banks from profiting as they do by exchange transactions it will correspondingly benefit the community.42 Glass expresses the idea here that although exchange-charging country banks will be made worse off, a Federal Reserve clearing system will make others better off by lowering their clearing costs. He appears to have in mind the proposition that aggregate costs will be lower overall, and thus the community will benefit. H. Parker Willis, Glass’s advisor and the first Secretary to the Federal Reserve Board, made similar criticisms of the old system in his post-1914 works. For example, Willis wrote the lead article in the March 1914 American Economic Review on the new Federal Reserve legislation and referred to exchange charges as “extortion.”43 41 For example, Conway and Patterson (1914) explain the “disadvantages” of the old methods, highlighting circuitous routing, excessive exchange charges, and unnecessary check float. Gidney (1916, p. 607) states, “Important economies are expected to be effected in the total cost of check collection, through having checks reach the paying bank by a reasonably direct route and after having passed through relatively few banks. . . .” Kemmerer (1928) describes a “defective exchange and transfer system,” adding “large shipments of currency” as another inefficiency. Tippetts (1929) gives a similar account, calling the system “the source of a number of evils.” Watkins (1929) emphasizes that exchange charges “operated under the old system to lessen its efficiency” and goes on to argue that such charges caused unduly large bankers’ balances. See also Miller (1949), Jessup (1967), Duprey and Nelson (1986), and Moore (1990). 42 U.S. Congress, House (1913) pp. 55–56. 43 For Willis’s opinion on Spahr (1926) and Cannon (1900), see Willis, et al. (1933), p. 238. W. P. G. Harding, then Governor of the Federal Reserve Board, stated that “the establishment of a universal country wide par-collection system” would result in the “elimination of the burdensome delays and expenses incident to the old indirect routing system” (Harding 1921, p. 338). 14 Federal Reserve Bank of Richmond Economic Quarterly In the conventional view, the Fed’s entry was a struggle by the progressive forces of banking reform against the vested interests of nonpar bankers. Duprey and Nelson (1986, p. 18) write: At the turn of the century . . . the private banking sector was widely acknowledged to have produced an inefficient and counterproductive arrangement for collecting checks beyond the local level. The invisible hand wasn’t working. This failure to produce an adequate solution for collecting out-of-town checks efficiently was one reason that the Congress, as part of its banking reform measures developed between 1908 and 1913, gave the Federal Reserve System both a regulatory role and an operating role in check clearing and collection. They go on to argue that the Fed’s quest for a universal par clearance system was frustrated by “stiff opposition and competition” from the correspondent banking system.44 Nonetheless, the Fed did enjoy “some success in improving the efficiency of intercommunity check collection”—the Fed reduced the number of nonpar banks “and probably helped limit abuses” in their practices. Summers and Gilbert (1996), drawing on Spahr, note “widespread dissatisfaction” with the settlement of interregional transactions pre-Fed and cite enhancing payment system efficiency as an important purpose for creating the Federal Reserve. In a similar vein, Gilbert (1998) concludes that, based on the fall in reserve holdings at banks joining the system, “evidence from the period when the Fed was founded suggests that the Fed’s services improved payments system efficiency.”45 4. A PROBLEM WITH THE CONVENTIONAL VIEW One reason to question the conventional view is that (with only a few exceptions) most writers do not explain why the participants in the check collection system were unable to implement efficiency improvements, if they were available. In other words, it is not clear why there would be a market failure in check 44 Duprey and Nelson (1986, p. 19). (1998, p. 137). One view which we do not discuss at length here is that Congress wanted the Fed to collect checks in order to prevent disruptions in check collection that accompanied financial panics, like the one in 1907 that occasioned widespread suspension of cash payments at banks (Corrigan 1983, pp. 345–48). While preventing financial panics was clearly the central motive behind the provisions moving reserve accounts over to the Reserve Banks, we know of no evidence that any of the founders perceived the functioning of the check collection system during panics, per se, as a motive for granting check clearing powers to the Reserve Banks. Moreover, it is hard to see why the rediscounting and open market powers of the Reserve Banks should not be sufficient to prevent financial disruption. Walter (1988, p. 57) reports that in congressional debate on the Federal Reserve Act there is no mention of the Fed providing check collection services to produce a safer payment system. “Senator Bristow and O. M. W. Sprague agreed, in an exchange during Senate hearings, that the problems with inter-city check collection during panics were caused by a lack of a lender of last resort” (U.S. Congress, Senate, Senate Committee on Banking and Currency 1913, pp. 512–13), cited in Walter (1988). 45 Gilbert Lacker, Walker, Weinberg: Check Clearing Reconsidered 15 collection. One might argue that it was beyond the capability of participants to create the Federal Reserve’s clearing system, since the Fed is a collective nonprofit institution.46 And yet the Reserve Bank’s check collection activities employed precisely the same technology and organizational techniques employed by the private sector. The Reserve Banks were essentially correspondent banks for the members of their clearing system, and their relationship with their respondents was organized in essentially the same way as private correspondent relationships. The Reserve Banks did set up a wire transfer system for moving funds rapidly between Reserve Banks, but private banks had been moving funds via wire transfer prior to the founding of the Fed (Langston 1921, pp. 168–72). Private clearinghouses were collective nonprofit institutions set up by banks, often endowed with quasi-regulatory power over their members.47 In fact, in the decades prior to the founding of the Fed, clearinghouses were making moves to expand their clearing activities to encompass checks drawn on country banks.48 The Boston clearinghouse had been clearing New England country checks for years.49 In short, banks could have set up the equivalent of the Federal Reserve Bank clearing system on their own. Presumably they would have done so if it would have made some participants better off without making others worse off—for example, if it would have appreciably lowered the costs of collecting checks. The fact that they did not do so seems to suggest that it would not have lowered costs. If the Reserve Bank clearing system lowered check clearing costs, why couldn’t the private sector do the same? Some believe one possible answer is that nonpar country banks enjoyed monopoly power. Presentment fees were set inefficiently high in order to extract rents from collecting banks. Such fees, it is said, “can lead to costly and complicated countermeasures” to avoid nonpar transactions.50 This view envisions a bank facing a choice between mailing directly to a nonpar bank with which it does not have a correspondent relationship and sending the check on to a correspondent who can present it at par. But it is not clear that this was always the case. In fact, presentment fees were often paid to nonpar banks by their correspondents. These collecting banks appeared to pay fees willingly in exchange for the respondent banks’ reserve balances (Spahr 1926, p. 111). Hence, presentment fees were often voluntarily agreed to as a component of a broader correspondent-respondent relationship. It is not clear how to 46 Duprey and Nelson (1986) seem to advocate this position when they argue that “wellestablished rivalries” between city and country banks somehow prevented banks from voluntarily agreeing to a mutually beneficial national clearing system. 47 Gorton (1985); Gorton and Mullineaux (1987). 48 Cannon (1900); Hallock (1903); Spahr (1926, pp. 119–30); Duprey and Nelson (1986, pp. 22–23). 49 Federal Reserve Bulletin (1916, p. 317). The Federal Reserve Bank of Boston took over the operations of the Boston clearinghouse by unanimous consent in 1916. 50 McAndrews (1995, p. 56). See also Gilbert (1998). 16 Federal Reserve Bank of Richmond Economic Quarterly reconcile this market power view with the documented features of correspondent relationships. Check collection routes were determined by the pattern of correspondent relationships. Thus, it is the market for correspondent relationships—as opposed to the market for the clearance of a particular check—that is relevant to the question of market power. What a country bank had to offer as a respondent— reserve balances—was available from other country banks as well. It appears unlikely that small country banks exercised any monopoly power in the market for correspondent relationships. 5. AN ALTERNATIVE VIEW The conventional view sees the pre-Fed check collection system as inefficient and disjointed, with much of the inefficiency being driven by the practice of nonpar remittance. An alternative interpretation is possible if we view the check collection system as a whole and focus on its network characteristics.51 While such terms as “network effect” or “network externality” are used widely by economists in reference to a variety of market settings, a check collection system (or any other payment clearing system) literally is a communication network. Two key characteristics are central to understanding the organization and performance of network communications industries: joint benefits and common costs. For most goods, a unit of consumption provides benefits to a single user. Some other goods or services provide simultaneous benefits to many people. For most such goods, a musical performance for example, one person’s utility does not depend on whether anyone else is partaking. One unit of a communication service, however, necessarily involves two “consumers”: a sender and a receiver. Neither party derives a benefit from communication unless the other one does. Similarly, the clearing of a check provides benefits jointly to both the payor and the payee. Note that the presence of joint benefits affects the criterion for judging whether provision of a unit of a good is economically efficient. For an ordinary good, we would say that a unit’s provision is efficient if the buyer’s willingness-to-pay exceeds the incremental resource costs of the good. For a network communication service, we would say that the provision of a unit of service is efficient if the sum of the willingness-to-pay of the sender and the receiver (or payor and payee in the case of a payment instrument) exceeds the incremental cost.52 51 For a discussion of the characteristics of network services as applied to payment systems, see Weinberg (1997) and Lacker and Weinberg (1998). James McAndrews (1995) first suggested that check collection at the founding of the Fed should be viewed as a network communications industry, analogous to credit card and ATM clearing networks. See also Summers and Gilbert (1996, pp. 6–7). 52 This distinction is emphasized by Baxter (1983). Lacker, Walker, Weinberg: Check Clearing Reconsidered 17 The fact that check clearing services provide joint benefits to pairs of users implies that there are common costs even at the level of the individual unit of service. Common costs are costs that cannot be uniquely attributed to the provision of service to particular users or groups of users. The incremental cost of a unit of a payment clearing service is common to the payor and payee in that it cannot be uniquely attributed to either. In network services markets, common costs tend to exist at a variety of levels. The technology for such services often includes substantial fixed infrastructure costs. The physical transport of items such as checks involves common costs since the cost of a trip cannot be attributed to particular items or particular pairs of senders and receivers. Many common costs in such markets are fixed relative to the quantity of a service provided. For example, the cost of a transportation node facility, such as a terminal, or (in the case of checks) a bank branch, often cannot be uniquely attributed to any particular item passing through it. The presence of substantial common costs implies that it is impossible to specify precisely an individual user’s share of total costs. Consequently, there is some ambiguity in determining the “right” price for a particular user to face. Efficiency requires that no individual or group pay less than its incremental cost, defined as the cost of extending service to the group in question given the level of services provided to all other users. Otherwise they might inefficiently overuse the service.53 If all users pay incremental cost, however, the service will not recover all of the costs that are common across groups of users. In order to cover common costs, the service must charge some users more than their incremental cost. There are often many ways to allocate common costs, all of which are consistent with efficient provision of the service.54 The presence of joint benefits and common costs gives rise to what are often called “network effects.” One person’s participation in a network brings benefits to that person as well as to all others who wish to communicate with him. It is important to note, however, that while these benefits may be external to the individual’s action, they are internal to the network in which he participates. A network’s participants, as a group, may have an incentive to shift common costs away from some individual participants: those who place 53 This principle must be modified in the presence of joint benefits: see discussion on the next page. 54 We use the term “efficient” in the sense of Pareto efficiency. An allocation is Pareto efficient if no party can be made better off without making some other party worse off. Some readers may be familiar with the Ramsey cost allocation principle, which states that each price charged by a multi-product provider should be set at a markup over costs that is inversely proportional to demand elasticity. This would appear to prescribe a uniquely efficient allocation of common costs. The Ramsey allocation is optimal, however, only under a particular assumption about the way in which social benefits are calculated, specifically, by adding up the utilities of individual agents. The Pareto criterion is less restrictive, and there will tend to be multiple Pareto efficient allocations. 18 Federal Reserve Bank of Richmond Economic Quarterly a relatively low private value on participation but who bring large external benefits to other participants. Efficient participation may even require that an individual pay less than their incremental cost. To be more precise, consider an existing network whose incremental cost of adding a particular new member is cn , while the mutual benefit to existing members of adding this new member is vn . For the potential new member, the corresponding incremental costs and benefits are ci and vi . It is efficient to add this member if vn + vi > cn + ci . Suppose that the network charges the new member the price p. The new member is willing to participate if vi > ci + p. The network is willing to add the new member if vn + p > cn . Thus, any price satisfying vi − ci > p > cn − vn induces efficient participation. Note that if vi − ci < cn , then the network must charge less than the network’s incremental cost of adding the new member, because the new member’s private net benefit from joining is low. Participation is efficient nonetheless because of the value, vn , the member brings to the network. Since the new member’s participation in the network brings joint benefits to all network participants, both cn and ci are common costs. The price determines who bears the common costs. 6. PRE-FED CHECK CLEARING When seen in terms of the allocation of the common costs of a network service, a very different view of the pre-Fed check collection system emerges. The centerpiece of the conventional view is the practice of nonpar collection and the resulting circuitous routing of checks. The level of presentment fees, however, helps determine the allocation of the common costs of check collection. Moreover, circuitous routing is not obviously wasteful, given the common costs of shipping check bundles. And complaints about excessive costs appear to be motivated by dissatisfaction with the allocation of costs among participants rather than the overall level of aggregate costs. Correspondent banking relationships were central to the clearing of checks before the founding of the Fed and can be understood quite clearly in terms of common costs. The correspondent relationship bundled together a number of distinct functions: the respondent used the correspondent to clear checks drawn on banks in the vicinity of or that had relationships with the correspondent; the correspondent presented checks drawn on the respondent; and the respondent held balances with the correspondent, which were used to settle clearings in either direction (Spahr 1926, p. 111). Settlement via interbank balances has clear advantages over settlement by remittance of specie or exchange draft, since the common cost of correspondent balances serves both investment and settlement functions (Watkins 1929, pp. 3–5). Holding balances with a bank in a financial center, where they could earn interest, was preferable to holding sterile reserves in the vault. Combining several items into a single Lacker, Walker, Weinberg: Check Clearing Reconsidered 19 shipment saved shipping costs. Selecting a limited number of correspondents was advantageous because it economized on the fixed costs associated with any given relationship; bilateral clearing and settlement arrangements with the universe of depository institutions would obviously be far too costly. The overall advantage of a correspondent relationship is that certain common costs are spread among a number of distinct payment services rather than duplicated across multiple service providers. Presentment fees can be easily understood as a means of allocating common costs. The presentment fee was the price paid to the paying bank for accepting presentment by mail rather than over the counter, where the paying bank was obligated to pay at par. Presentments, whether over the counter or by mail, were generally paid by debits to correspondent balances or by drafts. Mailing drafts to presenting banks involved postage costs, and so for some checks there was a positive incremental cost to the paying bank of accepting mail presentment. Otherwise the paying bank was largely indifferent about the means by which checks arrived for payment (Spahr 1926, pp. 99–101). In terms of our earlier notation, interpreted here as the incremental benefits and costs of accepting presentment by mail rather than over the counter, vi was approximately zero and ci was slightly positive. A collecting bank, on the other hand, was likely to place considerable value on having a means to avoid the real resource costs of making over-the-counter presentments at long distances. Thus we would expect a large value for −cn , the positive cost savings associated with mail presentment to country banks. Apart from costs, the collecting bank should be relatively indifferent about means of presentment, so vn should be approximately zero as well. If q is the presentment fee, then using our earlier notation, q = −p. With this change of variables, the condition for efficient participation is now ci − vi < q < vn − cn . The presentment fee must exceed the paying banks’ cost of participating, net of benefits, ci − vi > 0. Similarly, the presentment fee must not exceed vn −cn ≈ −cn > 0, the net incremental benefit to the collecting bank of adding the paying bank to the mail presentment network. The net benefit of switching to mail presentment is vn + vi − cn − ci ≈ −cn − ci , which is positive when the cost savings to the presenting bank, −cn , exceeds the incremental cost to the paying bank, ci . It seems likely that for many country checks, direct presentment was more costly than postage for remittance, and therefore mail presentment was economically efficient. Under the property rights inherent in the pre-Fed check clearing, the paying bank was free to choose the presentment fee q. In this setting one would predict that paying banks would set q as high as possible. We therefore should have observed q = −cn , presentment fees equal to the collecting bank’s net willingness 20 Federal Reserve Bank of Richmond Economic Quarterly to pay to avoid over-the-counter presentment.55 For checks drawn on country banks the cost savings associated with mail presentment were substantial, and thus −cn was large. When the paying bank was nearby, the resource cost of over-the-counter presentment was likely to be quite low, and the collecting bank’s willingness to pay for mail presentment would be correspondingly small. This is consistent with country bank presentment fees that are close to the cost to collecting banks of making a direct presentment through an agent or an express company, and the general absence of presentment fees in the city. The presentment fee effectively passes the collecting bank’s cost savings on to the country paying bank as an inducement to participate via mail presentment. All of the joint benefits of mail presentment, −cn −ci = q−ci , accrue to the paying bank. Although some banks complained that presentment fees exceeded the direct outlays of the paying bank for remittance (postage, the cost of exchange, and so on),56 there is no economic efficiency reason why they should not. It is clearly possible for the presentment fee to exceed ci , the paying bank’s direct outlays for remittance, without violating the condition for efficient participation. Thus presentment fees that are “excessive” in this sense are not necessarily evidence of monopoly power as some have claimed.57 Did the system in fact economize on the costs of moving checks? Much of the commentary on this topic has argued the contrary, based on welldocumented instances of circuitous routing. But it is not at all obvious that the examples of circuitous routing constitute strong evidence of excessive costs. Cannon (1900, p. 76), commenting on the famous Sag Harbor check, writes that The reason why banks forward checks in this apparently unreasonable way, often getting the items far out of their regular course, is easy to explain. It sometimes appears cheaper to the one who has the check in hand to enclose it with other items to some regular correspondent, who, assumedly, is nearer the bank on which the check is drawn, than to hunt up a special correspondent for it alone. This reasoning suggests that the cost comparison implicit in the conventional interpretation of the circuitous routing examples is not the relevant one. The appropriate comparison for the bank holding the check is between the cost of sending the item directly to the paying bank (or a “special correspondent” for this check alone) and the cost of including the item with a batch of checks being sent to an established correspondent in the hope that the correspondent 55 An alternative property rights regime in which collecting banks were entitled to mail presentment at par would result in q = ci − vi . In this case the cost savings from mail presentment would accrue to the collecting banks rather than to the paying banks. 56 Spahr (1926, p. 240–43). 57 Stevens (1998), for example. Lacker, Walker, Weinberg: Check Clearing Reconsidered 21 would be better able to get the check to its ultimate destination. For an irregular check—one drawn on a bank with whom one does not have a correspondent relationship—the incremental cost of sending the check directly to the paying bank would include the postage on the letter, along with the cost of preparing a separate shipment. The incremental cost of adding a check to a regular shipment to a nearby correspondent was probably negligible. Note also that settlement was probably less costly through established correspondent relations. Consequently, the latter was almost certainly less costly than direct presentment for a bank holding a check drawn on a country bank. Critics of pre-Fed check clearing implicitly attribute to the wayward check all of the common cost of the shipment to the correspondent.58 The bank deciding where to send an item would not consider the costs incurred by the next bank to hold the check. The next bank could send it on to another correspondent, which could send it on again to one of its correspondents, and so on. Could this lead to excessively costly check routes? From a social point of view, the correct cost comparison is between the expected cost of sending the check along to the next correspondent bank (including the cost of sending it to subsequent correspondents) and the cost of a more direct route. Again, there is no reason to believe that the expected cost of indirect routing was not almost always less than that of direct routing of irregular interregional checks. The fact that some items ended up following routes that look excessively costly ex post does not mean that routing choices were inefficient ex ante. Some mistaken routing choices were inevitable in a decentralized system with thousands of banks. The average costs of such mistakes constitute a valid part of the social cost of the pre-Fed system. The mere existence of such costs, however, does not imply inefficiency for the system as a whole. Only if the costs of the decentralized system exceeded the costs of creating a centralized system would inefficiency be implied. None of the critiques of the pre-Fed system has presented evidence on this dimension.59 Evidence that it is possible to reduce some collection costs is not, by itself, conclusive evidence that an alternative arrangement would be superior. Yet as implied in the comments by Carter 58 A possibility we do not pursue here is that the cost of postage does not represent the social cost of mailing checks. Since postage rates, then as now, are uniform across destinations, it is quite likely that postage on the irregular country checks in question was lower than incremental social cost, in the sense that the total postal revenue on shipments to the country bank’s location failed to cover the incremental cost of service to that location. This would provide yet another reason to question the cost comparison implicit in the circuitous routing evidence. It would also cast doubt on the social value of the movement to shift to direct mail presentment as opposed to direct presentment by an agent such as an express company. 59 There appear to be no available estimates of the frequency of such circuitous routing. Cannon (1900), Hallock (1903), and Spahr (1926) cite only eight examples between them. Referring to the Sag Harbor check, James (1998, p. 144) notes that “Given the paucity of other examples, one might be suspicious of this example’s general applicability.” 22 Federal Reserve Bank of Richmond Economic Quarterly Glass above, to build an alternative system that reduces the costs associated with collecting certain checks would itself be costly. Similarly, some pre-Fed writers, such as Cannon (1900) and Hallock (1903), used evidence on the costs of collecting country checks to argue for the value of creating and operating a country clearinghouse. As noted above, the frustratingly slow pace of such efforts suggests that the costs of creating a country clearinghouse exceeded the available cost savings. If the status quo was efficient, why was there so much dissatisfaction with it? Here, it is important to note that complaints about the system are almost entirely voiced from the point of view of city banks. It was argued that the cost of collecting country checks was too high, and it was widely acknowledged that a par collection system would reduce the earnings of the nonpar country banks.60 In other words, par collection would reduce net costs for city banks and raise net costs for nonpar country banks. The complaints of the city banks seem to have been driven by dissatisfaction with the allocation of costs implied by the status quo arrangements, rather than by dissatisfaction with aggregate cost.61 What city banks wanted was for presentment through the mail to have parity with direct presentment, in which case the paying bank would be obligated to pay at par, without deducting a presentment fee. Such a regime would have inevitably shifted costs toward country banks and away from city banks. Although city banks were generally unable to obtain par presentment legislation, the Federal Reserve Board ultimately granted that right to Reserve Banks by regulatory fiat. 7. THE FED’S ENTRY INTO CHECK CLEARING RECONSIDERED According to the conventional view, self-evident inefficiencies motivated and rationalized the Fed’s entry into the check collection industry. The process involved a struggle between a progressive reform effort and the vested interests of nonpar banks. Our alternative view suggests a very different interpretation 60 Most of Spahr’s (1926, pp. 240–43) “arguments for par collection” amount to claims that costs borne by collecting city banks could and should be reduced. For example, he argues that par collection under the Federal Reserve would “relieve trade . . . of the burden” of clearing costs because “the costs would fall to the Federal Reserve Banks and reduce to that extent the earnings that go to the government” (Spahr 1926, p. 240). That is, the government should defray collecting bank costs. 61 City banks’ complaints about the costs of collecting country checks should also be viewed in the context of secular trends in the structure of the banking industry. The total number of banks in the country grew from 12,424 in 1902 to 26,765 in 1914, and many of these new banks were small country banks (U.S. Treasury, Office of the Comptroller of the Currency 1903, 1915). Thus there were a growing number of country banks to contend with. Lacker, Walker, Weinberg: Check Clearing Reconsidered 23 of the process by which the Fed entered the check collection industry. In our view, the motive was to attract membership, and the process involved a struggle over the allocation of the common costs of check collection. All national banks were required to join the Federal Reserve System, but for state-chartered banks, membership was optional. From the beginning, attracting members, and their reserve balances, was viewed as critical to the success of the institution (White 1983, p. 130). A key perceived defect of the previous system was the “pyramiding” of reserves in financial centers, which left the latter vulnerable to sudden widespread withdrawals. Through rediscounting the Reserve Banks would provide an elastic supply of balances in response to rapid demand shifts, preventing financial panics (White 1983, pp. 63–125). In this context, the Reserve Bank check clearing service authorized by the Act could help attract members, as the early leaders of the Federal Reserve clearly understood. A mid-1915 report to the Reserve Bank Organizing Committee spelled out the link between Reserve Bank check clearing services and the membership question: It must be borne in mind that the banking power of the United States will divide more sharply than it has ever done before into two groups—members and non-members. It is the intent of the Act itself to bring non-members into the system. But so long as there is any considerable body of non-member banks, the two groups will of necessity be in competition with one another, producing two parallel clearing systems. . . . (T)he domestic exchange business of the Federal reserve system must be so arranged as to offer constant inducements to non-members to enter the system. At the same time, members must find it more profitable to use the Federal reserve system than to make collections as at present. The situation is more complex when it is taken into consideration that member banks are in a position to deal on favorable terms either with the Federal reserve banks and their members or with non-members.62 The Fed’s check clearing service should aim to reduce costs to members and attract nonmembers to join the system. To do so, they would need to attract the check clearing business of their members, who were under no obligation to clear through the Fed. The reserve requirements in the Federal Reserve Act, while essential to the monetary goals of the Act, made it more difficult to attract members. Prior to the founding of the Fed, banks kept reserves with correspondents in addition to specie and notes in their vaults. Correspondent balances could be used to satisfy legal reserve requirements, up to a limit, under state laws and the National Bank Act (Watkins 1929, pp. 67, 96). Moreover, correspondents would often grant 62 Preliminary Committee on Organization (1914, pp. 58–59). The organizing committee consisted of the Comptroller of the Currency and the Secretaries of the Treasury and Agriculture. The Preliminary Committee on Organization reported to them and was chaired by H. Parker Willis. 24 Federal Reserve Bank of Richmond Economic Quarterly immediate credit for deposited checks, and these counted toward the respondent’s required reserves. Under the Federal Reserve Act, reserve requirements had to be met with balances held at the Reserve Banks; correspondent balances would no longer count. If the Reserve Banks did not offer check clearing services, member banks would have to hold separate correspondent balances in order to clear checks, and these balances would have a higher opportunity cost, since they would no longer do double duty. Willis (1923, p. vi) described the implications in dramatic terms: It was recognized that, without these powers, [referring to Reserve Bank check clearing authority] the reserve banks would become merely the holders of dead balances carried for the member banks without any service to them; and, since the business public abhors an idle or unnecessary institution, just as nature is traditionally said to abhor a vacuum, it would not submit long to the needless burden created by such emergency institutions designed to put out financial fire. Failure to offer attractive check clearing services to justify member bank reserve balances would threaten support for the System.63 The reserve requirement provisions of the Federal Reserve Act were to be phased in over three years, so the System had time to develop a strategy. According to the original Act, member bank required reserves would be transferred over from correspondents in annual steps from November 16, 1914, to November 16, 1917.64 These provisions were revised by the amendments of June 21, 1917, in connection with measures to aid financing of the government’s war effort, lowering the requirements but making them effective immediately.65 The Fed’s struggle to establish its check clearing service is readily understandable from our alternative perspective. The first national check clearing venture, the “voluntary reciprocal plan” initiated by the Board in March 1915, was unsuccessful, never attracting more than a third of the member banks. In exchange for agreeing to accept mail presentment at par, member banks were able to clear at par on members that joined. This was essentially a voluntary clearinghouse, modeled after the city clearinghouses. That it failed should be no surprise, given the terms that were offered. Reciprocal par presentment allocates common costs according to each bank’s outlays. This allocation successfully attracts members where the cost of over-the-counter presentment is low, as among city banks. In this case, a bank joining the scheme gives up little in presentment fee income. But where over-the-counter presentment costs are 63 Stevens (1998) also discusses the role of reserve requirements in the evolution of the Fed’s check clearing activities. 64 Starting on November 16, 1914, correspondent balances could count towards a maximum of 6/15 of required reserves in the first year, 5/15 the second year, 4/15 the third year, 3/15 the third year, and not at all after November 16, 1918. 65 The Board gave banks until July 15, 1917, to comply (Federal Reserve Bulletin 1917, pp. 508–09). Lacker, Walker, Weinberg: Check Clearing Reconsidered 25 high and banks have the right to charge for remittance outside the scheme, this allocation of common costs may fail to induce participation; some prospective members would have no incentive to join, even if their participation would be worthwhile. Two-thirds of member banks apparently did not want to join under the reciprocal par presentment cost allocation. Some might have brought large benefits to the other participants in the scheme by lowering presentment costs, even though they themselves placed a relatively low value on participation. As we noted above, these are just the types of banks that charge presentment fees. The Fed needed to find a way to induce their participation. With its second venture, the “compulsory” plan, the Fed found a solution to the problem: under Circular No. 1 (May 1, 1916), members were required to accept mail presentment at par. Having failed to induce more than a third of their members to voluntarily give up charging presentment fees against the Reserve Banks, the Fed prohibited such fees outright. The compulsory plan was more attractive to members than the voluntary-reciprocal plan on two counts. First, the Reserve Banks were offering to clear checks on any member bank, regardless of how many member banks joined the scheme. The Reserve Banks immediately had over 7,000 par endpoints. Second, the cost of joining the second plan was much lower for many banks. Under the previous plan they would have had to give up presentment fees. Under the new plan, the Board had already taken away their right to charge presentment fees against the Reserve Banks. Joining was less of a sacrifice now. Success was not yet assured. In about a year, reserves would be transferred over from correspondents to the Reserve Banks under the new reserve requirements. As we noted above, correspondent balances were a key component of the bundle of mutual clearing services that made up the typical correspondent banking relationship. Settling cleared checks by crediting or debiting correspondent balances was less costly than remitting specie or exchange drafts. Members would need to retain some correspondent balances to clear checks on nonmembers, and such balances would no longer do double duty as required reserves. The new reserve requirements would break apart some of the shared common costs built into pre-Fed correspondent banking arrangements, raising member bank costs.66 The Fed’s strategy now was to rebuild that cost sharing around member bank balances at Reserve Banks. The objective was to offer to collect at par checks drawn on every bank in the country.67 In theory, member banks would no longer need external correspondent balances, the Reserve Banks having taken over all the essential clearing functions connected with them. To this end, the Board sought the amendments passed on September 7, 1916, allowing Reserve Banks to clear checks drawn on nonmembers. But the 66 Martin, et al. (1915, pp. 369–70). clearing and collection plan to be effective must be so comprehensive as to include all checks” (Federal Reserve Bulletin 1917, p. 100). 67 “Any 26 Federal Reserve Bank of Richmond Economic Quarterly Reserve Banks had trouble getting nonmember banks to accept presentment at par, as one would expect; the Reserve Banks had nothing of value to offer in exchange. This led the Board in early 1917 to seek amendments allowing the Reserve Banks to accept checks deposited by nonmembers. Perhaps reciprocal check clearing privileges would entice nonmembers to give up their presentment fee income. By early 1917, however, the Board’s ban on presentment fees against the Reserve Banks made clear that check collection costs would be reallocated as a by-product of the Fed’s strategy to eliminate presentment fees. Bankers who would be disadvantaged by such a reallocation mobilized to push the Hardwick Amendment. The resulting legislative battle exposed the divergent interests in the allocation of check collection costs. The Hardwick Amendment was effectively defeated following a “nationwide campaign . . . by the Credit Men’s Association, the mail-order houses, manufacturers, jobbers, wholesalers and merchants in the large centers.”68 Payees, in other words, lobbied in favor of shifting costs towards payor banks.69 An ensuing opinion by the Attorney General spelled out the new distribution of property rights.70 Reserve Banks could not pay presentment fees. Nonmember banks could decide for themselves whether to charge fees, but assessing fees against a Reserve Bank for mail presentment was the equivalent of not accepting mail presentment from them. In this environment, the Reserve Banks attempted to exercise as much leverage as possible to persuade nonmembers to pay at par. During the period from early 1919 through 1923, the Reserve Banks resorted to a number of costly collection techniques, such as sending Reserve Bank employees to present over the counter or hiring local agents to make direct presentation. These were characterized in litigation as outside the bounds of customary banking practice; in many cases the expenditures on such techniques exceeded the presentment fees that were avoided and thus would not have been undertaken by private sector collecting banks.71 The list of banks accepting par presentment was naturally largest in the presence of such measures. When court decisions struck them down, Reserve Bank leverage was commensurately reduced and the par list shrank. 68 “The Hardwick Amendment” (1917, p. 40). See also Tippetts (1929, pp. 272–74). that costs were shifted to taxpayers as well, since the Fed stopped recovering its costs when collection fees were eliminated in June 1918 (Spahr 1926, pp. 192–93). 70 Federal Reserve Bulletin (1918, pp. 367–70). 71 Spahr (1926); Tippetts (1929). 69 Note Lacker, Walker, Weinberg: Check Clearing Reconsidered 8. 27 CONCLUSION The Reserve Banks were able to achieve what reformers had been unable to bring about—a more centralized clearing of interregional checks. Although early reform-minded writers, like Cannon (1900) and Hallock (1903), had argued that the cost savings from eliminating what they saw as inefficiencies in clearing irregular interregional checks would exceed the cost of setting up a more centralized arrangement, no such schemes had emerged. The Fed succeeded where earlier efforts failed, under our alternative view, because the Board arrogated the right to present at par on member banks. No such unilateral ability to reallocate property rights was available to private sector collecting banks. Reallocating property rights had the effect of shifting the common costs of check collection away from collecting banks that used the Reserve Bank system, toward member banks and, after fees were eliminated in 1918, toward federal taxpayers. The par presentment right granted to the Reserve Banks by the Board in Circular No. 1, Series of 1916 (now Regulation J), is essentially a barrier to competition in the sense that it allows the Reserve Banks to offer check collection services at lower costs than competitors. Other banks could obtain the right to present at par, but they would have to offer paying banks material compensation in order to do so. The Reserve Banks did not need to offer any compensation. Note that this barrier to competition persists today in the form of differential presentment times (Lacker and Weinberg 1998). Private sector collecting banks must present by 8:00 a.m. in order to obtain same-day funds (at par), while Reserve Banks can present until 2:00 p.m. for same-day funds; the Reserve Banks have a “six-hour monopoly.” Interestingly, the Board of Governors recently revisited the presentment time differential.72 The statement announcing the Board’s decision to retain the competitive advantage notes that any equalization would reallocate costs in a way that would be disadvantageous to some segment of the industry. Moving the private presentment time later, for example, would make collecting banks better off and paying banks worse off.73 Ironically, the Fed’s original entry into check collection appears to have been accomplished by reallocating the common costs of check collection in just this fashion. 72 Board 73 Board of Governors of the Federal Reserve System (1998). of Governors of the Federal Reserve System (1998, p. 12). 28 Federal Reserve Bank of Richmond Economic Quarterly REFERENCES Baxter, William F. “Bank Interchange of Transactional Paper,” Journal of Law and Economics, vol. 26 (October 1983), pp. 541–88. Benston, George J., and David B. Humphrey. “The Case for Downsizing the Fed,” Banking Strategies (January/February 1997), pp. 30–37. Board of Governors of the Federal Reserve System. “Collection of Checks and Other Items by Federal Reserve Banks and Availability of Funds and Collection of Checks” (Docket No. R–1009). Available: http://www.bog.frb.fed.us/boarddocs/press/BoardActs/1998/19981209/R-1009.pdf [March 3, 1999]. . Regulations. “Regulation J, Collection of Checks and Other Items by the Federal Reserve Banks and Funds Transfers through Fedwire,” as amended 1998. . “The Federal Reserve in the Payments System,” Federal Reserve Bulletin, vol. 76 (May 1990), pp. 293–98. . Seventh Annual Report, 1920. Washington: Government Printing Office, 1921. . Annual Report of the Federal Reserve Board, 1914. Washington: Government Printing Office, 1915. Cannon, James G. Clearing-Houses: Their History, Methods and Administration. London: Smith, Elder, & Co., 1900. Congressional Record. Washington, D.C., 1917. Conway, Thomas, Jr., and Ernest M. Patterson. The Operation of the New Bank Act. Philadelphia: J. B. Lippencott Co., 1914. Corrigan, E. Gerald. “Background Materials to the Statement of E. Gerald Corrigan,” in The Role of the Federal Reserve in Check Clearing and the Nation’s Payments System. Finance and Urban Affairs, U.S. Congress, House of Representatives. Subcommittee on Domestic Monetary Policy of the Committee on Banking, and the Subcommittee on Commerce, Consumer, and Monetary Affairs of the Committee on Government Operations. Hearing, 98 Cong. 1 Sess. Washington: Government Printing Office, 1983. Duprey, James N., and Clarence W. Nelson. “A Visible Hand: The Fed’s Involvement in the Check Payments System,” Federal Reserve Bank of Minneapolis Quarterly Review, vol. 10 (Spring 1986), pp. 18–29. Lacker, Walker, Weinberg: Check Clearing Reconsidered 29 Federal Reserve Bank of Richmond. “The Development of Par Collections by Federal Reserve Banks,” Letters to College Classes in Economics and Banking Discussing The Practical Operations of the Federal Reserve System, no. 6 (May 1922). Fellows, E. G. The Exchange Charge on Checks. Dissertation, Rutgers University, 1940. Friedman, Milton, and Anna Jacobson Schwartz. A Monetary History of the United States: 1867–1960. Princeton: Princeton University Press, 1963. Gidney, Ray. “Notes: Federal Reserve Clearings and Collections,” Journal of Political Economy, vol. 24 (June 1916), pp. 606–08. Gilbert, R. Alton. “Did the Fed’s Founding Improve the Efficiency of the U.S. Payments System?” Federal Reserve Bank of St. Louis Review, vol. 80 (May/June 1998), pp. 121–42. Gorton, Gary. “Clearinghouses and the Origin of Central Banking in the United States,” Journal of Economic History, vol. 45 (June 1985), pp. 277–83. , and Donald J. Mullineaux. “The Joint Production of Confidence: Endogenous Regulation and Nineteenth Century Commercial-Bank Clearinghouses,” Journal of Money, Credit, and Banking, vol. 19 (November 1987), pp. 457–68. Hallock, James C. Clearing Out-of-Town Checks. St. Louis: Author, 1903. Harding, W. P. G. “Reply to Senate Resolution Relative to Check Clearing,” in Board of Governors of the Federal Reserve System, Seventh Annual Report, 1920. Washington: Government Printing Office, 1921, pp. 335–57. “The Hardwick Amendment,” Journal of the American Bankers Association (July 1917), pp. 40–41. James, John A. “Commentary,” Federal Reserve Bank of St. Louis Review, vol. 80 (May/June 1998), pp. 143–50. Jessup, Paul F. The Theory and Practice of Nonpar Banking. Evanston, Ill.: Northwestern University Press, 1967. Kemmerer, Edwin W. The ABCs of the Federal Reserve System. Princeton: Princeton University Press, 1928. Lacker, Jeffrey M., and John A. Weinberg. “Can the Fed be a Payment System Innovator?” Federal Reserve Bank of Richmond Economic Quarterly, vol. 84 (Spring 1998), pp. 1–25. Langston, L. H. Practical Bank Operations. New York: Ronald Press Co., 1921. Martin, Wm. McC., Frederic H. Curtiss, and Pierre Jay. “Report of the Federal Reserve Agents’ Committee on Clearings,” Federal Reserve Bulletin, vol. 1 (November 1915), pp. 369–72. McAndrews, James J. “Commentary,” Federal Reserve Bank of St. Louis Review, vol. 77 (November/December 1995), pp. 55–59. 30 Federal Reserve Bank of Richmond Economic Quarterly Miller, Melvin C. The Par Check Collection and Absorption of Exchange Controversies. Cambridge: The Bankers Publishing Co., 1949. Moore, Carl H. The Federal Reserve System. London: McFarland & Co., 1990. Preliminary Committee on Organization. Report to the Reserve Bank Organizing Committee. New York: Reserve Bank Organizing Committee, June 1, 1914. Preston, Howard H. “The Federal Reserve Banks’ System of Par Collection,” Journal of Political Economy, (July 1920), pp. 565–90. Spahr, Walter Earl. The Clearing and Collection of Checks. New York: Bankers Publishing Co., 1926. Sprague, Oliver M. History of Crises under the National Banking System, U.S. National Monetary Commission, Senate Document No. 538, 61 Cong. 2 Sess. Washington: Government Printing Office, 1910. Stevens, Ed. “Non-Par Banking: Competition and Monopoly in Markets for Payments Services,” Working Paper 9817. Cleveland: Federal Reserve Bank of Cleveland, November 1998. . “The Founders’ Intentions: Sources of the Payments Services Franchise of the Federal Reserve Banks,” Working Paper 03–96. Cleveland: Federal Reserve Bank of Cleveland, Financial Services Working Paper Series, December 1996. Summers, Bruce J., and R. Alton Gilbert. “Clearing and Settlement of U.S. Dollar Payments: Back to the Future?” Federal Reserve Bank of St. Louis Review, vol. 78 (September/October 1996), pp. 3–27. Timberlake, Richard H. The Origins of Central Banking in the United States. Cambridge, Mass.: Harvard University Press, 1978. Tippetts, Charles S. State Banks and the Federal Reserve System. New York: D. Van Nostrand Co., 1929. . “The Par Remittance Controversy,” American Economic Review, vol. 14 (December 1924), pp.629–48. U.S. Congress, House of Representatives, House Report 69, 63 Cong. 1 Sess. Washington: Government Printing Office, 1913. U.S. Congress, Senate, Senate Committee on Banking and Currency, Hearings on H. R. 7837 (S. 2639), 63 Cong. 1 Sess., November 6, 1913. Washington: Government Printing Office, 1913, pp. 512–13. U.S. Treasury, Office of the Comptroller of the Currency. Annual Report, 1914. Washington: Government Printing Office, 1915. . Annual Report, 1902. Washington: Government Printing Office, 1903. Lacker, Walker, Weinberg: Check Clearing Reconsidered 31 Walter, John R. An Examination of the Factors Motivating the Federal Reserve System’s Entrance into the Provision of Check Collection Services. Dissertation, University of Delaware, 1988. Watkins, Leonard L. Banker’s Balances. Chicago: A. W. Shaw, 1929. Weinberg, John A. “The Organization of Private Payment Networks,” Federal Reserve Bank of Richmond Economic Quarterly, vol. 83 (Spring 1997), pp. 25–43. White, Eugene Nelson. The Regulation and Reform of the American Banking System, 1900–1929. Princeton: Princeton University Press, 1983. Williams, Charles R. The Bankers Encyclopedia. Chicago: The Bankers Encyclopedia Co., 1901. Willis, Henry P. “The Federal Reserve Act,” American Economic Review, vol. 4 (March 1914), pp. 1–24. Willis, Henry Parker. The Federal Reserve System: Legislation, Organization and Operation. New York: The Ronald Press Co., 1923. Willis, H. Parker, John M. Chapman, and Ralph West Robey. Contemporary Banking. New York: Harper & Brothers Publishers, 1933. Wyatt, Walter. “The Par Clearance Controversy,” Virginia Law Review, vol. 30 (June 1944), pp. 361–97. A Forward-Looking Monetary Policy Reaction Function Yash P. Mehra T he Federal Reserve’s reaction function, which summarizes how the Federal Reserve (Fed) alters monetary policy in response to economic developments, plays an important role in macroeconomic and policy analyses. It can be helpful in predicting actual policy actions, thereby serving as a benchmark for assessing the current stance and the future direction of monetary policy. Also, in macro models, the reaction function is central in evaluating Fed policy and determining effects of other macro policies or economic shocks, implying macroeconomic performance may itself depend upon the conduct of monetary policy. Consequently, there is considerable interest in identifying the nature of actual policy pursued by the Fed and determining whether the estimated reaction function fostered or hindered macroeconomic stability.1 Although numerous monetary policy reaction functions have been estimated, in this article I estimate one that sheds new evidence on the nature of Fed policy since 1979. In particular, I present and estimate a forward-looking The views expressed are the author’s and not necessarily those of the Federal Reserve Bank of Richmond or the Federal Reserve System. 1 See, for example, Clarida, Gali, and Gertler (1998b) and Taylor (1998). Clarida, Gali, and Gertler estimate a forward-looking version of the Taylor rule for the post–World War II U.S. economy and find that the interest rate policy in the Volcker-Greenspan period was more responsive to changes in expected inflation than it had been in the pre-Volcker period. They then embody these estimated policy rules within a standard business cycle model and analyze the statistical properties of inflation and output. They show that the estimated pre-Volcker rule permits larger fluctuations in inflation and real output (and therefore greater macroeconomic instability) than does the Volcker-Greenspan rule. Similarly, Taylor (1998) argues that U.S. interest rate policy during the period 1986 to 1997 is well predicted by the Taylor rule and that this period in U.S. monetary policy had the greatest degree of economic stability. Both inflation and real output had smaller fluctuations during this period. He then identifies two other periods, 1879 to 1914 and 1960 to 1979, when policy deviated significantly from the Taylor rule in that the nominal interest rate was not very responsive to both inflation and real output fluctuations. During those periods macroeconomic performance was not as good. Federal Reserve Bank of Richmond Economic Quarterly Volume 85/2 Spring 1999 33 34 Federal Reserve Bank of Richmond Economic Quarterly monetary policy reaction function that predicts the actual path of the funds rate during most of the period from 1979 to 1997. The distinguishing characteristic of this policy reaction function is that policy responds to movements in longterm inflationary expectations as evidenced by the behavior of the bond rate, an issue discussed first in Goodfriend (1993) but ignored in the recent empirical work on estimated monetary policy rules.2 I also examine whether the policy reaction function has changed significantly during the 1990s, especially during Alan Greenspan’s tenure as Fed Chairman. Finally, since this reaction function predicts actual policy actions fairly well, I discuss whether policy during the most recent period 1997Q1 to 1998Q2 is consistent with prior Fed behavior. This period is of interest because during this period the Fed did not adjust the funds rate in response to above-trend real growth. The policy reaction function that I consider here has both backward- and forward-looking components. It assumes that the funds rate responds to actual inflation, increases in expected future inflation, expected output gap, and the bond rate. The funds rate response to the bond rate captures the influence of long-term inflationary expectations on policy. The empirical work here, which focuses on the behavior of the funds rate over two sample periods, 1960Q2 to 1979Q2 and 1979Q3 to 1998Q2, broadly supports this specification. However, policy responses differ across these sample periods. The most significant difference is that the funds rate has responded to movements in the bond rate after 1979 but not before. This indicates that since 1979 the Fed has been very sensitive to long-term expected inflation; so much so that for most of this period the nominal funds rate has moved more than one-for-one with actual inflation. Hence the real as well as the nominal funds rate increased in response to inflation. That is not the case in the pre-1979 period, when the nominal funds rate did not adjust one-for-one with actual inflation. In that period the real funds rate declined in response to actual inflation. The policy reaction function given here tracks the actual behavior of the funds rate more closely since 1979 than it does in the period before. The sample period 1979Q3 to 1998Q2 spans the tenures of Paul Volcker and Alan Greenspan as Fed Chairmen. The results, however, indicate that the policy reaction function has not changed much between the Volcker and Greenspan periods. Finally, policy during the most recent subperiod 1997Q1 to 1998Q2 is consistent with prior Fed behavior. While the U.S. economy has grown at a very strong rate during this period, actual inflation has fallen steadily and 2 Clarida, Gali, and Gertler (1998a) and Clarida, Gali, and Gertler (1998b) estimate forwardlooking versions of the Taylor rule for G7 countries including the United States, but they do not examine the role of the bond rate. Nor do they examine issues relating to the stability of the reaction function and the ability to predict actual policy. Mehra (1997), on the other hand, does consider the response of policy to the bond rate and finds that the bond rate is significant in the reaction function. Y. P. Mehra: Monetary Policy Reaction 35 long-term inflationary expectations as measured by the behavior of the bond rate have remained well behaved. Furthermore, there is also some evidence that the economy’s underlying trend growth rate may have increased somewhat during the ’90s. Once we consider together the influences of all these economic factors on the funds rate target, then the actual funds rate, which hovered around 5.5 percent during this subperiod, is not too different from the value predicted by the policy reaction function. On a more intuitive level, the surprisingly good performance of the economy on the inflation front combined with well-behaved long-term inflationary expectations and higher estimates of trend growth worked to neutralize the tighter policy response indicated by above-trend growth. Accordingly, the absence of any policy move during this subperiod is not out of line with prior Fed behavior. 1. THE MODEL AND THE METHOD A Forward-Looking Specification The policy reaction function considered here builds upon the work in Taylor (1993), Mehra (1997), and Clarida, Gali, and Gertler (1998b). The particular specification estimated here can be derived using the following two equations: FR∗t = rr + a11 INFLt−1 + a12 (INFLt−1 − INFL∗ ) + a21 (EINFLt+1 − INFLt−1 ) + a31 (BRt − EINFLt+k ) + a41 EGAPt+k , and FRt = (1 − ρ)FR∗t + ρFRt−1 + vt ; 0 ≤ ρ ≤ 1, (1) (2) where FRt is the actual funds rate; FR∗t is the Fed’s nominal funds rate target for period t; INFL is the inflation rate; INFL∗ is the Fed’s inflation target; GAP is the output gap; BR is the bond rate; rr is the economy’s underlying equilibrium real interest rate; vt is a stochastic disturbance term; and E is the expectations operator. Equation (1) specifies the economic determinants of the funds rate target. It assumes that the Fed has a target for inflation and a target for the level of output. It also assumes that the Fed pays attention to actual inflation as well as to the expected change in its future direction. Equation (1) thus hypothesizes that the funds rate target each period is determined as a function of the real rate of interest, actual inflation, and gaps between actual inflation and expected output and their respective target levels. Since the Fed is concerned with the expected future direction of inflation, equation (1) also posits that the funds rate target depends upon the change in expected future inflation and the bond rate. The other important assumption implicit in (1) is that the economy’s underlying real rate of interest and the Fed’s short-term target for inflation are constant in the short run. 36 Federal Reserve Bank of Richmond Economic Quarterly Equation (2) specifies the actual funds rate as a weighted average of the last-period funds rate and the current-period funds rate target. It assumes the Fed smoothes short-run changes in interest rates and hence the actual funds rate adjusts gradually to the target implied by economic fundamentals specified in (1) (Goodfriend 1991). The magnitude of the parameter ρ measures the degree of interest rate smoothing in Fed behavior. In equation (2) there is also a shock term vt , indicating that the Fed may deviate transitorily from its systematic rule in (1). I, however, assume these transitory deviations from the policy rule are not serially correlated, as in true “policy shocks.” If we substitute (1) into (2), we get (3), which is the policy reaction function investigated here. FRt = (1 − ρ)(rr − a12 INFL∗ ) + ρFRt−1 + (a11 + a12 )(1 − ρ)INFLt−1 + a21 (1 − ρ)(EINFLt+1 − INFLt−1 ) + a41 (1 − ρ)(EGAPt+1 ) + a31 (1 − ρ)(BRt − EINFLt+1 ) + vt , (3) where all variables are defined as before. Equation (3) is the short-run policy reaction function where the funds rate in period t is determined in part by its actual value in the previous period and in part by previous, current, and expected future values of economic factors. The reaction function indicates that the funds rate rises if actual inflation increases, if future inflation is expected to increase, if the bond rate moves relative to its value consistent with the Fed’s current forecast of near-term expected future inflation, or if the expected future output gap is positive. The parameters aij (1 − ρ), i = 2, 3, 4, j = 1, 2, measure these short-run responses, their magnitudes being determined in part by the degree of interest rate smoothing in Fed behavior. If the Fed does not smooth interest rates, then ρ is zero and the funds rate adjusts each period in response to changes in its economic determinants. The coefficients aij , i = 2, 3, 4, j = 1, 2, then measure the responses of the funds rate target to economic fundamentals within each period. Consequently, the period t responses are the long-term responses. The key feature of the short-run reaction function (3) is that the funds rate is assumed to respond to long-term inflationary expectations imbedded in the bond rate. Goodfriend (1993) has convincingly argued that in order to establish and maintain credibility to low inflation, the Fed has reacted to the long bond rate. However, since an expected future inflation variable is already included in the reaction function, the bond rate should influence the funds rate only if it contains information beyond that which is already imbedded in the Fed’s current forecasts of the future inflation rate. As a result, the funds rate target is assumed to respond to deviations of the bond rate as measured from the expected future inflation rate. We define the steady state as the one in which the Fed has achieved its short-term objectives for inflation and real output and in which the public’s expectations of inflation are stabilized whereby long-term expected inflation Y. P. Mehra: Monetary Policy Reaction 37 equals the constant inflation target. Clearly the bond rate has no role in the steady state because the long-term expected inflation also equals the constant target inflation. Under this definition of the steady state (with EGAPt+1 = 0, EINFLt+1 = INFLt−1 = INF∗ , a31 = 0, a11 = 1), the policy reaction function (1) has the property that the nominal funds rate target equals the inflation rate plus the economy’s underlying real rate of interest, i.e., rr + INFLt .3 The policy reaction function (3) has thus embedded in it the Fisher relation as applied to the nominal funds rate target, indicating that economic fundamentals such as the inflation rate and the real rate of interest are the steady-state determinants of the funds rate target. The component rr + (a11 + a12 )INFL in (1) is of interest, however, for one more reason. It provides information about the long-run response of the nominal funds rate to inflation.4 In particular, the estimated coefficient (a11 + a12 ) that appears on the level of actual inflation in (1) measures the net response of the funds rate to three inflation variables.5 If its estimated value is above unity during a given sample period, it indicates that, as a result of the Fed’s shortterm reactions to inflation indicators, both the nominal and real funds rate target rose in response to inflation. On the other hand, if the estimated value is less than unity, then it indicates the real funds rate target declined in response to inflation. This information can be useful in assessing whether or not monetary policy is neutral during a given sample period. It is also worth pointing out that the short-term policy reaction function (3) studied here is similar in some respects to the Taylor rule recently estimated in Taylor (1998). The policy rule estimated there is given in (4): FR∗t = rr + INFt + d1 (INFt − INF∗ ) + d2 GAPt ; FRt = FR∗t , (4) where INF is measured by the average inflation rate over the past four quarters and all other variables are defined as before. First note that the reaction function (3) studied here collapses to the policy rule (4) if we substitute actual for expected output gap and set ρ = 0, a11 = 1.0, a12 = d1 , a21 = a31 = 0, a41 = d2 . In other words, according to the policy rule (4), the Fed does not smooth interest rates, responds only to actual inflation and the output gap, and ignores altogether the behavior of forward-looking inflation indicators in setting the funds rate target. Therefore one can interpret the policy rule estimated here as one that relaxes the restrictions implicit in the estimated Taylor rule. 3 I am implicitly assuming that the bond rate has been stripped of its assumed constant real rate component. In the empirical work, the bond rate variable (BR − EINFLt+1 ) used has been demeaned. 4 This is the long-run funds rate equation estimated in Mehra (1997) and is also the steadystate component of the Taylor rule (Taylor 1993). 5 To see this result, rewrite equation (1) so that all three inflation variables are in levels. Then the coefficients that appear on levels of these three variables (INFL, EINFL, BR) sum to (a11 + a12 ). 38 Federal Reserve Bank of Richmond Economic Quarterly Estimating the Forward-Looking Reaction Function The reaction function (3) contains unobserved expected values of inflation and output gap. In estimating (3), I replace unobserved expected values with actual values and assume that the Fed’s expectations of these variables are rational and hence uncorrelated with time t − 1 information known to the central bank as in (5): INFLt+1 = E(INFLt+1 /It−1 ) + vpt+1 GAPt+1 = E(GAPt+1 /It−1 ) + vyt+1 , (5) where vpt+1 and vyt+1 are forecast errors that are uncorrelated with t − 1 dated information used by the central bank to forecast inflation and the output gap. If we eliminate the unobserved expected values from (3), we can rewrite (3) as (6): FRt = (1 − ρ)(rr − a12 INF∗ ) + ρFRt−1 + (a11 + a12 )(1 − ρ)INFLt−1 + a21 (1 − ρ)(INFLt+1 − INFLt−1 ) + a41 (1 − ρ)GAPt+1 + a31 (1 − ρ)(BRt − INFLt+1 ) + vvt , (6) where vvt+1 = vt − (a21 − a31 )(1 − ρ)vpt+1 − a41 (1 − ρ)vyt+1 . The composite error term, vvt , is serially uncorrelated as both vpt+1 and vyt+1 are serially uncorrelated. But it is correlated with period t + 1 values of actual inflation and output gap. That is, the disturbance term in (5) is correlated with the righthand side explanatory variables. However, it can be verified that vv satisfies orthogonality conditions expressed in (7): E(vvt /It−1 ) = 0. (7) That is, the composite error term vvt is uncorrelated with t−1 dated information used by the central bank to forecast one-period-ahead inflation and the output gap. That suggests equation (5) can be estimated by instrumental variables, using variables in the information set It−1 as instruments. In particular, I follow Clarida, Gali, and Gertler (1998b) and estimate (5) using Hansen’s (1982) generalized method of moments estimator. Under the identifying assumptions (7), this procedure produces efficient instrumental variables estimates. Furthermore, the procedure generates a test of identifying restrictions (7) used to estimate the model parameters.6 6 I performed the generalized method of moments (GMM) estimation using the statistical package Regression Analysis for Time Series (version 4). The GMM is an efficient instrumental variables estimator. If we specify the list of instruments that are assumed to be uncorrelated with the disturbance term (vvt in [7]) and if we know exactly the covariance matrix of this disturbance term, then the GMM estimator is in fact the generalized instrumental variables estimator of the form given below: β = [(X0 Z)(Z0 ΩZ)−1 (Z0 X)]−1 (X0 Z)(Z0 ΩZ)−1 Z0 y, (a) Y. P. Mehra: Monetary Policy Reaction 39 Data, Definition of Economic Variables, and Empirical Specifications of the Funds Rate Equation The empirical work here estimates the reaction function over two sample periods, 1960Q2 to 1979Q2 and 1979Q3 to 1998Q2. It is widely believed that in the second subperiod the Fed made serious efforts to reduce the trend rate of inflation and contain inflationary expectations and that this deflationary policy was set in motion when Paul Volcker became Fed Chairman in late 1979. It is also believed that such policy has continued through the current regime of Alan Greenspan. Hence the estimated monetary policy reaction function is likely to differ between pre- and post-1979 periods.7 With regard to data and definitions, the funds rate variable (FR) is the average quarterly value of the effective funds rate. Inflation (INFL) is measured by the behavior of the (chain-weighted) GDP deflator. The output gap variable (GAP) is measured as the excess of actual over potential GDP. I consider two alternatives about the Fed’s estimate of potential GDP. In one I follow the evidence in Hodrick and Prescott (1997) that potential GDP has a smoothly varying trend and that this trend is well approximated by passing real GDP through the Hodrick-Prescott (HP) filter with the smoothness parameter λ set at 1600.8 Taylor (1998) also estimates policy rules with series on the potential output generated with the HP filter. Alternatively, following Clarida, Gali, and Gertler (1998b), I also consider results using potential GDP estimated from a fitted quadratic function of time. This specification assumes that trend GDP is where X is the matrix of observations on the explanatory variables, Z is the matrix of observations on instruments, Ω is the covariance matrix of the disturbance term, and y is the vector of observations on the dependent variable. In the special case where Ω = σ 2 I, so that the disturbance term vvt is both homoscedastic and serially uncorrelated, the estimator (a) reduces to the simple instrumental variables estimator. However, in practice we do not know the form of Ω. But as Hansen (1982) shows, it is possible to compute consistent estimators using a procedure that imposes little structure on the matrix Ω. In particular, the estimates here are generated using a two-step procedure. In step one, the policy reaction function is estimated using the instrumental variables with Ω = σ 2 I, and the residuals are computed. In step two, the matrix (Z0 ΩZ) is estimated using residuals as suggested in Hansen (1982) and the GMM estimator is computed replacing the component (Z0 ΩZ)−1 in (a) by its estimated value. In addition to specifying the list of instruments, one has to specify the length of lags on the instruments. Moreover, if the model suggests the presence of serial correlation, then one can take that into account in specifying the matrix Ω. The empirical work here is performed using four lags of instruments. Since the policy reaction function focuses on one-step-ahead expected values, the disturbance term under that assumption is serially uncorrelated. 7 The empirical work in previous research also indicates that policy responses differ significantly across these two sample periods (Mehra 1997; Clarida, Gali, and Gertler 1998b; and Taylor 1998). 8 The magnitude of the smoothness parameter λ determines the variability of the trend component. The larger the value of λ, the smaller the variability of its trend component. If λ is chosen to be infinity, then the filtered series approaches the least squares fit of a linear trend model. Hodrick and Prescott (1997), however, show that small changes in the value of the smoothness parameter chosen (λ = 1600) do not much alter the business cycle properties of real GDP. 40 Federal Reserve Bank of Richmond Economic Quarterly deterministic as opposed to being stochastic. The long-term bond rate (BR) is measured here by the nominal yield on ten-year U.S. Treasury bonds. In some previous studies, including Mehra (1997), lagged money growth is significant when included in the reaction function. In order to investigate this issue, I also include money growth in the policy reaction function. As in previous studies, money is defined by M1 until 1982Q3 and by M2 thereafter. Moreover, as in previous studies, money growth for the period 1979Q3 to 1982Q3 is included interacting with a slope dummy variable that is defined to be unity over this subperiod and zero otherwise. This formulation is consistent with the popular view that the Fed’s “new operating procedures” paid considerable attention to M1, and consequently such procedures may have been a source of movements in the funds rate target during this period. McCallum and Nelson (1998) also report that the New Operating Procedure dummy is generally significant when included in policy rules estimated there.9 Given the above-noted considerations, the policy reaction function estimated here is expressed in the following form: FRt = (1 − ρ)a0 + ρFRt−1 + a1 (1 − ρ)INFLt−1 + a2 (INFLt+1 − INFLt−1 ) + a3 (BRt − INFLt+1 ) + a4 GAPt+1 + a5 M1t−1 + vvt , (8) where a0 = rr − a12 INFL∗ , a1 = (a11 + a12 ), a2 = a21 (1 − ρ), a3 = a31 (1 − ρ), and a4 = a41 (1 − ρ). M1 in (8) is money growth measured by the behavior of M1 nd all other variables are defined as before. The instrument list consists of a constant and four lagged values of the funds rate (FR), the inflation rate (INFL), the output gap (GAP), the bond rate (BR), and the growth rate of real GDP. For the subperiod 1960Q2 to 1979Q2, the instrument list also included four lagged values of M1 growth. For the other sample period the instrument list includes money growth interacting with a slope dummy that is defined to be 1 over 1979Q3 to 1982Q3 and zero otherwise.10 2. EMPIRICAL RESULTS Estimates of the Forward-Looking Reaction Function Table 1 presents GMM estimates of the short-run monetary policy reaction function (8) for two sample periods, 1960Q2 to 1979Q2 and 1979Q3 to 1998Q2. 9 In their empirical work, the New Operating Procedure dummy is simply an intercept dummy, defined to be unity over 1979Q3 to 1982Q3 and zero otherwise. Here, the New Operating Procedure dummy is a slope dummy on M1 growth. 10 The choice of instruments is motivated by the view that the Fed’s forecast of expected inflation and the output gap depends upon the past history of inflation, the output gap, real growth, monetary growth, and the bond rate. In addition, the history of policy actions measured by the past behavior of the funds rate is also relevant. As is clear, many of the economic variables included in the Fed’s information set are consistent with the Phillips curve and monetarist views of the inflation process. Y. P. Mehra: Monetary Policy Reaction 41 Table 1 GMM Estimates of the Forward-Looking Reaction Function Panel A: HP Trend a1 a2 a3 a4 a5 ρ a0 x21 J 1960Q2–1979Q2 0.71 (3.8) 0.34 (3.3) −0.11 (1.0) 0.20 (3.6) −0.04 (1.0) 0.76 (9.6) 2.6 (2.9) 2.3 (0.13) 24.0 (0.15) 1979Q3–1998Q1 0.64 (4.2) 0.26 (2.2) 0.27 (7.4) 0.41 (4.8) 0.28 (7.5) 0.69 (13.5) 4.6 (11.7) 5.3 (0.02) 17.9 (0.27) Sample Period Panel B: Quadratic Time Trend 1960Q2–1979Q2 0.70 (3.2) 0.32 (2.6) −0.06 (0.5) 0.12 (4.1) −0.05 (1.0) 0.77 (8.5) 2.5 (2.5) 1.8 (0.17) 22.4 (0.21) 1979Q3–1998Q1 1.2 (8.7) 0.40 (4.2) 0.46 (7.1) 0.26 (7.2) 0.26 (6.9) 0.59 (12.1) 2.9 (7.4) 2.8 (0.09) 13.1 (0.60) Notes: The coefficients (t-values in parentheses below) reported above are from the funds rate equation (8) of the text: FRt = (1 − ρ)a0 + ρFRt−1 + (1 − ρ)a1 INFLt−1 + a2 (INFLt+1 − INFLt−1 ) + a3 (BRt − INFLt+1 ) + a4 GAPt+1 + a5 M1t−1 , where FR is the federal funds rate; INFL is the inflation rate; M1 is M1 growth; GAP is the output gap; and BR is the bond rate. For the sample period 1979Q3–1998Q1, money growth is included interacting with a slope dummy variable D∗ M1, where D is a dummy that is 1 over 1979Q3–1982Q3 and 0 otherwise. The instrument set consists of a constant, four lagged values of the funds rate, inflation, the bond rate, money growth, output gap, and real GDP growth. J is the test of overidentifying restrictions and is distributed Chi-squared. x12 is the Chi-squared statistic that tests the null hypothesis a1 = 1. Significance levels of these statistics are reported in parentheses below. The constant term a0 is (rr − a12 INF∗ ), where rr is the real rate of interest and INF∗ is the Fed’s inflation target. Panel A in Table 1 contains results that occur when potential output is measured with the HP trend, and Panel B contains results that occur when instead the quadratic trend is used. For all variables the coefficients reported are the shortrun coefficients, with the exception of the one for actual inflation. For that variable the coefficient reported is the long-run coefficient a1 ; the short-run coefficient can be recovered by multiplying the reported coefficient by (1 − ρ), i.e., a1 (1−ρ) in (8).11 The J-statistic reported there tests the null hypothesis that 11 As noted before, the policy reaction function (8) is nonlinear in parameters, with ρ appearing in front of many variables including FRt−1 . One could estimate (8) with or without imposing these nonlinear restrictions. If restrictions are imposed, then one gets estimates of the long-term coefficients and the short-term smoothness parameter ρ. Given long-term estimates, the shortterm coefficients are recovered by multiplying the estimated long-term coefficients by (1 − ρ). Alternatively, one may ignore these nonlinear restrictions and estimate directly the following version of equation (8): FRt = d0 + d1 FRt−1 + d2 INFLt−1 + d3 (INFLt+1 − INFLt−1 ) + d4 (BRt − INFLt+1 ) + d5 GAPt+1 + d6 M1t−1 + vvt , 42 Federal Reserve Bank of Richmond Economic Quarterly orthogonality restrictions imposed under GMM estimation are consistent with data. If this statistic is small, then it indicates the restrictions are not rejected by the data; therefore GMM estimates are consistent. If one focuses on post-1979 estimates, one can see that all estimated coefficients have expected signs and are generally statistically significant (see t-values in parentheses below estimates in Panels A and B, Table 1). Those estimates indicate that the funds rate rises if actual inflation rises, if future inflation is expected to increase, if output is expected to be above trend, or if the current-period bond rate moves relative to the expected future inflation rate. The estimated short-run coefficients (a2 , a3 ) that appear on two future inflation indicators are positive, indicating the funds rate target responded to expected changes in the future direction of actual inflation. In particular, the estimated short-run coefficient (a3 ) on the bond rate is 0.3 to 0.4, which indicates the funds rate increases 30 to 40 basis points in response to 1 percentage point increase in the bond rate.12 The M1 growth dummy is significant, which shows the influence of “new operating procedures” on the funds rate. The pointestimate of the coefficient that appears on the lagged funds rate (ρ) is between 0.6 to 0.8, which indicates the presence of considerable interest rate smoothing in Fed behavior. Finally, the reported J-statistic is small, which indicates the restrictions imposed in deriving GMM estimates are consistent with data.13 The results do not change qualitatively between two alternative measures of potential output used here. However, magnitudes of individual coefficients that measure responses of the funds rate target to different economic fundamentals are sensitive to the measure of trend used. The one difference to highlight is the long-run, estimated coefficient (a1 ) that appears on the level of actual inflation. This coefficient is greater than unity if output gap is estimated with the quadratic trend, suggesting that in the post-1979 period, the real funds rate target rose in response to inflation. However, the point estimate of a1 falls below unity if output gap is estimated instead with the HP trend (compare estimates of a1 in Panels A and B, Table 1). Nonetheless, as discussed later, the point where all variables are defined as before. The estimated coefficients di , i = 0,1,2,3,4,5,6, are then estimates of short-term coefficients. The long-term coefficients can then be recovered by multiplying the short-term estimates by [1/(1 − d1 )]. Both these procedures yield qualitatively similar results. The empirical work reported here is based mostly on estimates generated using nonlinear restrictions. 12 Since the Fed smoothes interest rates in the short run, the long-term estimated response measured by a3 /(1 − ρ) is stronger. In the HP trend case, the estimated long-run response is 0.26/(1 − 0.69), i.e., 0.87. 13 The overall fit of the regression as measured by the standard error of estimate is better when four lagged values of the instruments are used in estimation than it is when two or three lagged values are used. The results, however, do not change qualitatively if the policy reaction function is instead estimated using two to three lagged values of the instruments. In particular, the bond rate remains significant in the post-1979 period. The J-statistic continues to confirm the condition in (7) that the error term is not correlated with instruments. Y. P. Mehra: Monetary Policy Reaction 43 estimate of a1 has remained above unity in most other subperiods ending in the ’90s (see estimates of a1 in Tables 3 and 4 for various subperiods). Together these estimates indicate that during most of the Volcker-Greenspan period the real funds rate target increased in response to actual inflation.14 If one focuses on pre-1979 estimates, one can see that, like the post-1979 estimates, these estimates indicate that the funds rate target rises in response to actual inflation, to increases in expected future inflation, and to the positive expected output gap. But pre-1979 estimates differ significantly from post-1979 estimates in several ways. First, the bond rate is not significant, indicating that in the pre-1979 period the Fed did not adjust the target in response to movements in the bond rate. This result on the absence of long-term inflationary expectations on policy is robust to changes in the measure of trend used. Second, money growth is also not significant.15 Third, the long-run, estimated coefficient (a1 ) that measures the response of the funds rate to inflation is economically less than unity, indicating that the funds rate did not adjust one-for-one with inflation. That is, the real funds rate declined in response to actual inflation prior to 1979. This decline occurred despite the evidence here that in the pre1979 period the funds rate is responsive to movements in near-term expected future inflation (see estimates of a2 in Table1). The absence of response to the bond rate may explain why the real funds rate target declined in response to actual inflation during the pre-Volcker period. Assessing the Predictive Content of the Policy Reaction Function: Pre- and Post-1979 The key hypotheses posited about Fed behavior here are that the fund rate responds to economic fundamentals specified in (3). The estimates discussed above lend support to these hypotheses. In order to assess further the empirical plausibility of these hypotheses, I examine how well the policy reaction function predicts the actual behavior of the funds rate during the two sample periods. The predictive content is evaluated with the following regression: FRt = c + dPFRt + eet , (9) 14 Table 1 also reports estimates of the constant term a = rr − a INFL∗ in (8). But, as 0 12 is obvious, it is not possible to recover estimates of the assumed constant real rate rr without information about the Fed’s constant inflation target. 15 In the forward-looking reaction function estimated here, money growth, when included, is generally not significant, with the exception of the brief “new operating procedures” period 1979Q3 to 1982Q3. This result is in contrast with the one reported in many previous studies, where money growth is significant. One explanation of these different results is that the reaction functions in these studies are backward-looking. Hence money growth may be significant in these studies not because the funds rate responds to money growth but because past money is giving information about future economic factors that are omitted from the reaction function. 44 Federal Reserve Bank of Richmond Economic Quarterly where PFR is the funds rate predicted by the policy reaction function and ee is the disturbance term. The predicted values used in (9) are the dynamic within-sample simulated values of the policy reaction function (8), generated using actual values of explanatory variables. However, in order to highlight the importance of the effect of interest rate smoothing on the funds rate target, I also generate predictions of the funds rate with the smoothing parameter ρ in (8) set to zero. The predicted funds rate is an unbiased predictor of the actual funds rate if c = 0 and d = 1.16 Table 2 reports estimates of the regression (9) for two sample periods, 1961Q1 to 1979Q2 and 1981Q1 to 1998Q1. The results are reported for both measures of the output gap and with and without accounting for the effect of interest rate smoothing (see Panels A and B). (Figures 1 through 4 chart predicted and actual values of the funds rate for the HP filtered output gap.) If one focuses on post-1979 sample results with smoothing, one can see that the coefficient that appears on the predicted fund rate variable PFR in (9) is close to unity. x12 is the Chi-squared statistic that tests the null hypothesis that (c, d ) = (0, 1). This statistic is not significant at the 5 percent level, suggesting the predicted funds rate is an unbiased predictor of the actual funds rate. The result is not sensitive to the measure of trend used or to the presence of interest rate smoothing in Fed behavior (compare results in Panels A and B, Table 2). Figures 2 and 4 tell the same story, which is that the funds rate moves closely with the level determined by economic fundamentals as specified in (1). The results, however, are different in the pre-1979 period. When the focus is on results without smoothing, the coefficient that appears on the predicted funds rate PFR is significantly below unity and this result is not sensitive to the measure of the output gap. If one allows for the effect of interest rate smoothing on the funds rate, the coefficient that appears on the predicted funds rate does however move closer to unity (compare Figure 1 with Figure 3). But the predicted funds rate is still a biased predictor of the actual funds rate in one specification of the output gap (compare estimates in Panels A and B, Table 2). Hence the hypothesis that the funds rate target is a function of economic variables as specified in the policy reaction function (1) is a better description of Fed policy in the post-1979 period than it is in the previous period. 16 The result that the predicted funds rate may be a biased predictor of the actual funds rate in (9) does not imply that the stochastic disturbance term in the estimated policy reaction function (8) is biased and the estimation procedure used here is therefore invalid. The reason is that the result above—that the predicted funds rate is a biased predictor—may arise because the predicted values used in regression (9) are dynamic, not static. The policy reaction function (8) instead is estimated using actual values of explanatory variables including the lagged funds rate. Hence the disturbance term, though unbiased in (8), may appear biased in (9) if predicted values used are dynamic. Y. P. Mehra: Monetary Policy Reaction 45 Table 2 In-Sample Predictability of the Forward-Looking Reaction Function Panel A: Actual and Predicted Funds Rate, Without Smoothing Sample Period HP Trend c d Quadratic Trend x21 (2) c d x21 (2) 1961Q1–1979Q2 2.7 (3.5) 0.46 (4.9) 38.5∗ 2.2 (3.5) 0.53 (5.7) 25.9∗ 1981Q1–1998Q1 1.0 (1.3) 0.89 (10.1) 1.8 0.64 (1.6) 0.93 (20.9) 2.6 Panel B: Actual and Predicted Funds Rate, With Smoothing 1961Q1–1979Q2 0.4 (1.2) 0.89 (18.9) 5.3∗∗ 0.2 (0.2) 0.88 (8.2) 4.3 1981Q1–1998Q1 −0.27 (0.5) 1.0 (16.7) 0.5 0.2 (0.6) 0.96 (22.7) 0.8 *Significant at the 5 percent level. **Significant at the 10 percent level. Notes: The coefficients (t-values in parentheses below) reported above are from regressions of the form FRt = c + dPFRt , where FR is the actual funds rate, and PFRt is the predicted funds rate. The predicted values are generated using the policy rule (8) of the text, with ρ set to zero (see Panel A) and with ρ set to its estimated value (see Panel B). The predicted values used in Panel B regressions are within-sample but dynamic. x12 is the Chi-squared statistic that tests the null hypothesis (c, d ) = (0, 1). Has the Reaction Function Changed during the Greenspan Period? It is commonly believed that the Fed under Alan Greenspan has maintained an anti-inflationary stance set in motion when Paul Volcker became the Fed Chairman. In fact, as discussed above, the forward-looking reaction function (8) estimated here is consistent with the actual path of the funds rate over the Volcker-Greenspan period, indicating the policy reaction function may not have changed much over this period. Nevertheless, I investigate this issue further by comparing the reaction function between Volcker and Greenspan periods, 1979Q4 to 1987Q3 and 1987Q3 to 1998Q2. The main problem with estimating separate reaction functions over the Volcker and Greenspan periods is that those estimates may be subject to the small sample bias. In fact, during the Greenspan period both the bond rate and inflation have not varied much. In view of these considerations I estimate the reaction function using the technique of rolling regressions, which generates somewhat larger subsamples. In particular, I begin with estimating the reaction function over the Volcker period 1979Q4 to 1987Q3 and then continually reestimate it advancing the end date by four quarters, keeping the start date fixed. The resulting estimates of key coefficients 46 Federal Reserve Bank of Richmond Economic Quarterly Figure 1 Actual and Predicted Funds Rate, With Smoothing; HP Trend Pre 1979 Figure 2 Actual and Predicted Funds Rate, With Smoothing; HP Trend Post 1979 Y. P. Mehra: Monetary Policy Reaction Figure 3 Actual and Predicted Funds Rate, Without Smoothing; HP Trend Pre 1979 Figure 4 Actual and Predicted Funds Rate, Without Smoothing; HP Trend Post 1979 47 48 Federal Reserve Bank of Richmond Economic Quarterly are reported in Tables 3 and 4. Table 3 reports estimates using the HP filtered output gap, and Table 4 gives estimates using the quadratic trend output gap. If we focus on short-run estimated coefficients, ai , i = 2, 3, 4, they all have theoretically expected signs in all subsamples considered here and are generally statistically significant (t-values not reported). The estimated sizes of the individual coefficients are relatively stable over different subsamples, with few exceptions. Of particular interest is the long-run, estimated coefficient a1 that appears on the level of actual inflation. It is mostly above or close to unity in all subsamples, with the exception of those ending in 1996 and 1997. In those two subsamples the magnitude of the estimated coefficient is sensitive to the measure of the output gap used: it falls below unity if the output gap is estimated with the HP trend (compare estimates of a1 in Tables 3 and 4). These estimates thus suggest that the Fed during most of the Greenspan period has responded aggressively enough to expected inflation indicators that the real funds rate has increased in response to actual inflation. Predicting the Funds Rate during the Greenspan Period The results discussed in the previous section indicate that estimates of the individual coefficients that measure responses of the funds rate to economic fundamentals do display considerable subsample variability during the VolckerGreenspan period. But despite such variability, a cursory look at Figure 2 indicates that the policy reaction function here tracks the actual behavior of the funds rate fairly well in the ’80s and the ’90s. In this section I provide additional evidence on this issue by examining the out-of-sample predictive performance of the reaction function over the period 1988Q1 to 1998Q2, which for the most part spans the current tenure of Alan Greenspan as the Fed Chairman. Table 5 presents the predicted values of the funds rate. The predicted values are the dynamic one-year-ahead forecasts of the funds rate that are conditional on actual values of the economic fundamentals and are generated using rolling regressions over the forecast period. Panel A in Table 5 presents the predicted values generated with the HP trend and Panel B presents the predicted values with the quadratic trend. (Figures 5 and 6 chart the quarterly values for this period.) Actual values of the funds rate, prediction errors, and summary error statistics are also presented. As shown, the reaction function tracks the actual funds rate fairly well over the forecast period. The mean value of the prediction error is very small and the root mean squared error is about 0.4 of a percentage point. The average annual prediction errors are not statistically significant, with the exception of the year 1995. In 1995 the prediction error is positive, and it is more than twice the root mean squared error. During that year real growth decelerated from its rapid pace of the previous year, but the Fed did not lower the funds rate in response to such a slowdown. Nevertheless, since 1995 the magnitude of the prediction error has steadily declined. Y. P. Mehra: Monetary Policy Reaction 49 Table 3 Rolling Regression Estimates of the Reaction Function during Volcker-Greenspan Periods, GDP Deflator and HP Trend Sample Period Ends in a1 a2 a3 a4 ρ a0 1987Q4 1988Q4 1989Q4 1990Q4 1991Q4 1992Q4 1993Q4 1994Q4 1995Q4 1996Q4 1997Q4 1.0 1.0 1.1 1.2 1.2 1.2 1.4 1.5 1.1 0.8 0.7 0.6 0.6 0.5 0.6 0.6 0.6 0.7 0.7 0.5 0.4 0.4 0.48 0.45 0.37 0.35 0.35 0.35 0.39 0.39 0.33 0.29 0.28 0.25 0.28 0.40 0.32 0.36 0.41 0.39 0.38 0.38 0.41 0.32 0.42 0.43 0.50 0.50 0.51 0.56 0.57 0.59 0.61 0.64 0.66 3.6 3.4 3.5 3.1 3.1 2.7 1.9 1.6 3.2 4.1 4.5 17.4 21.6 19.3 16.0 14.1 15.0 16.1 15.5 14.7 16.1 17.1 1987Q3–1998Q1 0.7 0.1 0.28 0.35 0.81 4.7 17.8 (0.27) J(sl) (0.29) (0.12) (0.20) (0.38) (0.52) (0.45) (0.37) (0.41) (0.47) (0.38) (0.31) Notes: The coefficients above are GMM estimates of the forward-looking reaction function given in Table 1. Unless stated otherwise the estimation period for all regressions begins in 1979Q3 and ends in the year as shown in the first column above. All reported coefficients have significant t-values (not reported), with one exception: a2 is not significant over 1987Q3–1998Q2. Table 4 Rolling Regression Estimates of the Reaction Function during Volcker-Greenspan Periods, GDP Deflator and Quadratic Trend Sample Period Ends in a1 a2 a3 a4 ρ a0 1987Q4 1988Q4 1989Q4 1990Q4 1991Q4 1992Q4 1993Q4 1994Q4 1995Q4 1996Q4 1997Q4 1.2 1.2 1.1 1.3 1.3 1.4 1.5 1.5 1.3 1.2 1.3 0.7 0.7 0.6 0.6 0.7 0.7 0.7 0.7 0.5 0.5 0.5 0.54 0.54 0.47 0.47 0.49 0.48 0.49 0.49 0.43 0.44 0.45 0.17 0.17 0.25 0.23 0.26 0.29 0.30 0.29 0.30 0.31 0.28 0.39 0.39 0.50 0.48 0.50 0.54 0.54 0.54 0.59 0.59 0.57 3.3 3.1 3.6 3.2 3.0 2.6 2.3 2.2 3.1 3.3 2.9 14.6 15.4 14.3 14.5 14.1 15.3 14.3 14.1 13.7 12.6 12.1 1987Q3–1998Q1 1.4 0.2 0.28 0.20 0.73 2.5 17.5 (0.15) Notes: See notes for Table 3. J(sl) (0.49) (0.42) (0.50) (0.48) (0.52) (0.43) (0.50) (0.52) (0.55) (0.63) (0.60) 50 Federal Reserve Bank of Richmond Economic Quarterly Table 5 Actual and Predicted Funds Rate: 1988Q1–1998Q2 Panel A: HP Trend Panel B: Quadratic Trend Year Actual Funds Rate Predicted Error Predicted Error 1988 1989 1990 1991 1992 1993 1994 1995 1996 1997 7.6 9.2 8.1 5.7 3.5 3.0 4.2 5.8 5.3 5.5 7.8 8.7 8.2 6.0 4.3 3.1 4.2 4.9 5.1 5.4 −0.20 0.50 −0.01 −0.30 −0.80 −0.01 0.00 0.90∗ 0.20 0.10 7.9 8.6 8.1 6.0 4.1 2.9 4.2 5.1 5.4 5.9 −0.30 0.60 0.00 −0.30 −0.60 0.10 0.00 0.70 −0.10 −0.40 1997Q1 1997Q2 1997Q3 1997Q4 1998Q1 1998Q2 5.3 5.5 5.5 5.5 5.5 5.5 5.3 5.3 5.5 5.6 5.3 5.2 0.0 0.2 0.0 −0.1 0.2 0.3 5.9 6.5 6.7 6.2 5.7 5.9 −0.60 −1.0 −1.2 −0.7 −0.2 −0.04 Mean Error (1988–1997) Root Mean Squared Error 0.00 0.44 −0.02 0.41 *The prediction error is twice the root mean squared error. Notes: The predicted values above are generated using the forward-looking reaction functions reported in Tables 3 and 4. The predicted values are the dynamic, one-year-ahead forecasts, generated using rolling regressions over sample periods that all begin in 1979Q3 but end in the year before the forecast period. Table 5 also presents the quarterly values of the predicted funds rate for the most recent period 1997Q1 to 1998Q2. During this period the U.S. economy has grown at an above-trend growth rate, while inflation has steadily declined. The Fed, however, did not adjust the funds rate target during this subperiod, which remained at 5.5 percent. The funds rate predicted by the reaction function is 5.4 percent if the Fed’s estimate of trend GDP is generated with the HP filter, and it is 6.2 percent if instead the quadratic trend is used. There is a difference in prediction because in this subperiod the Fed’s estimate of trend GDP is 3.0 percent if the HP trend is used and 1.9 percent if instead the quadratic trend is employed to estimate the output gap. Consequently, the magnitude of the positive output gap is smaller with the HP trend than with the quadratic, thereby suggesting a less tight response to observed above-trend growth. If one believes the popular view that the economy’s underlying trend growth rate Y. P. Mehra: Monetary Policy Reaction Figure 5 Actual and Predicted Funds Rate, Out-of-Sample; HP Trend Figure 6 Actual and Predicted Funds Rate, Out-of-Sample; Quadratic Trend 51 52 Federal Reserve Bank of Richmond Economic Quarterly may have increased in the ’90s, then one may have more confidence in the reaction function estimated with the HP filtered output gap than in the one with the quadratic. Therefore, the absence of policy response to recent abovetrend growth is not inconsistent with prior Fed behavior if one takes seriously the proposition that the economy’s trend growth in recent years may be higher than 2 percent estimated using the quadratic trend. 3. CONCLUDING OBSERVATIONS In this article I estimate a forward-looking monetary policy reaction function that quite accurately predicts the actual behavior of the federal funds rate since 1979. The key property of the estimated reaction function in the VolckerGreenspan period is that the funds rate target is responsive to movements in long-term inflationary expectations evidenced by the behavior of the bond rate. During this period the Fed has responded aggressively enough to future inflation indicators that the real funds rate target increased in response to actual inflation. Such is not the case in the pre-1979 period, when the real funds rate target declined in response to inflation. It has been suggested that the U.S. economy experienced macroeconomic stability in the Volcker-Greenspan period because the interest rate policy pursued during this period was very responsive to expected inflation; so much so that the real funds rate increased in response to inflation (Clarida, Gali, and Gertler 1998b; Taylor 1998). The results here indicate that the Fed’s willingness to move the funds rate preemptively to react to movements in the bond rate may explain why the funds rate was more responsive to inflation in the Volcker-Greenspan period than it had been in the previous period. REFERENCES Clarida, Richard, Jordi Gali, and Mark Gertler. “Monetary Policy Rules in Practice: Some International Evidence,” European Economic Review, vol. 42 (June 1998a), pp. 1033–67. . “Monetary Policy Rules and Macroeconomic Stability: Evidence and Some Theory,” NBER Working Paper 6442, March 1998b. Goodfriend, Marvin. “Interest Rate Policy and the Inflation Scare Problem 1979–1992,” Federal Reserve Bank of Richmond Economic Quarterly, vol. 79 (Winter 1993), pp. 1–24. . “Interest Rates and the Conduct of Monetary Policy,” CarnegieRochester Conference Series on Public Policy, vol. 34 (Spring 1991), pp. 7–30. Y. P. Mehra: Monetary Policy Reaction 53 Hansen, Lars Peter. “Large Sample Properties of Generalized Method of Moments Estimators,” Econometrica, vol. 50 (July 1982), pp. 1029–54. Hodrick, Robert J., and Edward C. Prescott. “Postwar U.S. Business Cycles: An Empirical Investigation,” Journal of Money, Credit, and Banking, vol. 29 (February 1997), pp. 1–16. McCallum, Bennett T., and Edward Nelson. “Nominal Income Targeting in an Open-Economy Optimizing Model,” Mimeo. A paper prepared for the Riksbank-IIES Conference on Monetary Policy Rules. June 1998. Mehra, Yash P. “A Federal Funds Rate Equation,” Economic Inquiry, vol. XXXV (July 1997), pp. 621–30. Taylor, John B. “An Historical Analysis of Monetary Policy Rules,” National Bureau of Economic Research conference on Monetary Policy Rules, January 1998. . “Discretion versus Policy Rules in Practice,” Carnegie-Rochester Conference Series on Public Policy, vol. 39 (December 1993), pp. 195– 214. Mercantilists and Classicals: Insights from Doctrinal History Thomas M. Humphrey E conomists typically view their discipline as a progressive science in which superior new ideas relentlessly supplant inferior old ones in a Darwinian struggle toward the truth. Thus it came as something of a shock when Milton Friedman challenged this belief in the May 1975 issue of the American Economic Review. In response to the question “What have we learned in the past 25 years?”, Friedman argued that what monetary economists have learned since 1950 are hardly new ideas but rather a rediscovery of old ideas inherited from David Hume and his contemporaries more than 200 years ago. Three years later, the British economist Ivor F. Pearce shocked his readers even more. He denied that the Keynesian Revolution had contributed a single new or useful idea to monetary economics. Instead, he insisted that “human history is guided not by new ideas, for there are none,” but rather by “some ephemeral sub-group of . . . old ideas.” Such old ideas, “often believed to be new,” are “seized upon as the . . . solution to whatever difficulties immediate experience has made to seem important, and congealed into a crust of dogma by endless repetition and obeisance” (Pearce 1978, p. 93). The above sentiments express what every doctrinal historian knows, namely that much of what passes for novelty and originality in monetary theory and policy is ancient teaching dressed up in modern guises. To be sure, the increasing application of mathematical modeling has given these concepts greater rigor and precision. Likewise, better data and more powerful empirical techniques have improved our statistical estimates of the relevant quantitative magnitudes. Still, the basic ideas themselves often remain much the same. Thus instead Tom.Humphrey@rich.frb.org. An earlier version of this article appeared in this Bank’s 1998 Annual Report. For valuable comments the author is indebted to his Richmond Fed colleagues Alice Felmlee, Marvin Goodfriend, Bob Hetzel, Rowena Johnson, Elaine Mandaleris, and Ned Prescott. The views expressed do not necessarily reflect those of the Federal Reserve System. Federal Reserve Bank of Richmond Economic Quarterly Volume 85/2 Spring 1999 55 56 Federal Reserve Bank of Richmond Economic Quarterly of a steady progression of new paradigms, we see repeated cycles of existing ones whose periodic rise and fall perpetually casts them in and out of fashion. By itself, this recycling of established ideas need be no cause for alarm. Theories may survive because experience indicates that they possess a high degree of validity and because no better theories have been found. The trouble is, however, that sound theories are not the only ones to survive. Unsound theories may coexist with the sound ones. Unfortunately, policymakers and the general public are in no position to realize as much. Preoccupied by the pressing problems of the day, they have neither the time, inclination, or training, nor indeed the duty to trace the history of the ideas they employ or endorse. They have no reason to be aware of earlier policy debates in which sound theories were distinguished from fallacious ones. The result is that policymakers may subscribe to old theories under the mistaken impression that those theories are new. Worse, they may unwittingly deploy policies whose underlying theory has been challenged and found wanting in earlier policy debates. Here is where the doctrinal historian can help. His comparative advantage lies in identifying the origin and tracing the evolution of rival monetary doctrines across a succession of writers, events, episodes, and policy controversies. Each such incident constitutes a test, or observation, of the relative strengths and weaknesses of the competing doctrines. While no single test can yield conclusive results, many such tests may do so. Taken together, they reveal which doctrine has emerged from past experience as the more robust analytically. By demonstrating as much, the historian specifies those ideas that seem to offer the most effective basis for public policy. Of course, there is no assurance that the policymaker will heed the doctrinal historian and employ the best ideas. On the contrary, he may reject them or temporarily accept and subsequently abandon them. Here again the historian has something to say. His study of the forces influencing the receptivity and implementation of ideas permits him to predict a doctrine’s prospective success or failure. In this manner, the unique perspectives of doctrinal history may prove their worth. This article puts those perspectives to work. It shows that from a broad standpoint much of the history of monetary theory reduces to a struggle between opposing mercantilist and classical camps. Mercantilists, with their fears of hoarding and scarcity of money together with their prescription of cheap (low interest rate) and plentiful cash as a stimulus to real activity, tend to gain the upper hand when unemployment is the dominant problem. Classicals, chanting their mantra that inflation is always and everywhere a monetary phenomenon, tend to prevail when price stability is the chief policy concern. Currently, the classical view is in the driver’s seat. By all rights it should remain there since it long ago exposed the mercantilist view as fundamentally flawed. It is by no means certain, however, that the classical view’s reign is secure. For history reveals that, whenever one view holds center stage, the T. M. Humphrey: Mercantilists and Classicals 57 other, fallacious or not, is waiting in the wings to take over when the time is ripe. In this manner, the mercantilism of John Law and Sir James Steuart gave way to the classicism of David Hume and David Ricardo, the Currency School’s classicism bowed to John Maynard Keynes’s mercantilism, the mercantilist doctrines of Keynes’s disciples yielded to Milton Friedman’s classical monetarism, and so forth. Even today, with central bankers in several nations expressing commitment to the classical goal of price stability and monetarists advocating systematic, zero-inflation rules for monetary policy, mercantilist undercurrents still run strong. Supply-siders who argue that monetary policy must be accommodative to allow tax cuts to work their magic echo mercantilist opinion. So too do those who contend that, with global competition and rapid technological progress holding inflation in check, monetary policy is free to pursue nonprice objectives such as boosting growth and achieving full employment. Finally, observers who believe that monetary policy is powerless to stimulate the currently depressed Japanese economy harbor mercantilist fears of unspent hoards of idle cash. The following paragraphs attempt to spell out the core propositions of the original mercantilist and classical views and to establish the centrality of those propositions in the famous Currency School-Banking School and Keynesian-monetarist controversies—the two leading monetary policy debates of the nineteenth and twentieth centuries.1 From this doctrinal historical exercise, three themes emerge. First, with some exceptions, classicals tend to be quantity theorists; mercantilists, anti-quantity theorists. Second, classicals prefer rules; mercantilists, discretion. Third, for all their cogency, classicals may be doomed to face a perpetual mercantilist challenge. As long as some observers continue to believe, rightly or wrongly, that inflation and deflation are nonmonetary, or real, phenomena and that unemployment is a monetary one capable of correction by the central bank, the debate will be unending. 1. MERCANTILIST AND CLASSICAL MONETARY DOCTRINES The roots of the debate trace back to the original mercantilist writers of the preclassical era 1550–1770. Those writers argued that a nation’s stock of precious metals constituted the source of its plenty (wealth), power, prestige, and prosperity. For countries possessing no gold mines, augmentation of those conditions required the accumulation of specie through foreign trade. Accordingly, mercantilists advocated protectionist policies in the form of export 1 Additional famous policy debates pitting mercantilists and classicals include (1) the Swedish Bullionist controversy (1755–1765), (2) the English Bullionist-Antibullionist, or Bank Restriction, dispute (1797–1821), (3) the Bimetallism debate (1880–1896), and (4) the German hyperinflation debate (1922–1923). 58 Federal Reserve Bank of Richmond Economic Quarterly promotion and import restriction schemes to obtain a permanent trade balance surplus matched by corresponding persistent inflows of specie from abroad. This policy prescription was of course the mercantilists’ main claim to fame. But the hallmark that secures them a permanent niche in the history of monetary doctrines was their contra- or anti-quantity theory of money.2 They used that theory to deny that money determines prices and to tout the employment benefits of money-stock expansion fueled either by specie inflows or by paper money creation should those inflows languish. Consisting of at least seven propositions, the mercantilists’contra-quantity theory held that (1) money stimulates trade, (2) real cost-push forces determine the price level and the inflation rate, (3) the interest rate is a purely monetary variable whose level, high or low, is proof of the scarcity or abundance of money, (4) idle hoards absorb any cash not employed in driving trade, (5) causality runs from prices and real activity to money such that the money stock passively adapts to the needs of trade, (6) overissue is impossible when the money stock is backed by the nominal value of real property, and (7) discretion outperforms rules in the conduct of monetary policy. John Law (1671–1729) The clearest and most emphatic statements of the foregoing propositions came from John Law and Sir James Steuart, two economists writing near the close of the mercantilist era.3 Of the two, Law’s name is synonymous with the moneystimulates-trade doctrine that forms the central core and theme of his 1705 Money and Trade Considered; with a Proposal for Supplying the Nation with Money. Writing against the backdrop of a chronically depressed and underemployed Scottish economy (his home country), he argued that a shortage of metallic money was to blame, that a bank-issued paper currency must replace the deficient metallic one, and that the resulting expansion of the stock of paper notes would permanently increase the level of output and employment without raising prices.4 His argument stemmed from his assumptions of (1) the 2 Because anti-quantity theory elements also characterize the fixed-exchange-rate, smallopen-economy case of the modern monetary approach to the balance of payments, some observers may be tempted to equate mercantilism with that approach. In fact, however, the two theories differ markedly. First, the monetary approach applies the quantity theory, rather than its opposite, to closed-economy and inconvertible-paper floating-exchange-rate regimes. By contrast, mercantilists, with few exceptions, tended to apply the anti-quantity theory indiscriminately to all regimes. Second, the monetary approach rejects the mercantilist money-stimulates-trade doctrine. 3 On Law’s monetary theory, see Murphy (1997, Chs. 6 and 8) and Hutchison (1988, pp. 134–40). On Steuart’s theory, see Eltis (1986), Hutchison (1988, pp. 341–51), Meek (1967), and Skinner (1981). 4 Law’s fear of monetary shortage under a metallic standard is incompatible with the monetary approach to the balance of payments. The latter sees a small open economy, like Scotland, taking its price level as given from the closed world economy with money then flowing in through the balance of payments to support that price level such that no monetary shortage occurs. Of these T. M. Humphrey: Mercantilists and Classicals 59 availability of idle resources at unchanged resource prices and (2) constant returns to scale in production. Given these conditions, it followed that the economy’s long-run aggregate supply curve was perfectly horizontal up to the point of full employment. It likewise followed that money-induced increases in aggregate commodity demand would, via rightward shifts along the supply curve, generate matching increases in equilibrium real output without raising prices. Indeed, Law suggested that the price level might even fall if scale economies in production rendered the aggregate supply curve negatively sloped.5 In no case, however, would expansion of the stock of paper money raise prices. Having argued that causation runs from money to output, Law perceived that it could be made to run in the opposite direction too. With appropriate financial linkages put in place, output could induce the very monetary means of its own expansion. Indeed, Law thought this outcome was assured provided that banks issued money on productive loans secured by claims to future product or its equivalent. Coaxed forth by real output in this fashion, the paper money stock would grow in step with the real demand for it such that its purchasing power would be preserved unchanged. To ensure that the nominal money stock automatically expanded equally with the real demand for it, he advocated that paper notes be backed dollar-for-dollar with the nominal value of land. Collateralized by land, money would, he thought, enjoy stability of value. When economic development or cyclical recovery brought more land into cultivation, the money stock, secured by the extra land, could expand to meet the growing needs of trade at unchanged prices. Here was the prototype of the real bills doctrine later attacked so vigorously by classical writers. As for the doctrine that low interest rates spell monetary ease and high rates monetary tightness, Law accepted it without reservation. Anticipating Keynes’s liquidity preference theory of interest, Law saw interest rates as the price of money’s use, a price that varied inversely with the quantity available to use. Being purely monetary phenomena, low rates unambiguously signified an abundance of money and high rates a scarcity of it. Law, an ardent advocate of low rates, argued that they reduced the businessman’s cost of capital and so spurred investment and real activity. For him, money exerted its stimulus through indirect interest rate channels as well as through direct expenditure ones. Sir James Steuart (1721–1780) To Law’s doctrines, Steuart in his 1767 An Enquiry into the Principles of Political Oeconomy added four more. First was his explicit rejection of a monetary for a real cost-push theory of inflation. Tracing a causal chain from the degree two propositions, Law recognized the first but denied the second. He also argued, contrary to the monetary approach, that expansion of the domestic stock of paper money would, by stimulating production of goods for export, improve a country’s trade balance. See Murphy (1997, Ch. 8). 5 See Blaug (1996, p. 16). 60 Federal Reserve Bank of Richmond Economic Quarterly of competition in labor markets to wage rates to unit labor cost to product prices, he concluded that cost and competition determine the prices of all goods and thus the price level as a whole. Likewise, he held that the monopoly power of producers determines their profit margins as embodied in the profit markup component of individual and aggregate prices. In other words, he alleged that the same real forces—market power and cost—that govern relative prices account for absolute prices as well. He advanced a relative price theory of the absolute price level.6 Steuart’s second contribution was his doctrine of the hoards which he used to bolster his denial that money determines prices. He argued that idle hoards of specie absorb excess cash from circulation just as they release into circulation additional coin to correct a monetary shortage. Consequently, there can be no monetary excess or deficiency to spill over into the commodity market to affect prices. The hoarding-dishoarding mechanism ensures as much.7 For those occasional increases in the money stock that do manage to elude the hoarding mechanism and spill over into the commodity market, he argued, like Law, that they produce matching shifts in commodity demand along a horizontal supply schedule such that equilibrium real output alters at unchanged prices. Third was his reverse causation doctrine according to which causality runs from prices to money and its circulation velocity rather than vice-versa as in the quantity theory. Positing a two-step process, he said that cost and competition first determine prices. Then, with prices settled, the circulation velocity of coin adjusts to render the existing stock sufficient to accommodate the prevailing level of real activity at the given prices.8 If the money stock is excessive, wealth-holders remove the excess from active circulation and either hold it idle so that velocity falls or melt it down into plate and ornaments such that the money stock contracts. Conversely, if coin is deficient, the resulting recourse to paper substitutes and other expedients allows transactors to economize on coin whose velocity therefore rises. Via such devices, velocity adjusts to ensure that the stock of coin is just enough to purchase all the goods offered for sale at the predetermined level of prices. In this way, causation runs from prices to money and velocity. Here is the origin of the notion that changes in the stock of circulating media (coin and its paper substitutes) merely validate price changes that have already occurred and do nothing to produce such changes. 6 On Steuart’s cost-push theory, see Screpanti and Zamagni (1993, p. 53). all mercantilists were as sanguine as Steuart on hoards. Indeed they were somewhat ambivalent on the subject. Hoards to them could be either desirable or undesirable. On the one hand, hoards, by draining excess cash from circulation, would tailor the remaining stock precisely to the needs of trade. On the other hand, if output and so the needs of trade were expandable under the impact of a monetary stimulus, such hoards, by removing the source of that stimulus, could unduly constrain real activity. Even so, such hoards would see to it that no monetary excess ever developed to spill over into the commodity market to bid up prices. 8 See Screpanti and Zamagni (1993, p. 53). 7 Not T. M. Humphrey: Mercantilists and Classicals 61 Finally, there was Steuart’s uncompromising stance on the perennial issue of rules versus discretion in the conduct of policy. Like all mercantilists, Steuart sided with discretion. Monetary rules, whether of fixed or feedback variety, met with his skepticism as did all self-correcting adjustment mechanisms, natural or designed. To him, nothing but discretionary fine-tuning would do.9 Such enlightened intervention was the hallmark of his omnipotent, everactive, benevolent statesman whose job was to manipulate the volume of real activity in the national interest.10 Steuart’s statesman alone possessed the detailed knowledge necessary to conduct what today is known as a successful cheap-money, full-employment policy. The gap between actual and potential output, the monetary injection required to close the gap, and the interest rate necessary to draw the required metal from idle hoards: all revealed themselves to the statesman’s astute and vigilant scrutiny. So too did the ever-changing circumstances to which he tailored his actions. These propositions formed the core of mercantilist monetary theory which Law and Steuart deployed to analyze the underemployed economies of their time. Of the two writers, only Law, the paper money mercantilist, was able to translate his theory into action. His famous Mississippi scheme, which merged France’s national bank of issue with a trading and land development firm (the Mississippi Company) while simultaneously promising to reduce the French public debt, involved paper money expansion on a mammoth scale.11 The resulting spectacular inflationary boom and collapse of Law’s system had three consequences.12 It revealed that the initial output stimulus of a monetary expansion eventually vanishes leaving only inflation in its wake. It served to discredit paper money and financial innovation schemes for many years to come. It, together with the similar debacle of the assignats, a nominally land-backed paper currency issued by the French revolutionary government to inflationary excess in the years 1794 to 1796, provoked classicals to reject mercantilist trade and monetary theory root and branch. 9 Steuart of course never resorted to such modern terminology. Nevertheless, the concepts were his. 10 On Steuart’s statesman, see Eltis (1986) and Skinner (1981). 11 Law denied that the monetary expansion was excessive on the grounds that much of it went to redeem outstanding government bonds and equity claims to his trading firm. Since to him bonds and stocks shared money’s characteristic as a transactions medium, he saw all three instruments as exerting the same influence on spending. In his view, money swapped for bonds and equities leaves the total supply of financial purchasing power—money, bonds, and stocks— unchanged. Such monetary issue therefore is noninflationary. He erred. Bonds and stocks hardly qualify as transactions media and thus are far from perfect substitutes for money in spending. Monetizing them can be inflationary. See Niehans (1990, p. 51). 12 See Murphy (1997) for an exhaustive account of the rise and fall of Law’s system. 62 Federal Reserve Bank of Richmond Economic Quarterly Classical Counterpropositions Denouncing the mercantilist identification of wealth with precious metals, Adam Smith observed that national wealth consists not of specie or bullion but rather of stocks of productive resources—land, labor, and capital—and the efficiency with which they are used. With respect to the mercantilist prescription of protectionism as the path to opulence, both Smith and David Ricardo noted that wealth-enhancing, efficient resource allocation requires not protectionism but rather free trade in order to exploit comparative advantages stemming from specialization and division of labor.13 Price-Specie-Flow and Quantity Theory Propositions Other classicals joined the attack. David Hume (1752) used his price-specieflow mechanism to demonstrate the impossibility of the mercantilist goal of a permanently favorable trade balance and corresponding persistent specie inflow. Hume (pp. 62–63) noted that the additional specie, by raising domestic prices relative to foreign ones and so discouraging exports and spurring imports, would render the trade balance unfavorable and reverse the specie flow.14 The resulting drain of monetary metal would continue until domestic prices fell to the level consistent with trade balance equilibrium. Similarly, Hume (pp. 33, 37, 48) showed that the mercantilist fear of scarcity of money was unwarranted since any quantity of money, via a proportionate adjustment in the price level, could drive the trade of a nation. To prove as much, Hume (pp. 62–63) advanced a rigid version of the quantity theory according to which an exogenously given one-time reduction in the stock of money has no lasting effect on real activity but leads ultimately to a proportionate change in the money price of goods. Distinction between Absolute and Relative Prices Hume’s classical followers immediately seized upon his quantity theory and deployed it against the mercantilists. David Ricardo applied it to refute costpush theories of the price level.15 Accusing cost-pushers of confounding relative prices (market exchange ratios) with the absolute, nominal, or general level of prices, Ricardo flatly denied that a rise in costs—wage costs in particular— could raise general prices without an accompanying expansion of the money 13 Thus a follower of Smith might attribute Scotland’s penury not to monetary deficiency and the absence of banks, but rather to lack of specialization and division of labor resulting from a small population. 14 Cesarano (1998) argues that Hume actually rejected the price-specie-flow mechanism and its attendant changes in relative national price levels for the monetary approach to the balance of payments. By contrast, the standard view emphasized here holds that neither Hume nor his classical followers subscribed to the approach’s proposition of instantaneous purchasing power parity, or law of one price. 15 See Ricardo (1951–1973, I, pp. 46, 61–63, 104–05, 126, 302–03, 307–08, 315). T. M. Humphrey: Mercantilists and Classicals 63 stock. True, he did acknowledge that a wage hike might raise the prices of labor-intensive goods and so require consumers to spend more on those goods. But he also insisted that without accommodating increases in the money stock to foster spending, consumers would have less to spend on capital-intensive goods whose prices would therefore fall. The upshot was clear. Given a constant money stock, any wage-induced rise in some relative prices would be offset by compensating falls in others leaving the general average of all prices unchanged. Short-Run Nonneutrality and Long-Run Neutrality Propositions Classicals reserved their severest criticism for John Law’s money-stimulatestrade doctrine. Hume insisted that the doctrine holds in the short run but not the long.16 At first, money-stock changes indeed affect output and employment. Eventually, however, the output stimulus vanishes and only higher prices remain. Law’s doctrine holds in the short run because prices are temporarily sticky, or inflexible, in response to money stock changes. Such stickiness Hume attributed to the imperfect information price-setters possess on money-stock changes and their resulting failure to perceive and act upon the changes. Distribution effects constituted for him another source of temporary nonneutrality, or transitory influence on real activity, inasmuch as new money is initially concentrated in few hands and only gradually becomes dispersed throughout the economy.17 With prices sticky and money’s circulation velocity given, it follows that changes in the money stock are absorbed by output which accordingly deviates temporarily from its natural equilibrium level. Prices only begin to adjust when price-setters discover that their inventories of goods and labor are abnormally high or low. Eventually, monetary and price-perception errors are corrected as are initial distribution effects. At that point, the price level fully adjusts to the new money stock and output returns to its natural equilibrium level. Here is the source of the classical doctrine of the short-run nonneutrality and long-run neutrality of money.18 16 See Hume ([1752] 1955, pp. 37–38, 47–48). recognized still other sources of short-run nonneutrality including sticky nominal interest rates, fixed nominal charges such as rents and taxes, fixed nominal incomes of wage earners and rentiers, confusion of relative price for absolute price changes, market size encouragement to specialization and division of labor, and deliberate efforts on the part of organized groups to maintain real incomes. See Humphrey (1993, pp. 251–63). 18 Hume ([1752] 1955 pp. 39–40) admitted that money might exhibit long-run supernonneutrality. Being partly unanticipated (perhaps because agents formulate their expectations adaptively in a backward-looking way), a steady succession of money stock changes might perpetually frustrate the attempt of prices to catch up and therefore permanently affect the level of real output. 17 Classicals 64 Federal Reserve Bank of Richmond Economic Quarterly Classical Case for Rules Four remaining mercantilist arguments clamored for demolition. Classicals were glad to oblige. First was the mercantilist claim that discretion was superior to rules. Classicals countered with the opposite claim that rules replaced destabilizing activist intervention with smoothly operating, or stabilizing, automatic adjustment mechanisms. Unlike Steuart, classicals held a low opinion of the knowledge, capabilities, and motivation of the policy authorities. In particular, classicals, especially Ricardo, John Wheatley, and other Bullionist critics of the Bank of England, feared that central bankers operating under the kind of floating exchange rate, inconvertible paper regime prevailing in England during the Napoleonic Wars, would, if left to their own discretion, pursue inflationary policies. Since classicals regarded stability of the value of money as the overriding policy objective, they advocated rules obligating policymakers to achieve that goal. One such rule was the gold standard. By requiring the maintenance of a fixed currency price of gold, this rule, provided that the gold price of goods also remained fairly steady, was tantamount to stabilizing the money price of goods. And with the price level stable, money could function reliably as a unit of account and medium of exchange. In so doing, it could make its maximum contribution to the efficient operation of the real economy and cease to be a source of financial crises and panics. Say’s Law of Markets Next in line for rejection was the mercantilist claim that deficient aggregate demand condemns cash-poor economies to perpetual unemployment. Not so, wrote the classicist Jean-Baptiste Say in his 1803 Traité d’économie politique. The value of goods produced equals the cost of the inputs absorbed in their fabrication. It follows that the very act of production creates, in the form of factor payments, incomes sufficient to buy the goods off the market. And those incomes indeed will be spent. The insatiability of wants together with the unlikelihood that rational people would hoard their savings indefinitely in the form of sterile money ensures as much. Far from going unspent, saving automatically translates itself into investment. People deposit their savings with banks to earn interest. Those intermediaries, upon lending the saving to capitalist entrepreneurs to finance investment projects, guarantee that it enters the spending stream just as surely as if it were consumption spending. The upshot is that full-capacity supply creates its own demand such that mercantilist fears of general gluts and permanent stagnation are unfounded. Say’s Law of Markets identifies the natural level of real activity with full employment.19 19 Perhaps too cavalierly, classicals dismissed or minimized the problem of unemployment. To them joblessness, while it certainly occurred from time to time, was necessarily short-lived and T. M. Humphrey: Mercantilists and Classicals 65 Real Interest Rate As for the mercantilist argument that the interest rate is purely a monetary phenomenon, Hume, Ricardo, and Henry Thornton all repudiated it.20 They contended (1) that the natural equilibrium rate of interest is a real magnitude determined by productivity and thrift, and (2) that money, being neither of those variables, cannot affect the natural rate whose level is therefore resistant to monetary control. True, they conceded that a one-time monetary injection could temporarily depress the loan rate of interest below its equilibrium level. But they stressed the transience of this effect. They pointed out that the monetary injection puts upward pressure on prices. And since with higher prices more loans are needed to finance a given real quantity of investment projects, it follows that loan demands increase. The rise in loan demands reverses the initial fall in the loan rate and restores it to its natural level thereby frustrating attempts to keep it low. Supplementing the price-induced rise in loan demand is a fall in loan supply. For as prices rise, people need more cash, or coin, to mediate hand-to-hand transactions. The resulting conversion of notes and deposits into coin precipitates a cash drain from banks that diminishes bank reserves. To protect their reserves from depletion, banks raise their loan rates. Or what is the same thing, they contract their loan supply. The contraction of loan supply combines with the rise in loan demand to restore the interest rate to its natural equilibrium level determined by productivity and thrift. Criticism of Backing Theories of Money Last but not least was Law’s idea of a land-collateralized paper money stock. Henry Thornton was merciless in his criticism. He excoriated the plan on the grounds that it would fail to limit the money supply and in so failing would render the price level indeterminate.21 The plan’s flaw, wrote Thornton, is that it ties money to the nominal or dollar value, rather than to the fixed physical acreage, of land. By anchoring each dollar to another dollar, it sets up a dynamically unstable price-money-price feedback loop whose elements are free to expand or contract without limit. The result is that any random shock which raises land’s price would, by raising land’s value, increase money’s backing and so justify an expansion of its supply. The consequent expansion would further bid up land’s price thereby justifying still further increases in the money self-correcting through automatic wage, price, and interest-rate reductions. Only their inflationist, full-employment-at-any-cost counterparts of the Birmingham School, especially the Attwood brothers, Thomas and Matthias, were gravely concerned with it. 20 See Hume (1752, pp. 47–59); Ricardo (1951–1973, I, pp. 363–64; III, pp. 88–89, 91, 92; IV, p. 233; V, p. 445); Thornton ([1802] 1939, pp. 253–56). 21 Thornton ([1811] 1939, p. 342). He ([1802] 1939, pp. 244, 253–56) applies the same criticism to the real bills doctrine which ties the issue of bank money (notes and checking deposits) to the nominal volume of commercial paper that borrowers offer as collateral for bank loans. 66 Federal Reserve Bank of Richmond Economic Quarterly stock which would raise prices again and so on ad infinitum. In short, backing money with the nominal value of land—or, for that matter, with commercial paper representing the nominal value of goods in the process of production and distribution—would destabilize prices rather than stabilize them. Price stability required another principle of monetary limitation. Thornton’s refutation of the nominal backing idea completed the list of the original classical rebuttals of mercantilist monetary doctrine. Having contested this doctrine once, however, classicals and their descendants were called upon to counter it repeatedly throughout the nineteenth and twentieth centuries. Mercantilist views, despite their devastating initial rejection, reemerged to form the Banking School position in the famous Currency School-Banking School controversy that took place in England in the mid-1800s. Most of the usual suspects—cost-push, hoarding, reverse causality, discretion, nominal backing— appeared in the Banking School’s roundup. In opposing them, classicals, in their Currency School guise, found occasion to deploy the same quantity theoretic, price-specie-flow concepts they had earlier deployed against Law and Steuart. 2. CURRENCY SCHOOL-BANKING SCHOOL DEBATE (1830–1850) Ending a 24-year experiment with inconvertible paper, Britain had restored the gold convertibility of her currency in 1821. The ensuing Currency SchoolBanking School debate focused on whether the note component of such a convertible, gold-standard currency required statutory regulation to prevent overissue.22 The Currency School’s classical predecessors, notably David Ricardo, Henry Thornton, and others, had assumed that a convertible currency needed no such protection. If the currency were convertible, they reasoned, any excess note issue which raised British prices relative to foreign prices would be converted into gold to make cheaper purchases abroad.23 The resulting loss of specie reserves would immediately force banks to contract their note issue thus quickly arresting the drain and restoring the money stock and prices to their pre-existing equilibrium level. Given smooth and rapid adjustment (monetary self-correction), convertibility alone was its own safeguard. A series of monetary crises in the 1820s and 1830s, however, convinced the Currency School that adjustment was far from smooth and that convertibility per se was by no means a guaranteed safeguard to overissue. It was an inadequate safeguard because it allowed banks, commercial and central, too 22 For classic accounts of the Currency School-Banking School debate, see Viner (1937, Ch. 5), Fetter (1965, Ch. 6), Robbins (1958, Ch. 5), and Mints (1945, Ch. 6). For recent interpretations, see O’Brien (1975, pp. 153–59) and Schwartz (1987). 23 With the exception of John Wheatley, classicals held that national price levels could deviate temporarily from their purchasing power parity, or long-run equilibrium, levels. T. M. Humphrey: Mercantilists and Classicals 67 much discretion in the management of their note issue. Banks, facing no minimum required reserve ratio and willing to sacrifice safety for profit, could and did continue to issue notes even as gold was flowing out, delaying contraction until the last possible moment, and then contracting with a violence that sent shock waves throughout the economy. Currency School’s Monetary Rule What was needed, the Currency School thought, was a rule removing the note issue from the discretion of bankers and placing it under strict regulation. To be effective, this rule should require the banking system to contract its note issue one-for-one with losses of gold reserves so as to put a gradual and early stop to specie drains. Such a rule would embody the Currency School’s principle of metallic fluctuation according to which a mixed currency of paper and coin should be made to behave exactly as if it were wholly metallic, automatically expanding and contracting to match inflows and outflows of gold.24 Departure from this rule, the Currency School argued, would permit persistent overissue of paper. Such overissue, by forcing a protracted efflux of specie through the balance of payments, would in turn endanger the gold reserve, threaten gold convertibility, compel the need for sharp contraction, and thereby precipitate financial panics. Such panics would be exacerbated if internal gold drains coincided with external ones as domestic money holders, alarmed by the possibility of imminent suspension of cash payments, sought to convert paper currency into gold. No such consequences would ensue, the School felt, if the currency conformed to the metallic principle. Forced to behave like gold (regarded by the School as the stablest of monetary standards), the currency would be spared those sharp procyclical fluctuations in quantity that amplified disturbances arising from real shocks. The Currency School scored a triumph when its monetary rule was enacted into law. The Bank Charter Act of 1844 embodied its prescription that, except for a small fixed amount of notes issued against government securities, bank notes were to be backed by an identical value of gold. In modern terminology, the Act established a marginal gold reserve requirement of 100 percent behind note issues. With notes rigidly tied to gold in this fashion, their volume would start to shrink as soon as specie drains signaled the earliest appearance of overissue. Monetary overexpansion would be corrected automatically, swiftly, and gently before it could do much damage. Here was a practical policy application of Hume’s quantity theoretic, specie flow doctrines. Here was the notion of a channel of influence running from note overissue to rising prices to trade deficits to gold drains to corrective reductions in the note issue, reductions that 24 O’Brien (1975, p. 153) credits Joplin, Drummond, Page, Pennington, and McCulloch with the simultaneous enunciation of the metallic principle. 68 Federal Reserve Bank of Richmond Economic Quarterly restore general prices to their target equilibrium level. Here too was the classical preference for rules—in this case a 100 percent gold reserve requirement rule—rather than discretion in the conduct of banking policy. Banking School The rival Banking School flatly rejected the Currency School’s prescription of mandatory 100 percent gold cover for notes. Indeed, the Banking School denied the need for statutory note control of any kind. Instead, the School argued that a convertible note issue was automatically regulated by the needs of trade and required no further limitation. This conclusion stemmed directly from the real bills doctrine and the law of reflux which together posited guaranteed safeguards to overissue obviating the need for monetary control. The School’s real bills doctrine stated that the money stock could never be inflationary or deflationary if issued by way of collateralized loans advanced to finance transactions in the nominal volume of real goods and services. Similarly, the law of reflux asserted that overissue was impossible because any excess notes would be returned instantaneously to the banks for conversion into coin or for repayment of loans. Both doctrines embodied the notions of a passive, demand-determined money supply and of reverse causality running from prices and economic activity to money rather than vice versa as in the Currency School’s view.25 According to the reverse causality hypothesis, changes in the level of prices and production induce corresponding shifts in the demand for bank loans which banks accommodate via variations in their note issue. In this way, prices help determine the note component of the money stock, the expansion of which is the result, not the cause, of price inflation. As for the price level itself, the Banking School attributed its determination to factor incomes or costs (wages, interest, rents, etc.), thus positing a cost-push theory of price movements. The importance of cost-push theorizing to the Banking School cannot be overestimated. It even led Thomas Tooke, the School’s leader, to argue that high-interest-rate tight-money policies were inflationary since they raised the interest component of business costs, costs that passed through into higher prices.26 Mercantilist Ideas The concepts of cost inflation, reverse causality, and passive money are the hallmarks of an extreme anti-quantity theory of money to which the Banking School 25 Because these doctrines are consistent with those of the monetary approach to the balance of payments, Skaggs (1999) interprets the Banking School as early anticipators of that approach. Even so, the School hardly derived its conclusions from the logic of the monetary approach. The conclusions may have been the same, but they were reached by a different route. 26 On Tooke’s interest cost-push theory and Knut Wicksell’s definitive critique of it, see Humphrey (1998, pp. 60–64). T. M. Humphrey: Mercantilists and Classicals 69 adhered. Additional mercantilist hallmarks included the School’s propositions (1) that international gold movements are absorbed by idle hoards of excess specie reserves without affecting the volume of money in active circulation, (2) that gold drains stem from real shocks to the balance of payments rather than from domestic price inflation, (3) that changes in the stock of money are offset by compensating changes in the stock of money substitutes leaving the total circulation unchanged, and (4) that discretion is superior to rules in the conduct of monetary policy. The Banking School put these propositions to work in its critique of the classical monetary doctrines of the Currency School. Those doctrines, of course, contended that note overissue is the root cause of domestic inflation and specie drains. In opposing them, the Banking School argued as follows: Overissue is impossible since the stock of notes is determined by the needs of trade and cannot exceed demand. Therefore, no excess supply of money exists to spill over into the goods market to bid up prices. In any case, causality runs from prices to money rather than vice versa. Finally, specie drains stem from real rather than monetary shocks to the balance of payments and are totally independent of domestic price-level movements. These arguments severed all but one of the links in the Currency School’s monetary transmission mechanism running from money to prices to the trade balance, thence to specie flows and their impact on the monetary base, and finally back again to the money stock. The final link was broken when the Banking School asserted that gold flows come from idle hoards—buffer stocks of excess specie reserves—and not from the volume of money in circulation. Falling solely on the hoards, gold drains would find their monetary effects neutralized (sterilized) by the implied fall in excess reserves. To ensure that these hoards would always be sufficient to accommodate gold drains, the Banking School recommended that the Bank of England hold larger metallic reserves. With regard to the Currency School’s prescription that discretionary policy be replaced by a fixed rule, the Banking School rejected it on the grounds that rigid rules would prevent the banking system from responding to the needs of trade and would hamper the central bank’s power to deal with financial crises. Finally, the Banking School asserted the impossibility of controlling the monetary circulation via control of the gold and bank note component alone since limitation of that component would simply induce the public to resort to money substitutes (deposits and bills of exchange) instead. In other words, the circulation is like a balloon; when squeezed at one end, it expands at the other. More generally, the Banking School questioned the efficacy of base control in a financial system that could generate an endless supply of money substitutes. The Currency School, however, rejected this criticism on the grounds that the volume of deposits and bills was rigidly constrained by the volume of gold and notes and therefore could be controlled through the latter alone. In short, the total circulation was like an inverted pyramid resting on a gold and bank 70 Federal Reserve Bank of Richmond Economic Quarterly note base, with variations in the base inducing equiproportional variations in the superstructure of money substitutes. In counting deposits as part of the superstructure, the Currency School excluded them from its concept of money. It did so on the grounds that deposits, unlike notes and coin, were not generally acceptable in final payments during financial crises. Evaluation In retrospect, the Currency School erred in failing to define deposits as money to be regulated like notes. This failure enabled the Bank of England to exercise discretionary control over a large and growing part of the circulating medium, contrary to the School’s intentions. The School also erred in failing to recognize the need for a lender of last resort to avert liquidity panics and domestic cash drains. By the end of the nineteenth century it was widely recognized that the surest way to arrest an internal drain was through a policy of liberal lending. Such drains were caused by panic-induced demands for high-powered money (gold coin and Bank of England notes) and could be terminated by the Bank’s announced readiness to satiate those demands. The Currency School nevertheless remained opposed to such a policy, fearing it would place too much discretionary power in the hands of the central bank. These shortcomings in no way invalidated the School’s monetary theory of inflation which was superior to any explanations its critics had to offer. As for the Banking School, it rightly stressed the importance of checking deposits in the payments mechanism. But it was wrong in insisting that the real bills doctrine, which tied note issues to loans made for productive purposes, would prevent inflationary money growth. Like Henry Thornton, the Currency School triumphantly exposed this flaw by pointing out that rising prices would generate a growing demand for—and corresponding nominal collateral backing of—loans to finance the same level of real transactions. These loan demands, when accommodated in the form of deposit and note creation, would enlarge the money stock. In this way inflation would justify the monetary expansion necessary to sustain it and the real bills criterion would fail to limit the quantity of money in existence. Also, by 1900 Knut Wicksell and Irving Fisher had rigorously demonstrated the same point made by Thornton in 1802, namely that an insatiable demand for loans and a corresponding inexhaustible supply of eligible bills results when the loan rate of interest is below the expected rate of profit on capital. In such cases, the real bills criterion provides no bar to overissue. 3. THE KEYNESIAN REVOLUTION AND MONETARIST COUNTER-REVOLUTION (1936–1985) Classicals won the Currency-Banking dispute. Their victory lasted until exclassical John Maynard Keynes, having defected to the opposite side, routed T. M. Humphrey: Mercantilists and Classicals 71 them in 1936.27 But they regained their crown when monetarists (with help from the new classical school) dislodged Keynesian macroeconomics in the 1970s and 1980s. Keynes launched his attack in the midst of the Great Depression when the stark conditions of stagnation, poverty, and mass unemployment mocked the classical notion of a self-equilibrating, fully employed economy. Clearly the time was ripe for a mercantilist revival. That revival took the form of the Keynesian Revolution with the leader’s General Theory as its bible. In that book, Keynes replaced the full capacity, quantity theoretic doctrines of the classicals with at least four propositions inherited from Law and Steuart. Keynes’s Mercantilist Propositions First, like Law, he argued that in times of mass unemployment the primary stimulative effects of expansionary monetary policy fall on real output and employment rather than on prices. That is, they do so unless negated by liquidity traps and interest-insensitive investment demand schedules, both of which cause velocity reductions to absorb the impact of monetary expansion. Absent such phenomena, however, Keynes’s model implied that monetary stimuli affect real activity rather than prices. Like Law, he stressed that the stimulus works through an interest rate channel. More money means lower interest rates, a cheapened cost of capital, and thus a rise in investment spending. The increased investment induces additional rounds of consumption spending causing aggregate demand to rise by a multiple of the new investment spending. With idle resources available to draw upon, production expands to meet the increased aggregate demand. In expounding his interest rate transmission mechanism, Keynes praised his mercantilist forebears for anticipating it. Indeed, the “Notes on Mercantilism” section of his General Theory argues that the notion of a linkage running from money to interest rates to investment to output constituted the rationale for the mercantilists’ advocacy of export surpluses financed by specie inflows. Second, like Steuart, Keynes held that product prices, individual and aggregate, are determined by unit labor cost plus a markup to cover profits and nonlabor costs. Here is the mercantilist notion of the price level as a nonmonetary phenomenon.28 True, Keynes admitted that monetary expansion through its stimulus to employment might, because of diminishing returns to labor, raise unit labor costs and so prices. But he tended to minimize or disregard money’s price-raising effects. Instead, he treated the price level as an institutional datum 27 Before he abandoned classicism, Keynes was one of its luminaries. Both his 1923 A Tract on Monetary Reform and his 1930 A Treatise on Money are squarely in the classical tradition. He returned to the classical fold shortly before his death in 1946. 28 Keynes applied this notion to a closed economy. He was not referring to the case where, with foreign prices given and the exchange rate fixed, the real terms of trade drives the price level in a small open economy. 72 Federal Reserve Bank of Richmond Economic Quarterly governed by nominal wage rates which autonomous forces—union wage-setting policy, worker money illusion, and the like—render downwardly inflexible at low levels of employment. By expressing prices in terms of exogenously given factor costs, he pointed the way to a cost-push theory of the price level. His immediate followers, Joan Robinson, Nicholas Kaldor, and Richard Kahn, certainly interpreted him this way and accordingly denied money a role in price determination.29 Third, Keynes restated Steuart’s doctrine of hoarding in the form of his concept of the liquidity trap. The trap, he wrote, might come into operation in deep depressions when the interest rate falls to a level so low that everybody unanimously believes it cannot stay there but must return to its conventional normal height. At the floor rate, all are indifferent between holding cash or earning assets whose prices, which vary inversely with the interest rate, are expected to fall. Indeed, asset prices are expected to fall by an amount such that the resulting anticipated capital loss just equals (and so offsets) the interest return on the assets. As there is no advantage to holding such assets instead of zero-yield cash, the latter becomes a perfect substitute for the former in individuals’ portfolios. At this point, the demand for money becomes insatiable and infinitely sensitive to the slightest change in interest rates. Keynes called this pathological condition absolute liquidity preference. When this condition rules, no increase in the money stock, no matter how large, can reduce the interest rate. Suppose the central bank expands the money stock by purchasing bonds on the open market. Such bidding puts incipient upward pressure on bond prices. But the slightest rise of the latter induces bondholders to sell to the central bank and then to hoard the cash proceeds. Since at the floor rate of interest the demand for money is insatiable and the willingness to sell bonds absolute, no amount of open market operations can overcome absolute liquidity preference and reduce interest rates. And with rates at their irreducible minimum, they cannot fall any lower to stimulate real activity. Here is Keynes’s expression of the mercantilist fear that monetary expansion cannot be counted upon to stimulate spending because the new money may disappear into idle hoards. Fourth, Keynes found still another obstruction to block the interest rate channel. Even if monetary injections were successful in lowering interest rates, those injections still might fail to stimulate real activity if investment spending were unresponsive to the lower rates. If so, then two obstacles—an interestinsensitive investment schedule as well as a liquidity trap—could render monetary policy ineffective in a depression. In both cases, a rise in the money stock would be offset by a fall in velocity leaving total spending unchanged. With 29 On 324–30). the cost-push pricing theories of Keynes and his followers, see Tavlas (1981, pp. T. M. Humphrey: Mercantilists and Classicals 73 variable velocity absorbing the impact of money stock changes, none would be transmitted to nominal income. The rigid links connecting money to nominal income and prices as postulated by the classics would be severed or severely weakened. Steuart had said exactly the same thing in 1767. Post-Keynesian Extensions To Keynes’s own mercantilist doctrines, Keynes’s followers writing in the inflationary post–World War II period added others. Some interpreted inflation as a cost-push phenomenon emanating from union bargaining strength, business monopoly power, oligopoly administered prices, commodity shortages, supply shocks, and other real and institutional forces putting upward pressure on factor costs and profit mark-ups. Then too, “cheap money” advocates held that expansionary monetary policy could be used to peg interest rates at low levels so as to minimize the interest burden of the public debt while simultaneously stimulating real activity. An alternative version of the same argument, associated with the Phillips curve trade-off approach to policy questions, held that monetary policy could peg the unemployment rate at permanently low levels at the cost of a stable (nonaccelerating) rate of inflation. Underlying all these arguments were the presuppositions (1) that full employment is the dominant policy concern, (2) that the employment benefits of monetary stimuli exceed their inflationary costs, and (3) that disinflationary monetary policy, because entrenched inflation is so resistant to it, would produce intolerably large and protracted reductions in output and employment. John Law of course held similar presuppositions, as did other mercantilists.30 There remained the mercantilist ideas of reverse causation, passive money, and futility of base control of money and of inflation. Nicholas Kaldor supplied these ideas in his 1982 The Scourge of Monetarism. Representing the peak of post-Keynesian skepticism of the relevance of the quantity theory, Kaldor’s Scourge denied the possibility of base control given the central bank’s duty to guarantee bank liquidity and the financial sector’s ability to engineer changes in the turnover velocity of money via the manufacture of money substitutes. Kaldor’s transmission mechanism runs from trade unions to wages to prices to money and thence to bank reserves. Unions determine wages, wages determine prices, prices influence loan demands, and loan demands, via their accommodation in the form of bank-created checking deposits, determine the money stock, with central banks permissively supplying the necessary reserves. Far from exerting an activating influence, money appears at the end of the causal chain. 30 On the mercantilists’ policy goal of full employment, see Grampp (1952). 74 Federal Reserve Bank of Richmond Economic Quarterly Monetarists’ Response to Keynes and the Keynesians: The Classical Comeback Even as Keynesianism was riding high, critics were sniping at it from the sidelines. Eventually these criticisms would culminate in a monetarist counterrevolution that would dethrone mercantilist doctrines and restore classical ones. At least eight mileposts mark the route of the classical comeback. First came the theory of the real balance effect. Enunciated by Gottfried Haberler, A. C. Pigou, and Don Patinkin, it denied that Keynesian liquidity traps and interest-insensitive investment schedules could bar full employment.31 That is, it denied they could do so provided (1) wealth in the form of real money balances influences consumers’ spending decisions, and (2) prices possess some downward flexibility. The latter condition should hold in a slump since a depressed economy implies an excess supply of goods exerting downward pressure on prices. Lower prices in turn raise the real value, or purchasing power, of cash balances in consumers’ wealth portfolios. The rise in real cash balances stimulates consumption spending until full employment is reached. Indeed, it is unnecessary to wait for falling prices to activate the real balance effect. The central bank can achieve the same result directly by increasing the money supply. In principle, then, Say’s Law holds and money is hardly powerless to affect aggregate demand even under extreme Keynesian conditions. Keynes might have realized as much had he incorporated real balances into his consumption function. Second came the empirical work of Clark Warburton, Milton Friedman, and Anna Schwartz confirming money’s power to affect spending. Contrary to Keynes’s claim that idle hoards and offsetting velocity movements might negate money’s impact on nominal expenditure, Warburton established that (1) an erratic money stock through its impact on spending had been the chief factor causing most U.S. recessions, (2) money’s initial impact was on output, and (3) with a lag, prices eventually adjusted to fully absorb the money stock change.32 Friedman and Schwartz (1963) then corroborated Warburton by showing that a one-third contraction of the money stock caused or intensified the Great Depression of the 1930s. These studies, together with Friedman’s findings that persistent inflation is largely or solely the result of excessive monetary growth, effectively reestablished the classical doctrine of the short-run nonneutrality and long-run neutrality of money. They also showed that classical doctrine could account for the Great Depression. Third came Karl Brunner’s and Allan Meltzer’s 1967 critique of the LawKeynes theory of interest rates as a policy guide. That theory claimed that 31 See Haberler (1941, pp. 242, 389, 403), Pigou (1943, 1947), and Patinkin (1948, 1965). Warburton (1966) for a collection of his relevant papers, many published between 1944 and 1953. 32 See T. M. Humphrey: Mercantilists and Classicals 75 the interest rate, a purely monetary variable, accurately measures the degree of monetary ease or tightness. Brunner and Meltzer disagreed. The rate, they said, is an unreliable indicator of monetary ease or tightness. It is unreliable because it registers the impact of nonmonetary determinants—notably business loan demands—as well as monetary ones. The rate might be low or high not because money was easy or tight but rather because loan demand was weak or strong. Neglect of this important consideration could lead to perverse, destabilizing policy. For example, in times of depression, when slack business loan demands rendered the rate low, the authorities, misinterpreting the low rate as signifying easy money, might contract the money stock and thereby intensify the depression. Contrariwise, in times of inflation when booming credit demands rendered the interest rate high, the authorities, misinterpreting the high rate as signaling tight money, might expand the money supply and so escalate the inflation. By confounding the effects of loan demands with those of monetary ease or tightness, the central bank would engineer a perverse, procyclical monetary policy. This critique did much to discredit the Law-Keynes theory of the interest rate.33 Milton Friedman’s case for monetary rules constituted the fourth monetarist milestone. Friedman (1960) argued that long and variable time lags render discretionary countercyclical monetary policy destabilizing. Because such lags make forecast errors inevitable, the central bank cannot predict the short-run impact of its moves. The result is that expansionary actions aimed at fighting recessions may take effect at precisely the wrong time when the economy is booming just as contractionary anti-inflation actions may hit the economy when it is already mired in a slump. Friedman’s solution was to recommend a rigid rule fixing the money stock’s growth rate equal to the trend growth rate of output. Such a rule would operate as an automatic stabilizer working to restore aggregate spending to its long-run noninflationary full-employment path. Inflationary spending that outruns the rule-determined money stock could not be sustained and must slacken. Conversely, spending that falls short of money stock growth, as in recessions, would eventually quicken under the impact of the monetary stimulus. In this way, such rule-induced corrections would ensure that money acts to smooth cyclical fluctuations in spending and that long-run aggregate demand grows at the same trend rate as real output such that prices remain stable. The fifth milestone, and the one that more than any other turned the tide in favor of the classicals, was the stagflation experience of the 1970s. That episode saw the simultaneous appearance of rapid monetary growth, rising 33 As did a related critique attributing high rates to the inflationary anticipations of the public. Embodied in the inflation-premium component of interest rates and fueled by premonitions of policy permissiveness, such anticipations would be realized if the central bank, in a misguided attempt to lower rates, subsequently engineered rapid monetary expansion. 76 Federal Reserve Bank of Richmond Economic Quarterly unemployment, and accelerating inflation—an impossible combination according to the predictions of John Law and the Keynesian school. This experience did much to discredit mercantilist beliefs that money stimulates trade and that the price level is independent of the money supply. Natural Rate Hypothesis The sixth milestone was the monetarists’ natural rate hypothesis according to which unemployment returns to its natural equilibrium level regardless of the inflation rate. Milton Friedman (1968) and Edmund Phelps (1967) established this conclusion with the aid of an expectations-augmented Phillips curve. They showed that when inflationary expectations are incorporated into the Phillips curve, no permanent inflation-unemployment trade-offs remain to be exploited. True, like David Hume, they acknowledged that short-run trade-offs might still exist. Unanticipated rises in inflation, by lowering real wages, could stimulate employment and output temporarily. But once the increased inflation was fully perceived, anticipated, and therefore incorporated into nominal wage rates, the resulting rise in real wages would restore unemployment to its natural equilibrium level. In this way, the adjustment of expected to actual inflation transforms downward-sloping Phillips curves into a vertical line at the natural rate of unemployment. The classicals were right. Inflationary stimuli are temporary, never permanent. One cannot use a higher stable rate of inflation to peg the unemployment rate at arbitrarily low levels since there are no permanent employment gains to be had at any steady rate of inflation. Such gains can be had, if indeed they are available at all, only at the cost of ever-accelerating inflation. Many Keynesians eventually came to accept the natural rate hypothesis. Even so, they still contended that disinflation was too costly to pursue. Their fear stemmed from early versions of the expectations-augmented Phillips curve.34 Those versions embodied the assumption that agents revise their inflationary anticipations downward in mechanical, or adaptive, error-learning fashion only when actual, reported inflation turns out to be lower than expected. Accordingly, if the authorities sought to eradicate inflationary expectations— an absolute requirement of any successful disinflationary policy—they would have to force actual inflation below expected inflation thereby inducing the latter to adjust toward the former as it converged on the desired target rate. This sequence required the central bank to employ contractionary monetary policy to raise unemployment above its natural level. The resulting excess unemployment would put downward pressure on the actual rate of inflation to which the expected rate would adjust with a lag. Through this long and painful error-learning adjustment process, both actual and anticipated inflation eventually would be squeezed out of the economy, albeit at the cost of much lost output and employment. 34 See Taylor (1997, pp. 278–79). T. M. Humphrey: Mercantilists and Classicals 77 Rational Expectations Lower the Cost of Disinflation The seventh monetarist/new classical milestone disposed of this Keynesian concern. Pairing John Muth’s (1961) seminal work on rational expectations with Friedman’s natural rate hypothesis, Robert Lucas (1972) and Thomas Sargent and Neil Wallace (1975) showed that if expectations are formed rationally rather than mechanically then disinflation need not be a painful drawn-out process. On the contrary, the unemployment cost of disinflation might be far less than Keynesians feared. For if people formed their anticipations rationally, they would take into account all systematic, and therefore predictable, future disinflationary policy actions and embody them in their price forecasts. Provided policymakers behaved in a nonhaphazard, credible fashion, actual and expected rates of inflation and disinflation would coincide such that no gap would develop between them. With no gap, there would be no need for excess unemployment to generate it. Consequently, inflation, actual and expected, would be brought to its zero target level with no cost in terms of excess unemployment. In actuality, of course, this conclusion proved to be a bit too facile and sanguine. In a world in which wages and prices are to some degree sticky or inflexible such that markets fail to clear instantaneously, even rationally expected disinflation would incur some unemployment cost. Nevertheless, the analysis showed that these costs could be much lower than Keynesians feared. Time Inconsistency Case For Rules The last milestone was the time inconsistency argument which strengthened the classical case for rules by showing how they reinforce policy credibility. Enunciated by Finn Kydland and Edward Prescott (1977) and by Robert Barro and David Gordon (1983a, b), the argument is simplicity itself. Suppose a discretionary, fine-tuning central bank wants to eradicate inflationary expectations so it can have a favorable temporary inflation-unemployment trade-off to exploit. The bank announces its intention to pursue a policy of price stability. It assumes people will believe the announcement and revise their inflation predictions accordingly. The announcement, however, lacks credibility. Private agents realize that once they formulate and act upon such new price predictions, the bank will be tempted to renege on its promise and create a surprise inflation in order to boost output and employment. Such knowledge induces the rational public to discount the announcement and to maintain inflationary expectations at levels high enough to remove the bank’s temptation to cheat. The result is that equilibrium unemployment is no lower than it otherwise would be, and yet equilibrium inflation is too high. What prevents inflation from immediately dropping to zero at the natural rate of unemployment is the central bank’s inability to promise credibly not to create surprise inflation. Needed is something to convince the public that the central bank will not succumb to the temptation to inflate. That something is a monetary rule replacing the bank’s discretionary power with a 78 Federal Reserve Bank of Richmond Economic Quarterly precommitment binding it irrevocably to price stability.35 In demonstrating as much, the time inconsistency argument reinforced the classical case for rules.36 The cumulative effect of the foregoing developments was to shift mainstream monetary opinion away from the extremes of Keynesian mercantilism toward classical monetarism. Not all Keynesian doctrines were abandoned, of course. Nor were all monetarist ones embraced. On the contrary, mainstream opinion assimilated an eclectic amalgam of competing views. But a new consensus definitely had emerged. After four or five decades of mercantilist dominance, the classical view was at the wheel once again. 4. CONCLUSION Three centuries of monetary controversy and experience have established certain hard-won classical truths. Inflation and deflation are monetary rather than cost-push phenomena. There are no long-run inflation-output trade-offs to exploit; central banks cannot permanently peg real variables at disequilibrium levels. Attempts to do so produce explosive, ever-worsening inflation or deflation. Money-stock changes at best affect output and employment temporarily. The output effect vanishes when prices adjust; all that remains is a changed rate of inflation. Stability of the value of money is a prerequisite of an efficiently functioning real economy. All nonnegligible inflation rates violate this prerequisite and are therefore harmful. Monetary rules contribute to such stability. Presently these truths are in the driver’s seat. The proof is that many central bankers now view their primary mission as providing a stable price-level environment within which businesspeople can receive accurate market signals and allocate resources efficiently. Still the classical wisdom, though ruling, is hardly secure. For mercantilist views continue to abound. Even today, some economists still insist that it is better to live with inherited inflation than to fight it because disinflation is too costly to pursue. Others echo Steuart’s costpush theory, attributing the disinflation of the 1990s to such nonmonetary forces as increased global competition, rapid technological progress, falling computer and health-care costs, weakened power of labor unions, and the like. Still others evoke the Steuart-Keynes image of liquidity traps in holding that monetary policy is powerless to stimulate the currently depressed Japanese economy. Commentators even parrot Law’s monetary theory of interest when they cite Japan’s low interest rates as proof that the country is awash with money when the opposite is true. And always there are those who argue that, with prices 35 Alternatively, an established reputation as a zealous inflation fighter would do. time consistency case for rules differs a bit from Friedman’s argument. He sees rules as overcoming the central bank’s inability to predict the short-run impact of its actions. By contrast, the time inconsistency argument holds that rules are good for commitment reasons even when central bankers have full knowledge of the impact of their moves. 36 The T. M. Humphrey: Mercantilists and Classicals 79 determined by real considerations, monetary policy should be free to pursue nonprice objectives such as achieving full employment and maximizing real growth. The challenge then is to ensure that the classical truths will not be forgotten. But that is a tall order given that memories fade, that central bank leadership changes, that the current generation of economists familiar with the Keynesianmonetarist controversy is passing from the scene, that revisionist scholars can be counted upon to reinterpret the record radically, and that future generations may well be as reluctant as the present one to study the lessons of the past. The task of countering these influences and preserving the classical wisdom falls to the doctrinal historian. As curator of the stock of eclipsed and unfashionable ideas, he has his work cut out for him. An even more important challenge is to embed, or lock, the classical truths into enduring institutional arrangements that allow no room for mercantilist policy alternatives. To this end, proponents of the classical view propose a variety of possible arrangements. These include (1) congressional mandates for price stability, (2) formal contracts between elected governments and central banks fixing quantitative targets for price-level behavior, (3) guaranteed independence for central bankers to insulate them from the political pressure to inflate, and (4) the appointment of conservative, inflation-averse central bankers committed to the goal of price stability. The trouble is, however, that none of these proposed arrangements can assure that classical policies will reign supreme for all time. Mandates can be changed, contracts terminated, guarantees revoked, and appointments altered. The upshot is that it is too early to declare a permanent victory for the classical view. Indeed, there may always be a market for the opposing view that central banks need not and must not be bound to the goal of price stability. For better or worse, that view will challenge the classical view whenever the public perceives unemployment or sluggish real growth rather than inflation to be the dominant economic problem. Still, the inherent cyclicality of ideas suggests an inevitable classical response to that challenge. Classicism, in short, will return to prominence to be confronted anew. For history shows it to be nothing if not resilient. Over long spans of time, it has proved resistant to the kinds of economic shocks that occasionally propel mercantilists to prominence. That is one of the chief insights of doctrinal history. 80 Federal Reserve Bank of Richmond Economic Quarterly REFERENCES Barro, Robert J., and David B. Gordon. “A Positive Theory of Monetary Policy in a Natural Rate Model,” Journal of Political Economy, vol. 91 (August 1983a), pp. 589–610. . “Rules, Discretion, and Reputation in a Model of Monetary Policy,” Journal of Monetary Economics, vol. 12 (June 1983b), pp. 101– 22. Blaug, Mark. Economic Theory in Retrospect, 5th ed. Cambridge: Cambridge University Press, 1996. Brunner, Karl, and Allan H. Meltzer. “The Meaning of Monetary Indicators,” in G. Horwich, ed., Monetary Process and Policy: A Symposium. Homewood, Ill.: R. D. Irwin, 1967. Cesarano, Filippo. “Hume’s Specie-Flow Mechanism and Classical Monetary Theory: An Alternative Interpretation,” Journal of International Economics, vol. 45 (June 1998), pp. 173–86. Eltis, Walter. “Sir James Steuart’s Corporate State,” in R. D. Collison Black, ed., Ideas in Economics. London: Macmillan, 1986. Fetter, Frank W. Development of British Monetary Orthodoxy, 1797–1875. Cambridge, Mass.: Harvard University Press, 1965. Friedman, Milton. “25 Years after the Rediscovery of Money: What Have We Learned?” Discussion, American Economic Review, vol. 65 (May 1975), pp. 176–79. . “The Role of Monetary Policy,” American Economic Review, vol. 58 (March 1968), pp. 1–17. . A Program for Monetary Stability. New York: Fordham University Press, 1960. , and Anna J. Schwartz. A Monetary History of the United States, 1867–1960. Princeton: Princeton University Press for the National Bureau of Economic Research, 1963. Grampp, William D. “The Liberal Elements in English Mercantilism,” Quarterly Journal of Economics, vol. 66 (November 1952), pp. 465–501. Haberler, Gottfried. Prosperity and Depression: A Theoretical Analysis of Cyclical Movements. Geneva: League of Nations, 1941. Hume, David. “Of Money,” “Of the Balance of Trade,” and “Of Interest,” (1752), in Hume’s Writings on Economics, E. Rotwein, ed. Madison, Wis.: University of Wisconsin Press, 1955. Humphrey, Thomas M. “Historical Origins of the Cost-Push Fallacy,” Federal Reserve Bank of Richmond Economic Quarterly, vol. 84 (Summer 1998), pp. 53–74. T. M. Humphrey: Mercantilists and Classicals 81 . Money, Banking and Inflation: Essays in the History of Monetary Thought. Aldershot, England: Edward Elgar Publishing Ltd., 1993. Hutchison, Terence W. Before Adam Smith: The Emergence of Political Economy, 1662–1776. Oxford: Basil Blackwell, 1988. Kaldor, Nicholas. The Scourge of Monetarism. Oxford: Oxford University Press, 1982. Keynes, John Maynard. The General Theory of Employment, Interest and Money. London: Macmillan, 1936. Kydland, Finn, and Edward Prescott. “Rules Rather Than Discretion: The Inconsistency of Optimal Plans,” Journal of Political Economy, vol. 85 (June 1977), pp. 473–91. Law, John. Money and Trade Considered, with a Proposal for Supplying the Nation with Money. Edinburgh, 1705. Lucas, Robert E., Jr. “Expectations and the Neutrality of Money,” Journal of Economic Theory, vol. 4 (April 1972), pp. 103–24. Meek, R. L. “Rehabilitation of Sir James Steuart,” in his Economics and Ideology and Other Essays. London: Chapman and Hall, 1967. Mints, Lloyd W. A History of Banking Theory. Chicago: University of Chicago Press, 1945. Murphy, Antoin E. John Law: Economic Theorist and Policy-Maker. Oxford: Clarendon Press, 1997. Muth, John F. “Rational Expectations and the Theory of Price Movements,” Econometrica, vol. 29 (July 1961), pp. 315–35. Niehans, Jurg. A History of Economic Theory: Classic Contributions, 1720– 1980. Baltimore: Johns Hopkins University Press, 1990. O’Brien, Denis P. The Classical Economists. New York: Oxford University Press, 1975. Patinkin, Don. Money, Interest, and Prices, 2d ed. New York: Harper and Row, 1965. . “Price Flexibility and Full Employment,” American Economic Review, vol. 38 (September 1948), pp. 543–64. Pearce, Ivor F. “Confrontation with Keynes,” in The Coming Confrontation: Will the Open Society Survive to 1989? London: The Institute of Economic Affairs, 1978. Phelps, Edmund S. “Phillips Curves, Expectation of Inflation and Optimal Unemployment Over Time,” Economica, vol. 34 (August 1967), pp. 254– 81. Pigou, Arthur C. “Economic Progress in a Stable Environment,” Economica, vol. 14 (August 1947), pp. 180–90. 82 Federal Reserve Bank of Richmond Economic Quarterly . “The Classical Stationary State,” Economic Journal, vol. 53 (December 1943), pp. 343–51. Ricardo, David. The Works and Correspondence of David Ricardo, Vols. I and II, edited by P. Sraffa with M. H. Dobb. Cambridge: Cambridge University Press, 1951–1973. Robbins, Lionel. Robert Torrens and the Evolution of Classical Economics. London: Macmillan, 1958. Sargent, Thomas J., and Neil Wallace. “Rational Expectations, the Optimal Monetary Instrument, and the Optimal Money Supply Rule,” Journal of Political Economy, vol. 83 (April 1975), pp. 241–54. Say, Jean-Baptiste. Traité d’économie politique: Ou, simple exposition de la manière dont se forment, se distribuent, et se consomment les richesses, Vol. 1. Paris: Deterville, 1803. Translated under the title A Treatise on Political Economy, 4th ed. Philadelphia: G. R. Elliot, 1834. Schwartz, Anna J. “Banking School, Currency School, Free Banking School,” in J. Eatwell, M. Milgate, and P. Newman, eds., The New Palgrave—A Dictionary of Economics, Vol. 1. London: Macmillan, 1987. Screpanti, Ernesto, and Stefano Zamagni. An Outline of the History of Economic Thought, translated by David Field. New York: Oxford University Press, 1993. Skaggs, Neil. “Changing Views: Twentieth-Century Opinion on the Banking School-Currency School Controversy,” History of Political Economy, vol. 31 (Summer 1999), pp. 361–91. Skinner, Andrew S. “Sir James Steuart: Author of a System,” Scottish Journal of Political Economy, vol. 28 (February 1981), pp. 20–42. Tavlas, George S. “Keynesian and Monetarist Theories of the Monetary Transmission Process: Doctrinal Aspects,” Journal of Monetary Economics, vol. 7 (May 1981), pp. 317–37. Taylor, John B. “Comment,” in Christina and David H. Romer, eds., Reducing Inflation: Motivation and Strategy. Chicago: University of Chicago Press, 1997, pp. 276–80. Thornton, Henry. An Enquiry into the Nature and Effects of the Paper Credit of Great Britain (1802). Together with His Speeches on the Bullion Report, May 1811. F. A. von Hayek, ed. London: George Allen & Unwin, 1939. Viner, Jacob. Studies in the Theory of International Trade. New York: Harper Brothers, 1937. Warburton, Clark. Depression, Inflation and Monetary Policy, Selected Papers 1945–1953. Baltimore: Johns Hopkins University Press, 1966.