The full text on this page is automatically extracted from the file linked above and may contain errors and inconsistencies.
Big Push in Distorted Economies WP 21-07 Francisco Buera Washington University in St. Louis Hugo Hopenhayn UCLA Yongseok Shin Washington University in St. Louis Nicholas Trachter Federal Reserve Bank of Richmond Big Push in Distorted Economies∗ Francisco Buera Hugo Hopenhayn Washington University in St. Louis UCLA Yongseok Shin Nicholas Trachter Washington University in St. Louis Federal Reserve Bank of Richmond March 26, 2021 Abstract Why don’t poor countries adopt more productive technologies? Is there a role for policies that coordinate technology adoption? To answer these questions, we develop a quantitative model that features complementarity in firms’ technology adoption decisions: The gains from adoption are larger when more firms adopt. When this complementarity is strong, multiple equilibria and hence coordination failures are possible. More importantly, even without equilibrium multiplicity, the model elements responsible for the complementarity can substantially amplify the effect of distortions and policies. In what we call the Big Push region, the impact of idiosyncratic distortions is over three times larger than in models without such complementarity. This amplification enables our model to nearly fully account for the income gap between India and the United States without coordination failures playing a role. ∗ Buera: fjbuera@wustl.edu. Hopenhayn: hopen@econ.ucla.edu. Shin: yshin@wustl.edu. Trachter: trachter@gmail.com. We thank Andy Atkeson, Ariel Burstein, Ezra Oberfield, Michael Peters, and participants at several seminars and conferences for comments and suggestions. We thank Eric LaRose, Reiko Laski, and James Lee for outstanding research assistance. The views expressed herein are those of the authors and do not necessarily represent the views of the Federal Reserve Bank of Richmond or the Federal Reserve System. 1 1 Introduction Many countries have industrialized and grown rapidly by adopting modern technologies. Why don’t poor countries adopt more productive technologies? What policies can effectively promote technology adoption? The standard view emphasizes the role of distortions or barriers to technology adoption (e.g., Parente and Prescott, 1999; Hsieh and Klenow, 2014; Cole et al., 2016; Bento and Restuccia, 2017). According to this view, eliminating the distortions is the obvious policy response. An alternative view emphasizes the role of complementarity and coordination failures: Firms in poor countries use unproductive technologies because other firms use unproductive technologies, even though the gains from adoption increase with the number of adopters so that they will all benefit from a coordinated decision to adopt. According to this view, policies can have large aggregate effects by coordinating firms’ decisions. This view has a long tradition in policy circles (e.g., RosensteinRodan, 1943; Hirschman, 1958) and is supported by more recent theoretical works (e.g., Murphy et al., 1989; Matsuyama, 1995; Ciccone, 2002). However, there have been few quantitative analyses, if any, of the coordination failure view of economic development. Our paper bridges these two paradigms in a quantitative framework. This paper makes two contributions. First, our theoretical analysis goes beyond determining the existence of multiple equilibria to show that, even if they do not exist, complementarity in technology adoption can amplify the effect of distortions and policies. Second, our quantitative analysis based on aggregate and micro-level data examines the empirical relevance of multiple equilibria and coordination failures and, more importantly, shows the economic significance of the amplification channel in the absence of multiple equilibria. In what we call the Big Push region, the impact of idiosyncratic distortions is over three times larger than in models without such complementarity. This amplification in our model helps account for the vast income gap between poor and rich countries without coordination failures playing a role. In our model, firms are ex-ante heterogeneous, produce differentiated goods, are subject to idiosyncratic distortions, and are connected to one another through inputoutput linkages. Firms first choose whether or not to pay a fixed cost and enter the market. Active firms can operate a traditional technology or, upon paying adoption costs, a more productive modern technology. We first theoretically analyze the conditions under which the complementarity in 2 firms’ technology adoption decisions can amplify the effect of policies and distortions and even support multiple equilibria. When more firms adopt the productive technology, the aggregate price falls, which has the direct effect of reducing the price and the profit of the marginal adopter. However, the lower aggregate price also means a larger demand for the marginal firm’s output, a lower cost of the intermediate input for production, and less expensive goods portion of the adoption costs, raising the marginal adopter’s gains. If the latter forces dominate the direct competition effect, we have complementarity. More heterogeneity in firm productivity implies that the density of the firms near the adoption margin is smaller, implying a weaker feedback effect from the marginal firm’s adoption to the aggregate economy and hence weaker complementarity.1 On the other hand, idiosyncratic distortions correlated with firm productivity (i.e., more productive firms being distorted more) compress the effective heterogeneity across firms, which strengthens complementarity. For simple versions of our model, we derive the conditions for amplification and equilibrium multiplicity. What contributes to amplification are, consistent with the discussions above: a small elasticity of substitution across differentiated goods, adoption costs in goods rather than labor, a high intermediate input intensity of the modern technology, small heterogeneity in firm productivity, and a close correlation between idiosyncratic distortions and firm productivity. Multiplicity is an extreme form of amplification, and indeed the conditions for the existence of multiple equilibria are stronger than those for amplification, a result corroborated by our quantitative model that exhibits potent amplification even without multiple equilibria. Next, we use aggregate and micro-level data from the United States and India and conduct a quantitative analysis. The United States is an undistorted benchmark, and India is a large developing country for which relevant micro-level data is available. The full version of our model has several layers, but it is tractable enough that most of the parameters can be identified transparently from the data. In spite of the potential presence of multiplicity, under the assumption that the data comes from an equilibrium where adopters and nonadopters coexist, the values of the key model parameters are obtained in closed form from the establishment size distribution. The quantitative analysis has three main results. First, although our calibration 1 This may explain why multiplicity results in this literature invariably come from models with homogeneous firms. In contrast, firm heterogeneity is essential for our quantitative strategy. 3 targets the moments from the establishment size distribution but not the income level of either country, the model generates nearly as large an income gap between the United States and India—a factor of 7—as in the data, due to the higher adoption costs and degrees of idiosyncratic distortions in India, whose impact is amplified by the complementarity. Second, the United States is in the unique equilibrium region, but India is in the multiplicity region. However, India is found to be in the good (or high adoption/output) equilibrium, and hence coordination failures do not explain why India is poorer than the United States. The final result is that the effect of idiosyncratic distortions on aggregate productivity and GDP can be highly nonlinear. When the complementarity in firms’ adoption decisions is strong enough, multiple equilibria appear above a threshold degree of idiosyncratic distortions, but eventually only the bad equilibrium survives with even more distortions. This implies that, for a heavily distorted economy in a unique bad equilibrium, a reform that reduces the distortions just enough to place it in the multiplicity region gives it a chance to coordinate to the good equilibrium featuring more adoption and vastly higher GDP. More important, even when the model has a unique equilibrium for any degree of distortions, the model elements responsible for the complementarity still amplify the impact of distortions. In an empirically plausible range of distortions, a small reduction in distortions can disproportionately increase adoption and aggregate productivity by several times more than in the standard models without complementarity (e.g., Hsieh and Klenow, 2009; Hopenhayn, 2014; Restuccia and Rogerson, 2017). These positive nonlinear effects with or without multiplicity are the Big Push in distorted economies, and we call this range of distortions the Big Push region. Our quantitative results offer two broader implications. First, the powerful amplification of the impact of distortions through complementarity in our model can help account for the huge income differences across countries. Second, the existence of the Big Push region, where a small reduction in distortions can unleash massive improvements, can be an explanation of why some distortion-reducing reforms are more successful than others. Related literature The idea that underdevelopment can result from coordination failures goes back to Rosenstein-Rodan (1943). It has been formalized by Murphy et al. (1989) in a model with aggregate demand spillovers and by Ciccone (2002) in 4 a model with differences in intermediate input intensities across technologies.2 Some empirical works have applied the idea of multiple equilibria and coordination failures to historical contexts (Davis and Weinstein, 2002, 2008; Redding et al., 2011; Kline and Moretti, 2014; Lane, 2019; Crouzet et al., 2020). The evidence so far is mixed, suggesting that the possibility of multiple equilibria depends on the details of the economic environment, a theme we emphasize. Such advances in the theoretical and empirical literature have not been actively followed by quantitative work with few exceptions. Valentinyi et al. (2000), although a theoretical work, makes the important point that multiplicity is overstated in representative agent models. Using a heterogeneous agent version of the two-sector model of Matsuyama (1991), in which the economies of scale that are external to individual producers cause multiplicity, they show that sufficient heterogeneity restores a unique equilibrium. Graham and Temple (2006) study a representative agent version of a similar two-sector model and find that a quarter of the world’s economies are stuck in a low output equilibrium. Caucutt and Kumar (2008) numerically explore a model in the theoretical literature.3 Relative to these papers, our contribution is to quantitatively analyze a richer, more granular model of coordination failures, bringing together elements emphasized in the theoretical literature and disciplining the analysis with micro-level data. More important, we find that, even in the absence of multiplicity, these model elements amplify the impact of idiosyncratic distortions and policies. Our model builds on widely used models of heterogeneous firms, including those of Hopenhayn (1992) and Melitz (2003). We extend the standard model to incorporate discrete technology adoption choices.4 Our modeling choice is partly motivated by the evidence in Holmes and Stevens (2014), who show wide variations in the size of plants, 2 Krugman (1992) and Matsuyama (1995) review the earlier papers on this topic and the more recent theoretical contributions. Additional examples include Okuno-Fujiwara (1988), Rodrı́guezClare (1996), and Rodrik (1996), which analyze open-economy models of coordination failures. 3 Owens et al. (2018) study a quantitative urban model in which residential externalities cause multiple equilibria at the neighborhood level. Another related literature explores the role of coordination failures in accounting for the Great Recession (Kaplan and Menzio, 2016; Schaal and Taschereau-Dumouchel, 2019) in the tradition of Cooper and John (1988), but this literature abstracts from micro-level heterogeneity. 4 Yeaple (2005) and Bustos (2011) also consider firms’ technology choice decisions, but they either ignore the possibility of multiplicity or make assumptions that happen to guarantee uniqueness. The small- vs. large-scale sector choice in Buera et al. (2011) can also be thought of as a technology choice, but that model also has a unique equilibrium. 5 even within narrowly-defined industries. In our model, small firms producing with the traditional technology coexist with large firms operating the productive modern technology, with the technology choice driven by and magnifying the underlying heterogeneity in firm-level productivity. Another important element of our model is the input-output linkages in the form of roundabout production as in Jones (2011), which helps make firms’ adoption decisions complementary in our model and amplifies the effect of distortions in general. Finally, following Restuccia and Rogerson (2008) and Hsieh and Klenow (2009), we introduce idiosyncratic distortions, which stand in for various types of frictions, including barriers to technology adoption as modeled by Parente and Prescott (1999) and Cole et al. (2016). The interaction between distortions and technology adoption in our model is related to the impact of distortions on productivity-enhancing investment in Bento and Restuccia (2017) and Bhattacharya et al. (2013). Our emphasis is the amplification of the effect of distortions through the complementarity in firms’ adoption decisions, which results in highly nonlinear effects of distortions with or without multiple equilibria. Relative to the distortion literature, our model is unique in its ability to generate large income differences across countries with moderate degrees of idiosyncratic distortions. 2 Setup The economy is populated by a mass L = 1 of workers and measure one of potential firms, each of them producing a differentiated good j.5 Workers supply their labor inelastically and use their labor income to consume a final good. The differentiated goods produced by firms are combined to produce an intermediate aggregate, Z X= η−1 η yj dj η η−1 , η>1, where η is the elasticity of substitution, governing how complementary are differentiated goods in the production of the intermediate aggregate. The intermediate aggregate can be transformed with a linear technology to produce the final consumption good and the intermediate input to be used by firms. Firms are heterogeneous in their productivity z, drawn from a cumulative 5 Thus, j indexes a differentiated good or a particular firm. 6 distribution F (z), with density f (z) = ∂F (z)/∂z. Based on their productivity, firms choose to be active or inactive. An active firm with idiosyncratic productivity z must incur κe units of labor to enter the market and operate. An active firm produces using technology i ∈ {t, m}, labor l, and intermediate aggregate x, according to y=z Ai l1−νi xνi , νi ∈ [0, 1] , (1 − νi )1−νi νiνi where νi is the intermediate input elasticity. Technology m, the “modern” technology, is more productive and intermediate input intensive than technology t, ν m the “traditional” technology. Specifically, we assume that Am / νm (1 − νm )1−νm > At / νtνt (1 − νt )1−νt and νm ≥ νt . The modern technology requires κa units of an adoption good. The adoption good is produced by a competitive fringe in the adoption good sector by combining labor and the intermediate aggregate using a Cobb-Douglas production function. We denote by Pa the price of the adoption good. Finally, firms are subject to idiosyncratic gross output distortions given by τ z −ξ , where ξ ∈ [0, 1] is the elasticity of distortions with respect to productivity, and τ is a budget-balancing scale parameter. With ξ > 0, low productivity firms are subsidized, and high productivity firms are taxed, which is the empirically relevant range in the misallocation literature as we show in Section 5. As is standard, the demand for differentiated good j and the price index of the intermediate aggregate are yj = P pj η Z X and P = pj 1−η 1 1−η . dj We assume the labor market is perfectly competitive with equilibrium wage w. Problem of Intermediate Input Producers An active firm with productivity z producing with technology i earns operating profits πio (z). The firm chooses the price p for its differentiated good and the amount of labor l and intermediate input m required for production. The problem is πio (z) = max τ z −ξ p p,l,x P p η X − wl − P x Ai s.t. z νi l1−νi xνi ≥ y = νi (1 − νi )1−νi 7 (1) P p η X. (2) From the first order conditions of this problem, we obtain expressions for the optimal price and input demands: η w1−νi P νi 1 , η − 1 Ai τ z 1−ξ (1−νi )η+νi η−1 η η P li (z) = (1 − νi ) τ XAη−1 z η(1−ξ)−1 , i η w νi w li (z). xi (z) = 1 − νi P pi (z) = (3) (4) (5) From these expressions, we obtain an expression for the maximized operating profit πio (z) 1 = η η−1 1 η w1−νi η−1 τ η P η(1−νi )+νi XAη−1 z η(1−ξ)−1 , i which is increasing in z provided η − 1/(1 − ξ) > 0. It is useful to analyze the effect of distortions ξ on a firm’s decisions in two steps. First, the firm acts as if having a lower productivity—lowered by a factor z −ξ , as evidenced in equation (3). This effect also translates into an equivalent drop in output (equation 2). Thus, this effect is captured by defining effective productivity as z 1−ξ . Second, as seen in equations (4) and (5), distortions imply a lower elasticity of factor demands and profits with respect to z than in the undistorted case, i.e., η − 1/(1 − ξ) ≤ η − 1. To see this, notice that we can rewrite the productivity term in equation (4) as z (1−ξ)(η−1/(1−ξ)) ; the first term in parentheses corresponds to the productivity reduction effect, while the second one corresponds to the lower demand elasticity resulting from the wedge. Finally, the profits of a firm with productivity z, given the optimal entry and adoption decisions, are π (z) = max inactive,active o {0, max {πto (z) , πm (z) − Pa κa } − wκe } . (6) The entry and adoption decisions are characterized by thresholds ze and za , where ze ≤ za . That is, a firm with productivity z will be active if and only if z ≥ ze , and will adopt the modern technology if and only if z ≥ za . This is because, under the assumption that η − 1/(1 − ξ) > 0, the operating profit πio (z) is increasing in productivity z for i ∈ {t, m}. 8 Problem of Adoption Good Producers A representative competitive firm producing the adoption good takes the adoption good price Pa as given and solves max La ,Xa γγ 1 γ Pa L1−γ a Xa − wLa − P Xa , (1 − γ)1−γ (7) where γ is the intermediate aggregate share in the production of the adoption good. Equilibrium We consider symmetric equilibria where all firms of a given productivity make the same decision. Definition 1. A symmetric equilibrium consists of entry and adoption decisions by differentiated goods producers, factor demands by producers of differentiated and adoption goods, and relative factor prices w/P and Pa /P = (w/P )1−γ such that (i) firms maximize profits and (ii) the markets for labor and intermediate aggregate clear Z za Z ∞ lm (z) dF (z) + (1 − F (ze )) κe lt (z) dF (z) + ze za P γ + (1 − F (za )) (1 − γ) κa = L , w Z za Z ∞ w 1−γ C+ xt (z) dF (z) + xm (z)dF (z) + (1 − F (za )) γ κa = X . P ze za (8) (9) Equation (8) is the labor market clearing condition, and the four terms in the lefthand side are the labor used for traditional technology production, modern technology production, entry costs, and adoption good production. Equation (9) states that the intermediate aggregate used for consumption, traditional technology production, modern technology production, and adoption good production must add up to the quantity produced on the right-hand side. Manipulating the equilibrium conditions, we can characterize the equilibrium by three equations in three variables, ze , za , and P/w, which are shown in the appendix. 3 Understanding Amplification and Multiplicity A firm’s incentives to adopt the modern technology can be affected by other firms’ adoption decisions. We show that the strength of this complementarity has important implications for the amplification of the impact of distortions and policies and also for the possibility of multiple equilibria and coordination failures. 9 Before examining the role that different parameters play in our model, we first consider the complementarity at a more abstract level for the case of a policy that subsidizes technology adoption. Let D (z; a, s) denote the net gains from adoption for a firm with productivity z, when the mass of adopters in the economy is a, and the adoption subsidy is s: o D (z; a, s) ≡ πm (z; a, s) − πto (z; a, s) − (1 − s) P (a) κa , (10) where πio (z; a, s) is the operating profit of a firm with productivity z using technology i, and where we made explicit the dependence on the aggregate variables a and s. The aggregate price index when the mass of adopters is a is P (a). We are assuming that the adoption costs are in units of goods only (γ = 1 in our model). A firm adopts the modern technology if D ≥ 0 and operates the traditional technology otherwise. Consistent with our model, we assume that D is increasing in z, so the adoption decision rule is characterized by a productivity threshold za : that is, a firm adopts the modern technology if and only if z ≥ za , with the corresponding mass of adopters being a = 1 − F (za ), where za satisfies D (za ; a, s) = 0 . (11) We now consider a small change in subsidy, ds. We define the direct effect as the impact on za holding fixed the value of a at the initial equilibrium level: dza ds direct =− Ds (za ; a, s) . Dz (za ; a, s) (12) The total effect of this policy includes the equilibrium response of a and is dza ds total = − Ds (za ; a, s) . Dz (za ; a, s) − Da (za ; a, s) f (za ) The ratio between the total and the direct effect is [dza /ds]total [dza /ds] direct = 1 1− Da (za ;a,s)f (za ) Dz (za ;a,s) , (13) which can be interpreted as a multiplier measuring the degree of the feedback effect. When this ratio is greater than 1, the feedback is positive, an indication of complementary in adoption decisions. The intensity of complementarities depends 10 on the feedback ratio: f (za ) r (za , a, s) ≡ Da (za ; a, s) . | {z } Dz (za ; a, s) | {z } incentive (14) feedback The feedback ratio can be interpreted as the product of two terms that determine the adoption complementarities. The incentive term is the effect of a change in the mass of adopters on the incentives (net gains) for adoption Da (za ; a, s). The feedback term shows how the changes in the net gains from adoption feed back into a change in the adoption threshold, f (za ) /Dz (za ; a, s). The expression 1/Dz (za ; a, s) measures how the productivity of the marginal adopter relates to changes in the incentives to adopt, and f (za ) is the density of firms at the adoption threshold. A positive feedback ratio r(za , a, s) implies a multiplier exceeding one and hence amplification. In addition, the closer the feedback ratio is to one, the higher the multiplier will be. The feedback ratio also plays an important role in the analysis of equilibrium determination and the possibility of multiplicity. An equilibrium for a given s consists of a threshold za and a = 1 − F (za ) satisfying equation (11). In examining the equilibrium, it is useful to consider the following mapping: T (a) = a0 |a0 = 1 − F (za ) and D (za , a, s) = 0 . This function is analogous to a best response map, giving the fraction of adopters T (a) when firms behave optimally in response to the equilibrium prices that prevail with an adoption rate a. An equilibrium is given by a fixed point of this mapping a = T (a) . The slope of this mapping is T 0 (a) = Da (za ; a, s) f (za ) , Dz (za ; a, s) which is precisely the feedback ratio r(za , a, s). The equilibrium is unique if T 0 (a) < 1 for all a. The possibility of multiple equilibria arises if there is a range where T 0 (a) > 1 followed by a range where T 0 (a) < 1, as shown in the left panel of Figure 1. In the right panel, we zoom in on the right-most equilibria of the left panel. We show the effect of a change in the subsidy rate s, breaking it into the direct and the total effects. Note that the condition for multiplicity is stronger than the condition for amplification, r(za , a, s) > 0. In the remainder of this section, we illustrate the role of various model elements 11 The equilibrium mapping T (a) Direct and total effects ! " ! " &'&() !"#$%& " " Figure 1: Equilibrium Mapping and Amplification: An Example in generating amplification and multiplicity using simple examples. 3.1 Simple Model I: Heterogeneity Consider the case with no intermediate inputs in the production of differentiated goods, νi = 0, and an exogenous population of active firms, i.e., no entry margin. Adoption costs are in units of goods only (γ = 1). We further assume that the differentiated goods producers’ productivity z follows a Pareto distribution, z ∈ [1, ∞) ∼ 1 − z −ζ . There is no idiosyncratic distortion. In this case, the net gains from adoption presented in equation (10) are 1 D(z; a, s) = η η−1 η η−1 η−1 z η−1 − (1 − s)P (a)κa , P (a)η X(a) Aη−1 − A m t and the feedback ratio presented in equation (14) becomes r (za , a) = [(1 − η)P a + Xa ] ζ , η−1 (15) where P a = −(∂P/∂a)(a/P ) and Xa = (∂X/∂a)(a/X) are the elasticities of the price index and the aggregate demand with respect to the mass of adopters. We dropped the dependence of the feedback ratio on s, as the subsidy rate does not show up in this particular formulation. As in the general case, the feedback ratio is the product of the incentive and the feedback terms. The incentive term measures the impact of the mass of adopters on the profitability of the marginal adopter. It is a function of the adoption elasticities of 12 the price index and the aggregate demand with respect to the adoption rate, P a and Xa , and the demand elasticity, η. More adoption affects a firm’s profit through two channels. First, it shifts the demand in proportion to the change in the intermediate aggregate X. This is captured by the elasticity Xa . The second channel is through the impact on the price index. This is a negative competitive effect, whose strength is governed by the demand elasticity η. In addition, the decline in the price index has a direct effect on the net gains from adoption by lowering the cost of adoption. The net effect of the price index on the feedback ratio is captured by the term (1 − η)P a in equation (15). The feedback term in equation (15) captures the response of the mass of adopters to the increase in the net gains from adoption for the marginal adopter and depends on the demand elasticity η and the elasticity of the density of the productivity distribution to the mass of adopters ζ. In the feedback term, η − 1 measures the elasticity of the net gains from adoption with respect to productivity z, and its inverse measures the response of the adoption threshold to the changes in the net gains from adoption. The impact of changes in this threshold on the mass of adopters depends on the density of firms at this point, captured by the parameter ζ. To further understand the feedback ratio, we now analyze the adoption elasticities of the price index P a and the aggregate demand Xa : P a = Xa = 1 1 − η−1 ζ η−1 1− a Am η−1 ζ 1− h Aη−1 t Aη−1 m Aη−1 t ! M(a) , 1− η−1 ζ 1− Aη−1 m a η−1 ζ i where M (a) = + ∈ (0, 1) is the / 1−a modern sector’s share of total value added and employment. The demand elasticity η has a negative effect on the adoption elasticities P a and Xa . A higher elasticity of substitution η implies that the additional contribution of the marginal, less productive, adopter will be smaller relative to the contribution of the infra marginal, and more productive, adopters. On the other hand, a more elastic density of the productivity distribution (higher ζ) implies that the productivity of the marginal adopter declines less as the mass of adopters rises. This naturally results in larger elasticities with respect to the mass of adopters.6 In making these arguments, we are holding fixed the values of Aη−1 /Aη−1 and M(a), which also t m depend on η and ζ. One interpretation is that we are implicitly adjusting At /Am and the aggregate adoption rate a by changing the adoption cost κa . 6 13 Using the expressions for the adoption elasticities, we can express the feedback ratio in terms of the model parameters: r (za , a) = 2−η η−1 ζ +1−η η−1 1− Aη−1 t Aη−1 m ! M (a) . (16) For the effect of policies to be amplified, the multiplier in equation (13) must be greater than one. This requires a positive feedback ratio. In this case, the feedback ratio is positive if and only if 2−η >0 , (17) since the last three terms in the right-hand side of equation (16) are strictly positive.7 When η is low, the negative competition effect is small, and thus the positive effect on the gains from adoption of a larger mass of adopters dominates, generating a positive feedback ratio and hence amplification.8 While the existence of amplification depends solely on η, the degree of amplification does depend on ζ, the ratio Am /At , and M(a). Intuitively, a higher density of marginal firms (high ζ) and a larger difference in technology Am /At give larger amplification. Finally, we can obtain a necessary condition for multiplicity, i.e., the condition T 0 (a) > 1. Using the fact that the last two terms in the right-hand side of equation (16) are strictly less than 1, a necessary condition for multiplicity is 2−η > 1 . 1+ζ (18) This implies that the necessary condition for multiplicity is stricter than the condition for amplification—that is, amplification is possible even in the absence of multiple equilibria. Multiplicity requires that complementarity is strong enough to countervail the negative effect of heterogeneity on the density of marginal adopters, as well as that on the elasticities P a and Xa . This may explain why most multiplicity results in the literature come from models with homogeneous producers (ζ → ∞). 7 For aggregates to be finite, ζ + 1 − η must be positive. With limit pricing, as in Murphy et al. (1989), the negative competition effect is inoperative. As a result, amplification and multiplicity do not require restrictions on η. 8 14 3.2 Simple Model II: Idiosyncratic Distortions We now introduce idiosyncratic output distortions (controlled by ξ) into the simple example with heterogeneity above and explore how they affect complementarity and amplification. We retain all other simplifying assumptions: no entry margin, νi = 0, and γ = 1. As we show, distortions have two effects. First, distortions compress the distribution of the effective productivity of firms. In particular, it is as if the tail parameter of the productivity distribution were ζ/(1 − ξ) instead of ζ. Second, distortions create a wedge between the value of output produced by firms and their profitability and hence their factor demand, reducing the elasticity of profits and labor demand with respect to firms’ productivity. With distortions, the feedback ratio is now r (za , a) = [(1 − η)P a + Xa ] ζ/(1 − ξ) . η − 1/(1 − ξ) (19) Comparing the feedback ratio with the one with no distortions in equation (15), we see that the feedback term is adjusted by the distortion parameter ξ. This adjustment can be understood as a more concentrated firm productivity distribution, i.e., one with a tail parameter ζ/(1 − ξ) and a lower elasticity of the net gains from adoption with respect to firms’ productivity. Both adjustments make the feedback term larger, holding fixed the adoption elasticities P a and Xa . We can also derive expressions for the two adoption elasticities: P a = 1−ξ 1 − η−1 ζ 1− 1 ! Aη−1 t Mv (a) , 1−ξ 1 η − 1−ξ ζ − 1−ξ ζ (η − 1) 1− Xa = ηεP a − (η − 1) Aη−1 t Ml (a) εP a , Mv (a) where Mv (a) and Ml (a) are the modern firms’ shares of total value added and employment, respectively.9 The effect of distortions on the elasticity Pa is analogous to what we obtained 9 In particular, Ml (a) = Mv (a1+ξ/ζ ) and v M (a) = Aη−1 a1− t (1−ξ)(η−1) ζ . (1−ξ)(η−1) (1−ξ)(η−1) 1− ζ ζ Aη−1 1 − a1− + Aη−1 m a t 15 in the model with no distortions, adjusted for the compression of the effective productivity distribution with the tail parameter ζ/(1 − ξ). Accordingly, with more distortions, and for a given level of Mv (a), the elasticity of the price index with respect to the mass of adopters is higher. The effect of distortions on the elasticity of the aggregate demand Xa involves two effects. The first effect is the same as the one for P a , since the aggregate demand is a function of the price index. This implies that the compression of the productivity distribution increases both elasticities. The second is the effect that distortions have on the aggregate demand, creating a wedge between firms’ input demand and revenues. Here, holding fixed the modern firms’ shares and P a , an increase in distortions reduces Xa . However, one can see that the net effect of ξ on Xa is positive: The larger the degree of distortions, the more responsive the aggregate demand is to an increase in the mass of adopters. The relative effect of distortions on the two elasticities, P a and Xa , depends crucially on the mass of adopters, a, through its effect on the modern firms’ share of value added and employment, Mv (a) and Ml (a). The modern firms’ share of employment is smaller than that of value added, i.e., l M (a) ≤ Mv (a), with strict inequality when a < 1. This is because distortions introduce a wedge between the value of the output produced by the modern firms and their profitability and hence factor demands. This effect is clearest in the limit with an arbitrarily small mass of modern firms, where the employment share of the modern firms is arbitrarily small relative to their value added share, i.e., lima→0 Ml (a) /Mv (a) → 0, as the modern firms are infinitely productive and hence face arbitrarily large distortions. Here, the elasticity of the price index is smaller than that of the aggregate demand, P a < Xa . At the other extreme where the mass of adopters is close to one, we have Ml (a) /Mv (a) ≈ 1, and the elasticity of the price index is larger than that of the aggregate demand, P a > Xa . Using the expressions for the adoption elasticities, we rewrite the feedback ratio: r (za , a) = l l ξ M (a) − η v −1 η−1 ζ +1−η M (a) ! ζ +1−η Aη−1 t × 1 − η−1 Mv (a) . η(1 − ξ) − 1 Am M (a) 1 − (η − 1) M v (a) (20) Again, a multiplier exceeding one (i.e., amplification) requires r (za , a) > 0, which 16 is equivalent to 1 − (η − 1) Ml (a) ξ(η − 1) > Mv (a) ζ +1−η Ml (a) η v −1 . M (a) (21) This condition is automatically satisfied when the mass of adopters is small, a ≈ 0, because then Ml (a) /Mv (a) ≈ 0 and ζ +1−η is assumed to be positive for aggregates to be finite. Thus, with idiosyncratic distortions, an economy with high adoption costs (i.e., a small mass of adopters) will feature a multiplier greater than one for adoption subsidies. To the contrary, distortions tend to make amplification less likely when the mass of adopters is large, a ≈ 1, since condition (21) is stronger than condition (17) when Ml (a) /Mv (a) ≈ 1. How do distortions affect the necessary condition for multiple equilibria? As before, because multiplicity requires T 0 (a) > 1, the condition for multiplicity is stronger than that for amplification. From equation (20), holding fixed Mv (a), more distortions relax the condition for multiplicity. In the limiting case of lima→1 Ml (a)/Mv (a) = 1, the necessary condition for multiplicity is 2−η > 1 1+ ζ 1−ξ , which is weaker than condition (18) for ξ > 0. How ξ reduces the denominator of the right-hand side of equation (19) is the dominant force. When a < 1, we have Ml (a)/Mv (a) < 1, which further relaxes the necessary condition for multiplicity. 3.3 Discussion of the Role of Additional Elements In this section, we briefly describe two additional elements of the model. We first discuss the role of intermediate input in production. The use of intermediate input strengthens the role of complementarities in adoption, making both amplification and multiple equilibria more likely. We then describe the role of the entry margin and discuss its relevance for the size of firms in the data. Intermediate Input in Production. One important difference between our benchmark quantitative model and the simple models explored in Sections 3.1 and 3.2 is the use of differentiated goods. While in the simple models differentiated goods are used only as intermediate input for the production of the final good (to be 17 used for consumption and adoption costs), in the quantitative model an aggregated quantity of the differentiated goods is also used as an intermediate input in the production of the differentiated goods themselves (i.e., roundabout production). The importance of intermediate input in production is governed by the intermediate input intensity in the production of the adoption good, γ, and the intermediate input intensity of the traditional and the modern technologies in the production of the differentiated goods, νt and νm , respectively. In the simple examples, we set the intensity of the intermediate aggregate in the production of the adoption good to be equal to one, i.e., γ = 1. If γ = 0 in the simple examples, the effect of policies would be dampened by a negative equilibrium feedback, and neither amplification nor multiplicity would arise. A higher γ leads to stronger complementarity: More adoption reduces the adoption costs by more when a larger fraction of the costs is in units of goods rather than labor. The same logic applies to the intensity of the intermediate aggregate in the production of the differentiated goods themselves. Consider first the case where the intermediate input intensity in the production of the differentiated goods is the same for both the traditional and the modern technologies: νt = νm = ν. The use of the intermediate aggregate as an input in the production of the differentiated goods strengthens complementarities and the equilibrium feedback for two reasons. First, more adoption now reduces the price of firms’ intermediate input and hence their production costs, partly offsetting the negative competition effect on firms’ profits, which in turn relaxes the restrictions on the elasticity of substitution across differentiated goods η. For instance, when ν = 1 in the simple examples, the necessary condition for amplification and multiplicity is always satisfied for any γ ∈ [0, 1], independently of η. Second, as in the simple examples, the strength of the net effect of more adoption on the gains from adoption depends on η. When the intermediate aggregate is itself an input in the production of differentiated goods, the strength of the net effect also depends on ν, through a standard intermediate input multiplier 1/ (1 − ν), as in Jones (2011). When the modern technology uses intermediate input more intensively, that is, νm > νt as in our quantitative model, there is an additional positive effect of more adoption on firms’ incentive to adopt the modern technology. More adoption reduces the price of the intermediate input relative to labor, raising the relative profitability of the technology that uses the intermediate input more intensively. In this case, 18 amplification and multiplicity can arise even when the production of the adoption goods requires only labor (γ = 0), which shares some similarity with the result in Ciccone (2002). In summary, the goods share of the adoption costs and the intermediate input intensity in the differentiated goods production both contribute to amplification and multiplicity, but neither is absolutely necessary for such outcomes. Entry Margin. Relative to the simple models explored above, our quantitative model also has the firm entry margin: A firm must pay a fixed labor cost κe > 0 to become active. If firms’ productivity follows a Pareto distribution, the entry margin is summarized by the productivity of the marginal entrant ze , which is given by " " (η − 1) ζ 1+ ze = max 1, 1 + ζ − (η − 1) Am At η−1 ! −1 a 1− η−1 ζ ## 1 ζ 1 ζ κe . (22) The productivity of the marginal entrant ze and the average size of firms zeζ are an increasing function of the fraction of adopters a, the cost of entry κe , and the ratio of the productivity shifter of the modern to tradition technology, Am /At . 4 Identification of Model Parameters Section 3 shows that several features of the model can generate multiple equilibria. In general, parameter identification is not granted when a model features multiple equilibria (see, for example, Jovanovic, 1989), because the mapping from the data to the model parameters may not be unique. In this section, we construct an argument that allows us to uniquely identify the parameters of the model despite potential multiplicity. The key assumption for our identification strategy is that traditional and modern firms coexist in the data. However, the strategy does not presuppose multiplicity. Rather, once the parameters are uniquely identified from the data, we check whether or not the model has any other equilibrium for those parameter values. To keep things as simple as possible, we provide our constructive argument for the case where the intermediate input elasticity is the same for the modern and the traditional technology, i.e., νt = νm = ν, and the adoption good production only uses intermediate input (γ = 1). In addition, following the common practice in the literature, we maintain the assumption in Section 3 that firm-level productivity 19 distribution F is Pareto with a tail parameter ζ. We set the elasticity of substitution across differentiated goods η from outside of the model using estimates common in the literature on demand or profit share estimation. We also assume that there is no distortion ξ = 0. However, our identification strategy also holds without these parameter restrictions. Our goal here is to identify the following six parameters: The technology parameters At and Am , the entry and adoption costs κe and κa , the parameter of the productivity distribution ζ, and the intermediate input elasticity ν. We normalize the productivity of the modern technology to one, Am = 1. This leaves us with five parameters. For identification, we rely on the size distribution of establishments in the data G(l), where size is defined as the number of employees as well as the intermediate input share in the data. We begin by showing that, for a given η, we can identify the intermediate input elasticity ν directly from the intermediate input share in the data: ν= η × intermediate input share. η−1 (23) The other four parameters are identified from the establishment size distribution in the data. In particular, we rely on the implications of the theory for the relationship between the log of employment (log l) and the log of the fraction of establishments with size larger than l as illustrated in Figure 2. We hereafter refer to this relationship simply as the log-log relationship. Our identification strategy relies on the three thresholds in the figure, which must exist under the assumption that both traditional and modern technologies are used in the economy: (i) The size of the smallest entrant lt ≡ lt (ze ), (ii) The size of the largest establishment using the traditional technology ¯lt ≡ lt (za ), and (iii) The size of the smallest establishment operating the modern technology lm ≡ lm (za ). From the slope in the right tail of the log-log relationship, i.e., l > lm , we identify the tail parameter of the productivity distribution ζ for a given η: ζ d log (1 − G(l)) d log (1 − G(l)) =− ⇒ ζ = −η . η d log l d log l (24) Given the value of the intermediate input elasticity ν, the size of the smallest 20 log lt = log [(η − 1)(1 − ν)κe ] log ¯lt log lm log l −ζ log zzae log fraction w/ emp. > l (η − 1) log AAmt − ηζ Figure 2: Identification from the Establishment Size Distribution establishment is a simple function of the entry cost, pinning down κe : lt = [(η − 1)(1 − ν)κe ] ⇒ κe = lt . (η − 1)(1 − ν) (25) The theory implies that there should be a gap in the size distribution of establishments, if both the modern and the traditional technologies are operated in the economy.10 In particular, there should be no establishment larger than the largest establishment using the traditional technology ¯lt but smaller than the smallest establishment operating the modern technology lm ; i.e., G(l) = G (lm ) = G ¯lt for l ∈ ¯lt , lm . The difference between these two employment levels is a function of the relative productivity of the two technologies, Am /At , which, given the knowledge of η and the normalization of Am = 1, identifies At : log lm − log ¯lt = (η − 1) log Am At 1 ¯ η−1 lt ⇒ At = . lm Finally, to identify the adoption cost κa , we use the equilibrium condition relating the ratio of the two thresholds to the ratio of the adoption and the entry costs: κe 10 za ze η−1 = G ¯lt P w Aη−1 m Aη−1 t −1 κa ⇒ κa = − η−1 Aη−1 m ζ Aη−1 t −1 P w This is akin to the concept of missing middle in Tybout (2000). 21 κe . 5 Calibration We use data from the United States and India to calibrate the parameters of the model. We think of the United States as an economy that is distortion-free (ξ = 0) and therefore informative about the tail parameter of the firm productivity distribution ζ, among others. India is a large developing economy with relatively good data availability and is informative about the degree of idiosyncratic distortions ξ in particular. This choice is consistent with the evidence on the relationship between productivity and idiosyncratic distortions in the United States and India in Hsieh and Klenow (2014, p.1059). For our benchmark model without the simplifying assumptions of Section 3 or 4, the following eleven parameters need to be calibrated: the elasticity of substitution among differentiated goods η; the share of intermediate input in the production of the adoption good γ; the intermediate input elasticity of the modern and the traditional technology νm and νt ; the productivity levels of the modern and the traditional technology Am and At ; the Pareto tail parameter of the firm productivity distribution ζ; the entry and the adoption costs κe and κa ; and finally the degree of idiosyncratic distortions ξ and the budget-balancing scale parameter τ . Six of the eleven parameters are assumed to be the same for the United States and India: η, γ, νm , νt , Am , and ζ. Four of these six are fixed outside of the model. We maintain the normalization of Am = 1. We set η = 3, which is considered to be on the lower side, as discussed in Hsieh and Klenow (2009).11 As in Section 4, we set γ = 1, i.e., the adoption good production uses intermediate goods but not labor. In addition, we set νt = 0, so that labor is the only input of the traditional technology. This last assumption maximizes the difference between the two technologies’ intermediate input elasticity, νm and νt . Our choices of η, γ, and νt make amplification and multiplicity more probable as explained in Section 3, but the conclusions from our quantitative analysis do not rest on these assumptions. In Section 6.3, we show the result with γ = 0—i.e., labor is the only input of the adoption good production—and with νt = νm . Appendix B reports a sensitivity analysis with different values of η and νt . One of the remaining common parameters, νm , is then calibrated to match the 11 In our simple example of Section 3.1, the necessary condition for amplification and multiplicity was η < 2. However, as explained in Section 3.3, with intermediate input in the production of the differentiated goods, the restriction on η is relaxed. 22 intermediate input share in the United States data, yielding νm = 0.70.12 The other, the tail parameter of the productivity distribution, is calibrated to match the tail of the establishment size distribution for the United States in the Census Bureau’s 2007 Business Dynamics Statistics (BDS), giving ζ = 2.42 under the assumption of no distortion (ξ = 0 and τ = 1). We now describe how we calibrate the five parameters that differ between the United States and India: two distortion parameters (ξ, τ ) and three technology parameters (At , κe , κa ). Following the procedure in Section 4, we calibrate these country-specific parameters from their establishment size distribution only. In particular, we do not use any information on the level of income or productivity in either country. Since there is a priori no tight relationship between a country’s establishment size distribution and its income level, it is an open question how the model-predicted income gap between the United States and India will measure up to the data. Since we assume that the United States has no distortions, the United States calibration has ξ = 0 and τ = 1. For India, given the common tail parameter of the productivity distribution ζ calibrated to the United States data, we calibrate its degree of distortions ξ to match the tail of the establishment size distribution, utilizing ∂ log(1 − G(l)) ζ =− , η (1 − ξ) − 1 ∂ log l (26) which is the distortion-augmented version of equation (24). The calibration of the other three country-specific parameters, At , κe , and κa , closely follows the procedure in Section 4: They are chosen to match many features of the empirical establishment size distribution (the log-log relationship) for each country. The empirical moments are chosen to capture the nonlinearities of the loglog relationship in Figure 2. For the United States we use eight points from the empirical log-log relationship and 26 points for India. Figure 3 is the log-log plots of the establishment size distribution from the calibrated model for the United States and India, together with the empirical log12 The intermediate input share in the United States in 2007 was 0.46, calculated from the BEA input-output tables. Because we assume that the traditional firms use no intermediate input (νt = 0), the intermediate input share of the modern firms has to be 0.47 in order for the share in the entire η as in (23), we obtain νm = 0.70. economy to be 0.46. Multiplying 0.47 by η−1 23 United States India Note: The source of the U.S. data is the 2007 Business Dynamics Statistics. For the data for India, we combine the 2003 National Sample Survey and the 2005 Economic Census. See Buera et al. (2020) for details. Figure 3: Establishment Size Distribution: Model and Data log relationship. The United States data comes from the 2007 BDS. To produce the figure for India, we use the size distribution of establishments for the entire economy in India constructed by Buera et al. (2020), who combine data from the Fifth Economic Census in 2005 and the 2003 Survey of Land and Livestock Holdings carried out in the fifty-ninth round of the National Sample Survey (NSS). The Census has comprehensive information for all entrepreneurial units, excluding agriculture. The NSS provides information on employment by productive units in agriculture. In order to obtain an accurate establishment size distribution for the entire economy in India, it is crucial to account for its agricultural sector, which accounted for 57 percent of the total employment in 2004.13 As shown in the left panel, the model calibrated to the United States (solid line) generates a flat region that is the size gap between firms using the traditional technology and those using the modern technology, in order to match the concavity of the log-log relationship in the data (circles) for small establishments. The vertical location of the flat region shows that roughly half of all establishments use the modern technology. The calibration for India in the right panel also shows both the traditional and the modern technology in use in equilibrium, separated by a flat region (solid line). However, a much larger fraction of firms uses the 13 By comparison, the agricultural employment share was only 1.4 percent in 2007 in the United States. 24 Parameter U.S. Elasticity of substitution, η Intermediate aggregate share in adoption good production, γ Productivity distribution Pareto tail parameter, ζ Modern technology productivity, Am Modern technology intermediate input elasticity νm Traditional technology intermediate input elasticity, νt Entry cost, κe Traditional technology, At Adoption cost, κa Degree of distortions, ξ Distortion scale parameter, τ India 3 1 2.42 1 0.70 0 0.50 0.43 16 0 1 0.50 0.07 272 0.19 2.14 Table I: Calibrated Parameters traditional technology in India, as evidenced by the fact that less than 1 percent (0.25 percent, to be precise) of establishments are to the right of the flat region in the calibrated economy. The calibrated model captures the conspicuously flat region over intermediate establishment sizes in the data for India (dots): a missing middle.14 In Table I we report the calibrated parameters. Some remarks are in order. First, the United States and India have the same entry cost κe , which is identified from the size of the smallest establishment using (25). Because the smallest establishment is of the same size in both countries (one employee) and we assume that η and νt are the same for both countries, so is κe . However, this does not mean that the entry rate of firms is the same in the two countries. In fact, as shown in Table II, the fraction of firms that enter in India is three times that in the United States. Second, the traditional technology parameter At for the United States is six times that for India. Since both countries have the same productivity level of the modern technology Am by assumption, the technology gap between the modern and the traditional technology is six times as high in India. Third, the cost of adoption is seventeen times higher in India. The cost of adoption in India must be higher in order to rationalize the minuscule fraction of firms adopting the modern technology in spite of the enormous productivity gains from doing so.15 When measured in units 14 This does not contradict Hsieh and Olken (2014), who do not find a missing middle in the manufacturing sector in India. Buera et al. (2020) find a missing middle when studying the broad cross-section of establishments, and within narrow sectors, e.g., the agricultural sector. 15 The high adoption cost can be viewed as standing in for other inhibitors of technology adoption that are not explicitly modeled in our theory, such as the shortage of skilled labor necessary for using the modern technology, financial constraints, and bureaucratic or anti-competitive barriers to 25 Gross domestic product per capita Consumption per capita Average establishment size Fraction of entrants Fraction of entrants that adopt Am Employment share of Am firms Value added share of Am firms U.S. India 4.34 3.92 19.0 0.05 0.50 0.96 0.98 0.66 0.54 5.7 0.17 0.003 0.58 0.81 Table II: Statistics from the Calibrated Economy of labor (P κa , since γ = 1), the cost of adoption in India is over eighty times higher, because the (endogenous) price of the intermediate aggregate is five times higher in India. Finally, the calibration for India exhibits significant idiosyncratic distortions, as given by ξ = 0.19. The tail of the establishment size distribution in India is thinner, and hence the log-log relationship in the right tail is steeper. Equation (26) pins down ξ.16 The implied firm-level revenue taxes and subsidies are not extreme. As we show in Figure 12 in the appendix, the firm at the top 10−4 percentile of the active firm productivity distribution is taxed at about 40 percent, while the maximum subsidy is 80 percent for the least productive firm with z = ze . At face value, some of the calibrated parameters in Table I seem contradictory to the fraction of entrants reported in Table II: The United States exhibits less firm entry than India in spite of the substantially lower cost of adopting the modern technology and the significantly better traditional technology. However, as explained in Section 3, when more firms adopt the modern technology, they crowd out less productive firms through the negative competition effect as well as through higher wages. That is, the general equilibrium effects from more firms adopting the modern technology are responsible for the lower entry rate in the United States. In spite of the lower entry rate in the United States, the significantly higher rates of modern technology adoption means that the United States GDP per capita is nearly seven times that of India. This last result is a success for the model. Even though the calibration is based on the difference in the establishment size distribution between the United States and adoption. 16 A back-of-the-envelope calculation of the ξ for India from Hsieh and Klenow (2014, p.1059) gives 0.4. One possible interpretation is that our model elements amplify the impact of distortions so that a lower degree of idiosyncratic distortions can match the data for India. 26 India and does not use any information on the income or productivity gap between the two, the model generates a huge income gap. In the Penn World Tables, the GDP per worker of India is 6 percent of the United States level in 2005 while in the model it is 15 percent. That is, the model accounts for 73 percent of the United States-India income gap.17 6 Quantitative Exploration In this section, we explore quantitatively the role of the various model elements— mechanisms and parameter values—in amplifying the impact of distortions, generating coordination failures, and ultimately accounting for the United StatesIndia income gap. We also discuss what a Big Push in distorted economies is. 6.1 Multiple Equilibria and the United States-India Income Difference The calibrated United States economy has a unique equilibrium, but the economy in India features multiple equilibria. We show this by examining the net gains from adopting the modern technology for the marginal adopter with productivity za o πm (za ) − πto (za ) − P κa , which must be 0 in an equilibrium. Figure 4 shows this object for the United States and India. As we vary za , the price of the intermediate aggregate and the wage adjust to clear markets. For the United States (solid line), the net gains are monotonically decreasing in the fraction of adopters (and hence increasing in the productivity of the marginal adopter za ) and intersect the zero line once. This intersection is the unique, stable equilibrium. For India (dashed line), the net gains cross the zero line three times, twice from above and once from below. The leftmost intersection is the stable, low adoption or “bad” equilibrium, and the rightmost one is the stable, high adoption or “good” equilibrium. The one in the middle is unstable. Our calibration selects the good equilibrium to 17 We use the output-side real GDP at chained PPPs and the number of persons engaged from the Penn World Tables 9.0. The income gap in the data is a factor of 16.7 and in our model 6.7. Since 16.7 ≈ 6.7 × 2.5, the model explains 6.7/(6.7 + 2.5) × 100 ≈ 73 percent of the actual income gap. 27 Note: The figure shows, for the U.S. and India, the gains for the marginal adopter from operating the modern technology, net of adoption costs. Figure 4: Net Gain of the Marginal Adopter, United States and India match the data for India. That is, despite equilibrium multiplicity, coordination failures do not explain the observed income difference between the United States and India. As we show in Section 6.2, if firms in India were to fail to coordinate and get trapped in the bad equilibrium, India’s GDP would further shrink by a factor of 4. That is, the United States-India income gap could have been a factor of 28 rather than 7 in Table II. If coordination failures do not account for the income difference between the United States and India, then what does? The two countries in our model have different productivity of the traditional technology At , adoption costs κa , and distortions ξ, all identified only from their establishment size distribution.18 In Table III, we calculate the contribution of each of these elements to the United States-India gap in per capita consumption. To do so, we compute the hypothetical aggregate consumption of the United States by replacing one of the parameters with its value in the calibration for India, holding all others constant. This result is in the first column, where we replace κa , ξ, and At one by one. In the second column, we do the reverse: Starting from the calibration for India, we replace one of the parameters with 18 The entry cost is also country-specific, but the calibrated κe ’s for the United States and India coincide. As ξ changes, τ adjusts to balance the budget. 28 U.S. w/ India Parameters India w/ U.S. Parameters Benchmark 1.0 0.14 Adoption cost, κa Degree of distortions, ξ Traditional technology, At 0.37 0.41 1.03 0.71 0.34 0.19 Table III: Explaining Consumption Difference its value in the United States calibration. All per capita consumption is normalized by the United States level in the benchmark calibration. The first row shows that the model generates a factor of 7 difference between the United States and India consumption (=1/0.14).19 Starting from the United States calibration, we see that the adoption cost difference has the largest impact: Giving the United States the high adoption cost of India shrinks the United States consumption by a factor of 2.7 (=1/0.37). The role of idiosyncratic distortions is of a similar magnitude: Introducing idiosyncratic distortions of the proportions of India (ξ = 0.19) to the United States economy reduces the consumption by a factor of 2.5 (=1/0.41). The last row of the first column shows that, if we replace the traditional technology productivity At of the United States with the lower value from India, the United States consumption actually rises modestly. This is because the very low At leads to more adoption of the modern technology.20 The same set of counterfactual exercises for India in the second column leads to similar conclusions, although now adoption costs play a more important role. Giving India the much lower adoption cost of the United States while holding all other parameters constant results in a five-fold increase in consumption, which is much larger than the factor of 2.7 in the first column: The rise in adoption caused by the lower adoption costs represents a larger increase in productivity when the traditional technology is less productive as in India. Eliminating idiosyncratic distortions in India raises consumption by a factor of 2.4 (=0.34/0.14), which is nearly identical to the result in the first column (2.5). Finally, replacing the traditional technology At with the higher United States value has a modest positive effect on consumption in India. The higher At nearly doubles the number of firms but further discourages the adoption of the modern technology. 19 The gap in consumption is slightly larger than the gap in output, because entry costs and adoption costs are a larger fraction of the output in India than in the United States. 20 In this case, the number of entrants is nearly halved, but all the entrants adopt the modern technology. 29 Note: The figure shows, for the U.S. (cross) and India (dot), the combination of ξ and At for which multiple equilibria exist. The larger cross and dot correspond to the calibrated U.S. and India, respectively. Figure 5: Region of Multiple Equilibria, (ξ, At ) Space To summarize, the model nearly replicates the large income gap between the United States and India in the data, without directly targeting the income or productivity level of either country. Coordination failures turn out not to be part of the story despite the existence of multiple equilibria, and adoption costs and distortions explain most of the income gap. 6.2 Multiplicity, Amplification and Distortions In this section, we further explore the role of adoption costs κa , relative technology productivity Am /At , and idiosyncratic distortions ξ. We first identify the set of these three parameter values that generates multiple equilibria, holding fixed the other parameters as calibrated. Second, we show how per capita income and the average size of firms change with the idiosyncratic distortion ξ. This exercise showcases the potentially huge effect of distortions with or without equilibrium multiplicity. Region of Multiplicity Figure 5 shows, for a low adoption cost economy (the United States, cross) and a high adoption cost economy (India, dot), the combination 30 of the distortion parameter ξ and the traditional productivity At that generates multiple equilibria. (Recall that Am for both is normalized to 1.) We hold all other parameters fixed at their respective calibrated values, except that we adjust τ so that the budget balances. The larger cross and dot represent the calibrated United States and India economies, respectively. We see that India is in the region of multiplicity but the United States is not. Multiple equilibria arise for economies with both high degrees of distortions and unproductive traditional technology, toward the lower right corner of the figure. One interesting result is that, holding fixed the productivity of the traditional technology At , as we increase idiosyncratic distortions ξ (moving horizontally), we enter and then exit the region of multiplicity. To the right of the region, the only equilibrium is the one with nearly no adoption. Similarly, holding fixed the degree of idiosyncratic distortions, as we lower the productivity of the traditional technology At (moving downward), we enter and then exit the region of multiplicity, although it is hard to see this for the high adoption cost case (India, dot). The unique equilibrium with At close to 0 has a small number of entrants, nearly all of whom adopt the modern technology: With a useless traditional technology, entry also implies adopting the modern technology, which effectively raises the cost of entry and results in few entrants. Finally, the region of multiple equilibria is smaller for the high adoption cost economy. In this case, the model features a unique equilibrium with few adopters in most of the (ξ, At ) space. However, multiplicity can occur for smaller degrees of idiosyncratic distortions (as low as ξ = 0.17) than in the low adoption cost economy. Nonlinear Impact of Distortions We now explore the role of distortions in generating large differences in income levels with or without multiple equilibria. We start with the United States and the calibration for India, and vary the degree of distortions ξ, holding fixed the other parameters at their respective calibrated values, except that τ adjusts to balance the budget. In addition, since there is no multiplicity for any ξ in the United States calibration, we also consider a modified United States case that has the lower traditional technology productivity At of India. In Figure 6 we show the equilibrium consumption per capita as we vary the degree of distortions ξ for the United States (left panel) and India (right panel). The consumption per capita in the vertical axis (log scale) is normalized by the per capita consumption in the United States calibration. 31 United States India Note: Equilibrium consumption per capita as ξ goes from 0 to 0.5. Consumption is normalized by its level in the U.S. calibration and in log scale. The dotted line in the left panel is the no-multiplicity result of the United States. For the modified U.S. and the India cases, the solid lines are the high adoption equilibrium (low adoption threshold za ) and the dashed lines are the low adoption equilibrium (high za ). Figure 6: Distortions and Consumption per Capita For the United States, the equilibrium is unique for any value of ξ (dotted line). There are two notable features. First, the impact of distortions is large, reducing consumption by nearly 90 percent for large values of ξ. Second, for intermediate values of ξ, small changes in the degree of distortions have a highly nonlinear effect on consumption. That is, even without multiplicity, distortions can have an amplified impact. We discuss these features more rigorously in Section 6.3. We now turn to the modified United States case (i.e., with India’s At ) in the left panel and the India case in the right panel, both of which exhibit multiplicity. When the distortions are small, the equilibrium is unique in both countries. The solid line is the per capita consumption in the equilibrium with a higher fraction of adopters, or the good equilibrium. As we increase distortions, a second equilibrium, one with a small fraction of adopters emerges (dashed line). This is the bad equilibrium. For India, both equilibria exist over a short interval of intermediate values of ξ, above which only the bad equilibrium survives. Tracing either the good (solid line) or the bad (dashed line) equilibrium, idiosyncratic distortions have moderate to large effects on consumption. The effect is even larger, however, since distortions can make the economy jump between the two lines. Near the boundaries of the region of multiplicity, the effect of distortions are extremely disproportionate. Once the economy enters the multiplicity region from 32 United States India Note: Average firm size (the number of employees, in log scale) as ξ goes from 0 to 0.5. The dotted line in the left panel is the no-multiplicity result of the U.S. For the modified U.S. and the India cases, the solid lines are the high adoption equilibrium (low adoption threshold za ), and the dashed lines are the low adoption equilibrium (high za ). Figure 7: Distortions and Average Firm Size left, coordination failures can send the economy to the bad equilibrium. On the other hand, even without better coordination, a small reduction in distortions can push the economy from the bad equilibrium region to the unique good equilibrium region, which discontinuously increases consumption. This happens around ξ = 0.3 in the modified United States case and around ξ = 0.2 in the India case. Our calibrated India economy is in the good equilibrium with ξ = 0.19 near the end of the solid line. If ξ were to rise past the narrow interval of multiplicity, its per capita consumption will shrink by a factor of 5, without coordination failures playing any role. These results highlight the potentially disproportionate gains from reducing idiosyncratic distortions. Multiplicity is an extreme form of amplification. However, as the no-multiplicity United States case shows (dotted line), even without multiplicity, our model elements that are responsible for complementarity in adoption decisions amplify the impact of distortions to a magnitude unattainable in conventional models, a statement we make precise in Sections 6.3 and 6.5. Figure 7 illustrates the corresponding effect of distortions on the average size of firms, measured by the number of employees. The vertical axis is in log scale. For the modified United States and the India cases, the solid lines trace the average size of firms in the good equilibrium. In this equilibrium, there are fewer entrants but many of them adopt the modern technology, resulting in large firms. As distortions get bigger, there is more entry but less adoption, bringing down the average firm 33 size, which is more pronounced in the high adoption cost economy (right panel). The dashed lines, which appear when distortions are high enough, are the average firm size in the bad equilibrium. In this equilibrium, the number of entrants is large, but very few adopt the modern technology, which implies that firms are small on average, in line with equation (22).21 6.3 Unpacking the Mechanisms Our model combines several elements whose importance in explaining cross-country income differences has been studied in isolation in the literature. In this section, we illustrate the role of each element, in comparison with the findings in the literature. In Section 6.3.1, we examine how the model elements amplify the impact of idiosyncratic distortions. In Section 6.3.2, we analyze which elements are responsible for the large income gap between the United States and India that our calibrated model generates. 6.3.1 Nonlinear Impact of Idiosyncratic Distortions We start with the basic model in the distortion literature that abstracts from technology choices and input-output linkages, as in Restuccia and Rogerson (2008) for example. Next, we consider a case with a technology choice but without intermediate input or roundabout production, which is our rendition of Bento and Restuccia (2017). We then introduce input-output linkages in the form of roundabout production but remove the technology choice. This is our adaptation of Jones (2011). The fourth one has a technology choice and roundabout production as in our benchmark model, but with the modification that the adoption costs are in units of labor only instead of goods. Finally, we consider a case with technology choices, roundabout production, adoption costs in goods, except that both the modern and the traditional technologies have the same intermediate input intensity (νt = νm ), unlike in our benchmark with νt νm . For each model, we recalibrate the parameters to the same set of target moments as in our United States benchmark (Section 5) and calculate the effect of idiosyncratic 21 The modified United States case uses India’s At , which is smaller than its value in the United States calibration. As a result, the modified United States with no distortion (ξ = 0) has more adopters and a larger average firm size than the benchmark calibration (dotted line). The direct negative effect of the traditional technology productivity on the average firm size can be seen in equation (22). 34 Consumption Adoption Note: The figure presents the equilibrium consumption per capita and fraction of adopters in the U.S. as ξ goes from 0 to 0.5. Consumption is normalized by the consumption in the recalibrated, no-distortion U.S. economy and is in log scale. Figure 8: Unpacking the Mechanisms distortions. Below we present the results for the United States calibration of each model, which does not feature multiple equilibria for any degree of distortions ξ. The left panel of Figure 8 shows the effect of distortions (ξ) on consumption per capita, and the right panel shows the effect on the fraction of adopters. The solid line reproduces the effect of distortions in our benchmark economy (the dotted line in the left panel of Figure 6). It confirms that idiosyncratic distortions have a large negative effect on aggregate consumption, which is particularly pronounced around ξ = 0.2, close to its value in the calibration for India (ξ = 0.19). Consumption is down by 60 percent, and the fraction of adopters collapses to nearly 0 at ξ = 0.2. The first comparison model we consider is the basic model in the distortion literature that abstracts from intermediate inputs and technology adoption (νt = νm = 0 and At = Am = 0.69, with recalibration), shown by the dashed line. This specification should be considered the polar opposite of our benchmark model. For this model, consumption falls almost linearly with the distortion parameter ξ in the semi-log scale and by much less than in the baseline model. At ξ = 0.2, consumption goes down by less than 20 percent from its no-distortion level. Even with ξ = 0.5, the loss in consumption is only 30 percent. Next, the dotted line introduces technology adoption to the basic model but without intermediate input. Consistent with the literature, for example, Bento and Restuccia (2017), introducing the technology adoption by itself makes the 35 effect of distortions on consumption only marginally bigger and only at extreme degrees of distortions (ξ near 0.5): The dotted line and the dashed line are nearly indistinguishable in the left panel, and the reduction in consumption at ξ = 0.2 is again less than 20 percent. The dashed line with triangles instead adds roundabout production (intermediate input) to the basic model, but with no technology choice. Roundabout production more than doubles the effect of distortions on consumption, which decreases by almost 40 percent as ξ goes to 0.2. However, the effect here is nearly linear in the semi-log scale with respect to ξ and are still significantly smaller than those in our benchmark. The solid line with squares is a modified benchmark. It has the technology choice and roundabout production, but the adoption costs are in units of labor only, i.e., γ = 0 instead of γ = 1. We see that the effect of distortions on consumption is smaller than, yet comparable to, that in the benchmark, except for intermediate values of ξ between 0.2 and 0.3. The same is true for the impact of distortions on the fraction of adopters in the right panel. At ξ = 0.2, consumption is about 50 percent lower than in the no-distortion case (compared to 60 percent in the benchmark), and about 20 percent of active firms adopt the modern technology (compared to nearly 0 in the benchmark). The difference between the γ = 0 and the γ = 1 cases shows the quantitative relevance of the feedback effect of adoption on the price of the adoption goods, as discussed in Section 3.1. Finally, the solid line with circles is the modified benchmark case where the two technologies have the same intermediate input intensities, νt = νm = 0.69, instead of νt νm . (We reinstate γ = 1.) The effect of distortions on consumption and technology adoption is more measured than in the benchmark until ξ becomes large enough (ξ > 0.35). This highlights another feedback mechanism operating in the benchmark model: As more firms adopt, the lower is the price of the intermediate goods relative to labor, and therefore the higher the profitability of the modern technology that uses the intermediate input more intensively. Because this feedback mechanism is absent in this modified model, the negative effect of distortions on adoption and consumption is smaller than in the benchmark. At ξ = 0.2, consumption is 40 percent lower than in the no-distortion case, and nearly a quarter of active firms adopt the modern technology. On the other hand, when distortions are large enough that the fraction of adopters approaches 0 (ξ > 0.35), the negative effect on consumption is considerably larger than in the benchmark that has νt = 0. This 36 At κa Pa κa GDP p.c. Ratio Case 1: ν = 0, no adoption U.S. India 1 1 0 0 0 0 1.89 1.62 1 0.86 Case 2: νt = νm = 0, γ = 0 U.S. India 0.54 0.28 2.17 321 2.17 321 1.64 0.83 1 0.51 Case 3: ν = 0.69, no adoption U.S. India 1 1 0 0 0 0 5.33 3.37 1 0.63 Case 4: Benchmark U.S. India 0.43 0.07 15.9 271 9.35 810 4.34 0.66 1 0.15 Case 5: γ = 0 U.S. India 0.43 0.06 8.74 1123 8.74 1123 3.33 0.34 1 0.11 Case 6: νt = νm = 0.69 U.S. India 0.54 0.29 12.1 272 7.00 1034 4.15 0.49 1 0.12 Note: The price of the adoption good is Pa = P γ w1−γ . With γ = 0, the price of the adoption good is Pa = w = 1 while with γ = 1 it is Pa = P . Table IV: Alternative Model Results is because the dearth of adopters makes the intermediate input expensive, but the traditional technology in this modified model is still dependent on the intermediate input (νt = 0.69), reducing its effective productivity. Overall, the analysis in this section emphasizes the interaction among our model elements that is more than simply additive. Technology adoption, roundabout production, and the nature of the adoption cost (i.e., labor or goods) jointly explain the large, nonlinear effect of idiosyncratic distortions even in the absence of multiplicity or coordination failures. 6.3.2 Income Gap between the United States and India Our benchmark model calibration generated a large income gap between the United States and India, a factor of nearly 7, accounted for by the high adoption cost κa and the high degree of idiosyncratic distortions ξ in India (Table III). Since other models in the literature rarely generate such a large income gap, it is natural to ask which model elements of ours are responsible for this result.22 In Section 6.3.1, we calibrated various comparison models to the United States data with ξ = 0 and calculated the impact of higher degrees of distortions ξ. In this 22 Francesco Caselli suggested the exercise in this section, for which we are grateful. 37 section, we calibrate each of the comparison models separately to the United States and the India data, following the procedure in Section 5. We report the resulting GDP per capita of the two economies in Table IV, which also shows the calibrated productivity of the traditional technology At , the adoption cost parameter κa , and the adoption cost in units of labor Pa κa . One thing to note is that the degree of distortions ξ is assumed to be 0 for the United States and is identified from the right tail of the establishment size distribution for India. Since the model elements that vary across the comparison models do not affect the right tail of the size distribution, across all these calibrations the ξ for India remains at 0.19. We start with the basic model in the distortion literature that has no intermediate input nor technology adoption (ν = 0, At = Am and κa = 0; case 1 in Table IV). For this model, the only parameters we can use to match the size distribution in either economy is the entry cost κe , the Pareto tail parameter ζ, and the distortion parameter ξ. The United States right tail pins down ζ and the India right tail ξ, given the assumptions of a common ζ and ξ = 0 for the United States. Unsurprisingly, this simple model has a hard time matching the size distribution of either economy. The result is that the GDP per capita of India is only 14 percent lower than that of the United States, a magnitude comparable to the corresponding model result in Figure 8 (dashed line, ξ = 0.19). The second comparison model has a technology choice with labor-only adoption costs but still has no intermediate input (case 2 in Table IV). To match the establishment size distribution, the adoption costs become vastly different between the two economies. However, the gap in the traditional technology productivity At becomes smaller than in the benchmark model (case 4 in the table). The resulting GDP per capita gap is now larger: India’s is half that of the United States. This gap is larger than what we see in Figure 8 (dotted line) but much smaller than what our benchmark model generates. The third comparison model has intermediate input but no technology adoption (case 3 in the table). Like the first comparison model, without the technology choice and the related parameters, this model cannot closely match the establishment size distributions either. However, the linkage in the form of the roundabout production generates a meaningful GDP per capita gap between the two economies: India’s is nearly 40 percent smaller than that of the United States, which is nearly identical to the effect of ξ shown in Figure 8 (dashed line with triangles). 38 When the technology choice and the linkages through intermediate input uses are both incorporated, as in our benchmark, the model generates a much larger GDP per capita gap between the two countries. The exact size of the gap can vary depending on the nature of the adoption costs (goods vs. labor, case 4 vs. 5 in the table) and the intermediate input intensity of the two technologies (case 6), but what does not change is the insight that these model elements interact and generate a cross-country GDP per capita gap larger than the sum of their respective individual effects. 6.4 Big Push “Big Push” is the name Rosenstein-Rodan (1961) gave to the idea that a minimum scale of investment is necessary for economic development. The rationales are indivisibilities in the production function, especially social overhead capital and complementarities across sectors. Under these conditions, individual firms may not find it profitable to industrialize alone, even though all firms are better off industrializing together. Murphy et al. (1989) note that government investments in infrastructure do not automatically solve the coordination problem. In fact, if unaccompanied by firms’ coordinated decision to industrialize and utilize the infrastructure, the modern infrastructure becomes a classic “white elephant.” In this regard, the role of the government is to promote a coordinated, collective decision. If all we need is the coordination of firms’ decisions so that they all become better off, why do so many countries still remain unindustrialized and poor? Our framework helps address this question in two ways. First, the heterogeneity across firms implies that not all firms are better off in the good equilibrium. As discussed in our explanation of Figure 7, many firms that would be active (and make profits) in the bad equilibrium are inactive (and make no profit) in the good equilibrium. Although we have not specified preferences or welfare criteria, it is easy to see that the presence of losers as well as winners can validate the explanations that vested interests block the adoption of better technologies (Olson, 1982; Parente and Prescott, 1999). Second, our framework introduces another dimension to the notion of Big Push, beyond the coordination over multiple equilibria. Reforms that reduce idiosyncratic distortions are necessary elements of successful development policies. As Section 6.2 showed, the effects of reducing distortions get amplified in our framework with or 39 United States India Note: The elasticity of aggregate consumption to adoption subsidy for the U.S. and India calibration, as the distortion parameter ξ goes from 0 to 0.5. For the case for India, the solid line is the high adoption (low adoption threshold za ) equilibrium, and the dashed line is the low adoption (high za ) equilibrium. The dotted lines are for the simple model without intermediate input in production and with labor only adoption costs. Figure 9: Elasticity of Aggregate Consumption to Adoption Subsidy without multiplicity, which we call Big Push in distorted economies. In this view, the role of the government is to reduce distortions, identifying and exploiting the Big Push region where the returns to economic reforms are discontinuously high. 6.5 Big Push and Industrial Policy The view of economists and policymakers on industrial policy has evolved over time. In the early years, government planning, public investment, and protectionist trade policies were the dominant development strategy, but the results were more often than not disappointing (Krueger, 1997). The mounting evidence of “government failures” ushered in the wave of market-fundamentalist liberalization of the 1990s, with equally, if not more, disappointing results (World Bank, 2005). Renewed thinking on industrial policy emphasizes governments’ coordination of innovation and technology adoption (e.g., Rodrik, 2004), the very elements central to our framework. Here we calculate the effect of such a policy: subsidies for technology adoption. In addition, we solve the problem of a constrained planner who chooses the optimal technology adoption taking as given the distortions and the set of active firms. The industrial policy we implement subsidizes a fraction s of the cost of adopting the modern technology, financed by a lump-sum tax on consumers. 40 United States India Note: The aggregate consumption in the constrained planner allocation for the U.S. and India (dashed line). Consumption is normalized by the equilibrium consumption in the undistorted U.S. economy and is in log scale. Figure 10: Constrained Planner Allocation We show the elasticity of aggregate consumption to the subsidy for both the United States and the India calibrations, as we vary the degree of distortions ξ. For comparison, we also compute the elasticity from a simpler model without the roundabout production (νm = νt = 0) and with labor only adoption costs (γ = 0), which corresponds to the dotted lines in Figure 8 or case 2 in Table IV. Figure 9 presents two noteworthy results. First, in our benchmark model with the feedback effects, the elasticity of aggregate consumption to the subsidy is high in the Big Push region, especially in the multiplicity region for India, but relatively low outside of it. This can potentially explain why some industrial policies are more successful than others: It is a matter of whether the economy is in or outside the Big Push region. Second, the elasticity is uniformly low in the version with no feedback effect and there is no Big Push region, implying that complementarity and amplification are necessary conditions for successful industrial policy. We now calculate the optimal adoption subsidy s, by solving the problem of a constrained planner, who chooses the adoption threshold za taking as given the distortions and the set of active firms.23 The aggregate consumption from this constrained planner allocation is shown with dashed lines in Figure 10, while the solid lines reproduce the equilibrium outcomes from Figure 6. Consistent with the 23 We also worked out the case in which the planner chooses both the entry and the adoption thresholds. The entry margin is found to play a minor role, except at high degrees of distortions. 41 elasticity results above, the largest gains from influencing firms’ adoption decisions are in the Big Push region, where the dashed and the solid lines are farthest apart. Furthermore, the India case attests to the power of coordination achieved by the policy: Although the adoption subsidy has small effects on consumption for ξ greater than 0.2 in Figure 9, the planner can generate massive consumption gains by coordinating the economy away from the low adoption equilibrium to the high adoption equilibrium, as long as the degree of distortions is not too high. Nevertheless, the figure shows that the gains from reducing distortions tend to be larger than the gains from optimal adoption subsidies, and that adoption subsidies are not very effective at the low and the high end of the degree of distortions. In other words, not only do the idiosyncratic distortions determine the effectiveness of the industrial policy subsidizing technology adoption, but are also an important source of underdevelopment themselves. 7 Concluding Remarks This paper provides answers to the following three questions: (i) Can economic development be explained solely by coordination failures? (ii) Can economic development be a story of coordination failures and distortions? and (iii) Are there large nonlinear effects of distortions and policies even without multiplicity, which can explain the huge income differences across countries? We find that the United States calibration gives a unique equilibrium and that the calibrated India economy is in the multiplicity region but is in the good equilibrium, which says no to the first question. We find that small changes in idiosyncratic distortions can move the economy in and out of the region of multiplicity, resulting in discontinuously large changes in the aggregate output. More important, even without multiplicity, the feedback channels in our model creates Big Push regions, where small changes in distortions and policies have disproportionate effects. The answer to the second and the third questions is in the affirmative. A promising avenue for future research is the exploration of an asymmetric inputoutput structure of production—for example, a multisector extension, in which sectors differ in adoption costs and forward/backward linkages. We conjecture that this extension will feature clusters of amplification and multiplicity. Another is a dynamic extension of the model where only a subset of firms make entry and adoption decisions 42 each period. In this extension, coordination failures may show up as multiple steady states and history dependence. Whether policies that subsidize adoption or reforms that reduce idiosyncratic distortions can move the economy from bad to good steady states is an open question. References Bento, P. and Restuccia, D. (2017). Misallocation, Establishment Size, and Productivity. American Economic Journal: Macroeconomics, 9(3):267–303. Bhattacharya, D., Guner, N., and Ventura, G. (2013). Distortions, Endogenous Managerial Skills and Productivity Differences. Review of Economic Dynamics, 16(1):11–25. Blaum, J., Lelarge, C., and Peters, M. (2018). The Gains from Input Trade with Heterogeneous Importers. American Economic Journal: Macroeconomics, 10(4):77–127. Buera, F., Fattal Jaef, R. N., Laski, R., and Trachter, N. (2020). The Missing Middle in a Broad Cross-Section of Indian Establishments. Unpublished manuscript. Buera, F. J., Kaboski, J. P., and Shin, Y. (2011). Finance and development: A tale of two sectors. American Economic Review, 101(5):1964–2002. Bustos, P. (2011). Trade Liberalization, Exports, and Technology Upgrading: Evidence on the Impact of MERCOSUR on Argentinian Firms. American Economic Review, 101(1):304–340. Caucutt, E. M. and Kumar, K. B. (2008). Africa: Is aid an answer? B.E. Journal of Macroeconomics, 8(1). Chenery, H. B., Robinson, S., and Syrquin, M. (1986). Industrialization and Growth: A Comparative Study. Oxford University Press. Ciccone, A. (2002). Input Chains and Industrialization. Review of Economic Studies, 69(3):565–587. Cole, H. L., Greenwood, J., and Sanchez, J. M. (2016). Why Doesn’t Technology Flow From Rich to Poor Countries? Econometrica, 84(4):1477–1521. Cooper, R. and John, A. (1988). Coordinating Coordination Failures in Keynesian Models. Quarterly Journal of Economics, 103(3):441–463. Crouzet, N., Gupta, A., and Mezzanotti, F. (2020). Shocks and Technology Adoption: Evidence from Electronic Payment Systems. Unpublished manuscript. 43 Davis, D. R. and Weinstein, D. E. (2002). Bones, Bombs, and Break Points: The Geography of Economic Activity. American Economic Review, 92(5):1269–1289. Davis, D. R. and Weinstein, D. E. (2008). A Search for Multiple Equilibria in Urban Industrial Structure. Journal of Regional Science, 48(1):29–65. Graham, B. S. and Temple, J. R. W. (2006). Rich Nations, Poor Nations: How Much Can Multiple Equilibria Explain? Journal of Economic Growth, 11(1):5–41. Hirschman, A. O. (1958). The Strategy of Economic Development. Yale University Press. Holmes, T. J. and Stevens, J. J. (2014). An Alternative Theory of the Plant Size Distribution, with Geography and Intra- and International Trade. Journal of Political Economy, 122(2):369–421. Hopenhayn, H. A. (1992). Entry, Exit, and Firm Dynamics in Long Run Equilibrium. Econometrica, 60(5):1127–1150. Hopenhayn, H. A. (2014). Firms, Misallocation, and Aggregate Productivity: A Review. Annual Review of Economics, 6(1):735–770. Hsieh, C.-T. and Klenow, P. J. (2009). Misallocation and Manufacturing TFP in China and India. Quarterly Journal of Economics, 124(4):1403–1448. Hsieh, C.-T. and Klenow, P. J. (2014). The Life Cycle of Plants in India and Mexico. The Quarterly Journal of Economics, 129(3):1035–1084. Hsieh, C.-T. and Olken, B. A. (2014). The Missing “Missing Middle”. Journal of Economic Perspectives, 28(3):89–108. Jones, C. I. (2011). Intermediate Goods and Weak Links in the Theory of Economic Development. American Economic Journal: Macroeconomics, 3(2):1–28. Jovanovic, B. (1989). Observable Implications of Models with Multiple Equilibria. Econometrica, 57(6):1431–1437. Kaplan, G. and Menzio, G. (2016). Shopping Externalities and Self-Fulfilling Unemployment Fluctuations. Journal of Political Economy, 124(3):771–825. Kim, M., Lee, M., and Shin, Y. (2020). The Plant-Level View of an Industrial Policy: The Korean Heavy Industry Drive of 1973. Unpublished manuscript. Kline, P. and Moretti, E. (2014). Local Economic Development, Agglomeration Economies, and the Big Push: 100 Years of Evidence from the Tennessee Valley Authority. Quarterly Journal of Economics, 129(1):275–331. Krueger, A. O. (1997). Trade Policy and Economic Development: How We Learn. American Economic Review, 87(1):1–22. 44 Krugman, P. (1992). Toward a Counter-Counterrevolution in Development Theory. World Bank Economic Review, 6(S1):15–38. Lane, N. (2019). Manufacturing Revolutions: Industrial Policy and Industrialization in South Korea. Technical report, Monash University. Matsuyama, K. (1991). Increasing Returns, Industrialization, and Indeterminacy of Equilibrium. Quarterly Journal of Economics, 106(2):617–650. Matsuyama, K. (1995). Complementarities and Cumulative Processes in Models of Monopolistic Competition. Journal of Economic Literature, 33(2):701–729. Melitz, M. J. (2003). The Impact of Trade on Intra-Industry Reallocations and Aggregate Industry Productivity. Econometrica, 71(6):1695–1725. Murphy, K. M., Shleifer, A., and Vishny, R. W. (1989). Industrialization and the Big Push. Journal of Political Economy, 97(5):1003–1026. Okuno-Fujiwara, M. (1988). Interdependence of Industries, Coordination Failure and Strategic Promotion of an Industry. Journal of International Economics, 25(12):25–43. Olson, M. (1982). The Rise and Decline of Nations: Economic Growth, Stagflation, and Social Rigidities. Yale University Press. Owens, R. E., Rossi-Hansberg, E., and Sarte, P.-D. G. (2018). Rethinking Detroit. Opportunity and Inclusive Growth Institute Working Papers 11, Federal Reserve Bank of Minneapolis. Parente, S. L. and Prescott, E. C. (1999). Monopoly Rights: A Barrier to Riches. American Economic Review, 89(5):1216–1233. Redding, S. J., Sturm, D. M., and Wolf, N. (2011). History and Industry Location: Evidence from German Airports. Review of Economics and Statistics, 93(3):814– 831. Restuccia, D. and Rogerson, R. (2008). Policy Distortions and Aggregate Productivity with Heterogeneous Plants. Review of Economic Dynamics, 11(4):707–720. Restuccia, D. and Rogerson, R. (2017). The Causes and Costs of Misallocation. Journal of Economic Perspectives, 31(3):151–74. Rodrı́guez-Clare, A. (1996). The Division of Labor and Economic Development. Journal of Development Economics, 49(1):3–32. Rodrik, D. (1996). Coordination Failures and Government Policy: A Model with Applications to East Asia and Eastern Europe. Journal of International Economics, 40(1-2):1–22. 45 Rodrik, D. (2004). Industrial Policy for the Twenty-First Century. Discussion Papers 4767, CEPR. Rosenstein-Rodan, P. N. (1943). Problems of Industrialisation of Eastern and SouthEastern Europe. Economic Journal, 53(210/211):202–211. Rosenstein-Rodan, P. N. (1961). Notes on the Theory of the ‘Big Push’. In Ellis, H. S., editor, Economic Development for Latin America, International Economic Association Series, pages 57–81. Palgrave Macmillan, London. Schaal, E. and Taschereau-Dumouchel, M. (2019). Coordinating Business Cycles. Technical report, Centre de Recerca en Economia Internacional. Tybout, J. R. (2000). Manufacturing Firms in Developing Countries: How Well Do They Do, and Why? Journal of Economic Literature, 38(1):11–44. Valentinyi, A., Herrendorf, B., and Waldmann, R. (2000). Ruling Out Multiplicity and Indeterminacy: The Role of Heterogeneity. Review of Economic Studies, 67:295–307. World Bank (2005). Economic Growth in the 1990s: Learning from a Decade of Reform. The World Bank. Yeaple, S. R. (2005). A Simple Model of Firm Heterogeneity, International Trade, and Wages. Journal of International Economics, 65(1):1–20. 46 Online Appendix A Equilibrium Conditions In Section 2, we note that the equilibrium conditions can be represented by three equations in three unknowns. They are κe = Aη−1 m Aη−1 t w (νm −νt )(η−1) P −1 κa ze za η(1−ξ)−1 , η−1 Z w νt (η−1) η−1 za (η−1)(1−ξ) z dF (z) At P ze Z ∞ w νm (η−1) η−1 (η−1)(1−ξ) + Am z dF (z) , P za γ P 1−γ L − (1 − F (z )) κ − (1 − F (z )) κa P νt (1−η) e e a w γ 1 zeη(1−ξ)−1 , Aη−1 κe = t η(1−ξ)−1 η−1 w Z w η−1 P ( Pw )γ = (η − 1) τ η where Z B η(1−ξ)−1 Z P νt (1−η) za η(1−ξ)−1 z dF (z) ≡ (1 − w ze νm (1−η) Z ∞ η−1 P + (1 − νm ) Am z η(1−ξ)−1 dF (z) . w za νt ) Aη−1 t Sensitivity Analysis Because the literature lacks precise estimates of the elasticity of substitution across intermediate goods η and the intermediate input elasticity for the traditional technology νt , we fixed η = 3 and νt = 0 outside of the model. While η = 3 falls within the standard range in the literature, there is no available estimate of νt , beyond the fact that it is smaller than the modern technology elasticity νm (Chenery et al., 1986; Blaum et al., 2018; Kim et al., 2020). In this section, we explore the role of η and νt . As we vary η or νt , we recalibrate the model and re-do the exercises that produced Figure 6. We consider two values of η, 2.5 and 4, one on either side of the benchmark value η = 3. We also consider two values for νt : one where νt = 0.35 is larger than the benchmark value of zero but smaller than νm , and the other where νt and νm are both 0.69, which is the highest value possible for νt given the restriction 47 νt ≤ νm and the overall intermediate input share. Figure 11 shows consumption per capita as we vary the idiosyncratic distortion parameter ξ. The two top panels are the cases with different η’s for the United States and India. The two bottom panels are for alternative νt ’s. In all cases, consumption is normalized by the United States consumption in the equilibrium with no distortions. United States, alternative η India, alternative η United States, alternative νt India, alternative νt Note: Equilibrium consumption per capita of the United States and India as the distortion parameter ξ goes from 0 to 0.5. Consumption per capita is normalized by the U.S. consumption (no distortion) and in log scale. Figure 11: Sensitivity of Consumption and Equilibrium Multiplicity to η and νt The top panels of Figure 11 show that a smaller elasticity of substitution η increases the consumption difference between good and bad equilibria and widens the set of the distortion parameter ξ that generates multiple equilibria. This is consistent with the analysis in Section 3.1: A lower η makes goods less substitutable and firms’ 48 adoption decisions more complementary. In addition, the income gap between the United States and India is larger with a smaller η. We draw the conclusion that a small elasticity of substitution across goods are conducive to explaining the income differences across countries, with or without multiplicity. The bottom panels show that a lower intermediate input elasticity of the traditional technology νt has two effects. On the one hand, a lower νt enlarges the set of ξ’s generating multiple equilibria. This is consistent with the discussion in Section 3.3. On the other hand, a low νt compresses the consumption gap between good and bad equilibria. Intuitively, when νt is small, it is less costly to use the traditional technology in a world where few firms adopt the modern technology and the intermediate aggregate is expensive. The two effects run in opposite directions when it comes to explaining cross-country income differences. Another robust result in the plots for India is that the model features a very nonlinear effect of distortions, either through or independently of equilibrium multiplicity. C Idiosyncratic Distortion We show the magnitude of idiosyncratic distortions in the calibration fro India with ξ = 0.19. We model distortions as effective taxes and subsidies on firms’ revenue, and the tax rate is 1 − τ z −ξ . The solid line is the cumulative distribution of tax rates across firms in the calibration for India, while the dashed line is the same object but for the re-computed United States economy using India’s ξ. For a given tax rate on the vertical axis, we show the fraction of firms with higher tax rates on the horizontal axis in log scale. In both cases, a small fraction of firms pays taxes and subsidizes the rest of the economy. In India, the most productive firms (those at the top 10−5 percentile of the active firm productivity distribution) face tax rates of about 45 percent, and nearly half the firms receive subsidies of at least 60 percent of their revenue. For the United States case with India’s ξ, the whole tax schedule shifts up for two reasons. First, the active firms in the United States are more productive because the entry threshold ze is higher than in India. As a result, holding constant the τ̄ of India, the United States firms face a higher tax schedule. Second, in the United States equilibrium with India’s distortions, τ̄ is higher. The top United States firms face higher taxes than top firms in India—a tax rate of almost 60 percent—and 49 Note: The figure shows the cumulative distribution of taxes across firms. The solid line is for the calibration for India. The dashed line corresponds to the recomputed U.S. economy using India’s ξ. Figure 12: Idiosyncratic Distortions across Firms half the firms receive subsidies of at least 30 percent of their revenue. Overall, we conclude that the magnitude of taxes to large firms and subsidies to small firms is not implausible, relative to the numbers in the misallocation literature. 50