The full text on this page is automatically extracted from the file linked above and may contain errors and inconsistencies.
High-Skill Migration, Multinational Companies, and the Location of Economic Activity WP 19-20 Nicolas Morales Federal Reserve Bank of Richmond High-Skill Migration, Multinational Companies, and the Location of Economic Activity∗ Nicolas Morales† Working Paper No. 19-20 December 2019 Most recent version here Abstract This paper examines the relationship between high-skill immigration and multinational activity. I assemble a novel firm-level dataset on high-skill visa applications and show that there is a large home-bias effect. Foreign multinational enterprises (MNEs) in the US tend to hire more migrant workers from their home countries compared to US firms. To quantify the general equilibrium implications for production and welfare, I build and estimate a quantitative model that includes trade, MNE production, and the migration decisions of high-skill workers. I use an instrumental variables approach to show that the relationship between immigration and MNEs proposed by the model holds in the data. The model is then used to run two counterfactual exercises. The first, evaluates the implications of a more restrictive immigration policy in the US. I find that MNEs play a significant role in how immigration affects the location of production and welfare. In the second counterfactual exercise, I increase the barriers to MNE production to calculate the welfare gains generated by MNEs. I show that a model not incorporating migration would overestimate the MNE welfare gains for high-skill workers by 35% and underestimate welfare gains for low-skill workers by 8%. JEL: F16, F22, F23, J61 Keywords: High-skill immigration, H-1B visas, Multinational companies, IT sector ∗ I would like to thank John Bound, Andrei Levchenko, Jagadeesh Sivadasan and Sebastian Sotelo for their guidance and advice. I would also like to thank Vanessa Alviarez, Dominick Bartelme, Charlie Brown, Javier Cravino, Jingting Fan, Catalina Franco, Amelia Hawkins, Gaurav Khanna, Eunhee Lee, Pierre-Daniel Sarte, Mel Stephens, Gonzalo Vazquez-Bare and seminar participants at the U. Michigan, Midwest Trade (Vanderbilt), LACEA, OECD, Santa Cruz, Richmond Fed, Dallas Fed, McGill, Pompeu Fabra, New Faces in Trade (Penn State), SITE (Stanford) and Mid-Atlantic Trade (Duke) for helpful comments and suggestions. I am grateful to the Alfred P. Sloan Foundation and the NBER Fellowship on High-Skill Immigration for generous research support. The views expressed are those of the author and do not necessarily reflect those of the Federal Reserve Bank of Richmond or the Board of Governors. † Federal Reserve Bank of Richmond, nicolas.morales@rich.frb.org, sites.google.com/view/nicolasmorales 1 Introduction In recent years, particularly in the United States, policies have been proposed that would curtail high-skill immigration into the country. For example, by rescinding work visas for the spouses of high-skill immigrants or increasing the required lower bound for wages of new high-skill immigrant workers, both of which aim to reduce total high-skill immigration to the US.1 These policies rely on the argument that immigration reduces employment opportunities for natives whose skills are on par with the immigrants. On the other hand, those who support high-skill immigration argue that such policies are likely to decrease the level of economic activity in the US and lead both US companies and foreign multinational enterprises (MNEs) in high-skill industries, such as IT, to move their operations elsewhere. While both of these arguments are frequently made, there is little quantitative evidence of the extent to which restrictions on high-skill immigration would indeed cause high-skill industries to relocate outside of the US. In this paper, I quantify the impact that US restrictions to high-skill immigration would have on welfare, MNE activity, and total production in high-skill industries. To establish a link between MNEs and high-skill migration, I present a novel finding that foreign MNEs are more dependent on immigrant labor from their home countries than US companies. I build and estimate a quantitative model with multiple industries and countries that incorporates highskill migration, trade, and MNE activity, and I validate the model using an instrumental variables approach. I use this model to run two main counterfactual exercises. First, I study the effects of restricting immigration into the US on welfare, production and MNE activity. Second, in order to quantify the welfare gains created by MNEs, I increase the barriers to MNE production and calculate the importance of incorporating migration for estimating the welfare gains from MNEs. As a first step, I assemble a novel firm-level dataset that relates the nationality of each high-skill migrant hired in the US to the source country of the parent company of the firm. To construct this dataset, I use the universe of H-1B visas granted between 2001 and 2014 and L-1 visas granted between 2012 and 2014, obtained through a Freedom of Information request (FOIA) to the United States Citizenship and Immigration Services.2 The data include information on wages, worker nationalities, and characteristics of the sponsoring firm such as company name and location. I match this data by name and location to the corporate databases of Orbis and D&B Hoovers to get information on the ownership structure and industry of each firm. The link between the source country of the parent company and the origin of the immigrant workers 1 Such policies are the recent comments to rescind the H4 EAD work permits for H-1B spouses in mid-2018 and the “Protect and Grow American Jobs Act” to impose a higher lower bound on H-1B wages introduced in late 2017. 2 H-1B and L-1 visas are the main visa categories through which college-educated immigrants can get work authorization in the US. The L-1 program focuses on intra-firm transfers while the H-1B is open to everyone. More details on these programs are given in Section 2. 1 has been missing from previous studies and is key to understanding the relationship between MNEs and high-skill worker migration. I document a series of facts that relate immigration to MNE activity in the US. First, I show that foreign MNEs in the US have a large “home bias”, where on average, they hire 153% more foreign workers from their source country when compared with the number of workers from that source country hired by US companies and other foreign MNEs. There is heterogeneity in the magnitude of the home bias across countries, and this effect is consistently large. Such home bias could be explained by a demand channel where firms need source-country immigrants to produce, or a supply channel where source-country immigrants find it easier to migrate if working for an MNE from their origin country. I show MNEs pay lower wages to their sourcecountry immigrants, suggesting the supply channel is the predominant explanation for the observed home-bias. Finally, I show that higher home bias is positively correlated with higher visa applications, farther distance to the US, and being a non-English speaking country. Guided by these facts, I build a quantitative model that accounts for several channels through which immigration affects production. The production side of the model allows for trade and MNE activity across multiple industries, similar to the work of Ramondo and Rodriguez-Clare (2013) and Alviarez (2018). For instance, if German producers want to sell goods to the US, they will choose the production location that allows them to sell the goods at the lowest price. The model allows them to choose among producing the goods in Germany and shipping them to the US, paying a trade cost; or setting a plant in the US and selling their product domestically, paying an MNE cost; or setting a plant in a third country and selling to the US, paying both trade and MNE costs. Such decisions are at the core of why a company relocates its production when migration restrictions are imposed; their marginal cost is increased, causing them to move. The labor supply side of the model focuses on the decisions of college-educated workers in each country who choose which country to migrate to, industry to work in, and source technology to work with. For example, if a worker is employed by a company whose parent is headquartered in the US, he or she works with US source technology. Workers draw idiosyncratic productivities to work in each country-industry-source triplet, and they sort endogenously across triplets as explained in Roy (1951).3 If they choose to migrate, workers have to pay a non pecuniary migration cost. In the model, immigrants affect firm-level production in two ways. First, as suggested by Peri and Sparber (2011), I allow for imperfect substitution between immigrants and natives in the production function. Second, I allow for workers from different countries of origin to have different comparative advantages across industries. The link between MNE and migration appears through three separate channels. From the labor supply side, the migration cost can 3 The supply-side of the model is related to the literature that combines Roy (1951) and Eaton and Kortum (2002). Some recent examples using similar labor supply models are Lagakos and Waugh (2013), Hsieh et al. (2019), Lee (2019), Bryan and Morten (2018), Fan (2019), Tombe and Zhu (2019), and Liu (2017) 2 be lower if workers migrate to work at a company whose source technology is the same as that in the worker’s home country. From the labor demand side, foreign MNEs treat workers from their source-country as imperfect substitutes for domestic and other foreign workers; therefore, they treat source-country workers as distinct inputs for production. Additionally, I allow for the possibility of source-country immigrants lowering MNE costs by providing information or networks that facilitate MNE activity. As a validation for the model, I show well identified, reduced-form evidence that corroborates the positive link between own-country high-skill immigration and MNE employment. Using variation across years, industries, and source countries I construct a shift-share instrumental variable for the number of immigrants from each country working for different industries in the US. I interact the initial share of immigrants from the source-country working in an industry with the total number of immigrants from that country working for US companies in other industries. I show that a 1% increase in source-country immigrants working for a given industry in the US increases total MNE employment by firms from that source country by 0.70%-1.94%. The reduced-form evidence also corroborates that high-skill immigration, as opposed to low-skill immigration, has a predominant effect on MNE activity. I show that in order to measure the changes in welfare and production between the observed equilibrium and a counterfactual equilibrium, I only need six elasticities and data on observed migration shares, trade shares, MNE shares, and labor expenditure shares. I use the approach proposed by Dekle et al. (2008) and re-write the equilibrium in proportional changes from the observed equilibrium to a counterfactual equilibrium. This move allows me to significantly reduce the number of parameters to be estimated in order to determine the magnitude of endogenous responses of the model to any given exogenous shock, such as an increase in cost of migration. I use my novel dataset to estimate structurally three of the elasticities that are not available in the literature: the labor supply elasticity, the elasticity of substitution between source-country workers and other foreign workers, and the elasticity of MNE costs to immigration. I use an instrumental variables approach based on the trade shocks literature to construct a demand shifter and estimate the elasticity of labor supply. For the elasticity of substitution between foreign high-skill units of labor, I use the immigrant shift-share instrument described above to identify the slope of the relative demand for source-country and other foreign workers. Finally, I estimate the elasticity of MNE costs to immigration through indirect inference, by matching the estimated response of MNE employment to immigration. I use this estimated model to run two main counterfactual exercises. In the first, I increase the costs of immigration into the US from all other countries to reduce the stock of inbound highskill immigrants by 10%, consistent with a 0.3% decrease in total US workforce. The decrease of high-skill immigrants would cause US production in high-skill intensive industries such as IT and high-tech manufacturing to decrease by 0.53% and 1.70% respectively. This decrease would be largely driven by foreign MNEs who respond disproportionately to the migration 3 restrictions. Other countries share in production is expected to increase in response, with the IT sector in India increasing by 0.63% and in Canada by 0.16%. Real wages for US low-skill workers would decrease by 0.35%. High-skill workers are complements in production to lowskill workers such that the decrease in immigration of high-skill workers lowers the demand for low-skill workers and decreases their wages. The increase in labor costs caused by immigration restrictions would also increase prices for US consumers, adding to the negative effect on the real wages of US workers. On the other hand, US high-skill workers would experience a gain of 0.09% in their real wages driven by an increase in the market wages caused by the lower competition from immigrants. Overall, the real wages of US workers decrease by 0.22% when immigration is restricted, which in dollar terms would account for a total long-term loss of $8 billion USD for the US economy. In the second counterfactual exercise, I increase the barriers to MNE production in order to calculate the welfare gains from MNE activity. Foreign MNEs bring more efficient technologies that lower the costs of production domestically and improve efficiency. Canonical papers in the MNE literature such as Ramondo and Rodriguez-Clare (2013) and Tintelnot (2017) have focused on quantifying the welfare gains of going from MNE “autarky,” where MNE costs are prohibitive and MNE flows are zero, to the observed equilibrium in which MNE flows are positive. I use my quantitative model to show that going from MNE autarky to the observed MNE flows would increase welfare for high- and low-skill native workers by 1.16% and 1.42%, respectively. A model that does not incorporate migration would overstate the welfare gains for high-skill workers by 35% and understate the gains for low-skill workers by 8.15% since it would not account for the negative impact of immigration on high-skill natives nor the positive impact for low-skill workers. This result shows that the link between MNEs and immigration significantly affects the distributional welfare gains predicted by canonical MNE models that do not incorporate migration. To my knowledge, this is the first paper that quantifies the impact of high-skill immigration on the welfare of workers and the location of production by taking into account the specific channel of multinational activity. The United States immigration policies concerning high-skill workers, particularly through the H-1B program, have received significant attention in recent years. On one hand, high-skill immigrants are found to increase innovation (Hunt and Gauthier-Loiselle, 2010; Kerr and Lincoln, 2010; Peri et al., 2015b) and hence increase total factor productivity in the US. On the other, its empirical estimates on native employment and wages are mixed. While some papers find small to negligible consequences for the employment of native workers (Peri et al., 2015a), others find significant crowd-out effects (Doran et al., 2015). I contribute to the empirical literature by estimating a rich quantitative model to calculate the positive and negative consequences of immigration into the US. More broadly, several papers have used general equilibrium models to understand how highand low-skill immigration affects wages and employment of native workers. Among others, 4 Docquier et al. (2014), Bound, Khanna, and Morales (2018), and Burstein et al. (2018) look at the effects of immigration for native workers with different skills and occupations by focusing on the consequences for the recipient country and ignoring the implications of migration for the rest of the world. A second set of papers go beyond that and use multi-country models to study the consequences of migration in both receiving and sending countries. Such a global view on migration requires us to incorporate, to some extent, the possibility that production will relocate as a response to changes in immigration policy (Caliendo et al., 2018; Desmet et al., 2018; di Giovanni et al., 2015; Iranzo and Peri, 2009; Khanna and Morales, 2018). This paper contributes to this literature by including the channel of multinational production, which is key to understanding the effects that firms decisions to relocate production due to immigration policies have on welfare and productivity. A closely related strand of literature has used a mostly reduced-form approach to establish a link between immigration and trade (Gould, 1994; Hiller, 2013), immigration and FDI activity (Burchardi et al., 2019; Cuadros et al., 2019; Glennon, 2018; Javorcik et al., 2011; Wang, 2014; Yeaple, 2018), and the interrelations between migration, trade, and FDI (Aubry et al., 2018). I contribute to this literature by providing new facts on these links as well as building and estimating a quantitative model that allows me to properly quantify the aggregate implications of immigration for MNEs production and for welfare. As an additional contribution, I provide new evidence on the distributional welfare gains of MNE production. Many of the most notable papers in the multinational production literature have focused on quantifying the welfare gains of MNE production by incorporating, among other factors, the interrelations between MNE production and trade, intermediate inputs, innovation, and comparative advantage (Alviarez, 2018; Arkolakis et al., 2018; Head and Mayer, 2018; Ramondo and Rodriguez-Clare, 2013; Tintelnot, 2017). My paper is the first to show how the baseline results found in the literature might be expected to change if we were to incorporate the channel of migration, which would significantly affect the distributional welfare gains of MNE production. Finally, I contribute to the literature on the role of MNEs for the transfer of knowledge and formation of cross-country teams (Antras et al., 2006; Gumpert, 2018; Keller and Yeaple, 2013; Mion et al., 2018). My paper proposes international migration as an additional channel for knowledge transfer, where MNEs have a specific productivity effect from hiring workers from their source country. 2 Context and Data High-skill immigration into the US is possible through two main visa programs: the H-1B and the L-1. The H-1B program started in the early 1990s and was created as a pathway through which firms could hire temporary high-skilled workers in “specialty occupations” for a period 5 of three years with the option to renew it for three more. The main feature of the program is that the number of new visas awarded per year is capped at 65,000 visas with an additional 20,000 for those who have a post-graduate degree awarded by a US institution. If the number of applications exceeds the cap, then a lottery takes place to award the visas. Universities and non-profit organizations are exempt from the cap. The visa program recognizes a dual intent, in which the employees can obtain a green card after their H-1B expires. The L-1 program is lesser known than the H-1B and represents around 10% of total H-1Bs awarded. The total number of L-1 visas is not capped and the program is targeted at MNE companies, since it requires the sponsored employee to have worked at an affiliate of the employer for at least 1 year in a period of 3 years prior to admission to the US. L-1 visas are valid for up to 5 to 7 years and are also dual intent, where employees can get sponsored for a green card after being L-1 holders. Further details on the L-1 and H-1B programs are discussed by Yeaple (2018). For this project, I submitted a Freedom of Information Act (FOIA) request for the universe of I-129 forms for H-1B submitted between 2001 and 2014 and L-1 visas submitted between 2012 and 2014. The I-129 form needs to be filed by the employer to the United States Citizen and Immigration Services (USCIS) once the visa was approved by the Department of Labor and after the visa application went through the lottery in the case of the H-1B. The novelty of the dataset is that it contains individual information including the employer’s name, start and end dates for which the visa is valid, occupation, country of birth, and wages. Country of birth is a key variable needed for the subsequent analysis to establish the relation between MNE and immigration. The dataset also includes information on whether petitions were filed for new employment, a renewal of previously approved employment, or a change in the terms of employment. Such information has an advantage over the H-1B data posted by the Department of Labor where all types of petitions are pooled together and includes petitions that did not win the lottery. I combine the FOIA dataset with corporate information from Orbis, DnB Hoovers, and Uniworld, to get insight into the ownership structure of the employers and determine the country where the Global Ultimate Owner of the company is headquartered. This link is fundamental to my analysis as it will reveal the source technology that foreign workers are using when migrating to the US. The corporate datasets also contain useful information such as industry indicators for the affiliate and the parent company as well as data on employment and revenues. Appendix A explains how I constructed the FOIA dataset and provides details on the matching process with the corporate datasets. 3 Stylized facts In this Section, I use the visa data to present some facts that help shed light on the link between high-skill immigration and MNE activity and that motivate the model in Section 4. In a first stage, I show that there is a strong link between MNEs and high-skill immigration captured by 6 a “home bias” measure that indicates foreign MNEs from a given source country are more likely to hire immigrants from that country than US companies and MNEs from other countries. In a second stage, I show that foreign MNEs in the US pay their own-country immigrants lower wages, suggesting a predominant labor supply mechanism for the observed home bias. Finally, I present evidence that home bias is larger for countries that are farther away from the US and whose native language is not English. As shown in Appendix A, Tables 13 and 14, the US high-skill immigration system is highly skewed toward IT workers coming from India. US and Indian companies are also the main applicants for H-1B and L-1 visas. However, as noted in the facts presented in this Section as well as the quantitative results throughout this paper, high-skilled migration plays a significant role on the activity of all foreign MNEs that operate in the US as well as across high-skill industries other than IT. Foreign MNE companies have a “home bias” toward workers from their source country Foreign MNE companies have a “home bias” towards recruiting workers from their source country when compared to US companies. This is relevant since we should expect foreign companies to respond more to a migration policy change than American companies, which in turn has further implications for changes to the industrial structure and welfare in the US. To find support for this in the H-1B and L-1 data, I collapse the high-skill visa data to the origin (o) - source (s) - industry level (k) and proceed to run a regression as in equation 1. For example, one observation of the regression is German H-1B and L-1 recipients working for Japanese automotive companies located in the US. Ln(Nk,o,s ) = γ0 + X γs 1(origin = s) + δk,o + δk,s + k,o,s (1) s The key coefficient of interest is γs , which measures how much more likely it is that a company from source s will hire someone from o = s relative to o 6= s when compared to all other companies from other source countries. δk,o is an industry-origin fixed effect that captures whether migrants from origin o have a specific comparative advantage on industry k. δk,s is a source-industry fixed effect that captures any comparative advantage of a company from source country s that is operating in industry k. Equation 1 measures the home bias within industry and compares firms from s with all other firms with s0 6= s. The results of the home-bias coefficient γs can be found in Figure 1. The home bias is large for most countries in the sample, and there is significant heterogeneity across source countries. For example, Indian companies are shown to be 122% more likely to recruit workers from India than other countries, relative to non-Indian companies. 7 Figure 1: Estimated coefficient (γs ) on sourcing regression by country (H-1B+L-1) Appendix B explores further how the regression results change when including source-origin pairs with 0 value and running the regression at the source-origin level. The finding of home bias is very robust to these specifications. The results are also robust when only focusing on H-1B visas (excluding L-1 visas), showing that the documented “home bias” is not just a mechanical relationship driven by intra-firm transfers. The observed home bias can be explained by either demand or supply mechanisms. On the demand side, we can think of source-country immigrants as having a specific productivity for the source-country firm. For example, they might facilitate communication between the US affiliates and the parent company because of language or cultural proximity. On the supply side, it is possible that workers in the source country find it easier to migrate and work for a source-country company abroad. Perhaps workers are more likely to find foreign jobs because of networks in the home country or employers face a lower screening cost for source-country experience and education credentials. For now, I will not take a stand on whether the home bias is driven by workers being more likely to find a job at a company headquartered in their origin country or by foreign companies having a greater need for workers from their source country. However, I look at wage data and other covariates to see which mechanism might be more relevant to explain the home bias. In the model and subsequent estimation in Sections 4 and 6, I explicitly separate these two channels. Foreign MNE companies pay lower wages to immigrants from their source country 8 As a second exercise, I run equation 1 but using log average wages for each s, k, o as the dependent variable as in equation 2: Ln(w̄k,o,s ) = γ0 + X βs 1(origin = s) + δk,o + δk,s + k,o,s (2) s Coefficients βs can be interpreted as the average wage difference between source-country immigrants and other immigrants for an MNE from country s when compared to companies not from s. If the demand mechanism was the only explanation for the home bias, we would expect source-country workers to have a wage premium over other immigrants. Instead, if the supply mechanism were the sole channel, we would expect to see a wage penalty for source-country workers. As shown in Figure 2, most MNEs show a wage penalty for source-country workers relative to other immigrants. For example, Korean firms in the US pay 27.4% less to their Korean workers than to other immigrants when compared to non-Korean firms. Figure 2: Estimated coefficient (γs ) on wage regression by country (H-1B) While coefficients βs contain both supply and demand mechanisms, the negative pattern suggests that the predominant force in explaining the large degree of home-bias might be a supplyside mechanism. If immigrants find it easier to find jobs at source-country companies, the marginal immigrant working for a source-country company will have a lower ability and hence receive a lower wage than the marginal immigrant working for a non-source-country company. Sources of home bias: Firm size, distance and common language. 9 To further explore the mechanisms driving the observed home bias, I look at whether firm size and other observables explain part of the variation in home bias across source countries. As a first step, I estimate a version of equation 1 where instead of collapsing the data at the industry-source-nationality level, I collapse it at the firm-nationality level. The insight of this specification is to check the role of large firms on driving the results observed in figure 1, as when aggregating at the source-industry level, firms with more visa applications will tend to drive the results. As shown in Figure 3, the magnitude of the home-bias is still large and positive for most countries. However, the magnitudes are quite smaller than those shown in Figure 1, indicating that MNEs who apply for a higher number of visas exhibit a larger home bias. Figure 3: Estimated coefficient (γs ) on firm-level employment (H-1B+L-1) As a second step, I look at simple correlations between country-industry measures of home bias and observables to understand what other factors might drive the observed heterogeneity. I construct the home bias measure at the source-country and industry level by adding industrycountry pair interactions to equation 1 and using the estimated coefficients as the home-bias measure. As shown in Table 1, source-country measures seem to drive more of the heterogeneity than industry-specific measures. Countries that are farther away from the US and do not share a common language present higher degrees of home bias, which suggests that home bias is larger for countries where communication between the parent and the affiliate might be more costly. Industries in which the source country has a larger share of US imports tend to have lower home bias. Countries with larger market shares in the US are likely to have long-lasting trade relationships with the US, and the employment of home-country workers might be less crucial for production. 10 Table 1: Correlation between home-bias and observables Source country characteristics (s) GDP per worker at s Common language Free trade agreement Distance -0.21 -0.52 -0.26 0.56 Industry characteristics (k) Share of college grads in k Average college grads wage in k Employment share in US -0.16 -0.06 -0.05 Source-Industry characteristics (k) Industry GDP at s US Employment MNEs from s in k US Employment growth MNEs from s in k Share of US imports from s in k Share of non-US imports from s in k 0.12 0.05 0.09 -0.34 -0.01 Correlation coefficient between industry-source measure of home-bias and observables. s source country, k industry. Home-bias measure at the source-industry level calculated by taking the coefficients from a regression as in equation 1 that includes interactions between origin=source dummies and industry dummies. All covariates are measured at year 2014. I highlight in bold correlations over 0.3. 4 Model To quantify the general equilibrium implications of changes to high-skill immigration policy in the US and guided by the facts in Section 3, I build a quantitative model to disentangle the different mechanisms through which immigration affects production, welfare, and MNE activity. The model consists of two main parts: a labor market for high-skill workers described in Section 4.1 and a product market that includes trade and MNE activity described in Section 4.2. The model is static and consists of O countries. Production can be carried out by local companies or by foreign companies that set up an affiliate outside of their home country. 4.1 Labor Market and Migration Choices Each country o is endowed with a number of low-skill (L̄o ) and high-skill (N̄o ) workers. Lowskill workers are a homogeneous group who cannot migrate and receive wage wo . On the other hand, high-skill workers have heterogeneous abilities and are able to choose the location `, industry k, and source technology s they want to work with. Source technology refers to the 11 country where the company they work for is headquartered. At the beginning of the period i,o each worker i at origin o takes an ability draw ηk,`,s to work at each triplet z = k, `, s from a Frechet distribution as shown in equation 3: i,o F (ηk,`,s ) = exp − Z X 1 1−ρ Ãk,o ηzi,o κ̃ − 1−ρ !1−ρ (3) z=1 The shape parameter of the distribution κ̃ is common across origin countries and governs the dispersion of abilities for each individual. Lower values of κ̃ imply that individuals are likely to have very different abilities across triplets k, `, s. ρ represents the correlation across ability draws. In the extreme case of ρ = 1, individuals will have the same ability across all triplets κ̃ of s, k, and `. As in (Bryan and Morten, 2018), it is useful to rewrite κ = 1−ρ such that κ is a combination of ability dispersion and the correlation parameters. As will be shown later, the parameter κ is also the elasticity of labor supply, since it determines how much labor supply choices respond to changes in wages or migration costs. 1 1−ρ , determines the average ability level of each origin in each The scale parameter, Ak,o = Ãk,o industry. This allows for workers in a given country to have a comparative advantage at specific industries. This setup is related to the EK-Roy models of comparative advantage, which is a combination of the Ricardian model of productivities in Eaton and Kortum (2002) and the selection model proposed by Roy (1951). Such a setup has been used to model individual choices of occupations and industries (Hsieh et al., 2019; Lagakos and Waugh, 2013; Lee, 2019), as well as both for internal (Bryan and Morten, 2018; Fan, 2019; Tombe and Zhu, 2019) and international migration (Liu, 2017). Each high-skilled worker, indexed by i, chooses the triplet k, `, s that maximizes their utility w i,o is the real as in equation 4. ηk,`,s is the ability draw for individual i in triplet k, `, s, k,`,s P` o wage per effective unit paid in triplet k, `, s and εk,`,s is a mean one log normally distributed random term that captures random shocks that make workers from o more productive at triplet k, `, s.4 φo,`,s ≥ 1 is a non-pecuniary migration cost that is paid when migrating from origin o to location ` and source technology s. Having the migration cost depend on s is the first component of the home bias discussed in Section 3, since workers from a given origin can have a lower cost when working for an MNE of a specific source technology. As φo,`,s is non-pecuniary, i,o o the wage that individuals actually receive in the labor market is Wk,`,s = ηk`,s × wk,`,s . max k,`,s i,o i,o {Uk,`,s } = ηk,`,s × εok,`,s × 4 wk,`,s 1 × P` φo,`,s (4) The assumption that Ak,o only depends on origin and industry is done for convenience in the subsequent estimation. However it would be possible to work with Ao,k,`,s = Ak,o εok,`,s and the estimation results would be identical. 12 4.2 Production, Trade, and MNE activity I lay out the consumer problem in two stages. First, individuals take ability draws and choose a triplet k, `, s as explained in Section 4.1. Second, conditional on their choice and the wage they receive, they maximize their consumption utility as an individual living in ` who has Cobb-Douglas preferences over K industries as in equation 5 γ k,` U` = ΠK k=1 Qk,` (5) Each Qk,` can be written as a continuum of varieties indexed by j, and aggregated CES as in equation 6: Z Qk,` = j qk,` σ−1 σ σ σ−1 (6) dj j Each variety qk,`,s is produced using a Cobb-Douglas aggregate of intermediate inputs from each industry K and a composite of low- and high-skilled labor as in equation 7. qk,` = k,` K Y γ`,k,k0 Q`,k,k 0 α−1 l ψk,` lk,`α α−1 + h ψk,` hk,`α P α α−1 (1− k0 γ`,k,k0 ) (7) k0 =1 α represents the elasticity of substitution between low- (lk,` ) and high-skill (hk,` ) units of labor and γ`,k,k0 is the expenditure share for industry k in country ` on intermediates from industry k 0 . Each producer has an idiosyncratic productivity jk,` . While I omit index j in equations 7 and 8, both equations are at the producer level. I assume high-skill labor hk,` is a composite of effective units from the domestic country hdk,` , source country hsk,` , and other foreign countries hfk,` . That is, if the producer uses a source technology in a location ` 6= s then the aggregate hk,` can be written as in equation 8: 5 λ λ−1 hk,`,s ι λ−1 ι−1 λ ι−1 λ−1 ι−1 ι d sf s f f s d λ ι + ψk,`,s ψk,`,s hk,`,s + ψk,`,s hk,`,s = ψk,`,s hk,`,s | {z } | {z } | {z } local workers source country other foreign | {z } (8) all foreign The parameter λ governs the substitution between effective units of the domestic country and foreign effective units. Parameter ι governs the substitution between source country and other 5 If a company operates in ` = s, then the source and domestic inputs are the same and the only relevant substitution is between natives and foreign. 13 foreign workers. Having foreign and native workers be imperfect substitutes is consistent with the findings of Peri and Sparber (2011), who find that immigrants tend to specialize in different tasks than natives. At the same time, having source-country workers be an imperfect substitute for other foreign workers and natives is consistent with the knowledge transfer literature such as Keller and Yeaple (2013), who find that affiliates of US MNEs can use intermediate inputs from the parent country to transfer knowledge from parent to affiliate. This is where the demand side of the home-bias discussed in Section 3 appears, since foreign MNEs will have a specific value for migrants from their source country. f sf d s Share parameters ψk,`,s , ψk,`,s , ψk,`,s , ψk,`,s vary across source countries and industries. Differences in this parameters capture why some source-industry pairs might be more intensive on immigrants or source-country immigrants than others. 4.2.1 International trade and MNE Up to this point I have been taking the existence of MNE companies as a given. To close the model, I clarify how location decisions of MNEs are made. This setup is a multi-industry extension of the MNE production model proposed by Ramondo and Rodriguez-Clare (2013), which is an extension of the Ricardian trade model in Eaton and Kortum (2002). Multi-sector Ricardian MNE models have been developed by Alviarez (2018) and Arkolakis et al. (2018), among others. Producers of each variety j in source country s take a productivity draw jk,`,s to produce variety j in each possible location `. Such productivity is drawn from a Frechet distribution as in equation 9: F (jk,`,s ) = exp − L X Tk,s −θ j` ! (9) `=1 Once again, the shape parameter θ governs the productivity dispersion across production locations for a given producer. If θ is low, then there are large gains to MNE production, as a producer might have low productivity in their source country but high productivity at some alternative location. A producer of variety j, in industry k, with source technology s who chooses to locate production at location ` and sell their products to destination country n would charge a price as in equation 10: pj,k s,`,n = k k cks,` τ`,n δs,` jk,s,` (10) The price increases with the marginal cost of production cks,` . Marginal cost depends on both 14 the location of production ` and the source technology s since, as presented in Section 4.1, foreign workers have different costs of migration for domestic and foreign MNEs, which implies that an MNE from source s located in ` has access to a specific labor pool and pays a different wage per effective unit of labor than companies from other source countries. The locationsource specific productivity jk,s,` decreases the price, as more efficient producers generate more output for a given combination of inputs. If a producer located in ` wants to sell to destination k ) where part of the good gets lost in transit n 6= `, then they incur in an iceberg trade cost (τ`,n from ` to n. Alternatively, if a company decides to serve market n by setting up an affiliate k in n = `, then if s 6= ` the company incurs in an iceberg MNE cost (δs,`=n ), which represents the share of the goods that gets lost when adapting technology s to location `. A third option is for a company from s to locate in ` 6= s and sell goods to n 6= s, `, in which case it would k k ). Consumers end up buying each variety from the ) and MNE costs (δs,` pay both trade (τ`,n j,k cheapest producer such that: mins,` {ps,`,n }. k As a final feature, I will allow for the MNE cost δs,` to depend on the number of immigrants from s who work in location `, industry k. The rationale behind this modeling assumption is that immigrant diasporas in foreign countries might provide information and business connections k that facilitate FDI. The iceberg MNE cost δs,` can be written as in equation 11: k k o=s δs,` = δ̄s,` Nk,` ν (11) k o=s Where δ̄s,` is a baseline iceberg cost independent on the number of migrants and Nk,` is the number of migrants from o = s that work in `, k. The relevance of this mechanism is determined o=s by parameter ν.6 I assume that firms consider the stock of immigrants Nk,` as exogenous such that the effect on the MNE costs can be considered as a spillover or externality that is not internalized in the profit maximization process. 4.3 Equilibrium The equilibrium in this model can be defined as a set of prices, wages, and labor allocations such that: high-skill workers optimally choose the triplet k, `, s to work for, consumers in each location ` buy goods from the cheapest producer, labor markets clear, and trade is balanced. Since both individual abilities and producer productivities are drawn from Frechet distributions, it is possible to derive tractable, closed-form solutions for migration shares, trade shares, and MNE shares. 6 Incorporating immigrants in a gravity framework through the FDI costs has been a common strategy in the literature. See for example Cuadros et al. (2019) and Aubry et al. (2018), among others. All of my qualitative results still hold if we consider ν = 0. I incorporate this feature to help match the magnitude of the MNE employment elasticities to own-country migration estimated in Section 5. 15 The fraction of workers from origin o who choose to migrate to location ` and work for industry k with source technology s can be written as in equation 12: mig πo,k,`,s Ao,k =P `0 ,s0 ,k0 Ao,k0 wk,`,s o εk,`,s P` κ φ−κ o,`,s κ wi0 ,`0 ,s0 o εk0 ,`0 ,s0 P` 0 φ−κ o0 ,`0 ,s0 (12) Equation 12 implies that the probability of migration from origin o to triplet k, `, s depends on the comparative of origin o in industry k (Ao,k ), the real wage per effective unit in advantage wk,`,s , the migration cost from o to `, s (φo,`,s ), the random origin-specific term triplet k, `, s P` o εk,`,s , and a combination of these terms for all other triplets, captured by the denominator in equation 12. Consumers choose the pair `, s from which to buy each variety within each industry. Given the properties of the Frechet distribution, it is possible to write the share of goods bought from pair `, s by consumers in n as in equation 13: k −θ k (τ`,n ) T̃` trade πk,`,n =P k −θ k `0 (τ`0 ,n ) T̃`0 (13) The trade share depends on the bilateral trade cost between production location ` and destination country n, as well as on the effective technology parameter in location `: T̃`k = P k k k −θ . T̃`k is a combination of the fundamental technologies Tsk of source couns Ts c`,s × δ`,s tries operating in ` and the marginal cost for a producer with source s to operate in `. The overall marginal cost is a combination of the marginal cost of production ck`,s and the MNE k . iceberg cost δ`,s Finally, it is possible to write the share of production in ` in industry k that is done by MNEs from country s as in equation 14: mne πk,s,` =P k Tsk ck`,s × δ`,s s0 −θ k Tsk0 ck`,s0 × δ`,s 0 −θ (14) Appendix C shows the complete equilibrium equations including trade balance, labor, and product market clearing conditions and the cost functions. To solve for the equilibrium in the model, I use the approach suggested by Dekle et al. (2008) and solve the model in proportional changes. This method, also called the exact hat-algebra method, allows me to re-write the equilibrium equations as changes between the real and the 0 counterfactual scenarios. That is, I can rewrite each variable x as x̂ = xx where x is the variable under the real scenario and x0 is the value of the variable under the counterfactual. A key advantage of this method is that it allows me to understand more transparently how an 16 exogenous change in, for example, migration costs to the US φ̂o,U S,s > 1, affect other endogenous variables of the model. I rewrite all equilibrium equations in proportional changes in Appendix C.1. As an example, it is possible to re-write the migration share (equation 12) as in equation 15: mig π̂o,k,`,s mig 0 π = mig = P π Âo,k k0 ,`0 ,s Âo,k0 ŵk,`,s o ε̂k,`,s P̂` κ φ̂o,`,s −κ κ ŵk0 ,`0 ,s0 o mig ε̂ φ̂o0 ,`0 ,s0 −κ πo,k,`,s k0 ,`0 ,s0 P̂ 0 (15) ` As shown by equation 15, this approach allows me to classify each object of the equilibrium into four categories: Endogenous variables such as ŵk,`,s , P̂` , fundamental parameters such as mig . κ, exogenous parameters such as φ̂o,`,s , ε̂ok,`,s and Âo,k , and data on observed allocations πo,k,`,s The model includes many exogenous parameters such as migration costs φo,`,s , trade costs τk,`,n , MNE costs δk,s,` , fundamental technologies Ts,k , worker comparative advantages Ak,o and labor shares ψk,`,s but they are assumed to stay constant between the real and the counterfactual such that x̂ = 1. The counterfactual scenario involves changing just some of the exogenous parameters and evaluating how the endogenous variables respond. This strategy helps me avoid having to calibrate all parameters and just focus on six key elasticities that govern the responses of the endogenous variables: κ the elasticity of migration and labor supply, λ the elasticity of substitution between high-skill domestic and foreign effective units of labor, ι the elasticity of substitution between source-country and other foreign workers, α the elasticity of substitution between college and non-college workers, ν the elasticity of MNE costs on immigration and θ the trade and MNE elasticity. Those elasticities together with data on observed allocations are enough to compute the changes in the endogenous variables of the model. While I also need data on the observed migration, trade shares, MNE shares, and labor allocations, I do not need to take a stand on any other parameters of the model, which greatly reduces the number of parameters to be estimated. 5 Reduced Form Evidence Before discussing the estimation procedure, I will show the relationship between high-skill immigration and MNE activity proposed by the model is supported by the data. The predictions of the model can be summarized by using equation 14. First, I take the ratio of the share of US production in industry k by MNEs from source-country s relative to the share of US production in k by US firms. After taking logs and reorganizing terms, we get to equation 16. I add time subscripts t since I will use time variation in the subsequent estimation and remove ` subscripts since I am only looking at production in the US. 17 Ln mne πk,s,t mne πk,U S,t ! = Ln k k cks,t δs,t Ts,t TUk S,t −θ ! −θ ckU S,t k k → Ln(Xk,s,t ) = Ln(Ts,t ) − θLn(cks,t δs,t ) + Φk,t (16) Where Xk,s,t is the production of an MNE from s operating in the US in industry k, time t. −θ Φk,t = TUk S,t ckU S,t is an industry-time fixed effect that summarizes the US specific variables that are common to all s 6= U S in k, t. By inspecting equation 5, there are two channels through which an influx of immigrants from s would affect MNE production by s in the US. First, more immigrants from s will decrease the cost of hiring source-country workers by lowering s wages wk,s,t and expanding the market share of MNEs from s. Second, immigrants from s can k lower the MNE cost δs,t , further expanding s operations in the US. To check whether the data corroborate this positive relationship, I want to estimate equation 17. 7 Ln(Employments,k,t ) = Φk,t + Φs,k + βLn(N Immigrantso=s s,k,t ) + s,k,t (17) Following structural equation 16, I control for industry-time fixed effects (Φk,t ). I also include source-industry fixed effects (Φs,k ) to capture the time-invariant component of comparative advantage where MNEs from certain s historically specialize in specific industries. Estimating equation 16 by OLS would yield biased estimates of β, and the direction of the bias is unknown. On one hand, the error term contains time-varying shocks to comparative advantage that can simultaneously increase total employment and number of immigrants. This would cause β to be upward biased. On the other hand, the error term also contains time-varying shocks to s k . MNEs might demand fewer source-country immigrants cost parameters such as ψk,s,t or δ̄s,t as they increase their US operations since their US workforce learns how to communicate with the parent company over time. This would lead to a negative correlation between employment and immigrants, biasing β downwards. I proceed to construct a supply-push instrument in the spirit of Card and Lemieux (2001). As shown in equation 18, I use the share of immigrants from o = s who work in industry k in the US in year 2001 (the earliest in the H-1B data) and interact it with the number of immigrants from s that come to the US to work for US companies in industries other than k. The identification assumption is that time-varying shocks to comparative advantage or production function of MNEs are independent from the instrument. Equation 18 aims to capture changes in labor supply, such as changes in migration policy in the US or preference shocks of migrants, that are uncorrelated with demand-level shocks that affect MNE employment in the US. 7 I use total employment instead of revenues because the data coverage at BEA has fewer censored values when looking at employment. I combine yearly total US employment data by source-country and industry from the BEA with number of yearly visas by source-country and industry from my H-1B data. Details on the dataset construction can be found on Appendix D. 18 Shift Shares,k,t o=s N Immigrantsk,2001 0 = o=s × (N Immigrants from o = s, s = U S, k 6= k, t) N Immigrants2001 (18) Table 2 shows the estimates for β under OLS and 2SLS. Looking at column 2, an increase of 1% of immigrants from o = s working for s, k, t causes an increase of 1.71% in total employment for MNEs from s operating in industry k, time t. Column 3 excludes Indian companies from the analysis and shows that the effects are still positive albeit smaller than with the full sample. Indian companies have been the poster child for MNE activity driven by immigration, with US companies bringing migrants from India during the 1990s and Indian tech companies starting operations in the US in the late 2000s predominantly using migrant labor from India. It is reassuring that the results still hold even when excluding the Indian case. Table 2: Dependent Variable: Log US Employment of MNEs from s in k, t. (1) Ln (Imm from o = s in s,k,t) N obs Estimation 1st stage F-stat Sample (2) 0.07 1.71*** (0.055) (0.503) 616 OLS All 616 2SLS 14.4 All (3) 0.70* (0.347) 575 2SLS 13.2 Excluding India ∗p < 0.1, ∗ ∗ p < 0.05, ∗ ∗ ∗p < 0.01. Industry-Source-Time level regression. Years 2002-2014 included. Standard errors clustered at the source-country level. Industry-time and Source-Industry fixed effects included. Last column excludes observations for Indian MNEs. I proceed to discuss the validity of the shift-share instrument by discussing potential concerns with the time-varying component and the initial share component of the instrument. First, the instrument could be invalid if the time-varying component is correlated with other shocks happening at the source-country level that affect MNE employment growth in the US. An example could be trade agreements between s and the US that increase both migration from s in all industries and MNE activity from s. To control for this possibility, in Appendix Table 17, I show the results are robust to controlling for a source-time fixed effect instead of an industry-source fixed effect. If I add both industry-source and time-source, the estimates are still robust, although the second stage loses power given the reduced variation allowed by saturating the regression with fixed effects. As additional checks, I show that the estimates are robust to adding time-varying controls that are likely to be correlated with unobserved source-time shocks that affect MNE activity from s such as import share from s, total exports 19 of s, and total GDP from s. Second, I discuss the validity of the initial share component of the instrument. Following Goldsmith-Pinkham et al. (2019), it is important to establish that the initial share is not correlated with other observables that also predict the growth of the second stage outcome, the MNE employment by country s in industry k in the US. To test for such correlation, I run a regression of the initial share and the observed MNE employment growth on several observables as shown in Appendix Table 18. The industry share of exports from country s and the distance between s and the US are the main covariates that predict MNE growth from s in the US. These covariates are not significantly correlated with the initial immigrant share used in the instrument, which reinforces the validity of the shift-share approach. Finally, a third concern is that there might be a mechanical correlation between employment and immigration numbers, since immigrants also count as employees in total employment. For nonIndian companies, the number of source-country immigrants is sufficiently small to not make a difference on the estimates if we exclude the number of immigrants from total employment figures. On the other hand, Indian companies do predominantly employ Indian immigrants as their employees, so this mechanical relationship becomes relevant for India. Hence, it is important to keep in mind that part of the Indian disproportionate response when compared to other countries is likely driven by a direct increase in total employment. As a second set of results, I consider a broader definition of immigrants that can impact MNE activity. If the reason that an influx of immigrants from the source country expand MNE activity is because of networks that facilitate production in the US, we could think that not only immigrants employed by MNEs from s could contribute to this effect. Instead I consider as an explanatory variable all immigrants from o = s who work in industry k as a potential driver of MNE activity as shown in equation 19. I use the instrument in equation 18 as well, since the exogeneity and relevance arguments of the supply-push instrument also hold in this case. Ln(Employments,k,t ) = Φk,t + Φs,k + βLn(N Immigrantso=s k,t ) + s,k,t (19) Table 3 shows that on average, a 1% increase in the number of immigrants from s working for industry k, increases MNE employment from source-country s, industry k by 1.34%. Once again, excluding India gives smaller but consistent results. The new explanatory variable does not require information on the source-country of the MNE, just origin and industry of the workers. This allows me to estimate the same regression using the American Community Survey (ACS), which represents the stock of immigrant workers in the US instead of the flow (given by the H-1B data).8 The advantages of validating the estimates with the ACS are 8 The construction of the instrument with ACS data is slightly different than in equation 18. I calculate the initial share using the 1990 sample and include in the regressions observations between 2000-14. 20 twofold. First, columns 3 and 4 of Table 3 show the estimates for the stock are very similar to those using the flow. Second, it allows me to run the same regression using only non-college graduates. The estimates in column 5 show that an influx of non-college graduates from the source country does not significantly affect MNE activity. Such findings reinforce the decision of focusing exclusively on the immigration of college graduates when studying the link between immigration and MNE activity. 9 Table 3: Dependent Variable: Log US Employment of MNEs from s in k, t. Broader definition of Immigrants. (1) (2) (3) (4) (5) 0.84** (0.35) 1.94*** (0.68) 1.28 (1.06) 0.13 (0.65) 1097 152.0 1047 8.8 2730 19.0 2635 17.0 2510 7.0 H-1B All H-1B no India ACS college ACS - college, no India ACS Non-college Ln (Imm from o = s in k,t) 1.34*** (0.21) N obs 1st stage F-stat Sample ∗p < 0.1, ∗ ∗ p < 0.05, ∗ ∗ ∗p < 0.01. 2SLS estimates. Industry-Source-Time level regression. Years 2002-2014 for H-1B, 2000-2014 for ACS. Standard errors clustered at the source-country level. Industry-time and Source-Industry fixed effects included. Columns 2 and 4 exclude India. Column 5 uses only non-college graduate immigrants. The explanatory variable Ln (Imm from o = s in k,t) considers all immigrants from s working in the US for industry k (not just those working for s). 9 This is also consistent with the findings of Cho (2018), who shows Korean MNEs disproportionately hire Korean migrants only for managerial occupations, which are high-skill intensive. 21 6 Estimation In this Section, I proceed to describe the estimation strategies for the six key elasticities in the model. Labor supply parameter: κ. Production function elasticities: α, λ, and ι. MNE cost spillover: ν and the trade elasticity θ. While the trade elasticity θ is an important parameter, it has been estimated in several papers in the literature and is not the key contribution of this paper. Thus, I just use the value of θ = 4 as estimated by Simonovska and Waugh (2014). I will use the H-1B data to estimate κ and ι, set λ and α according to values estimated in the literature. Finally, I will calibrate ν to match the reduced-form results from Section 5 as explained below. 6.1 Labor supply elasticity κ I use an instrumental variable approach that exploits “trade shocks” across source countries and industries to estimate labor supply elasticity κ. As defined in Section 4.1, κ, has two main interpretations. First, it governs the dispersion of productivities, with higher values of κ implying either lower dispersion between draws (high κ̃) or high correlation among the draws (high ρ). Second, it can be interpreted as the labor supply elasticity, as it captures the response of relative migration flows and relative labor supply to changes in relative wages and migration costs. Following Bryan and Morten (2018), and using properties of the Frechet distribution, it is possible to write the conditional expectation of abilities as in equation 20. Γ̄ is the Gamma 1 . Equation 20 implies that as the share of workers from o that function evaluated in 1 − κ(1−ρ) chooses triplet z = {k, `, s} increases, the average ability of those choosing z decreases. 1 1 mig − κ κ E ηzi,o |i chooses z = Ao,k πo,z Γ̄ (20) A similar logic can be used for calculating the average wages that workers choosing z receive. x o i,o x Suppose individual i chooses z and gets a wage: wagei,o z = wz εz ηz , where wz is the equilibrium wage per effective unit paid to those who choose triplet z and the superscript x indicates whether the workers are hired by an MNE with s = o or if they are hired just as other foreign workers. εoz is a mean one log normally distributed random term that captures random shocks that make workers from o more productive at triplet z. Using the results in equation 20, it is possible to calculate average wages as in equation 21. 1 x o mig − κ Γ̄ E wagei,o z |i chooses z = wz εz πo,z (21) By taking logs and reorganizing terms, we get estimating equation 22. Using the US H-1B data, it would be possible to estimate equation 22 by using data on average wages and employment 22 at the industry-source-orign-time level and controlling for k-s-t-x fixed effects and o-k-t fixed effects.10 1 1 Ln wageoz,t = Ln (wzx ) − Ln (No,z,t ) + (Ln (Ao,k,t ) Ln (No,t )) + Ln εoz,t | {z } κ {z } | {z } |κ k-s-t-x FE o-k-t FE (22) error term Estimating equation 22 by OLS would yield biased estimates of − κ1 . The random term εoz,t positively affects average wages as well as the number of immigrants from o choosing z, No,z,t , biasing the estimate of − κ1 upwards. To identify this parameter, it is possible to construct k demand shocks that capture changes in the production comparative advantage (Ts,t ) of source o country s in industry k that are independent of time-specific productivity shocks εz,t experienced by origin o immigrants. I draw from the literature on trade shocks started by Autor et al. (2013) and construct a shift-share instrument that interacts the share of workers from o that choose triplet z in 2001 with the share of imports from non-US countries in industry k that come from country s.11 Shift Sharez,o,t = mig πo,z,2001 × Exports of k from s to ` 6= U S in t Exports of k to ` 6= U S in t (23) As shown in Table 4, the 2SLS estimates are consistent with the direction of the bias of OLS. The estimated value of κ is 6.17. As defined before, κ is the convolution of the true dispersion parameter κ̃ and the correlation among draws ρ. In Appendix E.1 I explain how it is possible to use the observed dispersion in wages to separately identify κ̃ and ρ. I estimate κ̃ = 2.08 and ρ = 0.66. While in a very different context, such estimates are consistent with Hsieh et al. (2019) who use κ̃ = 2 and Bryan and Morten (2018) who find a κ̃ = 2.7 and a somewhat larger correlation of 0.9. I also show that if I solely use the dispersion in wages to estimate κ, I get an estimate of κ = 8.28, which I will use to bound the results in the robustness Section.12 10 Since I will use time variation for estimation, I add a time subscript t. I drop Γ̄ for exposition purposes. x stands for a dummy variable that takes the value of 1 if s = o and 0 if s 6= o. Since the data are only for the US, N mig I do not consider the ` subscript, which is common for all observations. I use the property that πo,z = No,z,t o,t 11 Ideally, I would use MNE flows from s-k to other countries to construct the comparative advantage shocks. However, information is somewhat limited for non-US MNE flows for sufficiently disaggregated industry groups k and countries. In the model Ts,t represents comparative advantage in k for both trade and MNE, such that trade flows from s to other countries should also capture comparative advantage shocks. 12 Using the observed wage dispersion has been used in the EK-Roy literature to estimate the supply elasticity such as in Lagakos and Waugh (2013), Hsieh et al. (2019), and Lee (2019). 23 Table 4: Estimates of equation 22 o Ln Nk,s,t N obs Implied κ 1st stage F-stat OLS 2SLS -0.031*** (0.0068) -0.162*** (0.053) 2534 32.26 2534 6.17 20.0 ∗p < 0.1, ∗ ∗ p < 0.05, ∗ ∗ ∗p < 0.01. o-k-t FE and k-s-t-x FE included. Years 2002 to 2014 used for estimation. To minimize measurement error in average wages I pooled years in pairs (02+03, 04+05,...) and cells with fewer than 5 visa petitions were dropped. Standard errors clustered at the year, industry, and source country. 6.2 Production function parameters α, λ, and ι I set the elasticity of substitution between college and non-college workers, α = 1.7, based on an average of different papers that estimate that parameter such as Katz and Murphy (1992), Card and Lemieux (2001), and Goldin and Katz (2007). The aggregate elasticity using α = 1.7 is indistinguishable from 1.7. For the elasticity between effective units of high-skill domestic and foreign labor λ, I set λ = 13.1 to match the aggregate elasticity of 12.6 as estimated by Ottaviano and Peri (2012) for college graduates. Burstein et al. (2018) also find a high elasticity of substitution of 10 for between domestic and foreign workers. I proceed to estimate ι, the elasticity of substitution between source country and other foreign effective units. It is possible to rework the first-order conditions of the components in equation 8 to get to equation 24: Ln wage billsz,t wage billfz,t ! = (1 − ι)Ln s wz,t f wz,t ! + ιLn s ψz,t f ψz,t ! (24) Equation 8 implies that for an MNE from source s, the ratio of the wage bill spent on sourcecountry workers relative to the wage bill spent on other foreign workers is a function of the ratio of effective wage paid to source-country workers relative to the effective wage paid to other foreign workers.13 If one were torun this regression by OLS, two main issues would arise. First, the effective wage ratio Ln s wz,t f wz,t is not observed in the data, as these are wages paid per effective unit. Second, even if the ratio of effective wages was observed, unobserved productivity shocks would likely bias the coefficient upward, as we would be confounding supply 13 Once again I add time-subscript t since I will use multiple years of data for estimation. Also, z stands for a given triplet k, `, s. 24 and demand. I proceed to estimate this parameter in two steps. In the first step, I use the estimated value of κ and data on average wages and employment to back out the implied ratio of effective wages in equilibrium. In a second step, once I have the explanatory variable, I use an instrumental variables approach to identify ι. In Appendix E.2, I explain how it is possible to use equation 21 and the estimated value of ws κ = 6.17 to back out the implied effective wage ratio wz,t f . z,t I estimate equation 24 by using the foreign MNEs in my H-1B data and run a firm-level regression, using the log ratio of the wage bills of source and foreign workers as the dependent variable, and the log ratio ofthe effective wages estimated in equation 47 as an explanatory variable. The term Ln s ψz,t f ψz,t is considered part of the error term. I also add time-industry fixed effects to control for time-specific industry shocks. Since the error term includes the ψs preference for source workers relative to other foreign workers ψz,t f , I need an instrument to z,t consistently estimate equation 24. The instrument should shift supply but be uncorrelated with unobserved demand shocks in order to identify the demand parameter 1 − ι. I propose two instruments that use very different sources of variation to estimate ι. First, I use the shift-share instrument proposed in Section 5, equation 18. The shift-share instrument captures the supply push of immigrants from country s into the US that is independent of time-specific demand shocks experienced by companies from s in the US and is negatively correlated with the ratio of effective wages. Second, I use the log GDP per worker in country s as a proxy for average wages in country s. This is a valid instrument because the wage in the origin country is one of the main predictors of migration flows, as shown by Grogger and Hanson (2011) and Docquier et al. (2014), thus a change in the wage level in the origin country is a good predictor of the migration cost. The migration cost is directly related to the supply curve but is not correlated with demand shocks in the US that affect the ratio of effective wages between source and other foreign workers, which makes it a good instrument for the relative effective wages in the US. The OLS and 2SLS results of equation 24 can be found in Table 5 and results are consistent with what we would expect. OLS results are upward biased, since they predict a ι lower than one and not significant. When instrumenting for the effective wages, the estimated ι is 2.84 using the shift-share instrument and ι = 6.84 using the log of the GDP per worker. I will use ι = 2.84 as my baseline value and show robustness with ι = 6.84. 25 Table 5: Estimating equation for ι Log wage effective units ratio N obs Implied ι 1st stage F-stat OLS 2SLS 2SLS -0.0014 (0.201) -1.84*** (0.656) -5.84** (2.58) 1,750 1.00 1,750 2.84 48.67 1,750 6.84 25.71 ∗p < 0.1, ∗ ∗ p < 0.05, ∗ ∗ ∗p < 0.01. Controlling for time-industry fixed effects. Standard errors bootstrapped with 250 repetitions and clustered at the time-source level. Indian companies are removed from the sample as the small number of non-Indian workers they hire distort the relative wage ratios needed for estimation. 6.3 Spillover parameter ν Finally, I use indirect inference to estimate the MNE cost spillover parameter ν. When the number of workers from s in industry k, country ` increases by 1%, the MNE iceberg cost for companies from s in `, k decreases by −ν%. According to the estimates in Table 3, a 1% increase in the number of immigrants from s in k increases MNE employment by 1.34%. Such number decreases to 0.84% when excluding Indian companies. I calibrate two separate ν, one for India and another for all other source countries such that it matches the average MNE employment response of 1.34% and 0.84%, respectively. This yields νin = 1.25 and νs = 0.27. 6.4 Implementation To implement the model I need to make some simplifications, since data on the stock of native and immigrant workers across countries and industries is less detailed for non-US countries. First, I assume the world is composed of six regions: United States, Canada, Western Europe, India, China-Taiwan, and the Rest of the World (RoW). I also assume there are only three industries: Professional and Technical Services, which mainly includes the IT sector and consulting; high-skill intensive manufacturing, which includes Chemicals, Machinery, Computer, Electronic, Electrical Equipment and Transportation manufacturing; and a third sector that includes everything else in the economy. I separate industries this way to focus on the implications for industries that have a high dependence on high-skill migration and where MNEs in the US are predominantly concentrated. I also impose additional restrictions on MNE production and migration. All sectors engage in international trade and hire domestic and foreign workers, but I only allow for MNE activity in IT and high-skill manufacturing sectors. I restrict migration decisions such that workers cannot migrate to India, China-Taiwan, or RoW unless they were born there. This captures 26 a salient feature of the data where the main receiving countries for high-skill migrants are the US, Canada, and countries in Western Europe. I set θ = 4, α = 1.7, κ = 6.17, λ = 13.1, ι = 2.84, νin = 1.25 and νs = 0.27 consistent with the baseline parameters estimated in Section 6. Finally, the estimation of the model requires me to use data on observed trade shares by industry, MNE shares by industry, migration shares from each origin o to each triplet k, `, s, and skill shares for domestic, source country, and other foreign workers for each triplet k, `, s. In Appendix F I explain how I construct the dataset to run the counterfactual exercises. While the data on migration and skill shares for the US can be constructed using a combination of the ACS and my H-1B dataset, the data availability for migration in Canada and Europe is limited. In Appendix F, I also explain how I impute some of the data for those regions using the US data together with additional datasets on global migration and industry employment. Finally, to calculate the equilibrium I need to impose a normalization. I follow Allen et al. (2018) and impose that World output stays constant as in equation 25. This normalization implies that the output results should be interpreted as how the endogenous variables change as a share of total World output. Xus + Xin + Xca + Xeu + Xch + Xoth = X̄ 7 (25) Counterfactual exercises In this Section, I use the model to run two main counterfactual exercises that help quantify the link between high-skill migration, MNE activity, and the location of production. As my model is expressed in changes between the observed equilibrium and the counterfactual equilibrium, I can feed a given change to the model and calculate how the endogenous variables such as output and welfare respond to such change. As a first exercise, I introduce the shock of increasing the migration cost to the US to evaluate the long-term implications of a more restrictive highskill immigration policy. In a second counterfactual, I introduce the shock of increasing MNE barriers and use the model to understand how does modeling migration affects the quantification of the welfare gains of MNE production. 7.1 Counterfactual 1: The implications of a more restrictive migration policy As a first counterfactual exercise, I study how the location of high-skill industries and welfare would change in the long term if the US implements a more restrictive migration policy. To 27 facilitate the interpretation of the quantitative results, I will change the immigration cost from every country to the US such that it reduces the total stock of high-skill immigrants by 10%. A 10% decrease is consistent with a 0.95% decrease in the college-graduate workforce in the US and a 0.3% decrease in total US workforce. Despite the apparent rationing feature of the US immigration system, the Frechet assumption captures well the positive selection of immigrants which it is crucial to characterize the US immigration system. Given abilities being distributed Frechet, as the migration cost increases the total number of immigrants decreases and those who still migrate to the US will be positively selected following the intuition of equation 20. For the H-1B program, the number of visas is rationed through a lottery and those who win the lottery might not necessarily be of higher ability than those who lose the lottery. However, I will argue that in the medium run, the US immigration system is well characterized with a positive selection feature as in this model. First, employers pay a fee for applying for H-1Bs, indicating that those sponsored for an H-1B are positively selected among all origin-country college graduates. Second, if a worker with sufficiently higher ability loses the lottery, there are alternative strategies to come to the US, such as getting an L-1, getting sponsored for a green card, or finding a job at a non-lotterysubject company. Third, even if the worker loses the lottery, employers can apply for a new visa again in subsequent years. Finally, workers who receive graduate degrees from US institutions have a higher likelihood of getting accepted in the lottery. Overall, such features suggest the US high-skill immigration is selective and if the number of immigrants were to be reduced, those still migrating would be positively selected. As a first set of results, Table 6 summarizes how the increase in migration costs to the US affects the total revenues generated by each sector-country pair relative to World output. I present the results for the baseline case and the case where there is no MNE cost spillover (ν = 0) for comparison. High-skill industries in the US decrease their output more than the residual sector. Production in all other regions increases as a result of US migration restrictions. The IT and professional services sector would grow by 0.63% in India and 0.16% in Canada, while the high-skill manufacturing sector would also grow the most in India (0.43%) and Canada (0.96%). These results reaffirm the notion that a restriction to high-skill migration will predominantly affect high-skill industries and total economic activity in the US is expected to decrease as a result of such policies. The spillover amplifies the production response and results are qualitatively similar except in Canada, where MNEs in manufacturing account for 73% of production, so their increase in production is much larger when there is a spillover. 28 Table 6: Change in production by country and industry baseline US India Western Europe Canada China-Taiwan Rest of the World no spillover IT and Prof. High-Skill IT and Prof. High-Skill Other Other Services Manufacturing Services Manufacturing -0.53% -1.70% -0.40% -0.39% -0.43% -0.34% 0.63% 0.11% 0.16% 0.14% 0.10% 0.43% 0.24% 0.96% 0.24% 0.27% 0.20% 0.09% 0.01% 0.12% 0.07% 0.60% 0.08% 0.16% 0.10% 0.07% 0.29% 0.06% 0.09% 0.11% 0.06% 0.20% 0.07% 0.10% 0.09% 0.06% Percent changes in production output from increasing migration cost such that the total stock of migrants decreases by 10%. Change relative to World output. Foreign MNEs in the US disproportionately contribute to such output decline relative to their size because of their greater intensity in migrant labor. While this is true for both with and without the spillover, results are much larger for the case with the spillover by construction. As shown in Table 7, both in high-skill manufacturing and IT, foreign MNEs operating in the US experience an output drop larger than US-based companies. The contrast is particularly big for Indian and Chinese IT firms in the US, whose output would drop by 29.55% and 9.22%, respectively. While foreign MNEs are more intensive in foreign workers than American companies, they also have a particular dependence on foreign workers from their own source country. It makes sense then that companies from countries where labor is cheaper are the one who have the biggest hit. 29 Table 7: Production of MNEs in the US by source country baseline no spillover IT and Prof. High-Skill IT and Prof. High-Skill Services Manufacturing Services Manufacturing US -0.03% 0.97% -0.36% -0.40% India Western Europe Canada China-Taiwan -29.55% -9.14% -8.57% -9.22% -26.32% -8.00% -7.40% -7.83% -3.06% -0.70% -0.70% -1.20% -0.93% -0.51% -0.51% -0.79% Percent changes in revenues from increasing migration cost such that the total stock of migrants decreases by 10%. Change relative to World output. Even without the spillover effect, MNEs disproportionately drive the drop in production. Foreign MNEs account for 4.8% of production in the US IT sector, but account for 11.1% of the total drop in US IT output caused by the migration restriction. Similarly, in the High-Tech manufacturing sector, foreign MNEs account for 29.2% of production but are responsible for 38.0% of the drop in revenues. MNEs account for almost 100% of the drop in production once we include the spillover effect, as they become even more sensitive to reductions in immigration. Appendix G shows the decomposition of the domestic and foreign MNE contributions in each US industry. While the drop in production is a relevant channel through which migration restrictions affect real wages for US natives, there are some workers who gain from such restrictions. As shown in Table 8, high-skill workers would experience an increase of 0.09% in their real wages due to the migration restriction. When there are fewer migrants, firms substitute the missing foreign workers with natives pushing up the US native wage. Low-skill workers on the other hand would see their real wages decreased by 0.35% given their complementarity with high-skill workers. Aggregating across skill types, total real wages for US workers would decrease by 0.22% when migration is restricted. Real wage is calculated as the average wage for each group divided by the price index. A restriction in migration affects real wages predominantly through changes in wages as shown in column 2 of Table 8. 30 Table 8: Change in Real Wages and Compensating Variation Real Wages Wages Compensating Variation (percent change) (percent change) (per native worker) High-skill natives Low-skill natives Total US natives 0.09% -0.35% -0.22% 0.11% -0.33% -0.20% -60 108 57 Percent changes from increasing migration cost such that the total stock of migrants decreases by 10%. Real wages are calculated as average wage divided by the price index. Compensating variation is the dollar value each worker would need to be compensated to leave their utility the same after restricting immigration Finally, to put these numbers into context I calculate the compensating variation for low and high-skill workers. The compensating variation is the amount of income that workers need to be compensated in the counterfactual to hold their utility levels as in the real. Each low-skill worker in the US would need to be compensated by $108 each year while the gains for each high-skill worker amounts to $60. Overall, restricting high-skill immigration by 10% would cause each US worker to lose on average $57 per year once the economy reaches the steady state. The total loss for US natives would amount to almost $8 billion per year. 7.1.1 Mechanisms and Robustness The baseline results presented in Section 7.1 are a product of multiple mechanisms incorporated to the quantitative model that link migration to production and welfare. In this Section, I proceed to disentangle each mechanism to show how are they driving baseline results. First, as noted in the production results above, the spillover of immigrants on MNE costs significantly amplifies the magnitude of the results. In Table 9, I compare how the results affect real wages for different values of ν. With no spillover, qualitative results hold, but the effect of restricting immigration drops from -0.22% to -0.13%. If we assume a common, large spillover such as ν = 0.9, restricting immigration would generate losses for both low- and high-skill workers as the exit of high-productivity MNEs would reduce labor demand and increase prices in the US, harming all workers. 31 Table 9: Changes in the spillover parameter νin = νs = 0 νin = 1.25, νs = 0.27 νin = νs = 0.9 (no spillover) (baseline) (same spillover) High-skill natives Low-skill natives Total US natives 0.17% -0.26% -0.13% 0.09% -0.35% -0.22% -0.03% -0.48% -0.34% Percent changes from increasing migration cost such that the total stock of migrants decreases by 10%. Real wages are calculated as average wage divided by the price index. A large part of the literature on the effects of immigration has used implicitly or explicitly closed-economy models (Bound et al., 2018; Burstein et al., 2018; Docquier et al., 2014). One of the contributions of this paper is incorporating both trade and MNE activity as channels through which production relocates as a consequence of restricting immigration. In Table 10, I compare the welfare effects of the baseline model with alternative models that remove trade and MNE activity to understand how they drive the baseline result. Column 3 compares the baseline and the no-spillover model with a model that does not include MNE production. Such model is equivalent to a multi-country, multi-industry Eaton and Kortum (2002) model that allows for migration. The data used assume all companies producing in the US are domestic companies, so their intensity in hiring migrants is the one observed for US companies. The model without MNE production understates the real wage losses by 45% (-0.22% vs -0.12%) since it no longer accounts for the role of immigration in bringing more productive foreign companies’ which lower the price index in the US and increase real wages. When comparing it to the no spillover model, the model with no MNEs understates real wage losses by 6% (-0.13% vs -0.12%). In this case, the MNE channel is not as big when looking at the aggregate effects since MNEs account for a fairly small share of total production in highskill industries and the estimated elasticity of substitution between natives and foreign workers is high. However, as shown in Section 7.2, the migration channel does have a large impact in the welfare gains that stem from allowing MNE production regardless of the spillover. Column 4 looks at an alternative model where MNE activity is allowed but trade costs are prohibitive such that there is no trade. Restricting immigration generates a larger real wage loss under the model with no trade and also lowers the gains for high skill workers. When no trade is allowed, production does not relocate and consumers have to buy goods produced in the US, which without immigration become more expensive than when trade is allowed. The overall welfare loss under no trade is 15.7% higher than in the baseline model. 32 Table 10: Understanding mechanisms - Trade and MNE Baseline High-skill natives 0.09% Low-skill natives -0.35% Total US natives -0.22% no spillover No MNE No Trade 0.17% -0.26% -0.13% 0.17% -0.24% -0.12% 0.06% -0.39% -0.25% Percent changes from increasing migration cost such that the total stock of migrants decreases by 10%. Column 1: Baseline results. Column 2: Model with no spillover. Column 3: Model with no multinational activity. Column 4: Model with no international trade. As a second set of mechanisms, I look into how the results for real wages change for different values of the key elasticities. Appendix G shows that very low values of λ or high values of α could lead high-skill workers to lose from restricting immigration. However, total real wage losses have a similar magnitude among plausible values for the elasticities. 7.2 Counterfactual 2: Migration and the welfare gains of MNE production Restrictions to migration have big consequences on the activity of MNEs in the receiving country. To understand the aggregate implications of such result, I explore how the welfare gains from MNE activity are affected by incorporating migration into the model. A vast literature in international economics has used quantitative models to measure the welfare gains from trade by looking at the change in welfare when going from autarky, where trade costs are assumed to be very large such that trade is prohibitive, to the observed trade flows in equilibrium. Similarly, for MNEs, the welfare gains from MNE production are the welfare change when going from an equilibrium where MNE costs are very large (MNE autarky) to an equilibrium where MNE flows are as in the data. Given my estimated model, a contribution of this paper is to show that incorporating high-skill migration as an additional mechanism into a quantitative MNE model has significant implications for the distributional welfare gains across workers with different skills. k A sufficiently large change in the MNE costs δ̂s,` is fed into the model such that MNE flows go from the observed values in equilibrium to 0.14 By calculating how welfare changes between an “MNE autarky” situation and the observed equilibrium, we can calculate the gains from MNE production. As shown in the first column of Table 11, both low- and high-skill workers benefit from MNE production in high-skill industries. Such finding is intuitive since MNEs that move to the US bring new and more efficient technologies to produce some varieties domestically, 14 k Note that the spillover does not play a role anymore, since I am exogenously changing MNE cost δ̂s,` . 33 lowering prices and increasing overall production and welfare. A second finding shown in column 1 is that high-skill migration to the US would increase by 8.69% when allowing for MNE production, reinforcing the idea that MNEs have a larger intensity for migrants. Column 2, shows how the gains from MNE change when we consider a model with no migration closer to those used in the literature of MNE production. The model with no migration assumes the high-skill labor supply of each country is not mobile across countries but still allows for reallocation across sectors. The data used in this alternative model just considers the total high-skill workers in each country in the observed equilibrium, treating all of them as native workers. As shown in column 2, the total welfare effects of MNE production remain almost unchanged in the model with no migration when compared to the baseline. The model with no migration overestimates the welfare gains of MNE production by only 3.17%. Interestingly, the channel of migration does matter to quantify the distributional gains of MNE activity between low- and high-skill workers. A model with no migration would overestimate the gains from MNE production by 34.92% while underestimating the gains for low-skill workers by 8.15%. When we allow for MNE production, high-skill MNEs bring better technologies that improve welfare but at the same time increase the number of high-skill migrants. Since highskill migrants compete directly with native high-skill workers, they lower the equilibrium wages, which offsets the gains from MNEs. Low-skill workers, on the contrary, are complements to the high-skill migrants who join the country when MNEs are allowed. Therefore, migration contributes an additional gain toward welfare created by MNE production. Table 11: Welfare gains from MNE production Baseline No migration Relative to baseline High-skill natives Low-skill natives Total US natives 1.16% 1.42% 1.34% 1.56% 1.30% 1.38% 34.92% -8.15% 3.17% Migrants in US 8.69% 0.00% Percent changes in welfare of going from MNE autarky to the observed equilibrium. MNE autarky is the case k where MNE iceberg costs δs,` are very high such that MNE is prohibitive. Column 3 shows the welfare change in the no-migration setting relative to the welfare change in the baseline model with migration. The results in Table 11 hold when looking at the MNE gains for migrant-receiving regions such as Europe and Canada as shown in Appendix H. For migrant-sending countries such as China and India, the direction of the results with and without migration are the opposite. In the model with migration, allowing for MNEs increases the demand for migrants in developed 34 countries, taking away high-skill workers predominantly from India, China, and the Rest of the World. This increases the positive impact for high-skill workers who stay as they face lower competition from the migrants that leave. The model with no migration would therefore understate the MNE gains for high-skill workers in developing countries and overstate the gains for low-skill workers in developing countries. Appendix H also shows these results are robust for different values of the elasticities. 8 Discussion The results presented in this paper have useful implications for immigration policy in the United States. A reduction of 10% in the stock of migrants would cause a total loss of 8 billion USD for the US economy, driven by a $10 billion loss for low skill workers and a $3 billion gain for high-skill workers. The interrelation between MNEs activities and immigration is crucial to consider when designing policies that aim to attract FDI into the country since restrictions in immigration will be likely to mitigate the inflows of MNE activity. While this paper focuses on high-skill migration, an important policy question is how the results would change if I were to incorporate low-skill migration as well. Restricting lowskill immigration is expected to have effects that mirror the results in this paper, improving welfare for low-skill natives and reducing welfare for high-skill natives. The net welfare resulting from the restriction of both types of migration is hard to predict a priori without data on low skill immigration and the appropriate elasticities of supply and substitution, which might be different for low-skill workers. The effects of immigration on MNE activity on the other hand, are expected to be different for low- and high-skill immigration. As shown in Section 5, the entry of MNEs is likely to have a stronger link to high-skill immigration than low-skill immigration. The findings of this project open the door to future research on the relationship between MNE activity and immigration. A natural first next step would be to study the dynamic implications resulting from the transfer of migrants within a firm as a vehicle for knowledge diffusion and technology transfer. The use of dynamic models to understand how MNEs adjust to a shock in migration policy could help improve our understanding of the frictions MNEs face in transferring technology across countries. Second, the feature of home bias uncovered in this paper raises questions about the underlying reasons behind this empirical pattern. Future work might delve deeper into the decisions of MNEs to hire immigrant workers, and such hiring relates to the use of other production factors such as intra-firm intermediate inputs and investment in new technologies. 35 References Allen, T., C. Arkolakis, and Y. Takahashi (2018). Universal Gravity. Journal of Political Economy. Forthcoming. Alviarez, V. (2018). Multinational Production and Comparative Advantage. Journal of International Economics. Forthcoming. Antras, P., L. Garicano, and E. Rossi-Hansberg (2006). Offshoring in a Knowledge Economy. Quarterly Journal of Economics 121 (1), 31–77. Arkolakis, C., N. Ramondo, A. Rodriguez-Clare, and S. Yeaple (2018). Innovation and Production in the Global Economy. American Economic Review . Forthcoming. Aubry, A., H. Rapoport, and A. Reshef (2018). Migration, FDI and the Margins of Trade. Working paper . Autor, D., D. Dorn, and G. Hanson (2013, October). The China Syndrome: Local Labor Market Effects of Import Competition in the United States. American Economic Review 103 (6), 2121–2168. Bound, J., G. Khanna, and N. Morales (2018, May). Understanding the Economic Impact of the H-1B Program on the US. NBER book ”High-Skilled Migration to the United States and its Economic Consequences”, 109 – 175. Gordon H. Hanson, William R. Kerr, and Sarah Turner, editors. Brucker, H., S. Capuano, and A. Marfouk (2013). Education, gender and international migration: insights from a panel-dataset 1980-2010. mimeo. Bryan, G. and M. Morten (2018). The Aggregate Productivity Effects of Internal Migration: Evidence from Indonesia. Journal of Political Economy. Forthcoming. Burchardi, K., T. Chaney, and T. Hassan (2019, July). Migrants, Ancestors and Foreign Investments. Review of Economic Studies 86 (4), 1448–86. Burstein, A., G. Hanson, L. Tian, and J. Vogel (2018). Tradability and the Labor-Market Impact of Immigration: Theory and Evidence from the U.S. Working paper . Burstein, A., E. Morales, and J. Vogel (2018). Changes in between group inequality: Computers, occupations, and international trade. American Economic Journal: Macroeconomics. Forthcoming. Caliendo, L., L. Opromolla, F. Parro, and A. Sforza (2018). Goods and Factor Market Integration: A Quantitative Assessment of the EU Enlargement. Working Paper . 36 Card, D. and T. Lemieux (2001, May). Can Falling Supply Explain the Rising Return to College for Younger Men? A Cohort-Based Analysis. The Quarterly Journal of Economics 116 (2), 705–746. Cho, J. (2018, July). Knowledge transfer to foreign affiliates of multinationals through expatriation. Journal of International Economics 113, 106–117. Cuadros, A., J. Martin-Montaner, and J. Paniagua (2019). Migration and FDI: The role of job skills. International Review of Economics and Finance 59, 318–333. Dekle, R., J. Eaton, and S. Kortum (2008). Global Rebalancing with Gravity: Measuring the Burden of Adjustment. IMF Staff Papers 55, 511–540. Desmet, K., D. K. Nagy, and E. Rossi-Hansberg (2018, June). The Geography of Development. Journal of Political Economy 126 (3), 903–983. di Giovanni, J., A. Levchenko, and F. Ortega (2015, February). A Global View of Cross-Border Migration. Journal of the European Economic Association 13 (1), 168–202. Docquier, F., C. Ozden, and G. Peri (2014, September). The Labour Market Effects of Immigration and Emigration in OECD Countries. The Economic Journal 124, 1106–1145. Docquier, F., G. Peri, and I. Ruyssen (2014). The Cross-country Determinants of Potential and Actual Migration. International Migration Review 48 (09), S37–S99. Doran, K., A. Gelber, and A. Isen (2015). The Effect of High-Skilled Immigration Policy on Firms: Evidence from H-1B Visa Lotteries. Working Paper . NBER Working Paper 20668. Eaton, J. and S. Kortum (2002). Technology, Geography and Trade. Econometrica 70, 1741– 1779. Fan, J. (2019, July). Internal Geography, Labor Mobility, and the Distributional Impacts of Trade. American Economic Journal: Macroeconomics 11 (3). Glennon, B. (2018). How Do Restrictions on High-Skilled Immigration Affect Offshoring? Evidence from the H-1B Program. Working paper . Goldin, C. and L. F. Katz (2007). Long-Run Changes in the Wage Structure: Narrowing, Widening, Polarizing. Brookings Papers on Economic Activity (2), 135–65. Goldsmith-Pinkham, P., I. Sorkin, and H. Swift (2019). Bartik Instruments: What, When, Why, and How. Working Paper. Gould, D. (1994, May). Immigrant links to the home country: Empirical implications for U.S. bilateral trade flows. Review of Economics and Statistics 76 (2), 302–316. 37 Grogger, J. and G. Hanson (2011). Income maximization and the selection and sorting of international migrants. Journal of Development Economics 95, 42–57. Gumpert, A. (2018). The organization of knowledge in multinational firms. Journal of the European Economic Association. Forthcoming. Head, K. and T. Mayer (2018). Brands in motion: How frictions shape multinational production. American Economic Review . Hiller, S. (2013). Does immigrant employment matter for export sales? Evidence from Denmark. Review of World Economics / Weltwirtschaftliches Archiv 149 (2), 369–394. Hsieh, C.-T., E. Hurst, C. Jones, and P. Klenow (2019). The Allocation of Talent and U.S. Economic Growth. Econometrica. Forthcoming. Hunt, J. and M. Gauthier-Loiselle (2010). How Much Does Immigration Boost Innovation? American Economic Journal: Macroeconomics 2:2 . Iranzo, S. and G. Peri (2009). Migration and trade: theory with an application to the EasternWestern European integration. Journal of International Economics 79, 1–19. Javorcik, B. S., C. Ozden, M. Spatareaun, and C. Neagu (2011, March). Migrant networks and foreign direct investment. Journal of Development Economics 94 (2), 231–241. Katz, L. F. and K. M. Murphy (1992, February). Changes in Relative Wages, 1963-1987: Supply and Demand Factors. The Quarterly Journal of Economics 107 (1), 35–78. Keller, W. and S. Yeaple (2013). view 103 (4), 1414–1444. The Gravity of Knowledge. American Economic Re- Kerr, W. and W. Lincoln (2010). The Supply Side of Innovation: H-1B Visa Reforms and U.S. Ethnic Invention. Journal of Labor Economics 28(3), 473–508. Khanna, G. and N. Morales (2018). The IT Boom and Other Unintended Consequences of Chasing the American Dream. Working Paper . Lagakos, D. and M. E. Waugh (2013, April). Selection, Agriculture, and Cross-Country Productivity Differences. The American Economic Review 103 (2), 948–980. Lee, E. (2019). Trade, Inequality, and the Endogenous Sorting of Heterogeneous Workers. Working paper . Liu, C. (2017). Quantifying the Impacts of a Skill-Based US Immigration Reform. Working paper . 38 Mion, G., L. Opromolla, and A. Sforza (2018). The Diffusion of Knowledge via Managers Mobility. Working Paper . Ottaviano, G. I. P. and G. Peri (2012, February). Rethinking the Effect of Immigration on Wages. Journal of the European Economic Association 10 (1), 152–197. Peri, G., K. Shih, and C. Sparber (2015a). Foreign and Native Skilled Workers: What Can We Learn from H-1B Lotteries? Working paper . NBER Working Paper No. 21175. Peri, G., K. Shih, and C. Sparber (2015b, July). Foreign STEM Workers and Native Wages and Employment in U.S. Cities. Journal of Labor Economics 33 (S1), S225–S255. Peri, G. and C. Sparber (2011). Highly-Educated Immigrants and Native Occupational Choice. Industrial Relations 50:3 . Ramondo, N. and A. Rodriguez-Clare (2013, April). Trade, Multinational Production, and the Gains from Openness. Journal of Political Economy 121 (2), 273–322. Roy, A. D. (1951). Some Thoughts on the Distribution of Earnings. Oxford economic papers 3, 135–146. Santos Silva, J. M. C. and S. Tenreyro (2006, November). The Log of Gravity. The Review of Economics and Statistics 88 (4), 641–658. Simonovska, I. and M. Waugh (2014, January). The elasticity of trade: Estimates and evidence. Journal of International Economics 92 (1), 34–50. Tintelnot, F. (2017). Global Production with Export Platforms. Quarterly Journal of Economics 132 (1), 157–209. Tombe, T. and X. Zhu (2019). Trade, Migration, and Productivity: A Quantitative Analysis of China. American Economic Review . Forthcoming. Wang, C.-K. (2014). Bilateral Migration and Multinationals: On the Welfare Effects of Firm and Labor Mobility. Working paper . Yeaple, S. (2018, May). The Innovation Activities of Multinational Enterprises and the Demand for Skilled-Worker, Non-Immigrant Visas. NBER book ”High-Skilled Migration to the United States and its Economic Consequences”, 41 – 69. Gordon H. Hanson, William R. Kerr, and Sarah Turner, editors. 39 A H-1B and L-1 visa dataset construction A key contribution of this paper is to use a novel dataset on high-skill visas in the US that allows me to link demand for foreign high-skill labor to MNE activity. In this Section, I describe how such dataset was constructed. As a first step, I submitted a Freedom of Information Act (FOIA) request to the United States Citizenship and Immigration Services (USCIS) for the universe of forms I-129 approved between 2001 and 2014 for H-1B visas and between 2012 to 2014 for L-1 visas. The I-129 is submitted to USCIS after the lottery takes place in the case of the H-1Bs, so one attractive feature of these data is that it include only those migrants who effectively end up coming to the US. The dataset obtained through FOIA included for each approved visa the name of the firm, location, place of work, wage, occupation, start and end date of employment, and origin country as main covariates. It also includes the basis for classification of the visas indicating whether the I-129 was filed for new employment, change of employer, renewal, amendment, or other purposes. As visas are valid for 3 years but can be renewed for an additional 3 for the H-1B, a new I-129 is needed for such renewal. For years 2012 to 2014, the data provided by USCIS had a total of 933,838 forms associated to H-1Bs and 126,964 associated to L-1 for the relevant period. Wage data were not available for L-1 visas, but all other covariates were complete. In a second step, I proceed to match the FOIA database with the corporate database Orbis, to find two key pieces of information: the industry and the country of incorporation of the Global Ultimate Owner (GUO) of the firm that hired the migrant worker in the US. The GUO is the “individual or entity at the top of the corporate ownership structure” who owns the affiliate for more than 50% and its not being majority owned by any other company worldwide. The information from Orbis is complemented by additional corporate ownership information from D&B Hoovers and Uniworld to serve as a quality check for some cases where Orbis data are incomplete. The FOIA data and Orbis do not have a common identifier that allows me to easily match observations between datasets. Orbis has the advantage of having its own statistical matching tool that allows taking the Name and City provided by the FOIA data and finding the the firm record in Orbis for a firm that matches those characteristics. While the Orbis matching algorithm does a good job finding the relevant companies, many records are not matched because of the FOIA record, including some variant of the firm name not recognized. These observations have to be dealt with mostly by hand, which makes this process very time consuming. To narrow the sample of companies that need to be matched, I proceed to limit the sample in two main ways. First, I limit the search to all employers listed in the FOIA data who have submitted at least 10 petitions over the period between 2001-2014. As shown in Table 12, those with fewer than 10 petitions account for 21% of the total H-1B petitions and 44.6% of L-1 petitions. As a second step, within those employers with more than 10 petitions, I exclude from the matching employers in the education, hospitals, or government sectors since MNEs are 40 generally not present in these industries. Such employers account for 8.7% of the total H-1B petitions and 0% of L-1 petitions. Finally, a very small group of employers are not found in Orbis who account for 3.2% of H-1B petitions and 0.6% of L-1 petitions. This leaves us with a match rate of 66.9% for H-1Bs and 54.8% for L-1 petitions. The FOIA-Orbis dataset is used to show the stylized facts presented in Section 3 and to impute the data for MNE companies labor share between source and foreign workers needed to estimate the model but no aggregates are calculated using these data, which makes the lower match rate not a substantive matter in the quantitative exercise. Table 13 presents the distribution of visa petitions by worker nationality and source country (GUO of company applying for the visa). Table 14 presents the distribution by industry. Table 12: Sample matched to Orbis H-1B L-1 Count Share Count Share Total petitions 933838 100.0% 126964 100.0% Matched to Orbis 624777 66.9% 69519 54.8% Hospitals, Colleges and Government 81390 Fewer than 10 petitions 2012-14 197515 Other, not matched 30156 8.7% 21.2% 3.2% 0 56648 797 0% 44.6% 0.6% Not matched to Orbis Counts include all petitions between 2012 and 2014, including all basis for classification. “Fewer than 10 petitions” includes petitions filed by firms who submitted fewer than 10 petitions in the 2012-2014 period. 41 Table 13: Distribution across source countries and nationalities MNE Source Country Worker nationality Country H-1B L-1 H-1B L-1 Australia Canada China Finland France Germany India Ireland Israel Japan Korea Netherlands Spain Sweden Switzerland Taiwan United Kingdom Other 0.1% 0.5% 0.2% 0.2% 2.2% 0.8% 21.4% 2.5% 0.0% 1.2% 0.3% 0.5% 0.0% 0.2% 0.6% 0.3% 1.1% 1.5% 0.1% 4.1% 0.8% 0.3% 3.1% 2.3% 26.5% 0.9% 0.1% 2.1% 0.4% 0.5% 0.2% 0.3% 1.2% 0.0% 2.2% 4.4% 0.2% 1.7% 5.6% 0.0% 0.4% 0.3% 81.9% 0.1% 0.2% 0.3% 0.7% 0.1% 0.1% 0.1% 0.0% 0.4% 0.6% 7.2% 1.0% 17.0% 5.7% 0.2% 2.2% 2.4% 32.6% 0.9% 1.0% 2.4% 1.9% 1.0% 1.2% 0.6% 0.4% 0.3% 7.3% 21.8% United States 66.7% 50.4% - - Total 100% 100% 100% 100% 42 Table 14: Distribution by industry H-1B L-1 0.80% 6.43% 0.22% 0.13% 0.86% 0.13% 0.54% 1.62% 2.82% 1.37% 0.17% 1.06% 0.58% 1.64% 4.87% 5.20% 74.40% 0.07% 6.24% 1.86% 64.71% 0.06% Retail Wholesale Other 1.96% 0.74% 3.64% 0.69% 0.97% 16.22% Total 100.0% 100.0% Manufacturing Chemicals Computer and Electronics Electrical Equipment Food Machinery Metals Trasportation Equipment Services Finance and Insurance Information Professional, Scientific and Technical Real Estate Other B Empirical facts details The first fact in Section 3 shows that there is a strong home-bias effect, where foreign MNEs hire more migrant workers from their source country s than other companies not from s. In this Section, I present additional results that confirm the result holds under different specifications. For expositional simplicity, I present the robustness results with a pooled regression as in equation 26, that presents the average home-bias effect across different source countries. Ln(Nk,o,s ) = γ0 + γ 1(origin = source) + δk,o + δk,s + k,o,s (26) Where Ln(Nk,o,s ) is the log number of migrants in k, o, s. δk,o is an industry-origin fixed effect and δk,s is a source-industry fixed effect. The key coefficient of interest is γ, which measures how much more likely it is that a company from source s will hire someone from o = s relative to o 6= s when compared to all other companies from other source countries. As in the disaggregated regression in Section 3, the magnitude of the home bias is large. As shown in Table 15, controlling for worker and source-country fixed effects is relevant to isolate 43 the home-bias effect. Interpreting the result shown in the fifth column, companies from country s in industry k are 153% more likely to hire someone from country s than any other country when compared to companies not from s. Table 15: Alternative fixed effects for sourcing regression (1) 1(source = origin) N obs Sample (2) (3) (4) (5) -0.162 0.107 0.894*** 1.275*** 1.53*** (0.313) (0.160) (0.120) (0.171) (0.224) 1,928 H-1B Origin FE Source FE Industry FE Source-Industry, Origin-Industry FE 1,928 H-1B 1,928 H-1B 1,928 H-1B x x x x x x 1,928 H-1B x ∗p < 0.1, ∗ ∗ p < 0.05, ∗ ∗ ∗p < 0.01. All regressions control for year fixed effects. All visa petitions filed in 2012, 2013, and 2014. Standard errors clustered at the source-origin level. Table 16 shows that the result holds under several alternative specifications. Column 1 shows the results are very similar when only considering visa petitions for new employment (excluding renewals and job changes). Column 2 shows these results are robust to running the regression at the origin-source level by aggregating the data across all industries. The aggregate home-bias is a bit larger when not considering industries, which is in line with having a positive correlation between origin-industry and source-industry comparative advantages. Column 3 shows the results are robust to including source-origin pairs for which the data shows 0 observations. To handle zero values, I estimate the parameters using a Poison Pseudo Maximum Likelihood (PPML) to include zero observations as suggested by Santos Silva and Tenreyro (2006). Finally, columns 4 and 5 look at the differences in home bias by L-1 and H-1B visas. Surprisingly, the estimated home-bias is much smaller when looking at L-1 visas. However, since the H-1B program is much larger, the aggregate effects are dominated by the H-1B. The estimation in Section 6 uses data on wages, which are only available for H-1Bs so it is reassuring that L-1 visas are not driving the observed results. In Figure 4, I plot the disaggregated regression across source countries where we can see that there is a positive home-bias for both H-1B and L-1, but the effects are larger when looking at the H-1B. 44 Table 16: Alternative specifications for sourcing regression 1(source = origin) N obs Sample (1) (2) (3) (4) (5) 1.397*** (0.211) 2.227*** (0.240) 2.619*** (0.172) 0.311** (0.104) 1.078*** (0.166) 972 No industries Including zeros 1,162 1,928 L-1 H-1B+L1 1,928 664 Only new No industries employment visas ∗p < 0.1, ∗ ∗ p < 0.05, ∗ ∗ ∗p < 0.01. All regressions control for year FE, source-industry FE, and origin-industry FE. All visa petitions filed in 2012, 2013, and 2014. Standard errors clustered at the source-origin level. In Figure 4, I plot the disaggregated regression across source countries where we can see that there is a positive home-bias for both H-1B and L-1, but the effects are larger when looking at the H-1B. Figure 4: Estimated coefficient (γs ) on sourcing regression by country (H-1B vs. L-1) (a) H-1B (b) L-1 45 C Equilibrium details The equilibrium of the model can be characterized by the following set of equations: 1. MNE shares - one for each s-k-` triplet mne πk,s,` =P k Tsk ck`,s × δ`,s s0 −θ k Tsk0 ck`,s0 × δ`,s 0 (27) −θ 2. Effective technology in country ` - one for each k-` pair: T̃`k = X k Tsk ck`,s × δ`,s −θ (28) s 3. Trade shares - one for each k-`-n triplet. trade πk,`,n k −θ k (τ`,n ) T̃` =P k −θ k `0 (τ`0 ,n ) T̃`0 (29) 4. Domestic price index - one for each k-n pair !− θ1 Pk,n X k −θ k = Γ̄ (τ`,n ) T̃` (30) ` Where Γ̄ = Γ 1−σ+θ θ 5. Unit cost in country ` industry k, source technology s ck`,s = γ̄ K Y γ 0 1−α l h α h Pk0`,k,k (ψk,` )α wL,` + (ψk,` ) (ck,`,s )1−α ,` P 1 1−α (1− k0 γ`,k,k0 ) (31) k0 =1 Where γ̄ is a constant that depends on γ`,k,k0 . wL,` is the low-skill labor wage in country `, which is the same across industries and source technologies in ` given free mobility of low-skill labor. chk,`,s is the high-skill labor unit cost, which is different for each triplet k, `, s given that high-skill workers have different abilities for each triplet, which makes companies in each triplet face a different labor pool of effective units, hence a different high-skill labor cost. Firms employ domestic d, source-country s, and other foreign f effective units of high-skill labor. If a company is located in their source country, source and native effective units are perfect substitutes. 46 1 1−λ fs s d d chk,`,s = (ψk,`,s )λ (wk,`,s )1−λ + (ψk,`,s )λ (cfk,`,s )1−λ s cfk,`,s (32) 1 f f 1−ι 1−ι ι s ι s 1−ι = (ψk,`,s ) (wk,`,s ) + (ψk,`,s ) (wk,`,s ) (33) 6. Share of non-college (ΘLk,`,s ), college (ΘLk,`,s ) - one for each k-`-s triplet. ΘLk,`,s = 1−α l (ψk,` )α wL,` ΘH k,`,s = 1−α h α h l ) (ck,`,s )1−α )α wL,` + (ψk,` (ψk,` h α h ) (ck,`,s )1−α (ψk,` 1−α h α h l ) (ck,`,s )1−α )α wL,` + (ψk,` (ψk,` (34) 7. Share of native (Θdk,`,s ), source (Θsk,`,s ), other foreign (Θfk,`,s ) expenditure - one for each k-`-s triplet. Θdk,`,s d d (ψk,`,s )λ (wk,`,s )1−λ =P x0 x0 λ 1−λ x0 (ψk,`,s ) (wk,`,s ) for x’={d,sf} Θxk,`,s x x (ψk,`,s )ι (wk,`,s )1−ι =P x0 x0 ι 1−ι x0 (ψk,`,s ) (wk,`,s ) for x,x’={s,f} (35) 8. Demand for low-skill (L), native (d), source (s), other foreign (f ) workers - one for each k-`-s triplet. Where I`,k are the revenues for industry k in country `. wL,` Lk,`,s = (1 − X mne γ`,k,k0 )ΘLk,`,s πk,s,` I`,k (36) k0 x wx,` Hk,`,s = (1 − X x mne γ`,k,k0 )ΘH k,`,s Θk,`,s πk,s,` I`,k with x = d, s, f (37) k0 9. Trade balance - Budget constraint - one for each `. I`,k is the revenues gained in ` industry k, Xn is the total labor income in country n, L̄` total low skill labor supply. I`,k = X trade πk,`,n γk,n Xn (38) n Xn = wL,` L̄` + X x x wk,`,s Hk,`,s k,`,s,x 10. Low-skill market clearing - one for each ` 47 with x = d, s, f (39) X wL,` Lk,`,s = wL,` L̄` (40) k,s 11. Migration shares - one for each o-k-`-s group mig πo,k,`,s Ao,k =P `0 ,s0 ,k0 Ao,k0 wk,`,s o εk,`,s P` κ φ−κ o,`,s κ wk0 ,`0 ,s0 o εk0 ,`0 ,s0 P`0 (41) φ−κ o0 ,`0 ,s0 12. High-skill market clearing, native (d), source (s), other foreign (f ) - one for each k-`-s triplet. No is the total number of workers born in o. 1 d d wk,`,s Hk,`,s = κ−1 mig s κ N Aκ Γ wk,`,s εsk,`,s (πo,k,`6 s k,s =o,s=o ) X = 1 1 κ(1 − ρ) 1 1− κ(1 − ρ) κ−1 f mig κ N Aκ Γ wk,`,s εok,`,s (πo,k,`6 o k,o =o,s6=o ) o6={`,s} C.1 1− 1 s s wk,`,s Hk,`,s s s wk,`,s Hk,`,s = κ−1 mig d κ Γ wk,`,s εdk,`,s (πo,k,`=o,s=o ) κ N` Ak,` (42) 1 1− κ(1 − ρ) (43) (44) Writing the equilibrium in proportional changes Following Dekle et al. (2008), I re-write all equilibrium equations in proportional changes. That 0 is, I can rewrite each variable x as x̂ = xx where x is the variable under the real scenario and x0 is the value of the variable under the counterfactual. In the remainder of this Section, I show how this approach allows me to distinguish 4 components needed to estimate the model: parameters needed for estimation, endogenous variables, parameters not needed for estimation, and data. I use the color scheme together with the equilibrium equations to clearly see how the different components affect the estimation of the model. Equations 36, 37, 42, and 43 are multiplicative so I omit them in the analysis below to focus on the ones that require data to be calculated. 1. MNE shares / Effective technology in country ` mne π̂k,s,` =P s0 k ĉk`,s × δ̂`,s −θ k mne ĉk`,s0 × δ̂`,s 0 πk,s,` −θ 2. Trade shares/ Domestic price index 48 ; −θ X k mne T̃ˆ`k = T̂sk ĉk`,s × δ̂`,s πk,s,` s k −θ ˆ k ) T̃` (τ̂`,n =P (τ̂ k )−θ T̃ˆk π trade trade π̂k,`,n `0 `0 ,n ; P̂k,n = X k −θ ˆ k trade (τ̂`,n ) T̃` πk,`,n !− θ1 ` `0 k,`,n 3. Unit cost / high-skill unit cost ĉk`,s = K Y γ 0 P̂k0`,k,k ,` P 1 1−α (1− k0 γ`,k,k0 ) l α 1−α L h α h 1−α H (ψ̂k,` ) ŵL,` Θk,`,s + (ψ̂k,` ) (ĉk,`,s ) Θk,`,s k0 =1 ĉhk,`,s = d d )1−λ Θdk,`,s )λ (ŵk,`,s (ψ̂k,`,s + fs s s (ψ̂k,`,s )λ (ĉfk,`,s )1−λ Θfk,`,s 1 1−λ 1 1−ι s f f s s ĉfk,`,s = (ψ̂k,`,s )ι (ŵk,`,s )1−ι Θsk,`,s + (ψ̂k,`,s )ι (ŵk,`,s )1−ι Θfk,`,s 4. Trade balance / Budget constraint (with x = d, s, f ) Iˆ`,k = X trade π̂k,n,` X̂n n ˆ X̂` = ŵL,` L̄ ` wL,` L̄` X | {z` } + X k,`,s,x trade πk,n,` γk,n Xn P trade π γk,n Xn | n k,n,` {z } ; Share sold to n: Λk,n,` x x wk,`,s Hk,`,s x x ŵk,`,s Ĥk,`,s X` | {z } High-skill share Λx k,`,s Low-skill share ΛL ` 5. Low-skill market clearing / Migration share X k,s wL,` Lk,`,s ˆ ŵL,` L̂k,`,s P = ŵL,` L̄ ` k,s wL,` Lk,`,s | {z } mig π̂o,k,`,s ; κ ŵ o φ̂o,`,s −κ ε̂ Âo,k k,`,s k,`,s P̂` κ =P ŵk0 ,`0 ,s0 o mig ε̂k0 ,`0 ,s0 φ̂o0 ,`0 ,s0 −κ πo,k,`,s `0 ,s0 ,k0 Âo,k0 P̂ 0 ` Low-skill share ΛL k,`,s 6. Other-foreign market clearing s s ŵk,`,s Ĥk,`,s = X o6=`,s f mig ŵk,`,s ε̂ok,`,s (π̂o,k,`6 =o,s6=o ) κ−1 κ 1 κ N̂o Âk,o P | 49 f mig wk,`,s εok,`,s (πo,k,`6 =o,s6=o ) o6={`,s} κ−1 κ 1 κ No Ak,o f mig wk,`,s εok,`,s (πo,k,`6 =o,s6=o ) {z Share o in `, s, k: Λok,`,s κ−1 κ 1 κ No Ak,o } The equations above imply that the change in the endogenous variables can be computed as long as I have estimates of the 6 key elasticities (θ, α, λ, ι, ν, and κ), the Cobb-Douglas share on intermediate inputs γ`,k,k0 and data on the following equilibrium allocations: Trade mig trade mne shares (πk,`,n ); MNE shares (πk,s,` ); Migration shares (πo,k,`,s ); Share of wage bill spent in low-skill (ΘLk,`,s ), and high-skill (ΘH k,`,s ), for each k, `, s; Share of high-skill wage bill spent on d s natives (Θk,`,s ) source workers (Θk,`,s ) and other foreign (Θfk,`,s ); Share of low-skill in total labor income (ΛL` ); Share of high-skill type x in s, `, k in total labor income (Λxk,`,s ); Share of low-skill employed in k, `, s (ΛLk,`,s ); Share of wage bill of k, `, s on migrants from o 6= {`, s} (Λok,`,s ) and production shares (Λk,n,` ). I explain how the dataset is constructed in Appendix F. One of the advantages of the exact hat-algebra procedure is that several parameters do not change between the real and the counterfactual so they do not need to be explicitly solved for. k k ), ), producer comparative advantage (Tsk ), trade costs (τ`,n These parameters are MNE costs (δ`,s f fs l h d s production function labor shares (ψk,` , ψk,` , ψk,`,s , ψk,`,s , ψk,`,s , ψk,`,s ), Total low-skill (L̄` ) and high-skill (N` ) labor born in `, individual ability comparative advantage (Ao,k ), origin-specific productivity (εok,`,s ), and the migration costs (φo,`,s ). The hat-algebra approach makes it easier to calculate the counterfactuals. For example, the counterfactuals computed in Sections 7.2 and 7.1 will compute how the equilibrium changes after an exogenous change of the MNE cost k in all countries δ`,s or the migration cost to the US φo,us,s . D Details of reduced-form exercise The dataset to run the regressions in Section 5 is constructed by merging publicly available data on MNE employment from the BEA with the H-1B dataset described in Section 2. I exclude L-1 visas from the analysis since L-1 data are only available for 2012 to 2014. I group both datasets into 26 source countries and 13 industry groups. The BEA data comes from a survey administered by the BEA to foreign MNEs who have operations in the US. The dataset is called ”Comprehensive Data on the Activities of U.S. Affiliates” and I use the survey for majority-owned affiliates. I use ”Employment of Affiliates, Country of UBO by Industry of Affiliate,” which provides US employment at the source-country level for 42 countries and 14 industries. As shown in Table 17, column 1 shows that the estimates are larger but similar in magnitude when including industry-source pairs with zero visa petitons. Results are robust to controlling for source-time fixed effects as shown in columns 2 and 3. Columns 4-6 show that adding source-time covariates does not affect the estimates of the immigrants coefficient in the second stage. 50 Table 17: Dependent Variable: Log US Employment of MNEs from s in k, t (1) Ln (Imm from o = s in s,k,t) 1.94*** (0.46) Share of US imports from s in k, t (2) (3) (4) 0.80*** 1.21 1.68*** (0.118) (0.737) (0.496) 5.25** (1.95) Total GDP at s, t (5) (6) 1.79*** (0.577) 1.62*** (0.489) -0.22* (0.12) Total Exports from s, t to US N obs 1st stage F-stat -7.63 (9.78) 1167 28.6 Industry-Source FE Source-Time FE 616 343.0 616 6.3 616 14.6 616 14.4 616 15.8 x x x x x x x ∗p < 0.1, ∗ ∗ p < 0.05, ∗ ∗ ∗p < 0.01. 2SLS estimates. Industry-Source-Time level regression. Years 2002-2014 included. Standard errors clustered at the source-country level. Industry-time FE included in all regressions. Column 1 adds 1 to each observation to avoid dropping zeros. Table 18: Correlation between observables and MNE employment growth Industry GDP share in s Industry exports share in s Share of MNEs from s in US in non-k Share of US imports from s in k Share of non-US imports from s in k Common language s and US Regional Trade agreement s and US Distance s and US N Growth MNE employment Initial Shares 3.81 (3.63) -3.47** (1.56) -1.52 (1.18) 0.47 (1.79) 1.29 (2.86) 0.19 (0.19) 0.26 (0.44) 0.11*** (0.04) 1.25** (0.50) 0.33 (0.21) 0.22 (0.16) 0.00 (0.25) -0.42 (0.39) 0.01 (0.03) 0.02 (0.06) 0.006 (0.005) 126 126 ∗p < 0.1, ∗ ∗ p < 0.05, ∗ ∗ ∗p < 0.01. 2SLS estimates. Shares and covariates at 2001. Log growth in MNE employment from country s in industry k between 2014 and 2004. Distance expressed in thousands of population weighted miles. Results are robust to regressing each covariate separately with the dependent variables. 51 E E.1 Estimation details Estimating κ using the wage dispersion The purpose of this Section is twofold. First, I will use variation in the observed wage dispersion for high-skill immigrants to estimate κ as a way of validating the estimate in Section 6.1, which uses a very different source of variation. Similar approaches have been used in the EK-Roy literature to estimate the supply elasticity such as in Lagakos and Waugh (2013), Hsieh et al. (2019) and Lee (2019). While this approach relies on the distributional assumptions for the ability draws, the Frechet distribution has been shown to provide a good approximation of the observed wage distribution (Burstein et al., 2018). Second, I will use the observed wage dispersion and the estimate of κ from Section 6.1 to separate the dispersion parameter κ and the correlation ρ. I will start by ignoring ρ and assuming κ = κ̃. Before proceeding to the estimation I will present two results based on the Frechet properties. Proposition 1 If productivity draws η are distributed Frechet with shape parameter κ, the i,o i,o observed market wages paid to employees Wk,`,s = ηk,`,s wk,`,s are also distributed Frechet with parameter κ. Proposition 2 If a random variable W is distributed Frechet with shape parameter κ, then the coefficient of variation can be written as: 2 Γ 1 − κ2 σ = 2 − 1 µ Γ 1 − κ1 Where Γ is the Gamma function. Proposition 1 indicates that observed market wages are also distributed Frechet with shape parameter κ, which means that the parameter κ is related to the dispersion of observed wages, conditional on individuals choosing the triplet k, `, s. This proposition indicates that the observed dispersion of wages can be used to make inference on the value of κ̃. Proposition 2 gives a useful expression to implement the estimation, as it says that the ratio of the observed variance of wages to the square of the mean of observed wages has a parametric relationship with κ. Based on the results of the propositions above, I can use the H-1B data on wages to calculate the variance and mean wages for each group of workers with origin o who migrate to the US to work in industry k with source technology s. I construct the empirical moments as in equation 45 and estimate the parameter κ̃ by GMM, choosing a value of κ̃ that minimizes the distance between the empirical moments and the moments from proposition 2. 52 o V ar(Wk,`,s ) 2 o W k,`,s ! Γ 1− 2 κ = 2 − 1 Γ 1 − κ1 (45) I present the baseline results using the H-1B data in Column (1) of Table 19. An alternative strategy is to use the estimate of κ = 6.17 from Section 6.1 and equation 45 by replacing κ by κ(1 − ρ). Table 19 compares the different approaches and shows that, overall, both yield similar values of κ even if the underlying assumptions for estimation are very different. Table 19: Estimates for κ using dispersion of wages κ κ̃ Implied ρ Only using dispersion Using trade shock and dispersion 8.28*** (0.138) 8.28*** (0.138) 0 6.17*** (2.012) 2.08*** (0.00016) 0.66 2,534 2,534 ∗p < 0.1, ∗ ∗ p < 0.05, ∗ ∗ ∗p < 0.01. Estimates by GMM using H-1B data on wages by country of origin, industry, and source technology. E.2 Estimating effective wage ratio To estimate equation 24 it is necessary to have information on the ratio of effective wages. Using the the properties of the Frechet distribution, we can write the observed average wages for each group as in equation 46: −1 1 x κ κ wageoz,t = wz,t πo,z,t Ak,o,t Γ̄εoz,t (46) Where wageoz,t is the average wage for those from origin o that migrate to triplet z = k, `, s at x time t, conditional on choosing z. wz,t is the equilibrium wage per effective unit paid to those who choose triplet z and the superscript x indicates whether the workers are hired by an MNE No with s = o or if they are hired just as other foreign workers. πo,z,t = Nz,to is the fraction of t workers from o, who migrate to z, and Ak,o,t is the comparative advantage of workers from o in industry k. Finally, Γ̄ is the Gamma function. By taking the ratio between wageo=s and wageo6z=s , taking logs and rearranging terms, it is z possible to get to equation 47: 53 Ln | wagesz,t wageoz,t 1 + Ln κ {z Data s Nz,t o Nz,t = Ln } | s wz,t f wz,t ! o εz,t 1 1 + Ln(Ns,t Ak,s,t ) − Ln(No,t Ak,o,t ) + Ln o κ εz,t |κ {z } {z } Origin-Industry-Time FE | {z } Source-Industry-Time FE Error term (47) Equation 47 shows that it is possible to run a regression at the source-origin-industry level o6=s using the H-1B data for average wages (wageo=s z,t and wagez,t ) and number of employees by o s ) together with the estimated value of κ, and regress a combination of those , Nz,t group (Nz,t variables on a set of source-industry and origin-industry fixed effects. Once those fixedeffects are estimated, it is possible to back out the log ratio of equilibrium effective wages Ln s wz,t f wz,t , which is our object of interest. F Dataset for counterfactual This Section describes how the dataset needed to compute the model is constructed. The description is based on the simplifications explained in Section 6.4 and the data needed as outlined in appedix C.1. I construct the database for 6 regions of the World. US, Canada, India, China-Taiwan, Western Europe (including Austria, Belgium, Denmark, Finland, France, Germany, Greece, Ireland, Italy, Luxembourg, Netherlands, Norway, Portugal, Spain, Sweden, Switzerland, and the United Kingdom) and the Rest of the World that includes a set of 41 countries that have available production data in the OECD. Industries are grouped into 3 categories using NAICS 2007 as the basis for classification: “IT and Professional Services” includes NAICS 51 (Information) and NAICS 54 (Professional Scientific and Technical Services); “HighTech Manufacturing” includes NAICS 325 (Chemicals), 333 (Machinery), 334 (Computer and Electronic), 335 (Electrical Equipment, Appliance and Components), and 336 (Transportation Equipment). All other industries are grouped into “Other.” trade ), production shares (Λk,n,` ), and intermediate input shares Trade Shares (πk,`,n (γ`,k,k0 ): Trade and production shares are computed using the Trade in Value Added database of the OECD. I use gross exports, gross imports, and output data for 2011. For “IT and Professional Services” I use the OECD industries “C64 - Post and telecommunications,” “C72 - Computer and related activities,” and “C73T74 - R&D and other business activities.” For “High-tech manufacturing” I use the OECD industries “C24 - Chemicals”, “C29 - Machinery and Equipment”, “C30-C31 - Computer, Electronic and optical equipment; Electrical machinery and apparatus,” and “C34-C35 - Transportation equipment.” All other industries are classified as “Other.” For intermediate input expenditure shares, I use the World Input-Output tables (WIOT) for year 2012. 54 mne MNE shares (πk,s,` ): to compute the MNE shares, I need the revenues of MNEs by industry and source country in the US, India, Western Europe, Chin-Taiwan, and Canada. The main source used is the BEA surveys of “US Direct Investment Abroad” for revenues of US companies abroad and the “Foreign Direct Investment in the United States” for revenues of non-US companies with subsidiaries in the US. I use the revenues reported for majority-owned affiliates in the NAICS sectors described above for 2012. While the BEA provides sufficient information for MNE activity involving the US, it does not provide revenues between the non-US regions by industry. To compute the non-US MNE revenues that are missing, I use the revenues reported by Orbis in 2012 for each source-destination-industry triplet. As shown by Alviarez (2018) Orbis provides a good approximation of MNE revenues by source and industry when compared to other aggregate datasets such as the OECD. mig Migration shares and labor allocations (πo,k,`,s ): migrant and native counts by origin country and industry in the US are taken from the 2012 American Community Survey (ACS) and for Canada from IPUMS International for 2011. For Europe, not all countries have micro data available so I use the surveys for France, Ireland, and Spain in IPUMS International to calculate the distribution of migrants across industries. Total migrant counts for Europe are taken from the IAB brain-drain data (Brucker et al., 2013). A key piece of information that is not available in any survey is whether workers are employed by a domestic or foreign company. To impute such data, I use the FOIA dataset on H-1B and L-1 to back out the proportion of native, source-country, and other foreign workers in the US by MNE source. As a first step, I compute the total ratio of foreign workers employed by firms from source s relative to US firms using the FOIA data. Second, from the BEA data used to calculate MNE shares, I calculate the relative size of MNEs with source technology s in industry k relative the size of US firms in industry k. These two ratios allow me to back out the likelihood of firms from s to employ foreign workers relative to US firms. I then use the FOIA data to calculate how many source vs other foreign workers are employed by non-US MNEs in each industry. Since the FOIA data are just for the US, I impute the ratio of foreign to native college graduates for Europe and Canada. The ratios of Canadian firms in the US are used for US firms abroad. The results are robust to alternative imputation methods. Industry employment of high- and low-skill workers in India is taken from IPUMS International for 2009. China-Taiwan and the Rest of the World total high- and low-skill worker counts are taken from International Labour Organization LABORSTA database. The ratio of low- to high-skill employment within industry is imputed using the values for India, and the total employment by industry is taken from the OECD. The distribution across source technologies in India and China-Taiwan is imputed using the MNE shares in those countries. f d s L x L o Labor expenditure shares (ΘLk,`,s , ΘH k,`,s , Θk,`,s , Θk,`,s , Θk,`,s , Λ` , Λk,`,s , Λk,`,s , Λk,`,s ): A final piece of data needed is several shares of labor expenditure for different skill groups across countries, industries, and source technologies. The labor allocations data described 55 above compute counts of workers so wage data are needed to map counts into expenditure shares. For the US the ACS is used to compute the average wages for workers across skill types, origin countries and industries. Such average wages together with the labor counts are used to compute the expenditures. A similar process is used for Canada and India using wage data from the IPUMS International surveys for each country. Individual wage data for Europe, China-Taiwan, and the Rest of the World are not available at the industry-skill level so I use the high-skill to low-skill wage premium in Canada to impute wages in Europe and the skill premium in India to impute wages in China-Taiwan and RoW. G Counterfactual 1: A more restrictive migration policy In this Section, I present additional results from restricting migration into the US. As noted in Table 6, foreign MNEs in the US respond more in terms of revenues than US companies. Table 20 decomposes the contribution to total output drop in the US by source country. Foreign MNEs in the IT sector are more intensive in migrants so their contribution to total output drop is of 11.7%, while they only account for 4.8% of production in IT. Table 20: Contribution of MNEs to output drop Percent change in Revenues IT and Prof.Services High-Skill Manufacturing Spillover No spillover Total US Foreign Share of Foreign MNEs in output drop -0.53% -0.03% -0.50% -0.36% -0.35% -0.04% 94.2% 11.1% Total US Foreign Share of Foreign MNEs in output drop -1.70% 0.68% -2.39% -0.40% -0.28% -0.15% 100% 38.0% Share of US production 95.16% 4.84% 70.84% 29.16% Percent changes from increasing migration cost such that the total stock of migrants decreases by 10%. Columns 1,2: contribution to output drop. Column 3: Share of production in the US by MNE source. 56 In Table 21, columns 2 and 3, I compute the model using lower values of λ = 7 and λ = 2, which is closer to a model where immigrants and natives are less substitutable. The model with low λ significantly changes the wage effects for high-skill native, who can even lose from restricting immigration when λ is sufficiently low. However, as we are looking at all college graduates, we would expect the elasticity to be on the higher end. In a second test, I compute the model with α = 3 to understand how results would change under a model where low and high skill workers are closer to perfect substitutes. Interestingly, as shown in column 4, the welfare effects generated by migration would be somewhat muted when working with a higher elasticity of substitution. The model with α = 3 would make high-skill workers lose from restricting immigration, as firms find it easier to substitute low for high-skill labor when highskill labor becomes more expensive. Negative effects for low-skill natives are reduced. In column 5, I compute the model by changing the trade elasticity to use the value suggested by Eaton and Kortum (2002) of θ = 8.28. The trade elasticity controls the dispersion of the producer productivities across countries. Higher values of θ would make productivities of producing in each country to be more concentrated, such that foreign companies and MNEs would have similar productivities. As shown in Table 21, columns 6, real wage losses are lower with the model with the high θ as the exit of MNEs generates a lower productivity loss. Using the upper bound for ι = 6.84 does not change the results but mainly because of the nature of the counterfactual. Since I am increasing the migration costs from all origins by the same magnitude, and all countries face the same elasticity of supply κ, firms reduce their number of source and other foreign units by the same amount. If migration policy would specifically target some origins, ι would have a larger presence in the quantitative results. Finally, I try κ = 8.28 as estimated using the wage dispersion method in Section E.1. A higher κ implies that abilities are more concentrated and high-skill workers are more sensitive to changes in the wage. When immigration is restricted, wages go up and high-skill workers relocate more when abilities are more concentrated. As such, their real wages increase more than in the baseline. Low-skill workers on the other hand lose more, since firms find it easier to find natives to replace the immigrants. Table 21: Understanding mechanisms - elasticities Baseline High-skill natives Low-skill natives Total US natives 0.09% -0.35% -0.22% λ=7 λ=2 α=3 0.04% -0.25% -0.01% -0.36% -0.36% -0.25% -0.24% -0.33% -0.17% θ = 8.28 ι = 6.84 κ = 8.28 0.11% -0.33% -0.20% 0.09% -0.35% -0.22% 0.10% -0.37% -0.23% Percent changes from increasing migration cost such that the total stock of migrants decreases by 10%. Column 1: Baseline results λ = 12.89, α = 1.7 and θ = 4. Column 2-3: same as baseline but λ = 30. Column 4-5: same as baseline but α = 5 . Column 6-7: same as baseline but θ = 8.28. Column “Relative to baseline” shows the percent change between the welfare change in the alternative model relative to the baseline model. 57 H Counterfactual 2: Welfare gains of MNE production The results for MNE welfare gains with and without migration presented in Section 7.2 only looked at the gains for the US. Table 23 shows the welfare gains of MNE for the baseline model with migration and an alternative model without migration. The bias on the welfare gains of the model without migration goes in opposite directions depending if the country is a migrant-receiving or migrant-sending country. The results for Europe and Canada are equivalent to those in the US; the model with no migration overestimates the welfare gains for high-skill workers and underestimates the effects for low-skill workers. When MNE is allowed, MNE companies in migrant receiving countries push up the demand for migrants, lowering the wages for native high-skill workers and increasing the wage for native low-skill workers due to complementarity. The mirror image of such results is experienced by sending countries such as India and China-Taiwan. Allowing for MNE activity increases the demand for high-skill migrants in US, Canada, and Western Europe, decreasing the number of high-skill workers in India and China-Taiwan. Such a decrease raises wages for high-skill and lowers wages for low-skill, and such effects are not captured by the model that does not include migration. Table 22: Robustness to key elasticities α=3 λ=7 Baseline No migration Baseline θ = 8.28 κ = 8.5 No migration Baseline No migration Baseline No migration High-skill natives Low-skill natives Total US natives 1.21% 1.32% 1.28% 1.58% 1.25% 1.35% 1.18% 1.41% 1.34% 1.56% 1.30% 1.38% 0.61% 0.75% 0.71% 0.96% 0.71% 0.78% 1.04% 1.40% 1.29% 1.47% 1.26% 1.32% Migrants in US 9.06% 0.00% 7.87% 0.00% 5.85% 0.00% 9.32% 0.00% Percent changes in welfare of going from MNE autarky to the observed equilibrium. MNE autarky is the case k where MNE iceberg costs δs,` are very high such that MNE is prohibitive. 58 Table 23: Welfare gains of MNE production by country Baseline No migration US High-Skill Low-Skill All Western Europe High-Skill Low-Skill All Canada High-Skill Low-Skill All India High-Skill Low-Skill All China-Taiwan High-Skill Low-Skill All 1.16% 1.42% 1.34% 1.71% 1.09% 1.26% 1.00% 6.97% 5.54% 0.98% 0.15% 0.21% 0.49% 0.25% 0.26% 1.56% 1.30% 1.38% 2.17% 0.91% 1.24% 5.39% 5.28% 5.31% 0.55% 0.27% 0.29% 0.15% 0.36% 0.35% Relative to baseline 35% -8% 3% 27% -17% -1% 437% -24% -4% -44% 84% 37% -69% 48% 31% Percent changes in welfare of going from MNE autarky to the observed equilibrium. MNE autarky is the case k where MNE iceberg costs δs,` are very high such that MNE is prohibitive. Column 3 shows the welfare change in the no migration setting relative to the welfare change in the baseline model with migration. 59