View original document

The full text on this page is automatically extracted from the file linked above and may contain errors and inconsistencies.

Cognitive Hubs and Spatial
Redistribution

WP 19-16

Esteban Rossi-Hansberg
Princeton University
Pierre-Daniel Sarte
Federal Reserve Bank of Richmond
Felipe Schwartzman
Federal Reserve Bank of Richmond

Cognitive Hubs and Spatial Redistribution∗
Esteban Rossi-Hansberg
Princeton University

Pierre-Daniel Sarte
Federal Reserve Bank of Richmond

Felipe Schwartzman
Federal Reserve Bank of Richmond
September 25, 2019
Working Paper No. 19-16
Abstract
In the U.S., cognitive non-routine (CNR) occupations associated with higher wages
are disproportionately represented in larger cities. To study the allocation of workers across cities, we propose and quantify a spatial equilibrium model with multiple
industries that employ CNR and alternative (non-CNR) occupations. Productivity is
city-industry-occupation specific and partly determined by externalities across local
workers. We estimate that the productivity of CNR workers in a city depends significantly on both its share of CNR workers and total employment. Together with
heterogeneous preferences for locations, these externalities imply equilibrium allocations that are not efficient. An optimal policy that benefits workers equally across
occupations incentivizes the formation of cognitive hubs, leading to larger fractions of
CNR workers in some of today’s largest cities. At the same time, these cities become
smaller to mitigate congestion effects while cities that are initially small increase in size.
Large and small cities end up expanding industries in which they already concentrate,
while medium-size cities tend to diversify across industries. The optimal allocation thus
features transfers to non-CNR workers who move from large to small cities consistent
with the implied change in the industrial composition landscape. Finally, we show that
the optimal policy reinforces equilibrium trends observed since 1980. However, these
trends were in part driven by low growth in real-estate productivity in CNR-abundant
cities that reduced welfare.

∗

Preliminary draft. erossi@princeton.edu, pierre.sarte@rich.frb.org, felipe.schwartzman@rich.frb.org. We
thank Mike Finnegan, Daniel Ober-Reynolds, and Jackson Evert for excellent research assistance. We also
thank David Albouy for useful comments. The views expressed here are those of the authors and do not
reflect those of the Federal Reserve Bank of Richmond or the Federal Reserve System.

1

1

Introduction
“Most of what we know we learn from other people (...) most of it we get for
free.” Robert E. Lucas Jr.

Workers capable of doing the complicated cognitive non-routine tasks required in a modern economy are scarce. Acquiring the expertise to work as a doctor, manager, lawyer,
computer scientist or researcher requires many years of schooling, sustained effort, and individual ability. These workers are a valuable input and so their allocation across industries
and locations is important for overall efficiency and welfare in an economy. The marginal
productivity of a worker depends on the local productivity of the industry where she works,
but also on the set of workers that work in the same city. Larger cities with a large fraction of workers in cognitive non-routine (CNR) occupations offer learning and collaboration
opportunities that enhance the productivity of other workers. However, the abundance of
CNR workers also lowers their marginal product, particularly in industries that are less intensive in these occupations. The interaction of these forces in equilibrium determines the
spatial polarization of workers and, relatedly, the spatial specialization of industries. Can
the economy allocate scarce CNR workers in a way that improves the lives of all workers?
Our aim is to study the allocation of occupations and industries across cities in the U.S. and
to characterize the optimal spatial allocation and the policies to implement it.1
The need for optimal spatial policy is the direct implication of the presence of urban
externalities. Externalities that enhance the productivity of workers in larger cities have
been discussed, analyzed, and measured at least since Marshall (1920).2 It is natural to
hypothesize that these production externalities depend on the occupational composition of a
city. After all, CNR occupations require more interactions between knowledgeable workers.
As we show in detail in the next section, the patterns of occupational polarization and
wages across space in the U.S. suggest that this is indeed the case. First, in the absence
of technological differences or externalities across locations, decreasing returns to workers in
an occupation imply that relative CNR to non-CNR wages should decline with the share of
CNR workers. We find a large positive relationship even after controlling for a number of
observable worker characteristics.3 Why are CNR workers then making relatively more in
locations where they are abundant? A possible answer is that these locations specialize in
industries intensive in these occupations. The evidence, however, suggests that firms in CNR
abundant cities are even more intensive in CNR workers than suggested by their industrial
make up. What makes demand for these workers so high in these cities? Our take, and
an explanation that reconciles these various facts, is that the abundance of CNR workers
itself makes them more productive: a local occupation-specific externality. Estimating the
strength of these externalities is a central part of our analysis.
The detailed quantitative assessment of optimal spatial policies we propose requires a
number of contributions. These fall along four main dimensions.
1

Figure 20 in the Appendix shows CNR shares across U.S. cities.
See Duranton and Puga (2004) for a review of the literature on externalities in cities.
3
General city amenities that would be equally appealing to both occupations are unlikely to be a deciding
factor in attracting the best workers to abundant cities. Indeed, we show that while real wages of CNR
workers increase with a city’s CNR intensity, real wages of non-CNR workers do not. Therefore, amenities
alone are not driving the pattern of wages across space. For an alternative view, see Couture et al. (2018).
2

2

First, we develop a spatial equilibrium model with multiple industries and occupations as
well as occupation-specific externalities. Multiple industries, costly trade, and input-output
linkages are all key features of the environment since the demand for different occupations
depends on the occupational intensity of the specific industries in each location. The framework also features externalities that are allowed to depend on the share of workers in CNR
occupations and the total workforce in the city. Finally, it includes heterogeneous preferences
for locations that act as a form of migration costs.
Second to arrive at an optimal spatial policy requires that we derive efficient allocations
in this setup. We choose to study the efficient allocation that benefit both occupations
equally. Implementing this allocation requires particular transfers between locations and
occupations, which we characterize.
Third, the details of optimal transfers require that we quantify the model and estimate the
parameters that determine the endogenous component of city-industry-occupation specific
productivity. Thus, we first recover productivity across locations, industries, and occupation such that the equilibrium of our model matches observed data. We then parameterize
the relationship between productivity and the occupational composition and size of cities
and estimate this equation using an instrumental variables approach. As proposed in the
empirical literature (e.g. Card (2001) and Moretti (2004a)), we use past migration flows of
particular immigrant groups and the location of land-grant colleges as instruments. We also
present results using model-implied instrumental variables. Our strategy yields a robust set
of results comparable to the existing literature though, as an improvement, estimated here
directly from productivity measures recovered from the general equilibrium framework we
lay out. Our findings imply that the productivity of CNR and non-CNR workers depends
similarly on city size. In addition, the productivity of CNR workers depends strongly and
significantly on the share of CNR workers. We find less evidence that the productivity of
non-CNR workers depends on the composition of occupations.
Finally, informed by these findings, we compute the optimal allocation and discuss its implementation using particular policy tools. We also highlight important quantitative aspects
of this allocation through various counterfactual exercises.
Our findings propose a new approach to spatial policy. They indicate that the spatial
allocation of workers and industries may be improved by reducing the size of large CNR
intensive cities while, at the same time, increasing their fraction of CNR workers. These
“cognitive hubs” take advantage of scarce CNR workers in the economy by clustering them
to maximize externalities. We find that in equilibrium, the social value of CNR workers is
82% larger than their private value. However, the industrial make up of cities, as well as
their location, does impose limits on the creation of cognitive hubs. Some large cities, such
as Miami or Las Vegas, remain non-CNR abundant since they are particularly productive in
industries where CNR workers are employed less intensively. Cognitive hubs end up scattered
geographically around the country to minimize transport cost with the cities with which they
trade the most.
In order to increase the share of CNR workers while alleviating congestion in larger
cities, the optimal policy prescribes a re-allocation of non-CNR workers to smaller cities
with lower CNR shares. The end result is that under the optimal policy, the smallest cities
grow in size by playing to their strengths and expanding industries in which a large share
of their employment already resides. The corresponding growth of smaller cities also makes
3

it possible for them to sustain more employment in non-tradable industries such as retail,
accommodation, and other services. Hence, contrary to some previous literature and much
of the public discourse, the economics of the problem suggest that, with the appropriate
transfers, small industrial cities in the U.S. should attract non-CNR workers and not try to
become the next San Jose. The concentration of CNR workers in a few “cognitive hubs”
should be encouraged, not scorned. Everyone can benefit from using CNR workers in the
most productive way possible.
Naturally, implementing the optimal allocation requires a number of transfers and taxes
that depend on the location and occupation of an agent. As Fajgelbaum and Gaubert
(2018) discuss, spatial efficiency requires a flat wage tax on all individuals to correct for the
differences in the marginal utility of consumption generated by heterogeneous preferences
for location. In addition to this tax, in both our frameworks and theirs, implementing the
optimal allocation requires a set of transfers. These transfers insure that non-CNR workers
benefit equally from the optimal policy so that occupational inequality is mitigated despite
the creation of cognitive hubs. In our analysis, the base transfers to non-CNR workers
amount to $18,126 (in 2013 dollars) while CNR workers, who earn substantially more, end
up paying a base transfer of $16,856. One interpretation of this base transfer is that of
a “universal basic income” paid to all non-CNR workers. CNR workers still need to be
incentivized to move to CNR intensive cities, and non-CNR workers to move to non-CNR
intensive cities. Therefore, occupation and city-specific transfers are positively correlated
with city size for CNR workers and negatively correlated with city size and the CNR share
for non-CNR workers. Their exact value depends on the particular location and industrial
composition of each city. Ultimately, the policy amounts to a subsidy to non-CNR workers
to move to smaller cities with low CNR shares, and incentives to CNR workers to form even
more intensive “cognitive hubs” in today’s largest cities.
Perhaps surprisingly, a comparison of the current spatial equilibrium to that in 1980 reveals that the spatial allocation of workers has approached that implied by the optimal policy
(with current fundamentals). Specifically, since the 1980s, CNR workers have become not
only more abundant nationally but also increasingly concentrated in CNR intensive hubs,
many of which are large cities. This formation of cognitive hubs has taken place in parallel
with a well documented increase in wage inequality across space and occupations. Our quantitative framework implies that both processes were linked through local occupation-specific
externalities. Our analysis indicates that absent these spillovers, the spatial polarization of
workers would have been greatly mitigated, and the welfare gains received by CNR workers
smaller than those of non-CNR workers since CNR workers became more abundant nationally over that time period.
The analysis also makes clear that not all forces pushing towards the spatial polarization
of workers are necessarily welfare-enhancing. This is the case, for instance, of housing regulations captured here through changes in the productivity of the real estate sector. The cost
of these regulations has been emphasized by, among others, Glaeser and Gyourko (2018),
Herkenhoff et al. (2018) and Hsieh and Moretti (2019). Relatively low real-estate productivity growth in CNR-intensive cities since 1980 has increased housing prices and led to more
polarized CNR hubs. These changes have thus brought the spatial distribution of occupations closer to that resembling the optimal allocation. In this case, however, since this more
spatially polarized distribution of workers resulted from reductions in measured real estate
4

productivity in larger cities, “the cognitive hubs” led to declines in welfare.
Relationship to the Literature A substantial literature has pointed to increasing spatial
concentration of skilled workers (Berry and Glaeser (2005), Diamond (2016), and Giannone
(2017)), as well as increasing wage inequality across space and within cities (Baum-Snow
and Pavan (2013), and Autor (2019)), with the skill premium increasing the most in large
cities. Our paper speaks to the optimal policy reaction to those trends.
We focus on production externalities as a key driving force behind those spatial patterns.
The estimation of those externalities is a central theme in urban and spatial economics.
A robust finding is the existence of a relationship between city size and productivity (see
Melo et al. (2009) for a meta analysis). While we allow for such agglomeration externalities,
our main focus is on externalities tied to the occupational composition of the city. This
is compatible with empirical evidence by Ellison et al. (2010) that industries with similar
occupational make-up tend to be spatially proximate. Given the correlation between occupational types and skill levels, our findings of strong spillovers stemming from the occupational
composition of cities mirrors findings by Moretti (2004a; 2004b) regarding the local external
effects of human capital.
There has been ample research on the extent of spatial misallocation in the U.S. economy
and the degree to which it corresponds to heterogeneity in taxation policy (or its local
incidence), zoning laws, or other unspecified sources of distortions. Examples of papers in
that vein are Albouy (2009), Desmet and Rossi-Hansberg (2013), Ossa (2015), Fajgelbaum
et al. (2018), Colas and Hutchinson (2017), Hsieh and Moretti (2019) and, most recently
Herkenhoff et al. (2018).
Our paper sheds light on place-based policies in that it highlights the optimal endogenous
expansion of different industries in different locations. A summary of the related literature
can be found in Neumark and Simpson (2015). Rather than evaluating exogenous policies,
we derive the optimal allocation in a quantitative spatial model with local externalities.
Our derivation of optimal policy thus generalizes that of Fajgelbaum and Gaubert (2018)
in an environment with input-output linkages and where trade is differentially costly across
industries. Two other recent papers that discuss the optimal distribution of city sizes are
Eeckhout and Guner (2015) and Albouy et al. (2019).
We integrate industrial, occupational and spatial heterogeneity in a single coherent framework. Other recent work that has emphasized the joint distribution of industrial and skill
composition within the U.S. are Hendricks (2011) and Brinkman (2014). As in Caliendo
et al. (2017), we allow for trade costs, thus capturing an explicitly spatial dimension, but
add to that framework by also allowing for occupational heterogeneity and local production
externalities. Finally, on a more technical note, our paper adds to the rapidly expanding
‘quantitative spatial economics’ literature that uses general equilibrium models to address
issues related to international, regional and urban economics. Redding and Rossi-Hansberg
(2017) provide a review of the main ingredients in these models.
The rest of the paper is organized as follows. Section 2 presents stylized facts that constitute prima-facie evidence for the presence of externalities among CNR workers within cities.
Section 3 presents our multi-industry spatial model with occupation specific externalities
within cities. Section 4 quantifies the model, including our estimation of the externality

5

parameters. It also discusses the role of externalities in the equilibrium allocation. Section 5
presents the optimal allocation as well as the resulting transfers and their implementation.
Section 6 provides a decomposition of the impact of fundamental changes in the national
CNR employment share and in technology across sectors and cities between 1980 and the
recent data. Section 7 concludes. We relegate many of the model’s details, additional robustness exercises and counterfactuals to the Appendix.

2

A Motivation for Occupation-Specific Externalities

The main question under consideration is whether there is a role for policy in altering the
observed spatial polarization of employment and, if so, what are its features? We now
provide some basic facts regarding the joint spatial distribution of wages and employment
for workers in different occupations that point to the existence of important occupational
externalities. Those facts constitute prima-facie evidence that the optimal policy may in fact
involve reinforcing existing patterns, with the appropriate transfers, rather than attenuating
them.
We separate workers from 2011 to 2015 in two large occupational groups: those that are
intensive in cognitive non-routine (CNR) tasks and the others (non-CNR).4 We calculate
the average residual wages of workers in each occupation and each city after controlling for
observable worker socio-economic characteristics.5 This classification builds on the observation by Acemoglu and Autor (2011) that one can best understand wage inequality trends
through such a task based approach.
Figure 1 shows that across U.S. cities, wages of workers employed in Cognitive NonRoutine (CNR) occupations, relative to those of workers in other (non-CNR) occupations,
increase with the corresponding share of CNR workers in total employment. This suggests
that differences in relative wages across cities are, to a large degree, driven by differences in
relative demand for CNR workers.6 The size of the scatter-plot markers captures city size.
They indicate that large cities appear to also be CNR intensive.
Focusing on CNR workers, the top panel of Figure 2 indeed shows that real wages of CNR
workers increase with the intensity of CNR employment across cities. Moreover, some of the
high real wage cities include places like San Francisco and New York that on average may
provide higher amenities to CNR workers (see Diamond (2016)). In those cities, therefore,
labor demand forces are seemingly pronounced enough to more than make up for the labor
supply inducing effects of local amenities, such as the variety of retail and entertainment
options. If workers differ in their preferences for where to live, the real wages depicted in
the top panel of Figure 2 reflect the compensating differential to the marginal CNR worker
in a given city.
4

Specifically, we follow Jaimovich and Siu (2018), and define CNR occupations to include occupations
with SOC-2 classifications 11 to 29 and non-CNR occupations to include SOC-2 classifications 35 to 55.
5
We include as control variables education, potential experience, race, gender, English proficiency, number
of years in the U.S., marital status, having had a child in the last year, citizenship status, and veteran status.
6
In particular, suppose that technologies were similar across cities, and that the share of CNR workers
were driven by the supply of those workers. Then, with decreasing marginal returns to worker type, increases
in the relative supply of CNR workers would lower their relative wages.

6

Log ratio of CNR to non-CNR wages

Slope: 0.41
Correlation: 0.4
0.2
0.1
0
-0.1
-0.2
-0.3
0.15

0.2

0.25

0.3

0.35

0.4

0.45

0.5

0.55

CNR share of city employment
Figure 1: Occupational employment share and wage premium
See text for details on definitions of the wage premium and occupation classification. Log
wage premium is depicted as a deviation from employment weighted mean. Each observation
refers to a CBSA. Marker sizes are proportional to total employment.

Differences in the relative demand for CNR workers across cities can arise for several
reasons. First, differences in relative demand for CNR workers may arise from exogenous (or
historically determined) differences in industrial composition or regulations. Suppose that
the industry make up of a city, n, is the main determinant of its demand for CNR workers
relative
to other types. Then, its wage bill share for CNR workers would be (approximately)
P CN R,j
σnj , where δ CN R,j is the national wage bill share of CNR workers in industry j,
jδ
and σnj is the wage bill of industry j as a share of that city’s total wage bill. Figure 3
compares the wage-bill shares of CNR workers implied by the different industrial composition
of U.S. cities relative to those observed in the data. The black line is a 45 degree line. The
observed wage bill shares differ from, and in fact increase more than one-for-one with, those
implied by differences in industrial mix alone, thus ruling out industrial composition as a
sole determinant of labor demand across cities.7
Differences in the relative demand for CNR workers across cities can also arise endogenously if more productive workers within occupational types sort themselves into particular
cities. Baum-Snow and Pavan (2013) indeed argue that observable worker characteristics
are an important determinant of the city size wage premium. However, the fact that relative wages in Figure 1 are computed from residuals after controlling for observable worker
characteristics suggests that sorting along these characteristics is not the only driving force
7

The figure also rules out the production technology for different industries being well characterized by
Cobb-Douglas (i.e. the elasticity of substitution across worker types is likely not equal to 1).

7

0.4

Slope: 0.47
Correlation: 0.27

Log real CNR wages

0.3
0.2
0.1
0
-0.1
-0.2
-0.3
-0.4
0.15

0.2

0.25

0.3

0.35

0.4

0.45

0.5

0.55

CNR share of city employment
Slope: 0.06
Correlation: 0.04

Log real non-CNR wages

0.4
0.3
0.2
0.1
0
-0.1
-0.2
-0.3
-0.4
0.15

0.2

0.25

0.3

0.35

0.4

0.45

0.5

0.55

CNR share of city employment
Figure 2: Occupational employment share and real wages
See text for details on definitions of the wage premium and occupation classification. Real
wages are calculated using consumption price indices obtained from the model quantification
(See Section 4). Log real wages are depicted as a deviation from employment weighted mean.
Each observation refers to a CBSA. Marker sizes are proportional to total employment.

8

underlying that figure. Thus, if sorting is nevertheless part of an explanation driving the
positive relationship between the wage premia of CNR workers and the employment share
of those workers, it must be taking place along dimensions that are not easily observed.
However, assuming that differences in amenities are experienced in similar ways by CNR
and non-CNR workers, high productivity non-CNR workers would then also sort themselves
into cities with a high share of CNR workers. The bottom panel of Figure 2 suggests that
this is not, in fact, the case.8
Finally, differences in relative demand for CNR workers may be explained by endogenous differences in productivity, even when not from sorting, if these differences arise from
production externalities that predominantly affect CNR workers. First, to the extent that
production externalities also increase with the concentration of CNR workers in a given city,
it is then naturally the case that the demand for CNR workers would increase with the share
of employment in CNR occupations, as suggested by Figure 1. Second, if production externalities mainly enhance the productivity of CNR workers, then real wages of CNR workers
would increase with the share of CNR employment within cities, as in the top panel of Figure
2, but no such effect would be expected among non-CNR workers, as suggested by the bottom panel of Figure 2. Third, and most importantly, Figure 3 shows that observed wage bill
shares of CNR workers increase more than one-for-one with those implied by differences in
industrial composition alone. This observation would be expected in an environment where
production externalities intensify the implications of industrial mix. Specifically, CNR workers will concentrate, all else equal, in cities whose industrial composition is tilted towards
industries intensive in CNR workers. In the presence of production externalities, therefore,
this concentration would lead to increases in the productivity of CNR workers. If the elasticity of substitution between worker types is higher than 1, one would then expect higher
wage shares for CNR workers in those cities relative to those given by industrial composition
alone.

3

A Quantitative Spatial Model with Multiple Industries and Occupations

The economy has N cities and J sectors. We denote a particular city by n ∈ {1, ..., N }
(or n0 ) and a particular sector by j ∈ {1, ..., J} (or j 0 ). Individuals are endowed with an
occupational type and cannot switch types. There are K occupational types, denoted by
k ∈ {1, ..., K} (or k 0 ), with aggregate number of workers Lk per type (total employment in
occupation k aggregated across industries and cities). Firms in all cities use multiple types
of labor but in potentially different proportions depending on the industry and the city.
Aggregate regional land and structures in region n are denoted by Hn . Labor of all types
moves freely across regions and sectors, while structures are region-specific. Some sectors
are tradable while others are not.
Quantities in the economy may be associated with industries, cities, or occupations.
For notational convenience, we denote aggregates across a given dimension by omitting
8
The small relevance of sorting to explain differences in wages across cities has in fact been recently
verified in empirical work by Baum-Snow and Pavan (2011) and Roca and Puga (2017).

9

0.8
0.75
Slope: 1.89
Correlation: 0.85

CNR wage bill share (data)

0.7
0.65
0.6
0.55
0.5
0.45
0.4
0.35
0.3
0.3

0.35

0.4

0.45

0.5

0.55

0.6

0.65

0.7

0.75

0.8

CNR wage bill share (predicted)
Figure 3: Occupational wage bill share: predicted vs actual
See text for details on definitions of the wages and occupation classification. The predicted
wage bill-shares are obtained by assuming that within-industry wage bill shares were equal
to national averages (see text for details). Each observation refers to a CBSA. Marker sizes
are proportional to total employment.

kj
the corresponding index. Thus, for example,
P kjLn is the number of workers employed in
k
occupation k in industry j in city n, Ln = j Ln represents the number of workers employed
P
P
in occupation k in city n, Lk = n Lkn represents all workers in occupation k, and L = k Lk
is simply total employment.

3.1

Individuals

Workers in each location n ∈ {1, ..., N }, are endowed with labor of type k ∈ {1, ..., K}, and
order consumption baskets according to Cobb-Douglas preferences with shares αj over their
consumption of final domestic goods, Cnkj :

10

Cnk =

Y

Cnkj

αj

,

j

where Cnk is a consumption aggregator. Consumption goods consumed in city n areP
purchased
at prices Pnj in sectors j ∈ {1, ..., J}. Utility is homogeneous of degree one, so that j αj = 1.
Workers supply one unit of labor inelastically. The income of a worker of type k residing
in city n is
Ink = wnk + χk ,

(1)

where wnk is the wage earned by a worker in occupation k in city n. The term χk represents
the return per household on a national portfolio of land and structures from all cities,
P
0
0
k
k
n0 rn Hn
χ =b
,
k
L
where rn is the rental rate on land and structures in that city, and bk denotes the share
of the national portfolio accruing to workers of occupational type k. In what follows, we
assume that bk is determined such that different worker types receive a share of the national
portfolio proportional to their share of wages in the total wage bill, so
P k k
w L
k
b = P n nk0 n k0 .
k0 ,n0 wn0 Ln0
Agents of a given occupational type differ in how much they value living in different
cities. These differences are summarized by a vector a = {a1 , a2 , ..., aN }, with each entry an
scaling the utility value that an individual receives from living in city n. We associate the
elements an with the particular way in which different workers experience the amenities of
given cities. Conditional on living in city n, the problem of an agent employed in occupation
k and characterized by amenity vector a is
Y
X
αj
vnk (a) ≡ max an Akn
Cnkj (a) , subject to
Pnj Cnkj (a) = Ink ,
J
kj
{Cn (a)}j=1
j
j
where Akn denotes an exogenous component of city-specific utility common to all individuals
of type k living in city n. All workers in a given occupation living in a given city will choose
the same consumption basket. It follows that Cnkj (a) = Cnkj for all a.
Agents move freely across cities. The value of locating in a particular city n for an
individual employed in occupation k, with idiosyncratic preference vector a, is
an Akn Ink
= an Akn Cnk .
Pn
In equilibrium, workers move to the location where they receive the highest utility so that
vnk (a) =

v k (a) = max vnk (a) ,
n

11

where v k (a) now denotes the equilibrium utility of an individual in occupation k with amenity
vector a. We assume that an is drawn from a Fréchet distribution independently across cities.
We denote by Ψ the joint cdf for the elements of a across workers in occupation k, so that
(
)
X
Ψ (a) = exp −
(an )−ν ,
n

where the shape parameter ν reflects the extent of preference heterogeneity across workers
employed in occupation k. Higher values of ν imply less heterogeneity, with all workers
ordering cities in the same way when ν → ∞. The assumption of a Fréchet distribution for
idiosyncratic amenity parameters implies closed form expressions for the fraction of workers
in each city:



k k ν
C
A
n
n
 Lk .
(2)
Lkn = Pr vnk (a) > max
vnk0 (a) = P
k
k ν
n0 6=n
A
C
0
0
0
n n
n

3.2

Firms

There are two types of firms: those producing intermediate goods and those producing final
goods. There is a continuum of varieties of intermediate goods which are aggregated into
a finite number of final goods corresponding to J sectors. Varieties of intermediate goods
are characterized by the sector in which they are produced, and by a vector of city-specific
productivity parameters, z = {z1 , z2 , ..., zN }, with each element zn scaling the productivity
of firms in city n producing that variety.
Final goods are sold in the city where they are produced. Varieties of intermediate goods
are traded across cities. Because of transportation costs, the price earned by intermediate
goods producers need not be the same as the price paid by final goods producers. Intermediate goods producers operating in city n, sector j, producing a variety indexed by z, produce
a quantity, qnj (z), for which they earn a price pjn (z). Final goods producers operating in city
n, sector j, purchase a quantity Qjn (z) of the variety of intermediate goods indexed by z.
3.2.1

Intermediate Goods

Idiosyncratic productivity draws, z, arise from a Fréchet distribution with shape parameter
θ. Draws are independent across goods, sectors, and regions. Specifically, if we let Φ be the
joint cdf of variety-specific productivity parameters across firms in industry j, then
(
)
X
(zn )−θ .
Φ (z) = exp −
n

Production of intermediate goods a variety indexed by z, in city n, and industry j, takes
place using the technology,

j

qnj (z) = zn Hnj (z)βn

"
X

kj
λkj
n Ln (z)

 −1


j
j γn

# −1
(1−βn
)

Y

j0

k

12

0

j0 j

Mnj j (z)γn

(3)

0

where γnj j ≥ 0 is the share of sector j input expenditures spent on materials from sector
j 0 in city n, γnj ≥ 0 is the share of value added in gross output in sector j, and βnj is
the share of land and structures
in value added in that sector. The production function is
PJ
0
constant returns to scale, j 0 =1 γnj j = 1 − γnj . The variable λkj
n denotes a labor augmenting
productivity component that is city, industry, and occupation specific. We denote by Hnj (z)
the quantity of structures used by a firm producing a variety z in industry j operating in
0
city n, by Mnj j (z) the quantity of material goods this firm uses from sector j 0 , and by Lkj
n (z)
the workers of type k it employs.
Importantly, given the evidence presented in Section 2, we allow λkj
n to reflect externalities
that depend on the composition of the labor force. In particular, we let
kj
λkj
n = λn (Ln ),

where Ln = {L1n , ...., LK
n } summarizes the occupational make up of the labor force in city n.
3.2.2

Final Goods

A final goods firm operating in industry j in city n produces the quantity Qjn according to
the technology,
η
 "
 η−1
# η−1
Z X
η
Qjn = 
Qjnn0 (z)
,
dΦ (z)

n0

where Qjnn0 (z) represents its use of intermediate goods of variety z produced in city n0 .
One unit of any intermediate good in sector j shipped from region n0 to region n requires
producing κjnn0 ≥ 1 units in the origin n0 . Therefore, producers of final goods in each sector
solve
XZ j j
j j
κnn0 pn0 (z)Qjnn0 (z) dΦ (z) ,
max
Pn Q n −
j
Qnn0 (z)

n0

subject to Qjnn0 (z) ≥ 0, where Pnj is the price of the final good in sector j, city n. Intermediate
goods in non-tradable sectors cannot be shipped between cities.
Final goods firms purchase intermediate goods from the location in which the acquisition
cost, including transportation costs, is the least. Denote by Xnj the total expenditures on final
goods j by city n, which must equal of the value of final goods in that sector, Xnj = Pnj Qjn .
Because of zero profits in the final goods sector, total expenditures on intermediate goods
in a given sector are then also equal to the cost of inputs used in that sector. Following the
usual Eaton and Kortum (2002) derivations, given a final good j produced in city n, the
share of intermediate inputs imported from location n0 is

j
πnn
0

 j j −θ
κnn0 xn0
= P
 j
−θ ,
N
j
κ
x
00
00
00
nn
n
n =1

where
13

j

j γn
1−βn
!"
#
j0 j




j
J  j 0 γn
 r βn
X wk 1− 1−  Y
P
1
n
n
n
xjn =
j
j
kj
j0j


1 − βn k
λn
 βn
 j 0 =1 γn

(4)

is a cost index associated with the production of varieties in sector j, city n. In quantifying
j
the model, we also allow for two non-tradable sectors for which πnn
= 1.

3.3

Market Clearing Conditions

Within each city n, the number of workers employed in occupation k must equal the number
of those workers who choose to live in that city. Put alternatively,
Z
XZ
kj
Ln (z)dΦ(z) = ζnk (a)dΨ(a), ∀ n = 1, ..., N , k = 1, ..., K.
(5)
j

where ζnk (a) ∈ {0, 1} denotes the location choice of households as a function of their type.
Market clearing for land and structures in each region imply that
XZ
(6)
Hnj (z) dΦ (z) = Hn , n = 1, ..., N.
j

Final goods market clearing implies that
X
k

Lkn Cnkj

+

XZ

0

Mnjj (z) dΦ (z) = Qjn .

(7)

j0

Finally, intermediate goods market clearing requires that
X j
qnj (z) =
κn0 n Qjn0 n (z).

(8)

n0

4

Quantifying the Model

In the model we have laid out, any quantitative statement about efficient allocations will
necessarily depend on the parameterization of occupation-specific externalities. Estimating
these externalities in turn requires that we take the model to the data and recover productivity in different occupations, sectors, and cities such that the equilibrium of the model
matches these data. Observations used in the model inversion are thus matched by construction. However, other predictions of the model with respect to recovered productivity or
tradable goods prices do not have easily observable counterparts. In fact, we show that the
properties of productivity and prices delivered by the model are comparable to those found
in different recent studies, effectively tying these studies within a single general equilibrium
framework.
The model is mapped into 22 industries and the two large occupational groups (cognitive
non-routine and others) emphasized in Section 2. Of the 22 industries, two are non-tradable,
14

meaning that all local output is also used locally. The two non-tradable sectors include realestate services, which is the single user of land in each city, and a composite sector comprising
retail, construction, and utilities. Tradable industries include 10 manufacturing sectors and
10 service sectors. In quantifying the model, we focus on the period 2011 to 2015.
The set of parameters needed to quantify our framework fall into two broad types: i) parameters that are constant across cities (but may vary across occupations and/or industries)
and ii) parameters that vary at a more granular level and require using all of the model’s
equations (i.e. by way of model inversion) to match data that vary across cities, industries,
and occupations.
0
To obtain an initial calibration for the share parameters γnj , γnjj and αj , we use an average
of the 2011 to 2015 BEA Use Tables, each adjusted by the same year’s total gross output,
where we assume that tradable sectors have γnj = γ j that are constant across cities and
0
similarly for γnjj ’s.9
We adopt the convention that all land and structures are managed by firms in the real
estate sector that then sell their services to other sectors. Accordingly, for all sectors other
than real estate, we reassign the gross operating surplus remaining after deducting equipment
investment to purchases from the real estate sector. These surpluses are in turn added to
the gross operating surplus of real estate.10 This convention implies that the share of land
and structures, βnj , in the production of all sectors other than real estate is equal to zero.
We set θ, the Fréchet parameter governing trade elasticities, to 10 or well within the
range of estimates of trade elasticities in the literature, between 3 to 17 (see Footnote 44 in
Caliendo and Parro (2015), as well as Head and Mayer (2014), section 4.2 for comprehensive
summaries of estimates). While estimates of θ have been carried out at various levels of
disaggregation, these can vary somewhat widely for a given sector or commodity across
studies.11 For our purposes, this uncertainty is further compounded by the fact that trade
elasticities that are relevant for trade between countries may not be appropriate for trade
between regions or cities.
As mentioned, we assume that two of the sectors (“real estate,” as well “retail, construction, and utilities”) are non-tradable, so that their transportation costs are treated as infinite.
For the tradable sectors, we follow Anderson et al. (2014) and assume that trade costs inj
crease with distance. Specifically, in order to ship one unit of good to city n, κjnn0 = (dnn0 )t
units of the good need to be produced in city n0 , with dnn0 the distance between city n
and city n0 in miles.12 The parameter tj is industry specific. For commodities, we directly
estimate tj from the Commodity Flow Survey synthetic microdata using standard gravity
regressions based on model trade-shares. In the tradable services, we use the values obtained
by Anderson et al. (2014) using Canadian data.
9

Since the model does not allow for foreign trade, we adjust the Use Table by deducting purchases from
international producers from the input purchases and, for accounting consistency, from the definition of gross
output for the sector.
10
One can verify that those reassignments do not affect aggregate operational surplus (net of equipment
investment), aggregate labor compensation, and aggregate value added (net of equipment investment).
11
For example, while Caliendo and Parro (2015) estimate an elasticity of 7.99 for Basic Metals and 4.75
for Chemicals, Feenstra et al. (2018) estimate a elasticities of, respectively, 1.16 and 1.46 for those two
categories.
12
We assume that within city distance is equal to 20 miles.

15

Ciccone and Peri (2005) summarize estimates for the elasticity of substitution between
skilled and unskilled labor in the literature as ranging between 1.36 and 2. Card (2001)
estimates the elasticity of substitution between occupations to be closer to 10. We adopt
 = 2 as a benchmark. Finally, we set ν so that the average elasticity of employment with
respect to real wages in our model matches the estimate of 1.36 as in Table A.11, column 4,
of Fajgelbaum et al. (2018). This implies ν = 2.02.13
Given the parameters above, we use data on wages by occupation and location (wnk ), as
well as data on employment by occupation, industry and location (Lkj
n ), to obtain equilibrium
k
kj
values of productivity and amenities, λn and An respectively. Data pertaining to wnK , and
Lkj
n
P
k0 j is available from the American Community Survey (ACS). The ACS also allows us
k0

Ln

to adjust wages for individual characteristics so that our data captures P
city wage premia
0
for each occupation. The Census provides measures of total employment, k0 Lkn j , from the
County Business Patterns (CBP) that better match BEA industry-level counts. We combine
total employment from the CBP with ACS data on employment shares to obtain Lkj
n . The
k
kj
exact procedure that yields λn and An by way of model inversion is described in detail in
Appendix B.
Table 1 compares the relationships between wages, employment, and employment composition across different cities highlighted in previous work relative to the data used in our
model inversion. The first three rows of the table show regression coefficients of log wages
for CNR workers, non-CNR workers, and the CNR wage premium, on different measures of
city employment and employment composition. The subsequent rows show similar regression
coefficients obtained in previous literature. The data we use implies relationships that are
consistent with those in other work. In particular, all wages increase with city size, more
so for skilled workers. A similar relationship holds for wages and city composition where
proportionally more skilled cities exhibit higher wages for all workers, more so for skilled
workers.14

4.1

Model-Implied Productivity, Amenities, and Prices

There exists a large literature that has estimated and studied the role of agglomeration externalities. Much of this work has relied on a production function approach using measures
of output and factor inputs to estimate Total Factor Productivity (TFP), or using labor
productivity more directly, in exploring how productivity depends on the scale of city employment or its skill composition. There is also a literature that has sought to understand
how tradable goods prices vary with city size. We now show that our model-consistent TFP
measures and tradable prices compare favorably with previous empirical work in both literatures but within a single general equilibrium framework. Moreover, the same framework
13
Here, ν is somewhat larger than the value obatained by Fajgelbaum et al. (2018). This reflects the fact
that ν is the elasticity of labor supply with respect to consumption rather than wages. Because Fajgelbaum
et al. (2018) abstract from non-wage income, they estimate values for the analogous parameter in their model
of between 0.75 and 2.25 depending on identification assumptions.
14
An exception is Moretti (2004a) who finds no statistically significant differences in the way that wages
of college educated workers and non-college educated workers vary with employment composition across
cities. Our findings, however, rely on a more recent time period where other work has found an increasingly
pronounced relationship between skill and city size (see Baum-Snow and Pavan (2013)).

16

can then be used directly to provide a quantitative assessment of optimal spatial policy.
Table 1: Wages, Employment, and City Composition
LCNR
n
LnCNR
n

ln(Ln )




ln wnnCNR
 CNR 
n
ln wwnCNR

0.070
(0.003)
0.049
(0.002)
0.021
(0.001)

0.332
(0.021)
0.218
(0.017)
0.114
(0.008)

Moretti HS1

—

—

Moretti Some College

—

—

Moretti College +

—

—

0.0455
(0.0080)

—

ln wnCNR

n

Roca & Puga wage log wage constant2
Diamond log college wage3

—

Diamond log non-college wage4

—

Baum-Snow et al. log wage, 2005-20075
Baum-Snow et al. log wage ratio6

0.065
(< 0.01)
0.029
(< 0.003)

ln



Dependent Variable

0.26
(0.11)
0.18
(0.01)



LCNR
n
Ln

1.474
(0.089)
0.973
(0.072)
0.501
(0.036)
0.85
(0.06)
0.86
(0.06)
0.74
(0.06)
—
—
—

—

—

—

—

1. Moretti (2004a) "Estimate the social return to higher education: evidence from longitudinal and
repeated cross-sectional data", Table 5.
2. Roca and Puga (2017) "Learning by Working in Big Cities", Table 1.
3. Diamond (2016) "The Determinants and Welfare Implications of US Workers’ Diverging Location
Choices by Skill: 1980-2000", Figure 4.
4. Diamond (2016), Figure 3
5. Baum-Snow et al. (2018) "Why Has Urban Inequality Increased?", Table 1. Standard error reported
as less than 0.01.
6. Baum-Snow et al. (2018), Table 2. Standard error reported as less than 0.003.

4.1.1

Tradable Goods Prices and City Size

Recent work by Handbury and Weinstein (2014), using Nielsen home-scanned data on tradable goods bought in grocery stores, highlights that tradable consumer prices decrease with
city size. Prior to that study, the consensus view, based on more aggregated prices, was
that such prices instead increased with city size.15 Given the detailed nature of Nielsen
home-scanned prices, Handbury and Weinstein (2014) are able to control for product buyer
15

In principle, given that rents generally increase with city-size, tradable consumer prices might indeed
follow the same pattern to the degree that they are partially influenced by local rents as an input cost.

17

and retailer heterogeneity in a way that is not easily achieved with more aggregate prices.
Allowing for those controls reduces the elasticity of tradable goods bought in grocery stores
with respect to city size to zero. When Handbury and Weinstein (2014) further adjust local
price indices to reflect differences in the number of varieties of goods available in different
cities, they find that the price of tradable goods bought in grocery stores actually decreases
with city size, with an elasticity equal to −0.011 (Table 6). In addition, when calculating
this elasticity after purging the effect of local rents on retail costs, they obtain −0.017 (Table
9 in Working Paper version). In the presence of trade costs, such a declining relationship
indeed emerges when larger cities are generally more productive. It can hold for certain categories of goods even if the local scarcity of land means that the price of real estate services
is higher, thus driving up the general price level, in larger cities.
Table 2 below summarizes the relationship between prices obtained in our model inversion
and city size. Similar to Handbury and Weinstein (2014), our general equilibrium framework
reveals a decreasing relationship between prices and city size and, in fact, across all tradable
sectors with an average elasticity of −0.012. In the Food and Beverage sector, our model
inversion reveals an elasticity of −0.010, virtually identical to that Handbury and Weinstein
(2014) for grocery products. Remarkably, our finding arises without direct observation of
prices. Instead they follow from supply and demand relationships within a structural trade
model where cities produce different goods and where trade across regions is costly. When
informed by the data described above, our model then implies that large cities are generally
more productive thus yielding smaller prices for tradable goods.
4.1.2

Amenities

We now turn to the occupation-specific amenities implied by the model inversion, Akn . The
relationship between relative amenities for CNR and non-CNR workers against the size and
composition of cities is depicted in Figure 4. Our findings conform to Diamond (2016) in
that cities with more CNR workers are also relatively more amenable to those same workers.
At the same time, larger cities are relatively more amenable to CNR workers helping account
for the concentration of CNR workers in large cities.
Diamond (2016) provides evidence for a causal impact of local population composition
on amenities. In Appendix E, we show the effect of filtering out the component of amenities
that is endogenous to the local labor composition.16 While suppressing those endogenous
effects eliminates the positive relationship between the CNR share and relative amenities,
the relationship between relative residual amenities and city size becomes stronger. Intuitively, given the estimates in Diamond (2016), large non-CNR populations generate larger
congestion effects on CNR workers than on non-CNR workers. The bottom line, therefore, is
that our findings below regarding the optimality of concentrating CNR workers, computed
without endogenous amenities, are if anything conservative.
16

Here, we use the parameterization that Fajgelbaum and Gaubert (2018) obtain based on the estimates
by Diamond (2016).

18

Slope: 2.02
Correlation: 0.96

0.4

0.2

0

-0.2

-0.4

-0.6
0.15

0.2

0.25

0.3

0.35

0.4

0.45

0.5

0.55

CNR share of city employment
Slope: 0.05
Correlation: 0.5
0.4

0.2

0

-0.2

-0.4

-0.6
9

10

11

12

13

14

15

16

log city employment
Figure 4: Relative amenities and city size and composition
Ratio of occupational-specific amenity parameters for each city obtained from the model
quantification against employment share of CNR workers and log total employment. Each
observation refers to a CBSA. Marker sizes are proportional to total city employment.

19

4.1.3

Total Factor Productivity

A substantive literature in urban economics has addressed the relationship between productivity and city size (i.e. “agglomeration economies”), as well as that between productivity
and employment composition. Baseline estimates of real Total Factor Productivity (TFP)
typically rely on Cobb-Douglas production functions that allow for different types of labor
to enter separately. Within the context of our model, we follow Caliendo et al. (2017) and
express measured TFP as

ln T F Pnj

P
= ln

n0

X
X 0
πn0 n Xnj 0
0
j j
j
j
j
kj
kj
−
γ
β
ln
H
−
γ
(1
−
β
)
δ
ln
L
−
γnj j ln Mnj j , (9)
n n
n
n
n
n
j
Pn
k
j0

where δ kj is the share of occupation k wages in sector j’s wage bill. In the model we have
laid out, and up to a first-order approximation (abstracting from selection effects induced
by trade), we have that for tradable sectors,
X
ln T F Pnj '
δ kj γ j ln λkj
n ,
k

where λkj
n

γ j

may thus be interpreted as the component of TFP in sector j and city n
γ j
associated with occupation k.17 In the remainder of the paper, we let Tnkj = λkj
.
n
Table 3 below shows the city in which TFP recovered from the model is highest by
industry. The results largely conform to intuition. Productivity in Computers and Electronic
Equipment is highest in San Jose, CA; Honolulu stands out for Accommodation; Anchorage
for Oil, Chemicals and Nonmetallic Minerals; and Seattle for Motor Vehicles (which includes
aircrafts). It is also interesting to note that the two largest cities in the country are also top
cities in several sectors, with New York dominating in most service sectors while Los Angeles
stands out in several manufacturing sectors.
Table 4 compares estimates of productivity elasticities with respect to city size and employment composition obtained from our model inversion to those found in previous work.
In particular, we report elasticities with respect to city size from the meta analysis carried
out in Melo et al. (2009). As reported in Table 4, all results point to a positive relationship
between TFP and city size. Moreover, our findings fall within the range of reduced form
estimates found in the literature, with the possible exception of services. However, as in
previous literature, the elasticity of (tradable) services productivity with respect to city size
is substantially larger than that of manufacturing. To the extent that regional prices are
not readily available, the relationship between TFP and city size estimated in some of the
existing literature captures variations in nominal TFP, that is ln T F Pnj + ln Pnj . In other
words, while our model inversion produces measures of Pnj , the absence of local price data
can otherwise bias downward empirically estimated elasticities of TFP with respect to city
size. Indeed, our findings indicate that elasticities of real TFP with respect to city-size are
somewhat larger than those of nominal TFP.
17

See Appendix B for the details of this derivation. For non-tradable sectors, the city-specific share
parameters make it challenging to compare this term across-cities. Furthermore, our data does not allow us
to separate the productivity of the real-estate sector from the stock of housing.

20

Table 2: Elasticities of Final Goods Prices, Pnj , w.r.t. Ln
Sector
Food and Beverage
Textiles
Wood, Paper, and Printing
Oil, Chemicals, and Nonmetallic Minerals
Metals
Machinery
Computer and Electronic
Electrical Equipment
Motor Vehicles (Air, Cars, and Rail)
Furniture and Fixtures
Miscellaneous Manufacturing
Wholesale Trade
Transportation and Storage
Professional and Business Services
Other
Communication
Finance and Insurance
Education
Health
Accommodation
Real Estate
Retail, Construction and Utilities
Average
Tradable Average
Manufacturing Average
Tradable Services Average

Elasticity
−0.010
−0.022
−0.011
−0.016
−0.014
−0.005
−0.013
−0.003
−0.008
−0.009
−0.013
−0.007
−0.007
−0.018
−0.011
−0.002
−0.011
−0.024
−0.030
−0.012
0.132
0.047
−0.003
−0.012
−0.011
−0.013

We also compare our TFP regressions coefficients with respect to the share of CNR
workers to those estimated by Moretti (2004b) using a panel of firms (we use the CNR share
of employment, whereas he uses the college educated share of employment). Again, our TFP
measures are consistent with semi-elasticities that are of the same sign and comparable in
magnitude to those in Moretti (2004b). As before, the regression coefficients become larger
when deflated by the model-consistent regional price index.
Table 5 shows the coefficients in Table 4 for tradable sectors disaggregated by industry.
We find that the positive relationship between TFP and city size holds uniformly across
all tradable sectors. In addition, we find that the semi-elasticity of TFP in Computer and
Electronics with respect to employment composition across cities is more than twice as large
as the average for manufacturing, replicating the finding by Moretti (2004b) for high-tech
sectors.

4.2

Estimating Production Externalities by Worker Type

So far, we have described the equilibrium levels of ocupation-specific productivity consistent
γ j
with observed data on wages and sectoral employment, Tnkj ≡ λkj
. Having obtained
n
these productivity measures through the model inversion, we now turn to estimating their

21

Table 3: City with Top TFP for each Industry
Industry
Food and Beverage
Textiles
Wood, Paper, and Printing
Oil, Chemicals, and Nonmetallic Minerals
Metals
Machinery
Computer and Electronic
Electrical Equipment
Motor Vehicles (Air, Cars, and Rail)
Furniture and Fixtures
Miscellaneous Manufacturing
Wholesale Trade
Transportation and Storage
Professional and Business Services
Other
Communication
Finance and Insurance
Education
Health
Accommodation

MSA
Los Angeles, CA
Los Angeles, CA
Minneapolis, MN
Anchorage, AK
Los Angeles, CA
Houston, TX
San Jose, CA
Los Angeles, CA
Seattle, WA
Los Angeles, CA
Los Angeles, CA
New York, NY
New York, NY
San Francisco, CA
Los Angeles, CA
New York, NY
New York, NY
New York, NY
New York, NY
Honolulu, HI

Table 4: Elasticities of TFP with respect to City Size and Employment Composition
R
LCN
n
Ln

ln (Ln )

Average1
Manufacturing Average
Tradable Services Average

Real

Nominal

Real

Nominal

0.039
0.028
0.052

0.027
0.017
0.038

0.794
0.574
1.062

0.616
0.451
0.819

0.031
(0.099)
0.040
(0.095)
0.148
(0.148)

Melo et. al. Economy2
Melo et. al. Manufacturing
Melo et. al. Services
Moretti College Share
(Manufacturing)

3

—

—
—
—
0.846
(0.102)

1. Excludes non-tradables
2. Melo et al. (2009) "A Meta-analysis of estimates of urban agglomeration economies", Table 2. "By
type of response variable" and "By industry group."
3. Moretti (2004b) "Workers’ Education, Spillovers, and Productivity: Evidence from Plant-Level Production Functions", Table 2. College share in other industries, Cobb-Douglass production, 1992.

relationship to the scale and population composition of cities.18
18

The empirical exercise focuses on tradable sectors, for which our model generates measures of productivity separate from local housing supply.

22

Table 5: Sectoral Elasticities of TFP with respect to city size and employment composition
R
LCN
n
Ln

ln(Ln )

Food and Beverage
Textiles
Wood, Paper, and Printing
Oil, Chemicals, and Nonmetalic Minerals
Metals
Machinery
Computer and Electronic
Electrical Equipment
Motor Vehicles (Air, Cars, and Rail)
Furniture and Fixtures
Miscellaneous Manufacturing
Wholesale Trade
Transportation and Storage
Professional and Business Services
Other
Communication
Finance and Insurance
Real Estate
Education
Health
Accommodation

Real

Nominal

0.024
0.029
0.024
0.047
0.024
0.018
0.056
0.015
0.023
0.017
0.032
0.044
0.031
0.053
0.057
0.040
0.062
0.755
0.075
0.065
0.038

0.014
0.007
0.012
0.031
0.010
0.014
0.043
0.013
0.014
0.008
0.020
0.037
0.025
0.035
0.046
0.037
0.051
0.887
0.051
0.035
0.027

Real Nominal
0.427
0.276
0.576
0.935
0.489
0.424
1.458
0.398
0.532
0.168
0.626
0.906
0.595
1.135
1.063
0.878
1.353
9.681
1.624
1.421
0.588

0.311
0.246
0.285
0.795
0.259
0.359
1.203
0.348
0.384
0.277
0.491
0.811
0.492
0.817
0.936
0.829
1.133
12.279
1.091
0.776
0.484

We assume that occupational spillovers have the same labor augmenting effect across
sectors. Hence, we let
 k
Ln
kj
R,k j
ln Tn = τ γ ln
+ τ L,k γ j ln (Ln ) + ln Tbnkj ,
(10)
Ln
where Tbnkj is an exogenously determined component of technology. In turn, this term is given
by
ln Tbnkj = ak0 + akZ Znj + dkj + ukj
n ,
where Zn is a vector of observable city/industry characteristics, dkj denotes a set of industry
dummies, and ukj
n captures unobserved city-specific sources of natural advantages in the
production of sector j goods with workers of type k.
Equation (10) allows for an agglomeration effect that depends on city size, through τ L,k ,
and an additional effect related to the share of each worker type, through τ R,k . The elasticity
of productivity with respect to the agglomeration of a given type k is19


k
∂ ln λkj
Lkn
n
R,k
L,k Ln
=
τ
1
−
+
τ
.
(11)
∂ ln Lkn
Ln
Ln
19

See Glaeser and Gottlieb (2008) for a discussion of the marginal implications those elasticities.

23

Therefore, when τ L,k < τ R,k , individuals in a given occupation, k, have a larger marginal
effect on that occupation in cities where those individuals are less represented. However,
there are also cross-occupational effects. Specifically, for k 6= k 0 , we have that
0

 k
∂ ln λkj
n
R,k
L,k Ln
,
=
−
τ
−
τ
∂ ln Lkn0
Ln

(12)

which implies negative cross-occupational externalities when τ L,k < τ R,k . This congestion
effect increases with the share of workers in alternative occupations.
The first column of Table 6 reports the coefficients from a simple OLS regression where
we allow for two-way clustered standard errors by city and industry. These coefficients
are positive and significant. They indicate that individual productivity is enhanced by the
presence of other workers of the same occupational group. The coefficients also indicate the
presence of congestion effects since cross-occupational externalities are negative. These OLS
estimates, however, are potentially biased since workers of a given type may choose to live
in cities where they are relatively most productive. This would induce a correlation between
the exogenous component of worker productivity, Tbnkj , and the share of each type of worker
in a given city. Moreover, the estimates might be biased if there are omitted variables which
are correlated with both Tbnkj and the occupational ratio.
To help address the omitted variable bias, Table 6 explores the effects of adding various
controls to our basic OLS regression. Column 2 includes dummies for 9 census divisions
interacted with industry dummies.20 These should absorb many of the geographical and
historical components that may jointly determine amenities and productivity in different
places. Column 3 introduces the share of manufacturing workers in 1920 as a control.21
This aims to extract long standing factors that may influence the industrial composition
in individual places. Column 4 introduces geographic amenities constructed by the United
States Department of Agriculture (USDA) that include measures of climate, topography and
water area.22 These controls allow for the possibility that the same geographic characteristics that may lead workers to choose certain cities may also influence their productivity.
Finally, column 5 adds controls for demographic characteristics of different cities, including
racial composition, gender split, the fraction of immigrant population, and age structure.23
Together, these controls help narrow down the identification of the externality coefficients to
the extent that more productive cities attract individuals of certain demographic make-up.
The point estimates of the coefficients on CNR workers change only slightly with the
controls, while they increase the effect of labor market composition on non-CNR workers.
These controls help extract exogenous sources of productivity variation that affect individual
location decision. However, any residual variation in productivity may still be correlated with
20

They are 1. New England, 2. Mid-Atlantic, 3. East North Central, 4. West North Central, 5. South
Atlantic, 6. East South Central, 8. Mountain and 9. Pacific
21
Just as with our labor force variables of interest, this and other controls are likewise interacted with the
value added shares γnj .
22
Geographic controls include average temperature for January and July, hours of sunlight in January,
humidity in July from 1941 to 1970, variation in topography, and percent of water area.
23
Demographic controls are, by city, the percent female, black, hispanic, and percent in the age bins 16-25
and 26-65 (observations related to the younger than 16 population are dropped from the sample, and the
age bin 66+ is omitted from the regression).

24

population levels and composition. In order to further account for those residual effects, we
adopt an instrumental variable strategy, drawing on the existing empirical literature for
candidate sources of exogenous variation. The key difference here is that we seek to explain
productivity measures extracted from a structural model directly.
Table 6: OLS Estimates
(1)
VARIABLES
k

γnj log( LLnn )
γnj log(Ln )

(2)

(3)

CNR

non-CNR

CNR

non-CNR

CNR

0.822***
(0.13)
0.422***
(0.05)

0.636***
(0.23)
0.345***
(0.04)

0.817***
(0.12)
0.419***
(0.05)

0.673***
(0.20)
0.343***
(0.04)

0.842***
(0.12)
0.409***
(0.05)
0.0217
(0.03)
0.0741***
(0.01)
-0.0816***
(0.03)
-0.0225
(0.02)
-0.0272*
(0.01)
0.0372***
(0.01)

X

X

7,640
0.599

7,640
0.764

Jan. Temp
Jan. Hrs Sun
July Temp
July Humid
Topology
% Water Area
% 1920 Mfg Workers

(4)
non-CNR

CNR

0.689***
0.838***
(0.22)
(0.12)
0.345***
0.410***
(0.04)
(0.05)
-0.0553**
0.0135
(0.03)
(0.03)
0.0479*** 0.0720***
(0.01)
(0.01)
-0.0265
-0.0756***
(0.02)
(0.02)
-0.0156
-0.0163
(0.03)
(0.02)
-0.0244**
-0.0259*
(0.01)
(0.01)
0.00338
0.0380***
(0.01)
(0.01)
-0.00587
(0.01)

% female
% Black
% Hispanic
% Age 16-25
% Age 26-65

Industry FE
Census Division FE
Observations
R-squared

4.2.1

X
X
X
X
X
X
X
X
7,640
7,640
7,560
7,560
0.649
0.797
0.659
0.804
Robust standard errors in parentheses
*** p<0.01, ** p<0.05, * p<0.1

X
X
7,460
0.661

(5)
non-CNR

CNR

non-CNR

0.667***
0.900***
0.681***
(0.21)
(0.12)
(0.22)
0.343***
0.388***
0.322***
(0.04)
(0.05)
(0.04)
-0.0583**
0.00705
-0.0464**
(0.02)
(0.03)
(0.02)
0.0490*** 0.0658*** 0.0573***
(0.01)
(0.02)
(0.02)
-0.0209
-0.0705*** -0.0398*
(0.02)
(0.02)
(0.02)
-0.0127
-0.000596
-0.0260
(0.03)
(0.02)
(0.02)
-0.0237** -0.0348***
-0.0174
(0.01)
(0.01)
(0.01)
0.00370
0.0349***
0.00766
(0.01)
(0.01)
(0.01)
0.00816
-0.00195
0.00607
(0.01)
(0.01)
(0.01)
-0.0497*** -0.0505***
(0.01)
(0.01)
0.00174
0.0417***
(0.02)
(0.01)
0.0272*
0.00474
(0.02)
(0.02)
0.00567
0.0177
(0.03)
(0.04)
0.0260
0.0351
(0.03)
(0.03)
X
X
7,460
0.808

X
X
7,459
0.663

X
X
7,459
0.811

Instrumenting for Employment Levels and Composition

In order to isolate the residual simultaneity between exogenous productivity variation and
labor allocation, we resort to variants of instruments proposed in the literature. Specifically,
we follow Ciccone and Hall (1996) and use population a century ago to capture historical
determinants of current population. We also follow Card (2001) and Moretti (2004a), and
use variation in early immigrant population and the presence of land-grant colleges to capture historical determinants of skill composition across cities. A detailed discussion of the
particular instruments is provided in Appendix B.
Table 7 shows the estimation results with instrumental variables and all the controls. The
first column repeats the OLS results in the last column of Table 6, and the second column
25

shows the corresponding two-stage-least-squares estimates. Those are similar to the OLS
estimates, being well within one standard error from one another. To evaluate the strength
of the instrumental variables, we follow the procedure in Sanderson and Windmeijer (2016)
to obtain separate first-stage F statistics for each of the endogenous variables.24 Since the
F-statistics are below the value of 10 recommended by Staiger et al. (1997), the estimates
may have some bias and incorrect standard errors. The literature on IVs then recommends
the use of limited information maximum likelihood (LIML) estimators. The third column
of Table 7 carries out the estimation using a continuously updated GMM estimator (GMMCUE), similar to a limited information maximum likelihood estimator but which allows for
clustered and heteroskedastic standard errors. The Stock and Yogo (2005) critical values in
the LIML model for a p-value of 5% to be 10% or better is 6.46, at or close to our obtained
values.
To verify that our empirical strategy indeed identifies external effects, we carry out the
same regressions on data generated by a counterfactual allocation in which we set τ R,k =
τ L,k = 0 for all occupations k. The results are presented in Table 10 in the Appendix.
They confirm that the OLS estimates for the effect of occupation shares on productivity are
biased downward (the coefficients are now negative), whereas the estimates for the external
productivity effects related to population exhibit little bias. More importantly, the exercise
also shows that our IV’s successfully eliminate most of those biases, especially so in the
GMM-CUE estimates.
As a final measure of robustness, we carry out an estimation exercise using IV’s implied
by the model. Recall that, in our framework, the size and composition of population in different locations is determined simultaneously by local productivity, amenities, input-output
linkages, and trade costs. Thus, we construct a counterfactual allocation where, for each
industry and occupation, we set productivity to be fully exogenous and equal to the averages, across cities, of the productivity parameters, Tnkj . We then use the counterfactual
employment shares and totals implied by that exercise as instruments. The results, presented
in Table 11 in the Appendix, confirm the main findings. Namely, the effect of population
composition is larger than that of city size, and the compositional effect is larger for CNR
workers than for non-CNR’s.

4.3

The Role of Externalities in Spatial Occupational Polarization

We adopt the GMM-CUE coefficients in the last column of Table 7 as our benchmark.
These coefficients imply that cross-occupational externalities are negative for CNR workers
in that τ L,CN R is significantly smaller than τ R,CN R . Hence, non-CNR workers create negative
congestion effects for CNR workers. In contrast, for non-CNR workers, the difference between
τ R,nCN R and τ L,nCN R is not significant indicating no clear evidence of congestion effects from
CNR workers to non-CNR workers.25
The externality effects coming from the local occupational composition are also clearly
substantial. They imply, all else equal, that moving from Winston-Salem, NC with a share
24

This follows largely the intuition laid out by Angrist and Pischke (2008), that requires strong IVs to
predict the two endogenous variables independently from one another.
25
The p-value when testing the hypothesis that τ R,k − τ L,k is positive is equal to 0.017 for the case of
CNRs and equal to 0.34 for the case of non-CNRs.

26

Table 7: Instrumental Variables Estimate

VARIABLES
k

γnj log( LLnn )
γnj log(Ln )

Observations
K.P. F
S.W.F. Lkn Share
S.W.F. Ln

(1)
OLS
CNR
non-CNR
0.900***
(0.12)
0.388***
(0.05)

0.681***
(0.22)
0.322***
(0.04)

7,459

7,459

(2)
2SLS
CNR
non-CNR
1.237***
(0.38)
0.331***
(0.06)

0.258
(0.50)
0.286***
(0.04)

7,459
7,459
4.180
5.748
6.429
8.964
6.384
9.040
Robust standard errors in parentheses
*** p<0.01, ** p<0.05, * p<0.1

(3)
CUE
CNR
non-CNR
1.338***
(0.38)
0.342***
(0.06)

0.795
(0.50)
0.354***
(0.04)

7,459
4.180
6.429
6.384

7,459
5.748
8.964
9.040

of CNR employment of 36 percent, corresponding to approximately the 75th percentile of
the distribution of CNR shares, to Austin, TX, with share of 42 percent, closer to the 94th
percentile, increases Tnkj for CNR workers by approximately 7 percent and reduces that of
non-CNR workers by close to 2.5 percent. Agglomeration externalities are similarly important. Moving from a city near the 75th percentile, such as Trenton, NJ, with approximately
180 thousand workers, to one near the 87th percentile such as Rochester, NY, with approximately 430 thousand, would imply a gain of close to 10% for both types of workers.
In Section 2 we conjectured that occupational externalities account for salient patterns
in the data related to the polarization of occupations and wage inequality across cities.
Given the model, the recovered TFP measure obtained through its inversion, as well as the
externalities in TFP we estimated, we now verify this basic intuition.
Counter-factual equilibrium allocations generated in the absence of externalities (i.e.,
τ R,k = τ L,k = 0 for all k) are presented in Figures 5 and 6. Figure 5 shows that absent
externalities, the relationship between the share of CNR workers and the wage premium
indeed becomes negative, indicating that the relative abundance of CNR workers now decreases their relative compensation. Furthermore, Figure 6 shows that without externalities,
the equilibrium wage bill share increases less than one-for-one, as opposed to more than
one-for-one, with the wage bill share predicted by the industrial composition across cities.
To see why, note that absent production externalities induced by employment size and composition, productivity is pinned down exogenously. Therefore, CNR workers in a city that
has a comparative advantage in the production of CNR intensive goods will generally earn
higher wages. Firms in that city will consequently substitute CNR workers for non-CNR
workers and, given an elasticity of substitution between occupations greater than 1, see a
reduction in its CNR wage share.
These exercises point to the patterns identified in Section 2 as being effectively driven
by occupation-specific elasticities. Given the significance of these externalities, the optimal
and equilibrium allocations differ. This in turn creates a role for optimal spatial policy to
which we turn next.

27

Log ratio of CNR to non-CNR wages

Slope: -3.94
Correlation: -0.75

0.5
0.4
0.3
0.2
0.1
0
-0.1
-0.2
-0.3
0.2

0.25

0.3

0.35

0.4

0.45

0.5

0.55

0.6

CNR share of city employment
Figure 5: Occupational share and wage premium - no externalities
Counterfactual values obtained from assuming no externalities (τ R,k = τ L,k = 0), while
keeping the exogenous part of productivity as originally quantified.

5
5.1

Optimal Allocation
The Planner’s Problem

We now describe the optimal allocation and the policies that implement it. The planner’s
problem takes as given that workers in different occupations can freely choose in which city
to live. Under this assumption, the expected utility of a worker of type k is given by
vk = Γ



ν−1
ν

 X

! ν1

k ν

Akn Cn

,

n

so that, given welfare weights for each occupation, φk , the utilitarian planner solves
W=

X
k

φk Γ



 X
N
ν−1
ν

! ν1

k ν

Akn Cn

Lk .

(13)

n=1

The planner maximizes the expression in (13) subject to the availability of labor in each city
and occupation (2), the constraints on the use of labor in each occupation and city (5), the
constraints on the use of land and structures (6), the resource constraints associated with
final goods in each city and sector (7), the resource constraints associated with intermediate
goods across all varieties z in each industry j and city n (8), and the constraints that
household consumption of different goods and input flows be non-negative.
28

Figure 6: Occupational wage bill share: predicted vs equilibrium - no externalities
Counterfactual values obtained from assuming no externalities (τ R,k = τ L,k = 0), while
keeping the exogenous part of productivity as originally quantified.

The key difference between the optimal and equilibrium allocations stems from a wedge
between the private and social marginal products of labor. Lemma 1 characterizes this
wedge.
Lemma 1. Let ∆kn denote the wedge between the private and the social marginal value of a
worker in occupation k in city n. Then
∆kn

=

k0 j
k 0 Ln
wn k
Ln
k0 ,j

X

0

∂ ln λkn j (Ln )
.
∂ ln Lkn

(14)

This expression for the wedge points to the distortions that the planner is seeking to
29

correct. To focus ideas, consider our case with two occupations, k and k 0 , where spillover
elasticities are the same in all sectors. In that case we have that
∆kn

=

∂
wnk

0

0

k
k
ln λkn (Ln )
k0 Ln ∂ ln λn (Ln )
+
w
.
n
∂ ln Lkn
Lkn ∂ ln Lkn

The estimated externality parameters in Table 7 imply, using equations (11) and (12),
that occupation k’s own elasticity, ∂ ln λkn (Ln ) /∂ ln Lkn , is positive while the cross-elasticity,
0
∂ ln λkn (Ln ) /∂ ln Lkn , is negative. It follows that the wedge for any given occupation k increases with its wage and decreases with the wage of occupation k 0 . Hence, the planner
would like to increase the concentration of workers of a given occupation in places where
those workers are most productive, and in places where workers in other occupations are less
0
productive. The latter effect is stronger for CNRs than non-CNRs since ∂ ln λkn (Ln ) /∂ ln Lkn
is substantially larger when k = CNR. The end result is an increase in spatial polarization.
Given these wedges, the optimal policy is then most intuitively framed in terms of a set of
taxes and subsidies that incentivize workers to move to cities where their spillovers are larger.
Put another way, the planner internalizes the wedge between the private and social marginal
productivity of workers. At the same time, a utilitarian planner also attempts to balance
gains between different type of workers. Proposition 1 provides an exact characterization of
this spatial policy.
Proposition 1. If the planner’s problem is globally concave, the optimal allocation can be
achieved by a set of taxes and transfers such that
Pn Cnk = (1 − tkL )(wnk + ∆kn ) + χk + Rk ,
where tkL =

1
,
1+ν

and Rk is such that
φk v k Lk =

X

Pn Cnk Lkn .

n

The proposition generalizes Proposition 2 in Fajgelbaum and Gaubert (2018) to a multiindustry environment. Observe that industry superscripts, j, do not appear in Proposition 1.
In other words, the proposition establishes that the formulations in Fajgelbaum and Gaubert
(2018) may still be used when the environment is extended to include different industries
even when these are interdependent through input-output linkages.26 Of course, that is not
to say that the optimal allocation does not depend on local industry-specific productivity
(since they determine local employment and wages in both occupations), or that the optimal
allocation has no implications for the composition of industries across space. We show below
that under the optimal policy, large and small cities expand industries in which a large share
of their employment already resides, intensive in CNR and non-CNR workers respectively,
while medium-size cities generally diversify across industries.
Proposition 1 tells us that the planner’s solution for household consumption differs from
that implied by their budget constraint in two ways. First, the planner’s solution depends on
26

One can show that when spillover elasticities are industry-specific, the formulas in Proposition 1 no
longer hold exactly. In that case, employment shares of different industries in different occupations are
sufficient to capture the role of industrial composition, independently of the details of input-output linkages.

30

CNR worker spillovers

20000
10000
0
-10000
-20000
0.2

0.3

0.4

0.5

CNR worker spillovers

CNR share in data
ρ = 0.005 , ρ̃ = 0.323

CNR worker spillovers
(deviation from mean)
Population

< -11200

[ -5580 , 0 )

[ 5580 , 11200 )

[ -11200 , -5580 )

[ 0 , 5580 )

>= 11200

12,000

3,909,000

20000
10000
0
-10000
-20000
10

12

14

16

Log population in data
ρ = 0.402 , ρ̃ = 0.706

7,805,000

R
(equilibrium values)
Figure 7: ∆CN
n
R
captures the wedge between the social and the private marginal product of labor
∆CN
n
for CNR workers, as described in Lemma 1. Figure depicts deviation from employment
weighted average (US$ 58,273). Each marker in the map refers to a CBSA. Marker sizes
are proportional to total employment in each city. ρ and ρ̃ are unweighted and population
weighted correlations respectively.

the social marginal product of labor, given by wnk + ∆kn , rather than its private counterpart.
Second, in the planner’s solution, consumption increases less than one-for-one with the (social) marginal product of labor. This second element is optimal because, given heterogeneity
in preferences for locations, households that choose to live in lower wage cities do so because
their marginal utility of consumption is higher in those cities.27

5.2

The Value of Social Wedges Across Cities and Occupations

The wedge between the social and private marginal product of labor, ∆kn , may be calculated for each city and occupation using equation 14. Figures 7 and 8 show the deviations
of those wedges from their (employment weighted) means for CNR and non-CNR workers,
respectively. The average wedge for CNR workers is itself fairly large, at $58,273 dollars per
worker, or 82 percent of the mean CNR wage. The average wedge for non-CNR workers is
more modest and negative, at -$8,345. The wedge of non-CNR workers is negative because
their presence in a given city is associated with a reduction in its share of CNR workers which
then lowers the productivity of those workers. On average, non-CNR workers generate a net
congestion effect. Together, these values imply an average gain of $66,619 from switching a
non-CNR for a CNR worker. This large value is the result of the relative scarcity of CNR
workers. Implying that using them productively makes a substantive difference. Further27
Fajgelbaum and Gaubert (2018) show that the heterogeneity in preferences induces the same optimal
tax as an isoelastic negative spillover in amenities.

31

Population

< -7720

[ -3860 , 0 )

[ 3860 , 7720 )

[ -7720 , -3860 )

[ 0 , 3860 )

>= 7720

12,000

3,909,000

Non-CNR worker spillovers Non-CNR worker spillovers

Non-CNR worker spillovers
(deviation from mean)

20000
10000
0
-10000
-20000
0.2

0.3

0.4

0.5

CNR share in data
ρ = -0.934 , ρ̃ = -0.941
20000
10000
0
-10000
-20000
10

12

14

16

Log population in data
ρ = -0.63 , ρ̃ = -0.652

7,805,000

R
(equilibrium values)
Figure 8: ∆nCN
n
R
∆nCN
captures the wedge between the social and the private marginal product of labor
n
for non-CNR workers, as described in Lemma 1. Figure depicts deviation from employment
weighted average (US$ -8,345). Each marker in the map refers to a CBSA. Marker sizes
are proportional to total employment in each city. ρ and ρ̃ are unweighted and population
weighted correlations respectively.

more, this large gain indicates that education and migration policies that create and attract
CNR workers can potentially have high social value. Here, however, we take the supply of
CNR and non-CNR workers as given.
R
We find a strong positive correlation between ∆CN
the wedge between the social and
n
private value of CNR workers, and city size (0.40). In contrast, the correlation between
this wedge and the CNR share across cities is close to zero (0.005). These findings indicate
that, given the concavity of external effects and the fact that there are diminishing returns
to each factor, high CNR cities already exploit CNR externalities to a large degree in the
decentralized equilibrium. However, the correlation with the CNR share becomes positive
(0.32) when weighted by city size. Thus, there nevertheless remain gains to be exploited in
larger, CNR intensive, cities. Externalities from CNR workers appear to be particularly large
in New York, Houston, and cities in California and and much less pronounced in Florida
and, more broadly, in the South and Mid-West (except, modestly, in Chicago).
R
The overall patterns for ∆non−CN
, the wedge between the social and private value of
n
non-CNR workers, are more pronounced in Figure 8. In particular, there exists a clear
negative relationship between the wedge of non-CNR workers and both city size and the
CNR share across cities. The social value of non-CNR workers relative to their private value
is positive in many smaller cities. This wedge is also relatively large in some larger cities such
as in Florida, Las Vegas, and Phoenix. These findings indicate that an optimal allocation
would encourage or incentivize non-CNR workers to move to smaller non-CNR abundant

32

cities. It is in those cities where they can make their largest contributions.

5.3

Quantifying the Optimal Allocation

Percentage change in
CNR pop.

In computing the optimal allocation, we set Pareto weights such that gains under the planner’s solution are proportionately equal for both types of workers. Figures 9 and 10 show
the percentage change in employment in the optimal allocation relative to the equilibrium
allocation for CNR and non-CNR workers respectively. The results show that it is generally optimal for CNR workers to move to larger cities and for non-CNR workers to move to
smaller cities, thereby exacerbating the spatial polarization of occupations. This increased
spatial polarization follows from the spillover coefficient estimates in Section 4.2 which underscore that both types of workers (but particularly those in CNR occupations) become
more productive when clustered with other workers of their own type.
As Figure 9 shows, increases in CNR workers under the optimal allocation are particularly
large in cities like New York, San Francisco or San Jose, where the wedge between social and
private marginal products of labor for CNR workers is especially large. These cities, together
with other large cities including Chicago, Dallas, and Los Angeles, which are somewhat less
CNR intensive, become cognitive hubs under the optimal allocation. More generally, the
optimal policy creates cognitive hubs in larger cities that are already CNR abundant under

0.4
0.2
0.0
-0.2
-0.4
0.2

0.3

0.4

0.5

Percentage change in
CNR pop.

CNR share in data
ρ = 0.485 , ρ̃ = 0.632

Percentage change in
CNR pop.
Population

< -0.206

[ -0.103 , 0 )

[ 0.103 , 0.206 )

[ -0.206 , -0.103 )

[ 0 , 0.103 )

>= 0.206

12,000

3,909,000

0.4
0.2
0.0
-0.2
-0.4
10

12

7,805,000

R
Figure 9: LCN
(percentage change from data equilibrium)
n

Percentage change in employment of CNR workers between equilibrium and optimal values.
Each marker in the map refers to a CBSA. Marker sizes are proportional to total equilibrium
employment in each city. ρ and ρ̃ are unweighted and population weighted correlations
respectively.

33

14

16

Log population in data
ρ = 0.65 , ρ̃ = 0.837

Percentage change in
Non-CNR pop.

the decentralized equilibrium. Given that trade is costly, cities that gain CNR workers
are somewhat uniformly distributed in space according to overall economic activity. They
constitute cognitive hubs in that they absorb CNR workers and are now surrounded by
smaller cities with more non-CNR workers.
Figure 10 illustrates that while the planner generally chooses to incentivize non-CNR
workers to move from large cities, a few large cities do nevertheless become more non-CNR
abundant under the optimal allocation. This is the case for cities such as Miami, Las Vegas,
Phoenix, and San Antonio where non-CNR workers have, in the decentralized equilibrium,
a social marginal product that is larger than their private marginal product. These cities
become new non-CNR centers. They specialize in non-CNR intensive industries, such as
accommodation and retail, and grow in size since the inflow of non-CNR workers is larger
than the exodus of CNR workers specified by the optimal allocation.
While the share of CNR workers increases in large cities under the optimal allocation,
the top panel of Figure 11 also shows that these cities lose in overall population while
smaller cities increase in size. The same pattern holds for cities with large and small CNR
shares in the bottom panel of 11. New cognitive hubs, therefore, emerge along side growing
small and non-CNR abundant cities. Put another way, the city size distribution evens out
under the optimal allocation. This feature recognizes that while the productivity of CNR
workers increases with the number of those workers, congestion also increases with city size.

0.5
0.0
-0.5
-1.0
0.2

0.3

0.4

0.5

Percentage change in
Non-CNR pop.

CNR share in data
ρ = -0.68 , ρ̃ = -0.785

Percentage change in
Non-CNR pop.
Population

< -0.361

[ -0.181 , 0 )

[ 0.181 , 0.361 )

[ -0.361 , -0.181 )

[ 0 , 0.181 )

>= 0.361

12,000

3,909,000

0.5
0.0
-0.5
-1.0
10

12

7,805,000

R
Figure 10: LnCN
(percentage change from data equilibrium)
n

Percentage change in employment of non-CNR workers between equilibrium and optimal
values. Each marker in the map refers to a CBSA. Marker sizes are proportional to total
equilibrium employment in each city. ρ and ρ̃ are unweighted and population weighted
correlations respectively.

34

14

16

Log population in data
ρ = -0.657 , ρ̃ = -0.719

Percentage change in population from
data equilibrium to optimal allocation

0.4

Slope: -0.05
Correlation: -0.69

0.3
0.2
0.1
0
-0.1
-0.2
-0.3
-0.4
9

10

11

12

13

14

15

16

Percentage change in population from
data equilibrium to optimal allocation

Log population in data

Slope: -1.31
Correlation: -0.86

0.5
0.4
0.3
0.2
0.1
0
-0.1
-0.2
-0.3
-0.4
0.15

0.2

0.25

0.3

0.35

0.4

0.45

0.5

0.55

CNR share of city employment
Figure 11: City size changes between data and optimal allocation
Each observation refers to a CBSA. Marker sizes are proportional to total employment.

In particular, as discussed above, non-CNR workers generate negative congestion effects
on the productivity of CNRs. Furthermore, heterogeneous location preferences imply that
35

attracting the marginal CNR worker to a given city becomes increasingly difficult.
Along with cities becoming more even in size under the optimal allocation, we observe
that both small and large cities generally increase their degree of industrial specialization,
while medium-size cities tend to move towards greater industrial diversification. Figure 12
highlights this pattern using changes in the Gini coefficient associated with the distribution
of wage bill shares across industries. The figure illustrates how these changes vary with the
share of CNR employment and depicts a U-shaped pattern. In the efficient allocation, cities
with low and high CNR worker shares become more specialized. In contrast, cities with
intermediate CNR shares exhibit zero or negative changes in Gini coefficients, indicating
no change or greater industrial diversification in those locations. This finding emerges because concentrating occupations is more valuable in cities that are particularly productive
in industries intensive in a specific occupation. The planner’s solution, therefore, prescribes
further expanding industries intensive in either CNR or non-CNR occupations in cities that
have a more extreme skill mix. Moreover, these cities tend to be at either end of the size distribution so that the U-shaped relationship shown in Figure 12 also holds, though somewhat
attenuated, with respect to population (see Figure 24 in the Appendix).
As examples of how efficient allocations change the industrial composition landscape,
Figure 13 highlights two cities at either end of the CNR share distribution. At one end,
San Jose, CA, with close to 1 million workers, sees its share of CNR workers increase from

Change in Gini coefficient from
data equilibrium to optimal allocation

0.15
0.1
0.05
0
-0.05
-0.1
-0.15
0.15

0.2

0.25

0.3

0.35

0.4

0.45

0.5

0.55

CNR share of city employment
Figure 12: Changes in the Gini coefficient between the data and optimal allocation.
Each observation refers to a CBSA. Marker sizes are proportional to total employment. The
solid-black line is a cubic fit on the data. The Gini is constructed using the Lorenz curves
depicting within city wage bill and industry rank.

36

50.8% to 91.2%. This change reflects an increase in industrial specialization, summarized
by a 0.11 change in the Gini coefficient and seen as an outward shift in its Lorenz curve
in the left panel of Figure 13. It captures in part an increase of 19 percentage points
in the employment share of San Jose’s top industry, Professional and Business Services,
and a 15 percentage point increase in that industry’s wage bill share. At the other end,
Harrisonburg, VA, with only 50,106 workers, sees instead its share of non-CNR workers
increase from 72.9% to 84.1%. This change stems from the planner emphasizing employment

San Jose, CA

100

80

Share of wage bill

80

60

40

20

60

40

20

0

0
0

20

40

60

80

100

0

20

Industry Rank

40

60

80

Industry Rank
South Bend, IN

100

80

Share of wage bill

Share of wage bill

Harrisonburg, VA

100

60

40

20

0
0

20

40

60

80

100

Industry Rank
Figure 13: Shift in the Lorenz curve between the data and optimal allocation.
Each marker refers to an industry, where the dashed gray line shows the Lorenz curve for
the data equilibrium and the blue line is for the optimal allocation.

37

100

in the industries in which Harrisonburg’s non-CNR workers are already intensively employed;
hence, the Gini coefficient increases by 0.05. Under the optimal allocation, the employment
share in Harrisonburg’s top industry, Non-Tradables (Retail, Construction, and Utilities),
increases by 2.0 percentage points while its wage bill sees a 3.2 percentage point rise.
In the middle of the city size distribution, changes in the Gini coefficient associated with
wage bill shares across industries are generally close to zero or negative. In other words,
many of those medium-size cities either stay with or diversify their industrial mix. The
bottom panel of Figure 13, for example, shows an inward shift of the Lorenz curve for South
Bend, IN. South Bend, a city of around 121,220 workers, sees its Gini coefficient fall by
around 0.05 as its employment spreads out across more industries. The wage bill share of
its initial top industry, namely Health, declines by 3.8 percentage points, while the wage bill
share of its new top industry, Non-Tradables, increases by 1.9 percentage points.
5.3.1

Taxes and Transfers: Implementing the Optimal Allocation

Implementing the optimal allocation involves transfers specific to each occupation and city.
Including the transfers, total consumption is equal to total income in each city. These
transfers serve several functions. First, they incentivize agents to move as described above.
Namely, they incentivize CNR workers to move to cognitive hubs and non-CNR workers to
move to smaller towns. Second, they guarantee that relative welfare gains are the same
across occupations and locations. Thus, the planner compensates non-CNR workers for
moving to smaller and less productive or amenable cities by implementing transfers from
larger to smaller cities. These transfers in turn are mostly financed by CNR workers in
larger cities. Note, however, that since CNR workers gain from the policy as well, they do
not mind making the transfers. The net flow of resources received or paid by cities (i.e the
trade balance) is shown in Figure P
14, calculated as
between
local nominal per

Pthekdifference
k k
k
P
C
L
−
w
L
−
r
H
capita consumption and output,
/L
.
n n
n
k n n n
k n n
As expected, the trade balance of cognitive hubs is large and negative. Cities such as
San Francisco and San Jose that are relatively large and very CNR intensive, specializing,
respectively, in professional and business services and computer and electronics, send net
payments of as much as $40,000 per resident, while some of the smaller cities, such as
Jacksonville, NC, specializing in accommodation and retail receive net transfers of close to
$20,000. These net transfers in some of the smaller cities amount to a form of basic income
paid to all non-CNR workers in small cities and financed (in net) by agents living in cognitive
hubs. Again, there are some exceptions as some relatively large cities such as Las Vegas end
up receiving net transfers since they become even larger centers of non-CNR employment.
The pattern of spillover coefficients we estimated, together with our model, implies that
the optimal allocation exacerbates the extent of labor market polarization across space.
However, this pattern does not reveal why labor markets are already as polarized as they
are in the decentralized equilibrium. As indicated in Figure 4, larger cities are relatively
more amenable to CNR workers, a pattern that survives even if we exclude the endogenous
component of amenities highlighted by Diamond (2016). In Appendix E, we show that even
with that component excluded, the planner chooses to create cognitive hubs by concentrating
CNR workers in large cities.
The overall gains in welfare from implementing the optimal allocation amount to 0.72%
38

Trade balance

25000
0
-25000
-50000
-75000
0.2

0.3

0.4

0.5

CNR share in data
ρ = -0.432 , ρ̃ = -0.687
Trade balance

25000
0
-25000
-50000
-75000

Trade balance
(per capita)
Population

< -17600

[ -8810 , 0 )

[ 8810 , 17600 )

[ -17600 , -8810 )

[ 0 , 8810 )

>= 17600

12,000

3,909,000

10

12

14

16

Log population in data
ρ = -0.451 , ρ̃ = -0.613

7,805,000

Figure 14: Trade balance per capita
Trade balance is defined as the difference
P allocation between the value conP in the optimal
sumed and value added in each city ( k Pn Cnk Lkn − k wnk Lkn − rn Hn ). Trade balance per
capita are given by those values divided by Ln . Each marker in the map refers to a CBSA.
Marker sizes are proportional to total equilibrium employment in each city. ρ and ρ̃ are
unweighted and population weighted correlations respectively.

of GDP for workers in both occupations. These gains are almost double those found in
recent work by Bartelme et al. (2019) at 0.4% of GDP in a study of optimal industrial policy
in a multi-country trade model. One reason why gains are not even larger in our setup is
that the observed equilibrium allocation is already fairly polarized. In fact, in Section 6 we
try to account for what has lead to this fairly polarized state starting from the prevailing
conditions in 1980.
To achieve equal welfare gains across occupations, the optimal transfer scheme has two
components. One that incentivizes agents to go to the ‘right’ location and is related to
differences in ∆kn across locations. The other is a fixed transfer by occupation (Rk ). This
fixed transfer guarantees that all workers obtain equal gains from moving to the planner’s
allocation. This fixed transfer amounts to a negative transfer (a payment) of -$16,856 for
CNR workers and a positive transfer of $18,126 to all non-CNR workers. The latter can be
implemented as a universal basic income that is paid by CNR workers. This transfer may
then be considered as the redistribution that CNR workers are willing to accept to form the
cognitive hubs where they can thrive.
The optimal transfers also involve a component that incentivizes CNR worker to move to
large, CNR abundant cities and for non-CNR workers to move to smaller cities with smaller
shares of CNR workers. This is achieved by giving large incentives to non-CNR workers
to move out of large, and more markedly, CNR abundant cities. Due to externalities that
are occupation-specific, this reallocation yields larger CNR productivity increases in CNR
abundant cities, which attracts CNR workers to these cities and eliminates the need for large
39

CNR net transfers

10000
0
-10000
-20000
0.2

0.3

0.4

0.5

CNR net transfers

CNR share in data
ρ = -0.474 , ρ̃ = -0.653

CNR net transfers
(per capita deviation
from mean)
Population

< -4950

[ -2470 , 0 )

[ 2470 , 4950 )

[ -4950 , -2470 )

[ 0 , 2470 )

>= 4950

12,000

3,909,000

10000
0
-10000
-20000
10

12

14

16

Log population in data
ρ = -0.02 , ρ̃ = -0.125

7,805,000

Figure 15: Optimal transfers to CNR workers (per CNR worker)
Optimal transfers per CNR workers are defined as the difference in the optimal allocation
between the value consumed and the income they would receive given optimal wages and
rents but absent the transfers (Pn CnCN R − wnCN R − χk ). Figure depicts deviation from
employment weighted average (US$ -4,345). Each marker in the map refers to a CBSA.
Marker sizes are proportional to total equilibrium employment in each city. ρ and ρ̃ are
unweighted and population weighted correlations respectively.

net transfers to those workers. In fact, in a handful of the most CNR abundant cities, this
effect is so strong that the planner prefers to balance it with negative transfers to avoid
congestion.
The optimal transfers are, of course, related to the ∆kn wedges described in Figures 7 and
8, and further depicted in Figures 15 (for CNR workers) and 16 (for non-CNR workers).28
In the median city, after incentives are taken into account, CNR workers contribute $3,087.
This number in part reflects the fact that CNR workers are socially valuable (recall that
the wedge between the social and private value of CNR workers is as much as US$ 58,273).
Note that CNR workers do not need to be particularly incentivized to stay in the large,
CNR abundant, cities. In fact, as Figure 15 shows, transfers decrease slightly with CNR
share and city size. The increases in productivity, and therefore wages, that result from the
enhanced externalities in cognitive hubs is sufficient to attract these workers. CNR workers
are net-recipients of transfers in only 6% of locations but net payers in 94% of locations. On
the whole, payments from CNR workers range on net from US$704 (in the 10th percentile
city) to US$5,299 (in the 90th percentile).29
Once incentive-based transfers are taken into account, non-CNR workers in the median
1
ν
More specifically, total transfers are equal to Pn Cnk − wnk − χk = 1+ν
∆kn − 1+ν
wnk + Rk .
Weighing by population net transfers to or from CNR workers range from a net contribution of close
to $690 (10th percentile) to a contribution of more than $10,000 (90th percentile), with the median net
contribution being close to $2,927.
28

29

40

Non-CNR net transfers

20000
0
-20000
-40000
-60000
0.2

0.3

0.4

0.5

Non-CNR net transfers
(per capita deviation
from mean)
Population

< -14400

[ -7210 , 0 )

[ 7210 , 14400 )

[ -14400 , -7210 )

[ 0 , 7210 )

>= 14400

12,000

3,909,000

Non-CNR net transfers

CNR share in data
ρ = -0.653 , ρ̃ = -0.749
20000
0
-20000
-40000
-60000
10

12

14

16

Log population in data
ρ = -0.571 , ρ̃ = -0.605

7,805,000

Figure 16: Optimal transfers to non-CNR workers (per non-CNR worker)
Optimal transfers per non-CNR workers are defined as the difference in the optimal allocation
between the value consumed and value added in each city attributed to non-CNR workers
(Pn CnnCN R − wnnCN R − χk ). Figure depicts deviation from employment weighted average
(US$ 2,632). Each marker in the map refers to a CBSA. Marker sizes are proportional to
total equilibrium employment in each city. ρ and ρ̃ are unweighted and population weighted
correlations respectively.

city receive a net transfer of $9,657, ranging from $2,538 (in the 10th percentile city) to
as much as $11,786 (in the 90th percentile city). Note that non-CNR workers have to be
incentivized to move to smaller cities. However, because cognitive hubs offer a high wage to
non-CNR workers (since those are the most productive cities while also ending up with fewer
non-CNR workers), non-CNR workers in cognitive hubs pay a transfer as well to discourage
other non-CNR workers from joining them. This accounts for the wide range in non-CNR
transfers and reduces the average net burden on CNR workers.30

6

The Formation of Cognitive Hubs after 1980

The US economy has evolved towards the formation of cognitive hubs at least since the
1980s. Quantifying our model to 1980 data yields a set of fundamental characteristics of
the economy that allows us to study this phenomenon in detail. In order to compare the
spatial structure of the economy in 1980 to that in 2015, we want to abstract from aggregate technology trends. Thus, we first build a ‘Baseline’ economy that adds only aggregate
30
If we weight by population, transfers and contributions to non-CNR workers range from a net contribution of about $7,081 to net receipts of $10,609 (90th percentile), with a median net receipt of about
$4,806.

41

changes in technology, population, and input shares to the 1980 economy.31 The Baseline
economy does not include any location-specific change in productivities across industries and
occupations or in amenities. It also does not include changes in aggregate CNR shares of
population. We study the role of these components by adding them gradually. As shown
in Figure 17, in this Baseline 1980 economy, US population is concentrated in cities where
the CNR share of employment was close to average. In the 2011-15 data, however, there is
greater dispersion around the (now larger) average. Figure 17 further shows the distribution
of CNR workers implied by the 2011-2015 planner’s solution calculated above. Together,
the histograms imply that the increasing concentration of CNR workers was in the direction
implied by today’s optimal allocation.
70%

Baseline (1980)
2011-2015
Optimal (2011-2015)

Share of total population

60%
50%
40%
30%
20%
10%
0%
-0.2

-0.1

0

0.1

0.2

0.3

0.4

CNR share (deviation from average)
Figure 17: Distribution of population by CNR share in city of employment
Share of population in cities with different ratio of CNR workers to total population. Bins
refer to deviation from population weighted average.

Given the move towards cognitive hubs over time, our model allows us to examine the
forces underlying this evolution and obtain a quantitative assessment of their welfare relevance. We do this with a series of counter-factual exercises that clarify the importance of
different forces in driving national trends. Thanks to the structural model, we can also do
the same decomposition in the absence of externalities. Those exercises then provide us with
a measure of the relevance of local spillovers for observed spatial trends. Table 8 shows how
such a decomposition affects the welfare of CNR and non-CNR workers. The columns depict
welfare levels relative to the Baseline 1980 economy for each occupation. The lower panel
repeats the exercises for a world without externalities (i.e., where we set the externality
elasticity parameters to zero).
31

For details on the construction of this Baseline economy and other counterfactuals, see Appendix F.

42

Table 8: Welfare Comparison, Relative to Baseline

Full Model
1. 1980 parameters
2. (1) + current input shares
3. (2) + national trends in technology and population
4. (3) + national skill biased technical change (Baseline)

CNR

non-CNR

CNR-tonon-CNR
Ratio

0.417
0.443
0.927
1.000

0.572
0.515
1.081
1.000

0.730
0.860
0.857
1.000

ratio of CNR to non-CNR welfare in Baseline
5.
6.
7.
8.

(4)
(5)
(6)
(7)

+
+
+
+

change
change
change
change

in
in
in
in

occ. shares in employment
local technology (ex real estate)
real estate productivity
amenities (2011-15 parameters)

1.906
1.051
1.072
1.049
1.046

1.062
1.083
1.064
1.065

0.990
0.991
0.986
0.982

9. Optimal Allocation

1.053

1.072

0.982

10. 2011-15 parameters minus change in real estate productivity
11. Optimal Allocation with parameters in (10)

1.059
1.072

1.074
1.087

0.987
0.987

0.356
0.493
0.936
1.000

0.468
0.556
1.055
1.000

0.761
0.887
0.887
1.000

Model without externalities
12. 1980 parameters
13. (12) + current input shares
14. (13) + national trends in technology and population
15. (14) + national skill biased technical change (Baseline)
ratio of CNR to non-CNR welfare in Baseline
16.
17.
18.
19.

(15)
(16)
(17)
(18)

+
+
+
+

change
change
change
change

in
in
in
in

occ. shares in employment
local technology (ex real estate)
real estate productivity
amenities (2011-15 parameters)

3.953
0.859
0.854
0.864
0.864

1.109
1.116
1.130
1.130

0.775
0.766
0.765
0.765

20. Optimal Allocation

0.866

1.133

0.765

21. 2011-15 parameters minus change in real estate productivity
22. Optimal Allocation with parameters in (21)

0.849
0.851

1.108
1.111

0.766
0.766

The top row of the table shows that welfare in 1980 was lower for both groups and in all
scenarios as one would expect given underlying technological trends. The second row shows
the effects of changing einput shares. It is well documented that CNR intensive industries
have become a larger part of the US economy, leading to relative gains for CNR workers
as compared to non-CNR workers. The next step (rows 3 and 14) brings total population
and average technology in each city/industry to 2011-15 levels while keeping the relative
productivity of CNR and non-CNR workers at 1980 levels. Relative to the baseline (rows
4 and 15), CNR workers are worse off and non-CNR workers better off. The difference is
accounted for skill-biased technical change. Note that externalities amplify the effect of skill
43

Log change in exog.
productivity excluding real
estate (deviation from mean)
Population

< -0.416

[ -0.208 , 0 )

[ 0.208 , 0.416 )

[ -0.416 , -0.208 )

[ 0 , 0.208 )

>= 0.416

4,000

5,080,000

Log change in exog. prod.
exclud. real estate

Log change in exog. prod.
exclud. real estate

biased technical change.
Rows 5 and 15 change the composition of employment to 2011-15 levels, with more CNR
workers and fewer non-CNR workers. Here, externalities play their largest role. Absent
externalities, the model would imply significant losses for CNR workers, as they become
more abundant, whereas standard neo-classical arguments would imply a reduction in their
relative wage. In effect, absent externalities, CNR workers would end up with welfare 14%
below the baseline counterfactual, while non-CNR worker’s welfare would grow by 11%. In
contrast, with externalities, both CNR and non-CNR workers end up gaining about the same
as occupations become more polarized across space.
Rows 6 and 16 add exogenous changes to local technology of non-real estate sectors
over and above what is implied by average national trends. It captures, for example, the fact
that computer and electronics output became particularly more productive in San Jose while
finance became particularly more productive in New York. As shown in Figure 18 those gains
were larger in cities that had high CNR shares in 1980. These location-specific technological
changes interact with externalities to increase the welfare of both types of workers.
Rows 7 and 17 add changes in the productivity of the real estate sector. This exercise
encompasses the effects of two different underlying processes. On the one hand, real estate
productivity increased more in fast growing cities, as the stock of housing increased in order
to accommodate rising populations. On the other hand, as it has been increasingly recognized (Glaeser and Gyourko (2018), Hsieh and Moretti (2019) and Herkenhoff et al. (2018)),
housing regulations have impeded development in some very productive areas. The table
1.0
0.5
0.0
-0.5
-1.0
0.2

0.3

0.4

CNR share in 1980
ρ = 0.057 , ρ̃ = 0.39
1.0
0.5
0.0
-0.5
-1.0
10

12

14

10,156,000

Figure 18: Change in the exogenous part of technology (all non-real estate sectors)
Changes are relative to baseline counterfactual, averaged across all sectors except real estate.
Each observation refers to a CBSA. Marker sizes are proportional to total city employment.
Averages are taken with value added weights. ρ and ρ̃ are unweighted and population
weighted correlations respectively.

44

16

Log population in 1980
ρ = -0.089 , ρ̃ = 0.343

Log change in exog.
productivity for real estate
(deviation from mean)
Population

< -0.998

[ -0.499 , 0 )

[ 0.499 , 0.998 )

[ -0.998 , -0.499 )

[ 0 , 0.499 )

>= 0.998

4,000

5,080,000

Log change in exog. prod.
for real estate

Log change in exog. prod.
for real estate

shows that the net effect of those two forces would have been positive in the absence of externalities, as housing development may have accommodated increasing population in high
growing locations. At the same time, their effect is negative once externalities are accounted
for since, as shown in Figure 19, housing productivity also lagged behind in CNR intensive
cities.

2
1
0
-1
-2
0.2

0.3

0.4

CNR share in 1980
ρ = -0.172 , ρ̃ = -0.372
2
1
0
-1
-2
10

12

14

16

Log population in 1980
ρ = -0.233 , ρ̃ = -0.514

10,156,000

Figure 19: Change in the exogenous component of technology in real estate.
Changes are relative to baseline counterfactual. ρ and ρ̃ are unweighted and population
weighted correlations respectively.

Finally, Rows 8 and 18 add the changes in amenities. In particular, Row 8 corresponds to
the 2011-15 equilibrium allocation. Changes in the spatial distribution of amenities appear
to add little to total welfare. That said, in considering the results presented in Table 8, it
is important to bear in mind that the particular sequence in which we added the changes
between 1980 and 2011-15 can have an effect on our results. We chose to present a sequence
that is intuitive to us, but the main findings highlighted above are robust to other sequences.
We calculate optimal allocation under two scenarios. The first scenario, depicted in Rows
9 and 20, corresponds to that depicted in Section 5 above when externalities are included.
The second scenario assesses the role of housing policy in impeding optimal policy. In
particular, we calculate the optimal policy under the assumption that housing productivity
was distributed as in 1980 (the corresponding equilibrium counterfactual welfare is presented
in Rows 10 and 21 and the corresponding optimal allocation in Rows 11 and 22). We find
that the increment in welfare is more than 50% larger in that scenario than when starting
from the actual equilibrium. In other words, the optimal policy is less effective in some of
45

the cognitive hubs owing to observed changes in housing supply restrictions .

7

Conclusion

Our aim in this paper has been to understand the extent to which workers are misallocated
in space and the policies that might improve observed allocations. The main culprit of
spatial misallocation is the existence of large occupation-specific externalities combined with
potential distortions due to land use regulations. Our quantitative spatial model allowed
us to measure occupation-specific local productivity by industry which, together with a
relatively standard instrumental variable approach, led us to estimate these externalities for
CNR and non-CNR occupations.
Our estimates suggest that both CNR and non-CNR workers become more productive in
large cities, but CNR productivity improves particularly when CNR workers are surrounded
by other CNR workers. These estimates, together with estimated local amenities by occupation, exogenous productivity differences across industries and locations, and the full set
of input-output linkages and transport costs in the U.S. economy, determine the current
allocation of economic activity. We find that an optimal spatial policy can improve on this
allocation for both occupations by 0.72%. Housing and optimal transfer policies reinforce
each other. Hence, combining them (by reverting the spatial distribution of real estate productivity to that of 1980) leads to welfare gains of close to 2.5% for CNR workers and 2%
for non CNR’s.
Since the 80’s the U.S. economy has experienced increased skill and occupational polarization across space. Large cities increasingly have more highly educated CNR workers that
earn more. In contrast, many medium and small cities have suffered an exodus of skilled
workers and experienced persistent population declines. These trends, amplified by local externalities, were also associated with a rise in income inequality between occupations. This
growing gap between top and medium and small-sized cities has motivated policymakers and
city governments to advocate policies to attract CNR workers to smaller towns in order to
reverse their fortunes. Our analysis shows that, given appropriate transfers, these efforts
would be counterproductive.
Our analysis underscores that while CNR workers are extremely useful, they are also
scarce. Furthermore, their productivity is tremendously enhanced by living with other CNR
workers. So attracting them to smaller towns with more mixed populations represents a
waste of resources. CNR workers are too valuable for society to be used in this way. A
better policy is to reinforce existing trends and let them concentrate in cognitive hubs while
incentivizing non-CNR workers to move and help smaller cities grow. Of course, some nonCNR workers will always be needed in those hubs because of imperfect substitutability of
occupations in production. The result is smaller, more CNR intensive, cognitive hubs in
some of today’s largest cities. We show that the resulting migration of non-CNR workers
that allows small towns to grow may be implemented with a baseline transfer to non-CNR
workers, reminiscent of a universal basic income, and a set of occupation-location specific
transfers. Overall, CNR workers transfer resources to non-CNR workers to generate equal
welfare gains.
Our findings suggest that efforts to stop the spatial polarization of occupations are mis46

guided. In fact, encouraging it further can yield benefits for everyone when accompanied
by the necessary transfers. Implementing these transfers, however, is critical. Otherwise,
cognitive hubs might use other indirect means of pushing out non-CNR workers such as, for
example, housing supply constraints, zoning restrictions, or a lack of investment in transportation networks to aid commuting. Such efforts can generate occupational polarization
across space without Pareto gains for all workers. Implementing the necessary transfers
would not only help avoid those inefficient policies and benefit CNR workers, but it would
also improve the welfare of non-CNR workers and the many small and medium sized cities
where they would end up living, working, and producing.
Our analysis abstracts from the role that spatial polarization might have on human
capital formation. In principle, the migration of CNR workers towards cognitive hubs may
be detrimental to smaller cities in a setting where the learning technology also features
meaningful externalities from CNR workers. At the same time, however, the transfers that
CNR workers are able and willing to make to non-CNR workers, given the productivity gains
they experience from living in cognitive hubs, might naturally be invested in education and
other training in the smaller cities. These transfers, if directed properly, have the potential
to ameliorate, or even reverse, the conceivably negative effects of spatial polarization on
human capital.

47

References
Acemoglu, D. and D. Autor (2011). Skills, tasks and technologies: Implications for employment and earnings. In Handbook of labor economics, Volume 4, pp. 1043–1171. Elsevier.
2
Albouy, D. (2009). The unequal geographic burden of federal taxation. Journal of Political
Economy 117 (4), 635–667. 1
Albouy, D., K. Behrens, F. Robert-Nicoud, and N. Seegert (2019). The optimal distribution
of population across cities. Journal of Urban Economics 110, 102–113. 1
Altonji, J. G., T. E. Elder, and C. R. Taber (2005). Selection on observed and unobserved variables: Assessing the effectiveness of catholic schools. Journal of political economy 113 (1), 151–184. B.3
Anderson, J. E., C. A. Milot, and Y. V. Yotov (2014). How much does geography deflect
services trade? canadian answers. International Economic Review 55 (3), 791–818. 4, B.1,
9, ??
Angrist, J. D. and J.-S. Pischke (2008). Mostly harmless econometrics: An empiricist’s
companion. Princeton university press. 24
Autor, D. H. (2019). Work of the past, work of the future. In Richard T. Ely Lecture given
at Annual Meeting of American Economics Association, Atlanta, GA, January, Volume 4.
1
Bartelme, D., A. Costinot, D. Donaldson, and R.-C. Andres (2019). The textbook case for
industrial policy: Theory meets data. Technical report, CEPR Discussion Papers. 5.3.1
Baum-Snow, N., M. Freedman, and R. Pavan (2018). Why has urban inequality increased?
American Economic Journal: Applied Economics 10 (4), 1–42. 1
Baum-Snow, N. and R. Pavan (2011). Understanding the city size wage gap. The Review of
economic studies 79 (1), 88–127. 8
Baum-Snow, N. and R. Pavan (2013). Inequality and city size. Review of Economics and
Statistics 95 (5), 1535–1548. 1, 2, 14
Berry, C. R. and E. L. Glaeser (2005). The divergence of human capital levels across cities.
Papers in regional science 84 (3), 407–444. 1
Brinkman, J. (2014). The supply and demand of skilled workers in cities and the role of
industry composition. 1
Caliendo, L. and F. Parro (2015). Estimates of the trade and welfare effects of nafta. The
Review of Economic Studies 82 (1), 1–44. 4, 11

48

Caliendo, L., F. Parro, E. Rossi-Hansberg, and P.-D. Sarte (2017). The impact of regional and
sectoral productivity changes on the us economy. The Review of economic studies 85 (4),
2042–2096. 1, 4.1.3, A.2.3, A.6
Card, D. (2001). Immigrant inflows, native outflows, and the local labor market impacts of
higher immigration. Journal of Labor Economics 19 (1), 22–64. 1, 4, 4.2.1, B.3, B.3
Ciccone, A. and R. E. Hall (1996). Productivity and the density of economic activity. The
American Economic Review 86 (1), 54. 4.2.1, B.3, B.3
Ciccone, A. and G. Peri (2005). Long-run substitutability between more and less educated
workers: evidence from us states, 1950–1990. Review of Economics and statistics 87 (4),
652–663. 4
Colas, M. and K. Hutchinson (2017). Heterogeneous workers and federal income taxes in a
spatial equilibrium. Technical report. 1
Couture, V., C. Gaubert, J. Handbury, and E. Hurst (2018). Income growth and the distributional effects of urban spatial sorting. Technical report, Working Paper University of
Chicago Booth. 3
Desmet, K. and E. Rossi-Hansberg (2013). Urban accounting and welfare. American Economic Review 103 (6), 2296–2327. 1
Diamond, R. (2016). The determinants and welfare implications of us workers’ diverging
location choices by skill: 1980-2000. American Economic Review 106 (3), 479–524. 1, 2,
1, 4.1.2, 16, 5.3.1, E
Duranton, G. and D. Puga (2004). Micro-foundations of urban agglomeration economies. In
Handbook of regional and urban economics, Volume 4, pp. 2063–2117. Elsevier. 2
Eaton, J. and S. Kortum (2002). Technology, geography, and trade. Econometrica 70 (5),
1741–1779. 3.2.2, A.2.3
Eeckhout, J. and N. Guner (2015). Optimal spatial taxation: Are big cities too small? 1
Ellison, G., E. L. Glaeser, and W. R. Kerr (2010). What causes industry agglomeration?
evidence from coagglomeration patterns. American Economic Review 100 (3), 1195–1213.
1
Fajgelbaum, P. and C. Gaubert (2018). Optimal spatial policies, geography and sorting.
Technical report, National Bureau of Economic Research. 1, 1, 16, 5.1, 27, E, 21
Fajgelbaum, P. D., E. Morales, J. C. Suárez Serrato, and O. Zidar (2018). State taxes and
spatial misallocation. The Review of Economic Studies 86 (1), 333–376. 1, 4, 13
Feenstra, R. C., P. Luck, M. Obstfeld, and K. N. Russ (2018). In search of the armington
elasticity. Review of Economics and Statistics 100 (1), 135–150. 11
Giannone, E. (2017). Skilled-biased technical change and regional convergence. 1
49

Glaeser, E. and J. Gyourko (2018, February). The economic implications of housing supply.
Journal of Economic Perspectives 32 (1), 3–30. 1, 6
Glaeser, E. L. and J. D. Gottlieb (2008). The economics of place-making policies. Brookings
Papers on Economic Activity 2008 (1), 155–253. 19
Handbury, J. and D. E. Weinstein (2014). Goods prices and availability in cities. The Review
of Economic Studies 82 (1), 258–296. 4.1.1
Head, K. and T. Mayer (2014). Gravity equations: Workhorse, toolkit, and cookbook. In
Handbook of international economics, Volume 4, pp. 131–195. Elsevier. 4
Hendricks, L. (2011).
view 52 (1), 1–32. 1

The skill composition of us cities.

International Economic Re-

Herkenhoff, K. F., L. E. Ohanian, and E. C. Prescott (2018). Tarnishing the golden and
empire states: Land-use restrictions and the us economic slowdown. Journal of Monetary
Economics 93, 89–109. 1, 1, 6
Hsieh, C.-T. and E. Moretti (2019). Housing constraints and spatial misallocation. American
Economic Journal: Macroeconomics. 1, 1, 6
Jaimovich, N. and H. E. Siu (2018). The trend is the cycle: Job polarization and jobless
recoveries. The Review of Economics and Statistics. 4
Marshall, A. (1920). Principles of Economics (Revised ed.). London: Macmillan; reprinted
by Prometheus Books. 1
Melo, P. C., D. J. Graham, and R. B. Noland (2009). A meta-analysis of estimates of urban
agglomeration economies. Regional science and urban Economics 39 (3), 332–342. 1, 4.1.3,
4
Moretti, E. (2004a). Estimating the social return to higher education: evidence from longitudinal and repeated cross-sectional data. Journal of econometrics 121 (1-2), 175–212. 1,
1, 14, 1, 4.2.1, B.3, B.3
Moretti, E. (2004b). Workers’ education, spillovers, and productivity: evidence from plantlevel production functions. American Economic Review 94 (3), 656–690. 1, 4.1.3, 4
Neal, D. (1997). The effects of catholic secondary schooling on educational achievement.
Journal of Labor Economics 15 (1, Part 1), 98–123. B.3
Neumark, D. and H. Simpson (2015). Place-based policies. In Handbook of regional and
urban economics, Volume 5, pp. 1197–1287. Elsevier. 1
Ossa, R. (2015). A quantitative analysis of subsidy competition in the us. Technical report,
National Bureau of Economic Research. 1
Redding, S. J. and E. Rossi-Hansberg (2017). Quantitative spatial economics. Annual Review
of Economics 9, 21–58. 1
50

Roca, J. D. L. and D. Puga (2017). Learning by working in big cities. The Review of
Economic Studies 84 (1), 106–142. 8, 1
Sanderson, E. and F. Windmeijer (2016). A weak instrument f-test in linear iv models with
multiple endogenous variables. Journal of Econometrics 190 (2), 212–221. 4.2.1
Staiger, D., J. H. Stock, et al. (1997). Instrumental variables regression with weak instruments. Econometrica 65 (3), 557–586. 4.2.1
Stock, J. H. and M. Yogo (2005). Testing for weak instruments in linear iv regression. Identification and Inference for Econometric Models: Essays in Honor of Thomas Rothenberg,
80. 4.2.1

51

Appendix
A
A.1

Model Details
Household Decisions

In a given occupation, all households living in the same city choose the same consumption
basket. It follows that Cnkj (a) = Cnkj for all a. Moreover, the demand for good j by workers
in occupation k living in city n is
Cnkj = αj
where Pn =

QJ

j=1



Pnj
αj

 αj

Pn k
Cn
Pnj

(15)

is the ideal price index in city n.

Agents move freely across cities. The value, vnk (a), of locating in a particular city n for
an individual employed in occupation k, with idiosyncratic preference vector a is
an Akn Ink
= an Akn Cnk .
Pn
In equilibrium, workers move to the location where they receive the highest utility so that
vnk (a) =

v k (a) = max vnk (a) ,
n

k

where v (a) now denotes the equilibrium utility of an individual in occupation k with amenity
vector a. We assume that an is drawn from a Fréchet distribution. Draws are independent
across cities. We denote by Ψ the joint cdf for the elements of a across workers in a given
occupation, with
(
)
X
−ν
Ψ (a) = exp −
(an )
,
n

where the shape parameter ν reflects the extent of preference heterogeneity across workers.
Higher values of ν imply less heterogeneity, with all workers ordering cities in the same way
when ν → ∞.
Assuming that workers of different types can freely move between cities, the average
utility of a worker of type k is given by
! ν1
 X


ν−1
ν
Akn Cnk
,
(16)
vk = Γ
ν
n
where Γ(.) is the Gamma function.
Combining this equation with equation (2) describing labor supply yields an expression
relating the value of each occupational type to consumption and employment in particular
locations:
k

v =



Lkn
Lk

− ν1

52

Akn Cnk .

A.2
A.2.1

Firms
Intermediate Goods Producers

Cost minimization implies that input demand satisfies:
rn Hnj (z)
= γnj βnj ,
xjn (z)qnj (z)
 k 1−
wn
k kj

wn Ln (z)
λkj
= P n 0 1− γnj 1 − βnj ,
j
j
k
wn
xn (z)qn (z)
0
k0

0
0
Pnj Mnj j (z)
xjn (z)qnj (z)

(17)

(18)

λkn j

0

= γnj j ,

(19)

where xjn (z) is the Lagrange multiplier which in this case reflects the unit cost of production.
We can solve for xjn (z) by substituting optimal factor choices into the production function,
j

j γn
1−βn
"
#




j
J 
1− 1− 
γnj0 j
Y
xjn
Bnj  rnβn X wnk
j
j0
xn (z) ≡
Pn
=
j
kj

zn
zn 
λ
n

 Zn
0
k
j =1

(20)

where xjn is a city and industry specific unit cost index such that
"
#
j
h
 0 −γnj0 j
Y
j
j iγn


−γnj
β
−1
−β
n
Bnj = 1 − βnj n
βnj
γnj j
γnj
.
j0

Given constant returns to scale and competitive intermediate goods markets, a firm produces
positive but finite amounts of a variety only if its price is equal to its unit production cost,
pjn (z) = xjn (z) =

xjn
.
zn

(21)

A.2.2

Final Goods
P
Let Qjn (z) = n0 Qjnn0 (z) denote the total amount of intermediate goods of variety z purchased from different cities by a final goods producer in city n, sector j. Given that intermediate goods of a given variety produced in different cities are perfect substitutes, final goods
producers purchase varieties only from cities that offer the lowest unit cost,
(
Qjn (z) if κjnn0 pjn0 (z) < minn00 6=n0 κjnn00 pjn00 (z)
j
Qnn0 (z) =
,
0
otherwise
where we abstract from the case where κjnn0 pjn0 (z) = minn00 6=n0 κjnn00 pjn00 (z) since, given the
distributional assumption on z, this event only occurs on a set of measure zero.

53

Denote by Pnj (z) the unit cost paid by a final good producer in city n and sector j for a
particular variety whose vector of productivity draws is z. Given that final goods firms only
purchase intermediate goods from the lowest cost supplier,
(
)
j
j

κ
x
0
0
nn n
Pnj (z) = min
κjnn0 pjn0 (z) = min
.
(22)
n0
n0
zn0
For non-tradable intermediate goods, firms must buy those goods locally, so that if j is
non-tradable,
xjn
.
(23)
zn
Then, the demand function for intermediate goods of variety z in industry j and city n is
given by
Pnj (z) =

Qjn


(z) =

Pnj (z)
P̃nj

−η

Qjn ,

(24)

where P̃nj the ideal cost index for final goods produced in sector j in city n,
P̃nj

Z
=

Pnj

1−η

(z)

1
 1−η

dΦ (z)

.

(25)

Since the production function for final goods is constant returns to scale, and the market
for final goods is competitive, a final goods firm produces positive but finite quantities of a
final good if its price is equal to its cost index, that is if Pnj = P̃nj .
A.2.3

Derivation of Prices

We follow Eaton and Kortum (2002) in solving for the distribution of prices. Given this
distribution and zero profits for final goods producers, when sector j is tradable, the price
of final goods in sector j in region n solves
Z

j 1−η
Pn
= Pnj (z)1−η dΦ (z) dz,
which is the expected value of the random variable Pnj (z)1−η .
κj

xj

j
nn0 n0
Let Pnn
denote the unit cost of a variety indexed by z produced in city n0
0 (z) =
z n0
and sold in n. Following the steps described in Caliendo et al. (2017), we have that


 j
j
θ
−ωnn
0p
Pr Pnn
0 (z) ≤ p = 1 − e
 j j −θj
j
. The price of variety z in city n and industry j, Pnj (z), is the
where ωnn
κnn0 xn0
0 =
j
minimum across Pnn
0 (z). Its cdf is,


j θ
Pr Pnj (z) ≤ p = 1 − e−Ωn p ,
54

P
P  j j −θ j
j
(Ωn does not depend on n0 because we are integratwhere Ωjn = n0 ωnn
0 =
n0 κnn0 xn0
ing out the city dimension).
Let FPnj (p) denote the cdf of Pnj (z), Pr [Pnj (z) ≤ p]. Then, its associated pdf, denoted
j

θ

fPnj (p), is Ωjn θpθ−1 e−Ωn p . . As in Caliendo et al. (2017), we have that
1

Pnj = Γ (ξ) 1−η Ωjn

− θ1

,

where Γ (ξ) is the Gamma function evaluated at ξ = 1 + 1−η
. The price of goods in tradable
θ
sector j may then also be expressed as
"
Pnj = Γ (ξ)

1
1−η

N
X


κjnn0 xjn0

−θ

#− θ1
.

n0 =1

In a given non-tradable sector j, κjnn0 = ∞ ∀n0 6= n, so that the equation reduces to
1

Pnj = Γ (ξ) 1−η xjn .
A.2.4

Trade Shares

Let Xnj denote total expenditures on final goods j by city n, which must equal of the value
of final goods in that sector, Xnj = Pnj Qjn . Recall that because of zero profits in the final
goods sector, total expenditures on intermediate goods in Ra given sector are then also equal
j
to
cost of inputs used in that sector, so that Pnj Qjn = Pnj (z) Qjn (z) dΦ (z). Let Xnn
0 =
R the
j
j
j
κnn0 pn0 (z)Qnn0 (z)dΦ(z) denote the value spent by city n on intermediate goods of sector j
j
produced in city n0 . Further, let πnn
0 denote the share of city n’s expenditures on sector j
goods purchased from region n0 . Then,
j
πnn
0

j
Xnn
0
=
j .
Xn

Observe that, since there is a continuum of varieties of intermediate goods, the fraction of
goods that firms in city n purchase from firms in city n0 is given by


 j
j
j
Pnn00 (z) .
π̃nn0 ≡ Pr Pnn0 (z) ≤ min
00
0
n 6=n

Following the steps described in Caliendo et al. (2017), we have that

j
π̃nn
=
0

j
ωnn
0
j
Ωn

 j j −θ
κnn0 xn0
= P
 j
−θ
N
j
n00 =1 κnn00 xn00
j
j
We can verify that π̃nn
0 = πnn0 , that is, the share of goods that firms in city n purchase
from city n0 is equal to the share of the value of goods produced in city n0 in the bundle

55

purchased by firms in city n (see Eaton and Kortum (2002), Footnote 17). Observe also that
η
PN  j j −θ
−θ
= (Pnj ) Γ (ξ) 1−η . Therefore, we may alternatively write the trade share
n0 =1 κnn0 xn0
j
πnn
0 as
j
πnn
0

#−θ
"
1
j
Xnn
κjnn0 xjn0 Γ (ξ) 1−η
0
=
=
Xnj
Pnj

j
In non-tradable sectors, πnn
= 1.

A.3

Market Clearing and Aggregation at the Industry and City
Level

R kj
Ln (z)dΦ(z), the labor market
Given the labor supply equation (2) and the definition Lkj
n =
clearing equation (5) may be rewritten as

k k ν
X
C
A
n
n
k
Lkj
 k , ∀ n = 1, ..., N , k = 1, ..., K.
n = L P
k
k ν
A
C
0
0
0
j
n n
n
R j
j
Given the definition Hn = Hn (z)dΦ(z), the market clearing equation for structures in each
city (6) may be rewritten as
X
Hnj = Hn , n = 1, ..., N.
j

Given our definition of total final expenditures, Xnj = Pnj Qjn , and the demand function for
consumption goods of sector j (15), the market clearing condition for final goods in each
city n and sector j (7) may be expressed in terms of sectoral and city aggregates,
X
X

0
Lkn αj Pn Cnk + Pnj
Mnjj = Xnj .
j0

k

Finally, given that πnj 0 n Xnj 0 = Xnj 0 n = pjn (z)κjn0 n Qjn0 n (z)dΦ(z), the market clearing condition
for intermediate inputs (8) may be rewritten in terms of sectoral city aggregates as
Z
X j
πn0 n Xnj 0 ,
=
pjn (z)qnj (z)dΦ(z)
n0
{z
}
|
R

Total value of intermediate goods produced in city n

j
j
n0 πn0 n Xn0

P

where
is the total value of expenditures across all cities spent on intermediate
goods produced in city n.
We can use this last aggregation relationship to obtain aggregate factor input demand
equations as follows,

56



wnk Lkj
= γnj 1 − βnj P
n
K

k
wn
kj
λn



k0 =1

rn Hnj = γnj βnj

X

1−
X
1−
k0

wn
0

λkn j


πnj 0 n Xnj 0 ,

n0


πnj 0 n Xnj 0 ,

n0
jj 0

j0

Pn Mn

j0j

= γn

X


πnj 0 n Xnj 0 .

n0

Finally, combining these factor demand equations yields the aggregate production function,

X
n0

A.4

πnj 0 n Xnj 0

=

xjn

X


 1
kj 1− 
λkj
L
n n

j

! −1
)
(1−βn

γnj
β j
Hnj n 

Y

0
Mnj j

γnj0 j

.

j0

k

Definition of Equilibrium

Equilibrium for this system of cities is given by a set of final goods prices Pnj , wages in
different occupations, wnk , rental rates, rn , intermediate goods prices paid by final goods
producers, Pnj (z), intermediate goods prices received by intermediate goods producers, pjn (z),
consumption choices, Cnkj , intermediate input choices, Qjn (z), intermediate input production,
0
qnj (z), demand for materials, Mnjj (z), labor demand, Lkj
n (z), demand for structures, Hn (z),
k
and location decisions, ζn (a), such that:
i) Workers choose consumption of each final good optimally, as implied by equation (15)
Q  Pnj αj
P j kj
k
k
and the budget constraint, j Pn Cn = Pn Cn = In , where Pn = j αj
and Ink is given
by equation (1).
ii) Workers choose optimally where to live as implied by equation (2).
iii) Intermediate input producers choose their demand for materials, labor and structures
optimally (as implied by factor demand equations (17), (18) and (19)), and produce positive
but finite amounts only if (21) holds, where xjn in that equation is given by (20).
iv) Final goods producers choose the origin of intermediate inputs optimally, implying
that a producer incity n and industry j imports a variety z from city n0 if and only if
κjnn0 pjn (z) = minn00 κjnn00 pjn00 (z) . The price that they pay for intermediate goods satisfies
(22) if the good is tradable and (23) if it is non-tradable.
v) Final goods producers choose their intermediate input use optimally according to (24)
and produce positive but finite amounts only if (25) holds.
vi) Market clearing conditions for employment (equation 5), land and structures (equation
6), final goods (equation 7), and intermediate goods (equation 8) hold.

A.5

Aggregate Equilibrium

At the aggregate level, equilibrium is given by values for the prices Pn , Pnj , xjn , rn , wnk ,
j
j
j0j
aggregate quantities Cnk , Lkj
, expenditures, Xnj , and expenditure shares, πnn
0,
n , Hn , Mn
that satisfy the following equations
57

X

 X j jj 0
0
j
k
Pn Mn = Xnj (N J eqs.)
α
P
C
Lkj
n n +
n

(26)

j0

k,j 0

Lkn

=

X

Lkj
n

=P

 Lk
k
k ν
An0 Cn0

n0

j

X

ν

Akn Cnk

(N K eqs.)

(27)

Hnj = Hn (N eqs.)

(28)

j

Pn =

Y  P j α
j

Pn Cnk

=

wnk

n
αj

P

0

+ Pn

k
wn
λkj
(L
n)
n

wnk Lkj
n = P 
k0

(N eqs.)

rn0 Hn0
0

k0 ,n0 ,j



j

Lkn0j

(29)

(N K eqs.)

(30)

1−

k0
wn
k0 j
λn (Ln )

j
j
1− γn 1 − βn

rn Hnj = γnj βnj

X

X

πnj 0 n Xnj 0 (N KJ eqs.)

(31)

n0

πnj 0 n Xnj 0 (N J eqs.)

(32)

πnj 0 n Xnj 0

(33)

n0
0

0

0

Pnj Mnj j = γnj j

X


N J 2 eqs.

n0

Pnj =




1

Γ (ξ) 1−η

Γ (ξ)
X

1
1−η

P 
 − θ1
j
j −θ
κ
x
nn0 n0
n0
xjn

if j is tradable

(N J eqs.)

(34)

if j is non-tradable

πnj 0 n Xnj 0

n0


=

xjn

X


 1
kj 1− 
λkj
n (Ln )Ln

j

! −1
(1−βn
)

γnj
β j
Hnj n 

Y

0
Mnj j

γnj0 j

(N J eqs.)

(35)

j0

k

j
πnn
0

 j j −θ
κ 0x 0
= P nn n −θ (N 2 J eqs.)
j
j
n00 κnn00 xn00

(36)

This system of equations comprises 2N + 2N K + 4N J + N KJ + N J 2 + N 2 J equations
in the same number of unknowns.
By substituting equation (33) into equation (26), adding over all industries (j) and all
cities (n) and rearranging, we arrive at the National Accounting identity stating that aggregate value added is equal to aggregate consumption expenditures in the economy,
58

X

k
Lkj
n Pn C n =

X

γnj Xnj

(37)

n,j

n,k,j

At the same time, multiplying both sides of equation (30) by Lkn , adding over city (n) and
occupation (k), and substituting out wnk and rn Hn using equations (28), (31) and (32), yields
the same national accounting identity. The fact that we can arrive at that same identity
by manipulating different sets of equations implies that there is one redundant equation in
the system, leading to one too many unknowns relative to the number of equations. The
presence of a redundant equation is a feature of Walrasian systems. In order to pin down
the price level, therefore, we need to amend the system with an additional equation defining
the numeraire. Specifically, we set :
X
ωn ln(Pn ) = ln(P̄ ),
(38)
n,j

where ωn are a set of weights. When computing counterfactuals,
P k k we set those weights to be
proportional to local nominal consumption: ωn ∝ Pn k Cn Ln . Finally, observe that if we
substitute the factor demand equations (31), (32), (33) into (35), we obtain the expression
for the unit cost index,

xjn

=

Bnj





j
rnβn

j
j γn
1−βn
"K 
#

J 
X wk 1− 1−  Y




A.6

k=1

n
kj
λn




0
Pnj

γnj0 j

.

(39)

j 0 =1

TFP accounting

In the text, we define TFP in equation (9) as
!
ln T F Pnj = ln

X

πnj 0 n Xnj 0 −ln Pnj −γnj βnj ln Hnj −γnj (1−βnj )

X

n0

δ kj ln Lkj
n −

X

0

0

γnj j d ln Mnj j ,

j0

k

where
δ

kj

P
=P

n0

wnk 0 Lkj
n0
0

wnk 0 Lkn0j
0

n,k0

is the national share of occupation k in the wage bill across all occupations.
From equation (35,)
X

πnj 0 n Xnj 0

=

xjn

"

X


 −1
 −1
kj kj

λn Ln

n0

1−βnj

 j
j βn

Hn

#γnj

Y

0
Mnj j

γnj0 j

j0
− θ1

j

j
Also, recall that we can write Pxnj = κj1 (πnn
) , which picks up the role of selection effects
n
nn
on P
productivity (see Caliendo et al. (2017)). We take a first order log-linear approximation
of n0 πnj 0 n Xnj 0 around national averages to obtain

59

!
d ln

X

πnj 0 n Xnj 0

 −1
X

λkj Lkj 
1
j
j
j
kj
'
− d ln πnn + γn (1 − βn )
d ln Lkj
−1
P
n + d ln λn
θ
kj kj 
k
k0 (λ L )
X 0
0
+ γnj βnj d ln Hnj +
γnj j d ln Mnj j ,
d ln Pnj

n0

j0
−1

−1

kj

(λkj
n Ln )
−1 . From manipulating equation
P
kj kj

k0 (λn Ln )

(λkj Lkj ) 
where P
−1 is the national average of

kj kj
k 0 (λ L )
(31), we can verify that:

 −1
kj

λkj
wnk Lkj
n Ln
n
=P
0 k0 j
k
P  kj kj  −1

k 0 wn Ln
k0 λn Ln
If we log-linearize around a national weighted average across cities, where we weight
individual cities by their wage bill, we have that
λkj Lkj
P

k0

 −1


(λkj Lkj )

−1


=

wnk Lkj
n

X
P
n

P
k0 j

k0
k0 wn Ln

×P

0

0

k0

wnk Lkn j
0

k0 j

k
n0 ,k0 wn0 Ln0

P
=P

n

wnk Lkj
n
0

k kj
n0 ,k0 wn0 Ln0
0

= δ kj ,

so that
!
d ln

X
n0

πnj 0 n Xnj 0

X

1
j
kj
' d ln Pnj − d ln πnn
δ kj d ln Lkj
+ γnj (1 − βnj )
n + d ln λn
θ
k
X 0
j j
j
jj
j0j
+ γn βn d ln Hn +
γn d ln Mn
j0

Comparing to the expression for TFP, it follows that, up to a first order approximation,
X

1
j
+ γnj (1 − βnj )
δ kj d ln λkj
d ln T F Pnj ' − d ln πnn
n
θ
k
which, abstracting from selection effects, reduces to
d ln T F Pnj = γnj (1 − βnj )

X

δ kj d ln λkj
n



k

(1−βnj )γnj
β j γ j
Defining Tnkj ≡ λkj
Hnkj n n it follows that for tradable sectors (in which case
n
βnj = 0 and γnj = γ j for all n),
X
d ln T F Pnj =
δ kj d ln Tnkj
k

60

or
ln T F Pnj =

X

δ kj ln Tnkj + constant independent of n

k

For the purposes of comparing T F Pnj across space, we can omit that constant.

B

Quantifying the Model and Model Inversion

We now provide additional detail on how we quantify the model. The set of parameters
needed to quantify our framework fall into broadly two types: i) parameters that are constant across cities (but may vary across occupations and/or industries) and that are directly
available from statistical agencies, or that may be chosen to match national or citywide
averages, and ii) parameters that vary at a more granular level and require using all of the
model’s equations (i.e. by way of model inversion) to match data that vary across cities,
industries, and occupations.

B.1

Details on Parameters That Are Constant Across Cities
0

Input use shares in gross output (γnj , γnjj ) : To obtain an initial calibration for these
share parameters, we use an average of the 2011 to 2015 BEA Use Tables, each adjusted by
the same P
year’s total grossP
output. The Use Table divides the value of the output in each
sector j, n0 ,n πnj 0 n Xnj 0 = n Xnj , into the value of input purchases from other sectors j 0 ,
P
P
P
j0
j0j
wnk Lkj
, operational surplus, n rn Hnj , and taxes on
n
n,j 0 Pn Mn , labor compensation,
n,k
P
production and imports, − n sjn Xnj ,
X
X 0 0
X
X
X
Xnj =
Pnj Mnj j +
wnk Lkj
rn Hnj −
sjn Xnj .
(40)
n +
n

n,j 0

n

n,k

n

Input purchases from other sectors are separated into purchases from domestic producers and
purchases from international producers. Since the model does not allow for foreign trade, we
adjust the Use Table by deducting purchases from international producers from the input
purchases and, for accounting consistency, from the definition of gross output for the sector.
As in Caliendo et al. (2018), for all sectors, we augment material purchases to include
the purchases of equipment. Specifically, we subtract from the operational surplus of each
sector 17 percent of their value added and then add the same value back to materials.32 This
17 percent value is estimated by Greenwood et al. (1997) as the equipment share in output.
We then pro-rate the equipment share of value added to different materials in proportion to
their use within each sector.
We interpret the remaining part of the gross operational surplus in a given sector as
compensation for services provided by real estate. We adopt the convention that all land
and structures are managed by firms in the real estate sector, which then sell their services
to other sectors. Accordingly, for all sectors other than real estate, we reassign the gross
32

When gross operation surplus amounts to less than 17 percent of value added, the entire operational
surplus is deducted.

61

operating surplus remaining, after deducting equipment investment, to purchases from the
real estate sector. These surpluses are in turn added to the gross operating surplus of real
estate.
It follows that, for all sectors j other than real estate,
X
rn Hnj = 0.
n

and in each of those sectors,
Pnreal estate Mnreal estate,j = Purchases from real estate by j
+Operational Surplus of j
−Equipment Investment by j.
In contrast, in the real estate sector,
X

rn Hnreal estate = Total Operational Surplus across all j

n

−Total Equipment Investment across all j.
One can verify that those reassignments do not affect aggregate operational surplus (net
of equipment investment), aggregate labor compensation, and aggregate value added (net of
equipment investment).
We assume that tradable sectors have a γnj = γ j , constant across cities and similarly for
jj 0
γn ’s. The two non-tradable sectors have city specific parameters. Given these adjustments
to the Use Table, the share parameters for tradable sectors follow immediately,
P
P
P j0 j0j
j
k kj
P M
n rn Hn
n,k wn Ln +
j
j0j
γ =
, γ = P n n jn j.
(41)
P
j
j
n (1 + sn )Xn
n (1 + sn )Xn
Furthermore, we have that, for the non-tradable sectors,
P k kj
j
j
k wn Ln + rn Hn
γn =
,
(1 + sjn )Xnj

(42)

where sjn is an ad-valorem subsidy for city n, sector j, which we introduce to account for
the fact that part of the sectoral value added calculated by the BEA is in fact paid out in
indirect taxes. Finally, since we do not observe use of materials by individual sectors in each
city, we assume that, the proportions of materials used in each city by nontradable sectors if
0

fixed at the national averages

j j
γn
j
1−γn

is the same for all cities and satisfies equals
P j0 j0j
0
P M
γnj j
j0j
= P n n j0 nj0j
j = γ̂
1 − γn
n,j 0 Pn Mn
0

P
P

0

n

0

Pnj Mnj j

n,j 0

0

0

Pnj Mnj j

The calibration of γnj and, therefore of γnj j , will require choosing additional parameters
as described below but consistent with the above equations.
62

Trade costs We assign trade costs to be a log-linear function of distance, that is,
j

κjnn0 = (dnn0 )t

where κn,n0 is the amount of goods that need to be shipped from location n0 in order for one
unit of the good to be available in location n, and dn,n0 is the distance (in miles) between
the two locations.
From equation (36) we can write
j
j
log(πnn
0 ) = −θt log(dnn0 ) + cn + cn0

(43)

where cn and cn0 are n and n0 location-specific factors
cn0 = −θ log(xjn0 )
and
X j
−θ
cn = − log
κnn00 xjn00

!

n00

We assign trade costs to industries using three different methods. First, we assign two
industries (retail, construction and utilities, and real estate) to be non-tradable. Second,
we use the estimates in Table 1 of Anderson et al. (2014) to obtain gravity coefficients for
services. Third, we rely on equation (43) to obtain the gravity coefficients. In order to do this,
we use the 2012 Commodity Flow Survey Public Use Microdata File. We add up shipment
values by industry, origin and destination and then, for each industry, we regress the log of
those averages on log of average shipment distance between each origin and destination.
The gravity coefficients used are summarized in Table 9, below.

B.2

Model Inversion for the Granular Parameters

From the ACS, we obtain data pertaining to wnk , and

Lkj
n
k0 j .
k0 Ln

P

The spatial distribution of

CNR shares (Lkn /Ln ) is depicted in Figure 20 below.
Census County Business Patterns
P The
k0 j
(CBP) provide us measures of total employment k0 Ln that better match BEA industrylevel counts, which we combine with the ACS data to obtain Lkj
n . From the BEA, we obtain
regional price parity (RPP) indices for each city, disaggregated into goods, services and rents.
As explained below, we use the level of rents and the relative price of goods and services,
providing us with 2(N − 1) additional restrictions (we deduct 2 since prices in any given city
are only defined relative to those in other cities). Furthermore, we can use the BEA Use
Tables to calculate the national share of income from land and structures in the production
of real estate, providing us with one additional equation. Lastly, as we explain in more detail
below, we can apply J normalizations to each sector.
The data plus normalizations above impose N K + N KJ + 2N + J − 1 independent restrictions that allows us to solve for N K values for amenity parameters, Akn , N KJ scaling

63

Table 9: Estimated Gravity coefficients (−θtj )
Industry
Retail, Construction and Utilities
Food and Beverage
Textiles
Wood, Paper, and Printing
Oil, Chemicals, and Nonmetalic Minerals
Metals
Machinery
Computer and Electric
Electrical Equipment
Motor Vehicles (Air, Cars, and Rail)
Furniture and Fixtures
Miscellaneous Manufacturing
Wholesale Trade
Transportation and Storage
Professional and Business Services
Other
Communication
Finance and Insurance
Real Estate
Education
Health
Accomodation

Gravity Coefficient
-∞
-1.24
-0.88
-1.36
-1.32
-1.20
-0.81
-0.77
-0.64
-0.90
-1.18
-0.83
-0.563
- 0.617
-0.928
-0.724
-0.297
- 0.678
-∞
-1.01
-1.42
-0.927

Source
Non-tradable
own estimate
own estimate
own estimate
own estimate
own estimate
own estimate
own estimate
own estimate
own estimate
own estimate
Anderson et al.
Anderson et al.
Anderson et al.
Anderson et al.
Anderson et al.
Anderson et al.
Non-tradable
Anderson et al.
Anderson et al.
Anderson et al.

(2014)
(2014)
(2014)
(2014)
(2014)
(2014)
(2014)
(2014)
(2014)

See text for own estimation details. Coefficients from Anderson et al. (2014) are extracted from Table 1

γnj (1−βnj )
γj βj
, (N − 1) shares of non-residential strucfactors in production, Tnkj ≡ (Hnj ) n n λkj
n
tures in value added in the real estate sector, βnreal estate , (N − 1) shares of value added in
non-tradable output, and J − 1 independent values for consumption share parameters, αj .33
The steps below describe the model inversion.
1. Computing consumption shares, αj . We first add up equation (26) across n and
j
j
j. We then use the factor

 demand equations (31) and (32) to obtain γn Xn =
P
P
0
0
j
k kj
, and the national accounting identity (37) to substitute
n rn Hn +
n,k wn Ln
out Xnj ’s and Cnk ’s from the aggregated equation (26):
!
!
X 0 0
X
X
X X
X
X
0
0
0
Pnj Mnj j .
wnk Lkj
wnk Lkj
+
Pnj Mnjj =
rn Hnj +
αj
rn Hnj +
n +
n
j0

n

n,k

j0

n

n,k

j0

P
j
jj 0
The ACS does not provide data on rn Hnj ,
or their sum across cities. While
j 0 Pn Mn
the BEA provides data on sectoral aggregates, those cover the whole country as opposed to
33

Furthermore, additional restrictions imposed on αj and β j , specifically that αj ∈ [0, 1], and β j ∈ [0, 1],
imply some overidentifying restrictions.

64

CNR share of employment
in data (deviation from mean)
Population

< -0.115

[ -0.0573 , 0 )

[ 0.0573 , 0.115 )

[ -0.115 , -0.0573 )

[ 0 , 0.0573 )

>= 0.115

12,000

3,909,000

7,805,000

Figure 20: CNR share from 2011-2015
Each marker in the map refers to a CBSA. Marker sizes are proportional to total equilibrium
employment in each city.

only MSA’s. Thus, we rely instead on ratios,

j
n rn Hn
kj
k
n,k wn Ln

P
P

and
P

P

0

n,j 0

P

Pnj Mnjj

n,k

k Lkj
wn
n

0

obtained from the

Use Tables, which we can then combine with data on n,k wnk Lkj
n and the above equation.
Specifically,
!
P
P
j0
X
X n,j 0 Pnj 0 Mnjj 0
r
H
0
0
n
n
αj
wnk Lkj
wnk Lkj
P
P n k kj 0 + 1 +
n
n
kj
k
n,k wn Ln
n,k wn Ln
j 0 ,n,k
n,j 0 ,k
!
P
P
j0
j0j
j
X
n,j 0 Pn Mn
k kj
n rn Hn
=
wn Ln P
+1 ,
0 + P
0
k0 k j
k0 k j
n,k0 wn Ln
n,k wn Ln
n,k
or
X

αj

 X k kj 0 jj 0
0
j
+
1
+
wn Ln %M
wnk Lkj
%
n
H
j 0 ,n,k

n,j 0 ,k

=

X



j0j



wnk Lkj
%jH + %M + 1 ,
n

n,k
0

where %jH and %jMj denote, respectively, the ratio of national aggregate rental income and
the ratio of national aggregate material inputs usage from sector j 0 to national aggregate
65

wage income in sector j which are consistent with the Use Tables. The J equations above
j
j
can be solved for J values
P ofj α . One can verify that any value of α obtained from thosej
equations will satisfy
j α = 1. One complicating factor is that in each sector j, α
must live in [0, 1]. However, because of measurement inconsistencies between ACS and BEA
data, the procedure generates negative values of αj in three out of 22 sectors. One of those
sectors (“Oil, Chemicals, and Nonmetallic Minerals”) indeed has much of its employment
located outside of urban areas. We use information from the Use Tables to calibrate αj in
that sector, setting it equal to 5.57 percent. The other two sectors (“Wood, Paper, and
Printing”, and “Metals”) are to a large degree producers of inputs for other industries, so
that we set their consumption shares
to 0. To ensure that all equations hold while satisfying
j0j
those restrictions, we allow %M ’s to deviate somewhat from those obtained from the Use
0
Tables. This requires adjusting γ j and γ j j for the tradable sectors, since those satisfy
γj =

1+%jH
P 0 0 ,
j
1+%H + j %jMj

0

0

γj j =

%jMj
P 0 0 .
j
1+%H + j %jMj

2. Expressing gross output and rental income from each sector and city as functions of
share parameters and wage bills. Using the labor demand equations (31), we obtain
X

wnk Lkj
n j .
j
1 − βn γn

P

πnj 0 n Xnj 0

=

n0

k

(44)

j
In the non-tradable sectors, πnn
= 1 and πnj 0 n = 0 for n0 6= n so that

Xnj =

wnk Lkj
n j .
j
1 − βn γn

P

k

For all sectors other than real estate, we have that βnj = 0, so that rn H jn = 0. For the
real estate sector, we have from the first-order conditions in that sector that
rn Hnreal estate =

βnreal estate X k k,real estate
w L
.
1 − βnreal estate k n n

Since real estate services are the only sector with positive rental income, this is also
equal to the total rental income in each city.
3. Computing the shares of land and structures in value added for the real estate sector,
βnreal estate . We use equations (30) to substitute out Pn Cnk in equations (26). We then
apply the relationships from equation (44) to substitute out gross output in (31) to
(33), and use the resulting equations to substitute out factor demands in (26). Given
that in the non-tradable sectors (“real estate” and “retail, construction, and utilities”),
expenditures are equal to gross output, this implies that, for j ∈ {“real estate”,“retail,
construction, and utilities”}, we have that

66

P k kj
1
k wn Ln
j
1 − βn
γnj
X
= αj
wnk Lkn
k

+ αj

X Lk

n k

Lk

k

+

X

b

n0

1−

0
γnj

j0

j0

βnreal estate X k0 k0 ,real estate
w 0L 0
1 − βnreal estate k0 n n

X

j0

γn (1 − βn )

γ̂ jj

0

X

!

0

wnk Lkj
n ,

k

where we are using the fact that βnj = 0 for all sectors other than real estate. Given
that we have two non-tradable sectors, this is a system of 2N equations, in N values
for γnj and N values of βnreal estate .
4. Computing individual values for nominal expenditures, Xnj , in tradable sectors. We use
equations (30) to substitute out Pn Cnk in equations (26). We then apply the relationships from equation (44) to substitute out gross output in (31) to (33), and use the
resulting equations to substitute out factor demands in (26). In the tradable sectors,
this gives us

Xnj = αj

X

wnk Lkn

k

X Lk

βnreal estate X k0 k0 j 0
w 0L 0
1 − βnreal estate k0 n n

X

!

n k
b
k
L
k
n0
P
k kj 0
X 0
jj
k wn L

n 0
γn
+
j0
1 − βn γnj
j0

+ αj

Given values for βnj from the previous step, values for Xnj are then immediately determined from the data.
5. Computing relative cost indices for tradable goods, x̃jn . For N (J − 2) tradable sectors
(all but “real estate,” as well as “retail, construction, and utilities”), we now solve for
j
(N − 1)(J − 2) values of the cost index, P xnxj , for each j ∈ {1, ..., J} from the system
n0

of (N − 1)(J − 2) independent equations,

n0

N
X

wnk Lkj
n

=
πnj 0 n xj Xnj 0 ,
j
j
1 − βn γn n0 =1

P

k

where xj = {xj1 , ..., xjN } is the vector of unit production costs. This system comprises
only (N − 1)(J − 2) independent equations since, for each j, adding up the right-handside and left-hand-side over n gives the same result on both sides irrespective of xj . At
67

the same time πnj 0 n (xj ) is homogeneous of degree 0 in xj for each j in equation (36),
j
so that we can still solve for the ratio, x̃jn ≡ P xnxj . 34
n0

n0

6. Computing relative tradable consumer prices, Penj , in every sector and city. Substituting
x̃jn from the previous step into equation(34) and rearranging, we have that for the
tradable sectors,

Pnj = Γ (ξ)

1
1−η

X j
−θ
κnn0 x̃jn

!− θ1

n0

×

X

xjn0 ,

n0

which gives a system of N (J − 2) equations. We can thus determine

1

ΞjP ≡ Γ (ξ) 1−η

j
n0 xn0
Pj

P


X
=
$nj

X j
−θ
κnn0 x̃jn

!− θ1 −1


n0

P
for each j by imposingP n $nj Pnj = P j , where $nj are model-consistent expenditure
weights given by Xnj / n0 Xnj 0 obtained in step 4. We may then then obtain for all
tradable j 0 s
!− θ1
j
X


P
−θ
κjnn0 x̃jn
.
P̃nj ≡ nj = ΞjP
P
0
n
Note that data on P j is only available in changes from a base period. Thus, we define
the base period to be 2011-2015, our benchmark period, and set P j = 1 in all sectors
in that period. For the remainder of the analysis, therefore, P̃nj = Pnj .
7. Computing non-tradable consumer prices. In the non-tradable sectors, we have that
1
Pnj = Γ (ξ) 1−η xjn for all n and j, and for those sectors, we determine prices based on
data from the Regional Price Parity (RPP) indices calculated by the BEA. We directly
1
estate
obtain values for Γ (ξ) 1−η xreal
≡ Pnreal estate from the RPP estimates of the price of
n
real estate services in different cities. For the other non-tradables (“retail, construction,
1
etc.
and utilities”), we choose Pnretail, etc. = Γ (ξ) 1−η xretail,
so that the price of services
n
(other than real estate) relative to tradable goods in the model matches its counterpart
in the RPP. To carry out this calculation, observe that the price index for services
αj


0
j0 P j
Σj 0 ∈Services αj
Σ
α
0
n
j
∈Services
where the service
can be defined by PnServices = Πj∈Services
αj
sectors include retail, etc., wholesale trade, transportation and storage, professional
and business services, other, communication, finance and insurance, education, health,
and accommodation. The price index for goods can be defined analogously where the
goods sector includes all remaining sectors other than real estate.
θj

34

Numerically, the system is easier to solve for

(xjn )
from which we can easily obtain values for xjn ’s.
θj
j
n0 (xn0 )

P

68

8. Computing firm productivity in different sectors, j, and cities, n,
occupation k, λkj
n . From equations (31), we have that


k
wn
kj
λn

1−

wnk Lkj
n = P  k0 1−
wn

X

wnk Lkj
n ,

k

0

k0

associated with

λkn j

which can rewrite as
P
wnk Lkj
n =
P

k0 j
k 0 λn
kj
λn

wnk

P

k0 j
k 0 λn
0
λkn j

k0

1−

wnk0

X
1−

wnk Lkj
n .

k

Thus, for each city n and industry j, we can use K − 1 of those equations to solve for
λkj
n
K − 1 ratios, λ̃kj
k0 j . With some rearrangement, those can be written as
n = P
k0

λn

λ̃kj
n

1
 −1
Lkj
n
=
1 .
 0  −1

P
0 −1
kj
k
Ln
k0 (wn )

wnk


 −1

From equations (45) (obtained by substituting the factor demand equations (31), (32)
and (33) into equations (35)), and the value for rn Hn obtained in step 2, we obtain
!γnj (1−βnj )
j j
γn
βn

Hn

X

1

k0 j

Γ (ξ)− 1−η

λn

(45)

k0



 X

Bnj
= j j
x̃n ΞP 


j

βnj X k kj
wn Ln
1 − βnj k

!βnj "

K 
X
k=1

j
j γn
1−βn
#


J 
1− 1− 
Y
wk

n
kj
λ̃n




0
Pnj

γnj0 j

,

j 0 =1

1

where we set Γ (ξ)− 1−η = 1 since it is common to all sectors and cities and thus
immaterial in any counterfactual exercise. Recall that the use of land and structures
as inputs has been folded in the real estate sector that then sells real estate services
to all other sectors (i.e. βnj = 0 in all sectors but real estate). Then, multiplying
 γnj
both sides of equation (45) by the ratios λ̃kj
gives N K(J − 1) values for the
n
productivity of firms in different sectors, j, and cities, n, associated with occupation
γnj (1−βnj )
γj βj
k, Tnkj ≡ (Hnj ) n n λkj
, which, in the special case where one abstracts from
n
differences in occupational composition across cities, reproduces measured regional and
sectoral productivity in Caliendo et al. (2018).
9. Computing the idiosyncratic amenity distribution parameter ν and amenity shifters Akn
for each city n and occupation k
69

To compute ν we match the estimate for local labor supply elasticity with respect to
local wage estimated by Fajgelbaum, Serrato and Zidar (2018) of 1.14. In our setup,
k
for any occupation k and city n, this elasticity is ν Pwn Cn k . The average elasticity is equal
n
to 1.14 if ν = 2.017. Given ν, we can now back-out amenities from the labor supply
equation 2.35

B.3

Instrumenting for Employment Level and Composition

In order to isolate the residual simultaneity between exogenous productivity variation and
labor allocation, we resort to variants of instruments proposed in the literature. Specifically,
we follow Ciccone and Hall (1996) and use population a century prior to our data period
to capture historical determinants of current population, and we follow Card (2001) and
Moretti (2004a), and use variation in early immigrant population and the presence of landgrant colleges to capture historical determinants of skill composition of cities. We now discuss
the particular instruments in more detail.
Population in 1920 Ciccone and Hall (1996) argue for the validity of historical variables
as instruments under the assumption that, after allowing for the controls described above,
original sources of agglomeration only affect current population patterns through the preferences of workers, and not through their effect on the residual component of productivity.
This reasoning motivates using population almost one hundred years prior to our data period as an instrument and will also serve as motivation for the other instruments, described
below.
Irish immigration in 1920 Next, we use the fraction of Irish immigrants in the population of each city in 1920. This instrument is motivated by Card (2001), who uses the location
of immigrant communities as an instrument for labor supply in different occupations. For
our purposes, we focus on the location of Irish immigrants following evidence reviewed by
Neal (1997), and further studied by Altonji et al. (2005), showing that attending catholic
schools substantially increases the likelihood of completing high school and college education.
We use as an instrument the fraction of Irish immigrants, rather than the overall catholic
population, because Irish immigrants represented the first wave of catholic immigration to
the U.S. and, therefore, historically were the first to invest in education. As additional validation for this instrument, we compile data on the current location of catholic colleges, and
observe that MSAs in which catholic colleges are present had in 1920 more than three times
the fraction of Irish immigrants as other locations.
The Presence of land-grant colleges Lastly, following Moretti (2004a), we also use as
an instrument the presence of a land-grant college within the city. Land-grant colleges were
established as a result of the Morrill Act of 1862, and extended in 1890, a federal act that
sought to give states the opportunity to establish colleges in engineering and other sciences.
35

For given k, labor supply is homogeneous of degree zero in Akn . This implies that amenities are only
determined up to an arbitrary, occupation-specific scaling constant, that is, we can change An |k to Ãkn so
that Ãkn = mk Akn without any observable implications.

70

Since the act is more than a century old, the presence of a land-grant college in the city is
unlikely to be related to unobservable factors affecting productivity in different cities over
our base period, 2011 − 2015. At the same time, as shown in Moretti (2004a), the presence
of land-grant colleges is generally correlated with the composition of skills across cities.

B.4

A check on the instruments: Estimates in Counterfactual
without Externalities

Table 10 below shows the results from carrying out the same estimation exercises as in Table
7 using employment and productivity values obtained from a counterfactual equilibrium in
which externality elasticities, τ R,k and τ L,k , are set to zero but all other model parameters
are kept at their original levels.
Table 10: Estimates with data generated by counterfactual without externalities
(1)
OLS
VARIABLES
k

γnj log( LLnn )
γnj log(Ln )

Observations
K.P. F
S.W.F. Lkn Share
S.W.F. Ln

B.5

(2)
2SLS

(3)
CUE

CNR

non-CNR

CNR

non-CNR

CNR

non-CNR

-1.054***
(0.30)
0.028
(0.06)

-0.109
(0.48)
-0.034
(0.04)

-0.240
(0.73)
-0.020
(0.06)

-0.860
(0.93)
-0.065
(0.04)

-0.073
(0.72)
0.004
(0.06)

0.095
(0.92)
0.003
(0.04)

7,459

7,459

7,459
7,459
7.646
7.742
12.18
12.29
14.84
17.10
Robust standard errors in parentheses
*** p<0.01, ** p<0.05, * p<0.1

7,459
7.646
12.18
14.84

7,459
7.742
12.29
17.10

Model-Implied IV

In this exercise, we estimate externalities using an IV implied by the model. This is obtained
by calculating the counter-factual allocation associated with an economy where, for any given
industry/occupation category, productivity is constant across cities, and using the resulting
counter-factual labor allocation as instruments.
This instrument will correct for a reverse causality problem since, by construction, there
is no exogenous variation in productivity across cities. Table 11 below shows the estimates.
The F-statistics are very large, implying no need to explore GMM-CUE estimates. Moreover,
the coefficients present the same general pattern as our baseline estimates: occupational
externalities are generally stronger than those associated with total population.
71

Table 11: Externality estimates with Model Implied IV’s

VARIABLES
k

γnj log( LLnn )
γnj log(Ln )

(1)

(2)

OLS

2SLS

CNR

non-CNR

CNR

non-CNR

0.900***
(0.12)
0.388***
(0.05)

0.681***
(0.22)
0.322***
(0.04)

0.758***
(0.12)
0.407***
(0.05)

0.390*
(0.21)
0.306***
(0.04)

Observations
7,459
7,459
7,459
R-squared
0.406
0.310
0.406
K.P. F
611
k
S.W.F. Ln Share
1216
S.W.F. Ln
1749
Robust standard errors in parentheses
*** p<0.01, ** p<0.05, * p<0.1

C

7,459
0.308
331.8
596.7
1492

The Planner’s Problem

This section describes the solution to the planner’s problem taking as given that workers
in different occupations can freely choose in which city to live. Under this assumption, the
expected utility of a worker of type k is given by equation (16). Given welfare weights for
each occupation, φk , the utilitarian planner then solves
W=

X

φk Γ

k



 X
N
ν−1
ν

! ν1

k ν

Akn Cn

Lk ,

(46)

n=1

where recall that Cnk aggregates final goods from different sectors:
Y
αj
Cnkj
.
Cnk =

(47)

j

The planner maximizes (46) subject to the resource constraints for final goods,

X
k

Lkn Cnkj +

XZ

0

Mnjj (z)dΦ(z) = 

j0

Z "X

# η−1
η
Qjnn0 (z)

η
 η−1

dΦ(z)

,

(48)

n0

where Qjnn0 (z) are the purchases of intermediate goods produced in city n0 by final goods
firms in city n, the resource constraints for intermediate goods of all varieties z and industries
j produced in all cities n
72

X

Qjn0 n (z)κjn0 n = qnj (z), ∀z ∈ R+
n,

(49)

n0

where
j


j

qnj (z) = zn Hnj (z)βn



# −1
(1−βnj ) γn Y
−1
 
j0 j
kj
j0j
γn

λkj
(L
)
L
(z)
M
(z)
,
n
n
n
n

"
X

j0

k

labor markets constraints in all locations,
XZ
k
Lkj
n (z)dΦ(z) = Ln ,

(50)

j

where labor supply in each city, Lkn , satisfies
Lkn = P

Akn Cnk

n0

ν

Akn0 Cnk0

ν Lk ,

the resource constraints in the use of land and structures,
XZ
Hnj (z)dΦ(z) = Hn ,

(51)

(52)

j

as well as non-negativity constraints applying to both household consumption of different
goods and input flows:
Cnkj ≥ 0 and Qjn0 n (z) ≥ 0.
From the resource constraint on local labor markets (50), and P
the labor
R kjsupply condition
(51), it follows immediately that national labor markets clear (i.e., n,j Ln (z)dΦ(z) = Lk ).

C.1

Solving the Planner’s Problem

We solve the Planner’s problem for interior allocations, (i.e., where Cnk and Lkn are strictly
greater than zero for all n and k). For each city n and sector j, let Pnj be the Lagrange
multiplier corresponding to the final goods resource constraint in city n, sector j (48), Pen the
multiplier corresponding to the aggregation of sectoral goods in each city (47), and pejn (z) the
multiplier corresponding to the intermediate goods resource constraints (49). For each city
n and occupation k, let wnk be the multiplier corresponding to regional labor market clearing
(50), Wnk the multiplier corresponding to the definitions of employment in each occupation
and sector (51). Finally, for each city n, let rn denote the multiplier corresponding to market
clearing for structures (52).
The first-order conditions associated with the planner’s problem are:
Cnk
= Pnj Lkn ,
∂Cnkj : Pen αj kj
Cn
73

(53)

which also defines an ideal price index,
j

 α
Pen Y Pnj
Pn = k =
.
j
Ln
α
j

(54)

In addition,

∂Cnk

k

:



φ Γ

! ν1

 X
ν−1
ν

Akn Cnk


k ν

Akn0 Cn0

P

n0

ν

ν
Akn0 Cnk0

n0

1 k
L
Cnk

(55)


N
k
k
X
∂ζ
0 C
n
Wnk0 ,
= Lkn Pn −
k
∂C
n
n0 =1
where
∂ζnk0

k

C
∂Cnk



 




Lkn
1 − k Lk if n0 = n
   LkL  n
.
=
0
k
n0
 − νk
if
n
=
6
n
L
n
C
Lk
ν
k
Cn

n

Also
∂Lkn :

J
X

Pnj Cnkj − w
enk + Wnk = 0.

(56)

j=1

where
w
enk = wnk

(57)
"

+

∂zn
XZ

j
Hnj (z)βn

hP

k00

00
λkn j

00
(Ln ) Lkn j


i −1
(1−βnj )
 −1

(z)

#γnj

J
Y

j 0 =1

∂Lkn

j

0

j0 j

Mnj j (z)γn

p̃jn (z) dz

denotes the total social marginal value of an extra worker of type k in city n. On the
production side, efficient allocations dictate

i− 1
h
1

 Qj 0 (z) > 0 if κj 0 pej 0 (z) = Pnj (Qjn ) η PN0 Qj 0 (z) η dΦ(z)
nn n
nn
n =1 nn
∂Qjnn0 (z) :
.
(58)
hP
i− η1
1

N
j
 Qj 0 (z) = 0 if κj 0 pej 0 (z) > P j (Qj ) η
dΦ(z)
n
n
nn
nn n
n0 =1 Qnn0 (z)
j
j
This last equation delivers efficient trade shares, πnn
0 , and prices, Pn , using the usual Eaton
and Kortum derivations. In addition,
1−

j
∂Lkj
n (z) : γn (1 −

qnj (z)
βnj ) kj
Ln (z)

k
wn
kj
λ
(L
( n n ))


P

k0

74

j

k0
wn
kj
λn (Ln )

(

)

k

1− pen (z) = wn dΦ(z),

(59)

∂Hnj (z) : γnj βnj
0

0

∂Mnj j (z) : γnj j

qnj (z) j
pen (z) = rn dΦ(z),
Hnj (z)
qnj (z)

pejn (z)
j0j
Mn (z)

0

= Pnj dΦ(z).

(60)
(61)

With the usual manipulations of these equations, one obtains
pejn (z) ≡ pjn (z)dΦ(z) =
where

xjn dΦ(z)
,
zn

γnj
j
n


1−  1−β
1−
 j X
 Y  γnj0 j
k
w
0

n
j  βn 
j

 
xn = Bn rn
Pnj
,

kj


0
λ
(L
)
n
k
j
n

(62)



(63)

and Bnj is defined as above.

D

Characterization of the Planner’s Solution

In the decentralized equilibrium, the budget constraint of a household of type k in city n
satisfies
Pn Cnk = wnk + χk ,
where χk = bk

P

Pn

0

rn0 Hn0

n0 ,j

Lk,j
n0

. In contrast, we now show that the consumption of a household of

type k in city n implied by the planner’s solution satisfies
ν
Pn Cnk =
w
e k + χk + R k ,
1+ν n
and recall that w
enk is the social marginal product of labor associated with occupation k in
city n.
Proof:
Equation (55) may alternatively be expressed as
Lk
φk v k nk
Cn

=

Lkn Pn


−

ν
Cnk



Lkn Wnk

 k 
N 
X
ν
Ln0
+
Lkn Wnk0 ,
k
k
C
L
n
n0 =1

where v k is defined in equation (16). Alternatively, we have that
X  Lk 0 
k k
n
ν
φ v −
Wnk0 = Pn Cnk − νWnk .
k
L
n0
|
{z
}
(1+ν)(χk +Rk )

Substituting for Wnk from (56) in this last expression gives

Pn Cnk = ν w
enk − Pn Cnk + (1 + ν) (χk + Rk )
75

or
Pn Cnk =

ν
w
e k + χk + R k .
1+ν n

(64)


Observe that we can also use (56) to write χk + Rk as a function of prices, w
enk , Pn , and
consumption, Cnk . In particular,



φk v k
ν X Lkn0
k
k
k
k
0C 0 .
χ +R =
−
w
e
−
P
n
n
n
1 + ν 1 + ν n0
Lk
We can then obtain an expression for the total consumption expenditures of households of
type k by adding (64) across cities n, with the expression for χk + Rk substituted in,
X
φk v k Lk =
Pn Cnk Lkn .
(65)
n

Substituting out φk v k back into the expression for χk + Rk and rearranging, we obtain
P
k k
X ν k  Lk 
n
k
k
n Pn Cn Ln
−
w
enk .
χ +R =
k
Lk
1
+
ν
L
n

P
P
Finally, note that n,k Pn Cnk Lkn = n,k wnk Lkn + rn Hn , so that
X

Lk (χk + Rk ) =

X

k

n,k

X ν
X

1
wkn Lkn −
w
enk − wnk Lkn +
rn Hn
1+ν
1
+
ν
n
n,k

(66)

k

The individual values for χ are determined to be such that equation (66) is satisfied.

D.1

The Social and Private Marginal Value of Workers of type k
in city n (Proof of Lemma 1)

Solving the derivative in the equation defining the social value of workers of type k in city n
(57), we obtain
w̃nk − wnk
"
∂znj
=

j
βn

Hnj (z)

XZ

hP

k0

k0 j

k0 j

λn (Ln ) Ln


(1−βnj )
1− 1 i 1−
(z)

#γnj

J
Y

j 0 =1

∂Lkn

j

0

j0 j

Mnj j (z)γn

pjn (z)dΦ(z)

where pjn (z)dΦ(z) = p̃jn (z). This expression is equivalent to

w̃nk − wnk =

X
j,k0

(1 − βnj )γnj P

k

k0
wn
0
λkn j (Ln )


00

1−

k00
wn
k0 j
λn (Ln )

0

∂λkn j (Ln ) j
qn (z)pjn (z)dΦ(z).
1− k0 j
k
∂L
λn (Ln )
n
1

76

Rearranging and integrating equation (59) yields
 k 1−
wnk Lkj
n

= (1 −

βnj )γnj P

wn
λkj
n (Ln )



k0

Z

k0
wn
k0 j
λn (Ln )

qnj (z)pjn (z)dΦ(z),

1−

so that the expression for the deviation of private from social marginal product of labor
simplifies further to
0

0

w̃nk − wnk =

X
j,k0

D.2

0

wnk

Lkn j ∂ ln λkn j (Ln )
.
Lkn
∂ ln Lkn

(67)

Implementation (Proof of Proposition 1)

We now discuss the implementation of the optimal policy. One possible implementation is
to combine a direct employment subsidy to firms that is specific to cities and occupations
(∆kn ), a linear occupation-specific labor income tax (tkL ), combined with occupation-specific
transfers (Rk ).
With externalities in occupations, the social and private marginal products of labor differ.
The first step in the implementation of optimal allocations, therefore, is to subsidize firms in
different locations to hire different occupation types. We define w
enk to be the after-subsidy
wage associated with workers in occupation k living in city n such that
w
enk = wnk + ∆kn ,
where ∆kn is a per-worker subsidy offered to firms in city n hiring workers of type k. With
these subsidized wages in place, we take advantage of various additional taxes and transfers
to implement optimal allocations. In particular, equation (1) becomes
Ink = (1 − tkL )w
enk + χk + Rk , (N K eqs.) .
where transfers have to be such that the government budget balances,
X
X
X
Lkn Rk =
tkL wnk Lkn −
(1 − tkL )∆kn Lkn .
n,k

We also have that labor demand depends only on pre-subsidy wages, wnk ,
 k 1−
wn

λkj
n (Ln )
k kj
γnj 1 − βnj pjn (z)qnj (z), (N KJ eqs.)
wn Ln (z) = P 

1−
k
wn

xjn

 j
= B rnβn
j

(69)

n,k

n,k

k0

(68)

(70)

0

k j
λn
(Ln )
j

"

X
k

n
1− # 1−β
1−

wnk
λkj
n (Ln )

77

γnj

 0 γnj0 j
 Y
Pnj
,

j0

(71)

Definition 1. An equilibrium with taxes and transfers is defined as the equilibrium without
taxes and transfers but with the additional conditions that i) Ink is given by equation (68),
ii) the first-order condition describing intermediate goods producers’ labor demand is given
by (70), iii) the cost index xjn is given by (71), and iv) the government budget constraint
(69) is satisfied.
Proposition. Let
1
1+ν
X 0 Lk0 j ∂ ln λk0 j (Ln )
n
∆kn =
wnk nk
L
∂
ln
Lkn
n
k0 j
tkL =

and Rk such that
φk v k Lk =

X

Pn Cnk Lkn .

n

Then, if the planner’s problem is globally concave, the equilibrium with taxes and transfers
implements the optimal allocation.
Proof. 1) The first order condition for household consumption choice (15) is identical to the
first order condition for consumption in the planner’s problem, (53). The modified budget
constraint for the household (68) implies a relationship between consumption and prices
identical to equation (64), which is derived from the first order conditions (55) and (56) in
the planner’s problem. At the same time, the optimal location decision for the household,
(2) is identical to the free mobility constraint in the planner’s choice (51) for a given set of
consumption Cnk .
2) The first order condition for factor demand for intermediate input producers, (17),
(19) and (70) are identical to the first order conditions for the planner’s problem (59), (60)
and (61) once one uses equation (67) to substitute w̃nk out of (59).
3) The condition that a producer in city n and industry j imports a variety z from city
0
n if and only if κjnn0 pjn (z) = minn00 κjnn00 pjn00 (z)
by the first order condition for the
Pis implied
j
j
planner’s problem (58), given that Qn (z) = n0 Qnn0 (z), p̃jn (z)dΦ(z) = pjn (z).
4) The first order condition associated with the optimalP
use of different varieties by final
j
goods producers (24) is implied by (58) given that Qn (z) = n0 Qjnn0 (z), p̃jn (z)dΦ(z) = pjn (z),
and Pnj (z) = minn0 κjnn0 pjn0 (z).
5) The market clearing conditions for employment (equation 5), structures (equation 6),
final goods (equation 7) and intermediate goods (8) are identical to the resource constraints
faced by the planner, respectively, (50) combined with (51), (52), (48) and (49).

E

A Counterfactual Economy After Eliminating Endogenous Amenities

We now verify whether the planner solution would be likely to change if one were to adjust local amenities to remove the components that Diamond (2016) argues are likely to be
78

endogenous. For that purpose, we carry out two counterfactual exercises. For both exercises, we first extract the exogenous component of amenities as implied by the mapping of
Diamond’s (2016) estimates into amenity spillovers described in Fajgelbaum and Gaubert
Q k0 τak0 k k
k,exo
(2018). Specifically, we calculate a value of Ak,exo
such
that
A
Ln
Cn = 1, with
n
n
nCN R,nCN R
CN R,nCN R
nCN R,CN R
CN R,CN R
= −0.43. In the
= 0.18 and τa
= −1.24, τa
= 0.77, τa
τa
first exercise, we calculate a counterfactual equilibrium where the labor supply equations are
k
k −ν
Cn
(Ak,exo
)
n
k
given by Ln = P
k . In the second exercise, we calculate the optimal allocation
k,exo k −ν
Cn0 )
n0 ( A n0
in that counterfactual environment.
Figures 21 below show the relationship between relative the exogenous part of amenities
implied by that exercise and city size and composition, further discussed in the text.
Figure 22 shows how the distribution of CNR workers in the optimal allocation compares
with the counterfactual equilibrium. As in our baseline economy, the planner has an incentive
to increase labor market polarization by concentrating proportionately more CNR workers
in larger cities. Figure 23 shows that, as in our baseline analysis, this increased polarization
is matched by transfers from the large cities to the small ones.

F
F.1

Quantifying the Model for 1980 and Counterfactual
Exercises
Quantifying the Model for 1980

In order to quantify the model for 1980, we follow similar steps as described in Section B,
with modifications to accommodate data constraints.
Regional Price Parities data are not available for 1980. In the baseline model quantification, we used those in order to calculate the productivity of the non-tradable sectors.
To obtain the productivity of the real estate sector in 1980, we match instead changes in
CoreLogic HPI data, available by county. As for the productivity of the non-tradable sector,
we assume that its spatial distribution does not change. In addition, the model inversion
exercise carried out for our 2011-15 benchmark does not pin down the national average level
of productivity for each industry, only its occupational and spatial variation. In order to
obtain the time variation of those levels, we choose average 1980 productivity levels to match
national level sectoral price series made available by the BEA.
To obtain wages and the occupational composition of cities and industries, we use the
5% sample of the 1980 Census data which is comparable to the ACS. The 1980 Census has
data for 213 MSA’s that account for approximately 85% of US employment in that year.
For the remaining MSA’s, we impute wages and employment by occupation and by sector
by taking the predicted values of a regression of those variables on 1980 CBP employment
by sector and housing prices.

79

0.6

Slope: -0.74
Correlation: -0.2

0.4
0.2
0
-0.2
-0.4
-0.6
-0.8
-1
0.15

0.2

0.25

0.3

0.35

0.4

0.45

0.5

0.55

CNR share of city employment
Slope: 0.12
Correlation: 0.68

0.6
0.4
0.2
0
-0.2
-0.4
-0.6
-0.8
-1
9

10

11

12

13

14

15

16

log city employment
Figure 21: Relative amenities and city size and composition (exogenous part)
Ratio of occupational-specific amenity parameters for each city obtained after extracting
the endogenous part of amenities implied by the parametrization used by Fajgelbaum and
Gaubert (2018). Each observation refers to a CBSA. Marker sizes are proportional to total
city employment.

80

Difference in CNR
share

0.6
0.4
0.2
0.0
-0.2
0.2

0.4

0.6

Difference in CNR
share

CNR share in
counterfactual
ρ = 0.704 , ρ̃ = 0.825
0.6
0.4
0.2
0.0
-0.2
11

Difference in CNR
share
Population

< -0.274

[ -0.137 , 0 )

[ 0.137 , 0.274 )

[ -0.274 , -0.137 )

[ 0 , 0.137 )

>= 0.274

44,000

1,816,000

12

13

14

15

Log population in
counterfactual
ρ = 0.414 , ρ̃ = 0.606

3,587,000

R
Figure 22: Optimal LCN
/Ln with counterfactual amenities (change from counterfactual
n
equilibrium)

Each marker in the map refers to a CBSA. Marker sizes are proportional to total equilibrium
employment in each city. ρ and ρ̃ are unweighted and population weighted correlations
respectively.

F.2

Details of Counterfactual Exercises

In the counterfactual exercises described in Section 6, we separate average changes in productivity or amenities from their geographical and occupational dispersion.
The first step is to study the consequences of changing factor shares. We focus on the
consequences of those changes to factor demand, while keeping unit costs in individual cities
and industries fixed. This exercise implies a set of alternative productivity parameters for
1980, which we then take as our base for comparison with the current period.36 Productivity
γnj (1−βnj ) 37
βj
changes then refer to changes in Tnkj = (Hnj ) n λkj
. The average change in producn
tivity between 1980 and 2011-15 for a given industry is a Tornqvist type index: a geometric
weighted average of the changes in productivity across cities, with the weights given by the
value added by each city/industry as a fraction of total industry value added. Those shares
are first calculated separately for the 1980 and 2011-15 periods, and the weights correspond
36

One advantage of this procedure is that, given that changes in factor shares can be city-specific, implied
productivity changes may otherwise depend on scaling parameters adopted for the different inputs.
37
Specifically, when calculating the average change
sector j and occupation k

 in productivity for ka given

P
w Lkj
we set ln Tnkj,counterfactual = γnj n0 ωnkj0 γ1j ln Tnkj0 , where ωnkj = P nwknLkj , and analogously for other
n0
n0
n0 n0
averages.

81

Net transfers
(per capita, ’000s)

15
0
-15
-30
0.0

0.2

0.4

0.6

0.8

Net transfers
(per capita, ’000s)

CNR share in
counterfactual
ρ = -0.88 , ρ̃ = -0.94
15
0
-15
-30
11

Net transfers (per capita)
Population

< -20500

[ -10300 , 0 )

[ 10300 , 20500 )

[ -20500 , -10300 )

[ 0 , 10300 )

>= 20500

44,000

1,816,000

12

13

14

15

Log population in
counterfactual
ρ = -0.587 , ρ̃ = -0.741

3,587,000

Figure 23: Optimal transfers in with counterfactual amenities
Optimal transfers defined as the difference
allocation between the value conP in the optimal
P
sumed and value added in each city ( k Pn Cnk − k w − nk Lkn − rn Hn ). Each marker in
the map refers to a CBSA. Marker sizes are proportional to total equilibrium employment
in each city. ρ and ρ̃ are unweighted and population weighted correlations respectively.

to the arithmetic average of those shares.38
The model does not allow us to pin down an aggregate trend in amenities since changing
amenities in all cities by a common scaling parameter leaves the equilibrium unchanged.
We thus assume that there was no such trend so as to focus on the welfare implications of
endogenous changes in equilibrium variables. For the baseline economy, this implies keeping
a Tornqvist type index of amenities constant relative to the 2011-15 period: specifically,
we keep a weighted geometric average of changes in amenities equal to 1, with the weights
given by employment shares by city (again the shares are taken for the baseline and 2011-15
periods separately and the weights are given by an arithmetic average).

38

We carry out a similar calculation in order to obtain productivity trends by city/industry/occupation

82

Change in Gini coefficient from
data equilibrium to optimal allocation

0.15
0.1
0.05
0
-0.05
-0.1
-0.15
9

10

11

12

13

14

15

16

Log population in data
Figure 24: Changes in the Gini coefficient between the data and optimal allocation.
Each observation refers to a CBSA. Marker sizes are proportional to total employment. The
solid-black line is a cubic fit of the data. The Gini is constructed using the Lorenz curves
depicting within city wage bill and industry rank.

83