View original document

The full text on this page is automatically extracted from the file linked above and may contain errors and inconsistencies.

Federal Reserve Bank of Chicago

Cyclical Transactions and
Wealth Inequality
Jung Sakong

January 26, 2022
WP 2022-05
https://doi.org/10.21033/wp-2022-05
*

Working papers are not edited, and all opinions and errors are the
responsibility of the author(s). The views expressed do not necessarily
reflect the views of the Federal Reserve Bank of Chicago or the Federal
Reserve System.

Cyclical Transactions and Wealth Inequality*
Jung Sakong†

January 26, 2022

Abstract
Wealth is distributed more unevenly than income, and one contributing factor might be that
richer households earn higher portfolio returns. I uncover one channel that causes portfolio returns to
be increasing in wealth: Poorer households consistently buy risky assets in booms—when expected
returns are low—and sell after a bust—when expected returns are high. Although time-varying
expected returns are a robust empirical fact, theories are ambiguous on whether poorer or richer
households engage in such cyclical trading patterns. I estimate the trading patterns for households
across wealth levels, in the US housing market for 1988-2013. I interact housing ownership patterns
from deeds records with household-level wealth, which I infer from merging owners’ surnames
with their name-based income in the 1940 full Census. The estimated dispersion in expected returns
from this “buy-high-sell-low” channel is large: The interquartile-range difference is 60 basis points
per year. The channel predicts that geographies with historically higher volatility will feature more
wealth inequality than income inequality: I verify this implication in the data. These results suggest
that a government policy intended to boost poorer households’ wealth via homeownership can
backfire if it ignores the status of house prices.

JEL classification: D15, D31, E32, G11, R21
Keywords: Wealth inequality, business cycles, real estate asset, timing of transaction, wealth return
* I am extremely grateful to my committee chair Amir Sufi and members Marianne Bertrand, Raghuram Rajan and
Luigi Zingales for their continuous guidance and support. I also thank Greg Buchak, Peter Chen, Lancelot Henry de Frahan,
Lars Hansen, John Heaton, Erik Hurst, Adam Jorring, Paymon Khorrami, Sehwa Kim, Helen Koshy, Simone Lenzu, Stefan
Nagel, Krisztina Orban, Canice Prendergast, John Shim, Dan Sullivan, Ana-Maria Tenekedjieva, Douglas Xu, Alex Zentefis,
Xiao Zhang, Eric Zwick, as well as seminar participants at the Corporate Finance Reading Group, Economic Dynamics
Working Group, Finance Brownbag and Stigler Lunch Workshop, for helpful suggestions. All errors are my own. The
views expressed in this paper are those of the author and do not reflect those of the Federal Reserve Bank of Chicago or the
Federal Reserve System.
† Federal Reserve Bank of Chicago (email: jung.sakong@chi.frb.org)

1

Introduction
Wealth is distributed more unevenly than income, even below the top 1%, which is the part of

the wealth distribution where the literature has focused most.1 One reason might be that the rate of
return on wealth increases in wealth. If that is the case, poorer households could earn a lower return in
two ways: (1) They participate less in risky assets that yield higher returns, or (2) they consistently
participate at the “wrong” times–when prices are high and expected returns are low. Many papers have
focused on the first channel. The second channel has received less attention.
In this paper, I use the US housing market to study this second channel. Constructing a new dataset,
I estimate the trading patterns of households across wealth levels. Lower-wealth households do indeed
consistently purchase housing when prices are high, and they sell when prices are low. I find that this
“buy-high-sell-low” channel has a significant impact on wealth accumulation: the interquartile range of
annual returns across wealth levels is 60 basis points.
Housing, especially ownership of a primary residence, is often seen as a vehicle for accumulating
wealth by middle- and lower-wealth households.2 Campbell (2006) shows that housing is the asset
class with the highest share of total assets between the 30th and 96th percentiles of the total-asset
distribution. Housing may help wealth accumulation for multiple reasons. One is that present-biased
individuals may benefit by tying up wealth in an illquid asset like a house.3 Partly to encourage
wealth accumulation by the middle class, government policies have also encouraged and incentivized
homeownership at least since the 1930s.4 My findings caution government policies that encourage
buying a home, however. If such policies disproportionately incentivize home purchases when prices
are high, they can backfire by impeding wealth accumulation and worsening wealth inequality.
Before describing the empirical exercise, I should first clarify what I mean by poorer households
“buying high and selling low.” Given any data series, there will always be households who trade at the
“wrong” times ex post. In order to have a lasting impact on wealth accumulation, poorer households
must consistently buy when expected returns are low and sell when expected returns are high. If
expected returns were constant, poorer households might be unlucky in some periods, but this outcome
1 For

example, compared to the bottom 50% of the income distribution, the next 49% make 4.7 times as much in
income, but own 6.5 times as much in net worth, based on the 2013 Survey of Consumer Finance.
2 Charles and Hurst (2002); Wainer and Zabel (2019)
3A long literature on present bias and its implication for savings took off starting with Laibson (1997). In an earlier
work with co-authors, I found in field experiments that experimentally increasing the illiquidity of a savings account
attracted more savings from subjects (Beshears et al. (2015)).
4 Carliner (1998)

1

would balance out in other periods when they are lucky.
When expected returns are time-varying and predictable, however, households who consistently
buy high and sell low will earn lower expected returns in a way that can be anticipated. Whether
any household will regularly buy high and sell low is theoretically ambiguous, and some standard
examples give opposite predictions. For instance, if mortgage availability increases when prices are
high, poorer households might be more likely to buy because at other times they are rationed out of
the credit market. On the other hand, if prices rise in economic booms because investors perceive
overly-optimistic returns, richer households might be more likely buy when prices are high because
they have better capacity to take advantage of the higher expected returns. This theoretical ambiguity
justifies constructing a dataset and estimating who “buys high and sells low.”
To precisely measure who engages in what kind of trading behavior, a dataset that contains both
identifying information and observed actual quantities traded is needed. This is because even within a
broad asset class such as housing or stocks, there are actual assets that differ in how their prices behave.
Therefore, even if I find that poorer households’ housing wealth rises more, I cannot conclude that they
bought more housing units, because they may just own houses whose prices rise more. Luckily for
housing, all trades are publicly observable from deeds records. Private information beyond just names
and residential addresses is missing though. For this reason, the wealth of home buyers and sellers
needs to be imputed.
My empirical solution is to use the house ownership data and attribute wealth levels to surnames.5
Surnames are passed down through generations. Wealth levels can be estimated by surname using the
1940 full-count Census, which was the first Census to ask about income and is the last Census that is
publicly available in full detail, because the Census Bureau only releases a full Census after 72 years.6
In my concurrent work with a co-author, we find that the income averaged at the surname level from
the 1940 Census is a strong predictor of those surnames’ average-wealth levels today, constructed from
individual-ownership-level data (Henry de Frahan and Sakong (2020)).
Sorting surnames into percentiles using their historical income from the 1940 Census, I find that
5 Using

surname-level variation in wealth works in the US, because there are about 160,000 surnames with 100 or
more individuals. In China, by contrast, 100 most common surnames account for 85% of the population and hence using
surname-level variation would not be informative. In another paper, I use this latter fact to identify Chinese buyers in the
US housing market (?).
6“This ’72-Year Rule’ 92 Stat. 915; Public Law 95-416; October 5, 1978) restricts access
to decennial census records to all but the individual named on the record or their legal heir.”
https://www.census.gov/history/www/genealogy/decennial_census_records/the_72_year_rule_1.html?CID=CBSM+history

2

poorer households buy more housing (in quantity units) than rich households when prices increase. In
other words, lower income households have a higher sensitivity, or “beta”, in their choices of housing
quantity to price. The negative slope in beta along the wealth distribution is shown in Figures 2a and
2b.7
The overall negative relationship between these betas and the wealth level is driven by differences
between racial groups: Non-White population groups exhibit highly pro-cyclical ownership of housing.
By constrast, after controlling for the racial share at the surname level, the betas are slightly increasing
in wealth level. Two interpretations are possible: (1) Belonging to a racial-minority group may be an
independent predictor of low wealth, or (2) racial minorities may be particularly vulnerable to cyclical
downturns.8
Going back to the overall negative relationship between the betas and my proxies for wealth levels
(i.e., surname-level historical incomes from 1940), I wish to know how much dispersion in return
on housing is generated by the timing of trades? To convert the estimated betas into interpretable
differences in returns along the wealth distribution, I make two sets of transformations: First, I map the
wealth-proxies to the present-day percentiles in the wealth distribution, and second, I map the betas to
returns on housing.
To map each percentile of the 1940 income by surnames to the corresponding place in the presentday wealth distribution, I take two steps: (1) Using surname-level data on average primary residence
value in 2012-2013, I map each 1940-income-percentile to its future housing consumption; and (2)
To map housing consumption to the corresponding place in the wealth distribution, I estimate the
relationship between these two variables in the 2013 Survey of Consumer Finances (SCF). Combining
these two steps, I convert the the surname-level 1940 income to the present-day wealth percentiles.
To map the estimated, housing-quantity-to-price betas to returns on housing, I use the formulas
I derived linking these two quantities along with estimates of expected-return variations taken from
Cochrane (2011).
After conducting these transformations, I find that returns on housing go up 60 basis points per
7I

construct and use two samples: One that maximizes the number of counties covered and another that maximizes the
number of years. Sample selection is discussed in detail in Section 3.
8 In an earlier work, I found empirical evidence that economic downturns in a local geographical area cause racial
prejudice in that area to rise (Sakong (2018)). In another earlier work, I used close electoral victory of Black politicians as
an instrument for local areas’ racial prejudice against Black people, I found that such increase in racial prejudice caused
Black people’s employment to fall and mortgages to be denied more (?). Combining the two results, it is possible that
business cycles disproportionately affect racial-minority groups through counter-cyclical racial prejudice. Also see Bayer
et al. (2016); Bayer et al. (2017).

3

year between the interquartile range of the wealth distribution (Figure 4c).9
I connect the estimated return differentials to the level of wealth inequality using a wealth accumulation equation and a back-of-the-envelope calculation. Simple manipulations of this equation reveal
that two key factors largely determine how return differentials translate to wealth inequality above and
beyond differences in income. First, even if some households earn lower returns on wealth, their wealth
share does not vanish because labor income replenishes wealth; hence, the labor-income-to-wealth
share modulates the impact of differential returns on wealth. Second, this stabilizing effect of labor
income is itself softened by expenditures out of current income; hence, the consumption-expenditureto-current-income ratio matters. Using estimates of these two quantities from widely used survey data,
I calculate that the estimated 60-basis-point return differential explains roughly 20% of the observed
wealth inequality between the interquartile range in the US above the part attributable to income
inequality.
Beyond explaining part of wealth inequality in the aggregate, the “buy-high-sell-low” channel has
a cross-sectional prediction: Geographies with larger time-variation in expected returns in the housing
market should have greater wealth inequality, over and above income inequality. This is because in
those areas, even the same beta-differences will generate a greater dispersion in wealth returns between
rich and poor households. And the greater dispersion in wealth returns persists in the geographical area,
because households typically own housing assets near where they live even for investment homes and
because families are reluctant to move once settled. I test and confirm this cross-sectional implication
of the channel.
To test this cross-sectional implication, I first sort US counties by historical business-cycle cyclicality, which itself predicts how much expected housing returns would vary (Cochrane (2011)). Using a
new set of imputed inequality measures and controlling for labor income inequality, I indeed find that
current wealth inequality is greater in those areas with higher historical cyclicality.
Executing this cross-sectional test faces additional data issues: Data on wealth are rare, and there
are no existing measures for wealth-inequality levels across US geographies. I impute wealth-inequality
levels by metropolitan areas, by combining multiple administrative data sources on the assets and debts
held by US zip codes.10 The imputed, between-zip-code wealth-inequality measures correlate strongly
9 By comparison, Fagereng et al. (2016) look at total returns on financial wealth in Norway and find about a 1% return
differential between the interquartile range per year.
10 Mian et al. (2013); Saez and Zucman (2016)

4

with historical cyclicality (Figure 6a), consistent with the “buy high and sell low” mechanism of this
paper.
The rest of the paper proceeds as follows. Section 2 presents the theoretical framework connecting
housing transactions to wealth inequality. Section 3 presents the data, empirical methodology, and
estimation results. Section 4 presents the geographical cross-sectional results. Section 5 concludes.
The appendix contains theoretical derivations, additional empirical tests and data details.

1.1

Contribution to the Literature

This paper sits at the intersection between literatures on wealth inequality, household portfolio
choice, and business cycles. Relative to the wealth-inequality literature, I study a dynamic mechanism
and micro-found the return heterogeneity with micro-data evidence. Relative to the portfolio-choice
literature, I focus on the rebalancing. This focus is more important in highly incomplete market settings,
where a full set of Arrow-Debreu securities cannot be used to fully replicate all dynamic trading.
The literature on wealth inequality decomposes contributions to wealth inequality into three
categories: income inequality, differences in savings rates, and differences in returns generated on
wealth portfolios. This paper is in third category. Even within the literature on the heterogeneity
in returns on wealth, there are four broad subcategories: (1) By far the largest literature examines
differences in the average risky-asset-market participation, mainly in housing, stocks and pensions;11
(2) A largely structural literature uses heterogeneous-agent frameworks to quantify how much return
differential there must be in order to generate the observed wealth inequality, given observed data on
income inequality and savings rates, often without taking a stance on what would generate such return
differential1213 ; (3) Another literature targets the upper tail of the wealth distribution by exploring a
particular mechanism of entrepreneurial return; and (4) A recent empirical literature uses rich microdata on wealth holdings to calculate the actual returns earned on wealth by households across wealth
levels, thus far exclusively with financial assets.14 My paper differs in a few dimensions: (1) I estimate
11 Most theoretical mechanisms that have been proposed to explain this phenomenon are static in nature: Low-wealth
households do not participate in high-return activity (e.g., Campbell et al. (2018); Boulware and Kuttner (2020) for racial
gap).
12Wealth distribution is wider than income distribution (Bisin and Benhabib (2017)). To explain why, the literature on
wealth inequality finds an important role for return heterogeneity and return loading on wealth (Benhabib et al. (2015a);
Gabaix et al. (2016); De Nardi and Fella (2017)).
13Theoretical papers have used the heterogeneous-agent model to understand wealth inequality (Bisin and Benhabib
(2017); De Nardi and Fella (2017); Guvenen (2011); Heathcote et al. (2009); Quadrini and Ríos-Rull (2015)).
14 Recent empirical work is starting to find evidence that wealth returns are increasing in income and wealth (Bach et al.

5

returns to the housing portfolio; (2) I take a stance on what generates the differential expected returns,
and impute the expected-return differential from trading behavior; and (3) The mechanism I propose is
dynamic in that it deals with the timing of trades.
The literature on household portfolio choice has a huge sub-literature on housing-market participation. Relative to that literature, my paper focuses on the timing of trades, or “portfolio rebalancing”
or “active changes” to use language from the closest paper, Calvet et al. (2009). What enables me to
study this under-explored dynamic mechanism is the construction of a panel on housing ownership by
wealth levels.
The literature on business cycles and asset pricing has begun to incorporate the interaction between
time series volatility and the cross-sectional distribution. More of the literature has rightly focused
on how cross-sectional inequality affects the behavior of aggregate quantities such as asset prices
or gross domestic product (GDP). By contrast, I focus on how time series volatility can affect the
cross-sectional distribution, highlighting a potential two-way feedback loop.
Methodologically, this paper contributes to the greater housing literature in general. Being able to
identify ownership of housing by poorer and richer households over the annual frequency can be useful
for questions beyond the impact on housing portfolio returns and wealth: For example, in a work in
progress, I study the cyclical behavior of residential segregation by income and race, using the panel
data I construct in this paper.

2

What to Measure and What It Means
I first begin by spelling out the behavior that corresponds to consistently “buying high and selling

low” and that will be estimated in the data. A contrast with an alternative measurement may be most
helpful: For some sample period, I can observe ownership spells for which I observe both the purchase
and the sale; with these I can estimate a realized return on that set of trades. This realized return differs
from the average-return differential that leads to wealth inequality for three reasons.
First, conceptually I use variations in expected returns as opposed to realized returns. For example,
from 2007 to 2008, average US housing stock experienced a realized return of -8% (or -20% relative to
the time series average). But standing in 2007, based on just the rent-to-price ratio at that point and
(2016) in Sweden; Fagereng et al. (2016) in Norway; Garbinti et al. (2017) in France; Kuhn and Rios-Rull (2016) and Wolff
(2017) in the US).

6

what we knew from the return predictability literature, the expected return was 8% (or -4% relative to
the average). Because I study persistent "buy high, sell low" behaviors, I use the -4% expected return as
opposed to the -20% realized return. That is, given any finite time period, there will always be people
who buy and sell at the ex-post "wrong" time; but over the long run, differences in the unpredictable
realized returns would cancel each other out. Given a long enough sample, the realized returns will
average out to an average-return difference. In this paper I have at most 25 years of data, so I estimate
changes in expected returns directly.
Second and related to the first reason, I estimate how housing-ownership levels in quantity change
with house-price changes as well as how expected returns change with house-price changes, and
then multiply them. This is because I can compute expected returns only at some market level. More
specifically, I impute expected returns as a linear function of the rent-to-price ratio, which can only be
observed at some local level. Therefore, the formulas derived in this section specify how to convert the
observed housing quantities and prices to units of portfolio returns.
Third, I compute expected returns on the whole stock of housing owned, and not just on those
that are traded. The alternative measurement using actual trades would omit households who just hold
onto the housing and thus generate a 0% realized return. The return differential conditional on trading
would be higher, but for overall wealth inequality, I would have to average that with households that do
not trade.
For these reasons, I derive formulas that convert the sensitivity of housing holdings in quantity
to some market-level house prices to wealth returns, and ultimately to the contribution to wealth
inequality levels, in the first sub-section. The second sub-section highlights that in theory, richer
or poorer households could exhibit higher sensitivity of housing holdings in quantity to changes in
house prices. The last sub-section highlights how the conceptual object maps to the data, and what
complications arise when using data that do not meet the requirements.

2.1

How Would the Timing of Ownership Affect Expected Returns

I take a top-down approach, to start from a wealth-accumulation equation, to zero in on the term
that needs to be estimated in the data. The following assumption specifies how observed wealth is
accumulated.
Assumption 1 (Measured-wealth accumulation). Household i’s measured-wealth accumulation is
7

given by
dWit
=
Wit



Yit
Cit
−
Wit Wit



dt + ∑ θitk dRtk
|k {z }
≡dRit

for measured wealth Wit of household i in time t. Asset classes are denoted by k with investment
share θitk and return dRtk , labor income flow Yit and expenditure Cit inclusive of both paid rents and the
imputed user cost of housing (i.e. the opportunity cost for owner-occupants). In particular, assets k are
defined such that all households get the same return, namely, dRkit = dRtk ∀i.
First note that the income variable Yt here is only labor income, because the cash flow component of
capital incomes is included in the total returns dRtk . Non-durable expenditure Cit is defined to include
the user cost of housing for owner-occupants, as with rent paid for renters. For owner-occupants, the
hypothetical rents (i.e., cash flow) from dRtk of housing and the hypothetical user cost included in Cit
cancel each other out. The wealth Wt is all non-human wealth, inclusive of financial assets and real
assets (i.e., the wealth concept in Piketty (2015)). It is important to note that Wit is the measured wealth;
the equation above is an accounting equation.
The assumption is straightforward for investment assets and housing owned as investment homes.
Further explanations are necessary for owner-occupied housing, for which economists have explored
whether housing wealth is actual wealth. Changes in the current price of housing for owner-occupants
appear on both the income side (i.e., in dRtk for housing) and the expenditure side (i.e., the user-cost-ofhousing component of Cit ). There is no net effect on observed wealth. Changes in the future price of
housing that lead to changes in house prices show up only on the income side, in the form of house-price
appreciation in dRtk . In economic terms, the household is not richer, in that the increase in the house
price on the wealth side corresponds to the net present value of higher expected costs of housing
going forward.15 But in accounting terms, the house-price appreciation is nevertheless an increase in
measured wealth. In terms of accounting wealth, the equation is true for both owner-investors and
owner-occupants.
The accounting wealth Wit need not correspond to the economic concept of wealth. In particular,
endogenous policy functions Cit and θitk are functions of economic concepts of wealth. For example,
increases in Wit driven by an increase in the price of the primary residence for owner-occupants need
15 Sinai

and Souleles (2005)

8

not lead to an increase in Cit . How Cit behaves is important when translating an average return to a
level of wealth inequality later. For most of the paper, I deal with the accounting concept of wealth.
The empirical literature on wealth inequality uses the accounting, observed measure of wealth as well.
The key assumption here is that the assets k are defined such that all households earn the same
return on them (more precisely, they have to have the same expected return). Surveys often bundle
together assets (e.g., housing), but heterogeneous asset returns are likely, especially in an asset class
such as housing. How violation of this assumption can affect estimated average returns is discussed in
more detail below.
Lemma 2 (Return decomposition). The average return on wealth can be decomposed:

n  
 o 



E dRit − dRt = ∑ E θitk − E θtk E dRtk
k






 



k
k
k
k
+ ∑ cov θit , Et dRt − cov θt , Et dRt
|

{z
}
k 


"timing"

Note that the covariance is with respect to the expected return, Et dRtk . Importantly, the second term is
non-zero only if expected returns are time-varying.
The lemma follows immediately from the fact that the expectation of the product is the sum of the
product of expectations and the covariance, that is, E [θt Et (dRt )] = cov (θt , Et (dRt ))+E (θt ) E (Et (dRt ))
(i.e., first take the conditional expectation, and then take the unconditional one).
The first term on the right-hand side (the product of expectations) is the focus of an enormous
literature in economics, finance, and sociology on risky asset market participation. These papers


typically multiply an average differential in participation E θitk − E θtk by the expected return

E dRtk to obtain the the return differential.
The second term (the covariance) is the focus of this paper. If conditional asset returns were not
time-varying, that is, if Et dRtk = µ k , the covariance terms would drop out. With time-varying expected
returns, agents who increase exposure when expected returns are high accumulate wealth faster.
With time-varying expected returns, the magnitude of the variations in the expected return partly
determines the contribution of the timing of trades on average returns. While realized returns clearly
vary over time, whether expected returns vary over time and in predictable ways has been one of
9

the central questions in the asset pricing literature. In particular, Cochrane (2011) estimates expected
returns using the rent-price ratio by regressing the realized return,


Dk
k
log µtk ≡ log Et Rt+1
= ak + bk log kt
Pt
An extensive literature documents return predictability in the housing market, with different papers
using different house-price indices.16
The contribution of the trade timing to returns is in terms of wealth shares θitk , which are themselves
products of prices and quantities in units. The co-movement with expected returns consists of two parts:
passive and active change, mirroring the decomposition of risky portfolio share in Calvet et al. (2009).17
In particular, in response to price changes in asset k, the asset share θitk moves in the same direction in
the absence of active adjustment, and the expected return Et dRtk likely moves in the opposite direction.
The active change can offset or amplify the change in asset share θitk . I work with formulas linking
each piece to price changes d log Ptk to derive an easily computable formula.
Lemma 3 (Covariance approximation). Given that expected returns are a function of the rent-to-price
ratio, co-movement between kth asset share and the expected return can be decomposed as





    



k
k
k
k 
cov θitk , µtk ≈ E θitk E µtk 
cov
log
P
−
logW
,
log
µ
+
cov
log
Q
,
log
µ
it
t
t
it
t 

|
{z
} |
{z
}
passive

active

See Appendix for the derivation.
The formula shows that passive change induces a negative relationship between θ and µ. However,
with a big enough contrarian active change, the covariance can even be positive.
Proposition 4 (Return differential from active trades). The timing of active changes widens wealth
inequality if and only if


 

   cov log Qk , log Pk 
it
k
k
k
k
k
 t
acov θit , µt ≈ −b E µt var log Pt E θitk
k
var log Pt
16 Relatively fewer papers study return predictability in local housing markets with proper returns. The key data limitation

is measuring local rent levels at a frequency higher than the decade frequency, for the cash-flow component. In more recent
years, both the Department of Housing and Urban Development and Zillow have begun to release local rent indices. Several
papers study return predictability with log house price instead (e.g., see papers in the survey in Ghysels et al. (2013)). Other
papers that document return predictability in the housing market are Piazzesi and Schneider (2016) (using Zillow Research
for local house prices), Glaeser and Nathanson (2015) (using Federal Housing Finance Agency data for the price index
and Department of Housing and Urban Development data for rent), Campbell et al. (2009), Davis and Van Nieuwerburgh
(2015), and Cochrane (2011) for aggregate return.
17Table 4 in Calvet et al. (2009) discusses heterogeneity by household financial characteristics.

10

is increasing in wealth level and bk ̸= 0 (return is predictable in asset k).
See Appendix for the derivation.
The rest of the paper estimates the empirical elasticity,

2.2

cov(log Qkit ,log Ptk )
.
var(log Ptk )

Theoretical Ambiguity on Who Would Own When

Before addressing the empirical challenges in estimating what type of households would hold
more housing when prices are high, I first discuss how even standard theories predict that it could
be either poor or rich households. The theoretical ambiguity arising from commonly accepted forces
further justifies dealing with the empirical challenges. In this sub-section, I discuss the possibilities in
words; for a more rigorous discussion based on a standard consumption-savings problem with portfolio
choices, refer to the Appendix.
On one hand, suppose cyclical booms are accompanied by independent increases in the supply
of household credit. Such pro-cyclical expansion of credit may be driven by market forces or by
government policies that assist homeownership especially in boom years when budgets are plentiful.18
Increased access to mortgages and other household credit would disproportionately affect poorer
households who were likely to be closer to borrowing or collateral constraints. Therefore, pro-cyclical
credit supply would lead to poorer households buying and owning more housing in booms than in
busts, relative to richer households.
On the other hand, suppose cyclical booms are accompanied by widespread expectations of higher
asset returns, for example, because households mistakenly believe high past asset returns will continue
on.19 Again, if poorer households are closer to borrowing or collateral constraints, richer households
are better positioned to change portfolio holdings to take advantage of the perceived higher expected
returns in booms. Therefore, extrapolative expected returns in the housing market would lead to richer
households buying and owning more housing in booms than in busts, relative to poorer households.20
Even widely studied theories on what happens over business cycles have opposing predictions for
what types of households would own assets more in booms versus busts, and it is possible that both
forces are simultaneously at play. Therefore, who accumulates wealth faster through the timing of
18 Rajan

(2011)
et al. (2015)
20This implication can be seen in Kaplan et al. (2017), where a positive shock to expected house-price appreciation
leads to a lower homeownership rate.
19 Barberis

11

trade is an open, empirical question.

2.3

Empirical Challenge in Estimating the Timing of Ownership

Whether rich or poor households trade against expected returns is an empirical question that cannot
be conclusively answered with any accessible, off-the-shelf dataset. This section briefly discusses what
issues arise if I attempt the estimation in commonly used datasets.

Estimating the timing-of-ownership, cov θitk , Et dRtk , is difficult in the Survey of Consumer Finances (SCF), for example, where we observe only the product of price and quantity, because of
the assumption, dRkit = dRtk ∀i. That is, dRi = ∑k θik dRk is true only if returns are common across
individuals. Data at such a level of disaggregation are rare: Wealth-tax data (e.g., Calvet et al. (2009))
and housing microdata are two examples.
With coarser asset class K (suppressing time subscripts where obvious),
dRi = ∑ θiK dRK
i
K

where dRiK retains the i superscript because the asset composition within the asset class K affects the
expected returns. The two terms can be decomposed into:
θiK =

∑ θik
k∈K

dRK
i =

θik k
∑ K dR
k∈K θi

θiK can be calculated in the SCF, with only wealth share, but dRK
i cannot be. Effectively, what I

K
do when I assume dRK for average return to asset class is ignore the variation in dRK
i − dR . This
omission would compress the dispersion of return heterogeneity, in the same sense that assuming
common dR instead of dRi does. Using only the coarse asset classes, the actual individual-specific
expected return terms are


 K  K


K
K
K
E θitK dRK
+ cov θitK , dRK
it = E θit E dRit + cov θit , dR
it − dRt


Both the return-on-asset-class-K earned by household i, E dRK
it , and the co-movement of that

K
household-i-specific return and share, cov θitK , dRK
it − dRt , cannot be estimated from the SCF.
If we use average return instead of individual-specific returns for coarse asset classes (let dRK

K
denote the average return on asset class K), the last term, cov θitK , dRK
it − dRt , can differ due to skill
12

or different portfolio risk profiles. Note from above that passive change makes cov (θ , dR) negative me
K
chanically. The discrepancy cov θitK , dRK
it − dRt will be more negative for poor households because
of passive change, if they live in areas with more volatile house prices.
Most importantly, asset pricing tells us that if covariance is driven by price dynamics, a tight


K
negative link exists between cov θitK , dRK (as well as cov θitK , dRK
it − dRt ) (i.e., household i takes


on more risk quantity) and E dRK
it (i.e., higher risk premium for taking on higher risky).
In the housing micro-data from deed records in which I get to observe the quantity, I can bypass
the issue discussed in this sub-section by dealing with quantity changes directly and multiplying
them using changes in expected returns. Furthermore, by going from aggregate housing returns to
location-specific returns, I get closer from K to k.
For a discussion of why this measurement is not possible in existing datasets such as survey data
and mortgage origination data, see the Appendix.

3

Estimating the Timing of Ownership by Wealth Levels
In order to find out who holds more housing in booms versus busts and to estimate the consequent

wealth-return gradient against wealth levels, in this section I construct the dataset, plot raw-data
patterns, estimate the housing-quantity-to-price betas by wealth proxies and finally convert those betas
into wealth returns.

3.1

Compiling the Dataset

For the main dataset, I merge a panel of housing ownership (by property and year) to the owners’
wealth levels, using surnames of the owners. Constructing the dataset is a non-trivial exercise, but it
allows me to observe unit-quantity-holdings of housing by wealth levels, which is essential to estimate
the “buy high, sell low” channel. On the asset side, the housing data from CoreLogic (more details
below) are comprehensive, disaggregated and reliable. What was missing had been the identity of the
owners, buyers and sellers, for whom we see the names and their mailing addresses, but do not have
readily usable covariates such as income, wealth or race. The essence of the dataset-construction is in
extracting such covariates from the names and the mailing addresses.
The final dataset is dominated by survey data on the precision of the owner characteristics, but is

13

superior on the asset-side details. For studying dynamic portfolio choice, for which one major challenge
is in dealing with assets of different price dynamics, this trade-off is useful.21
After describing the CoreLogic data and samples in the first subsection, I explain in detail how
I infer household characteristics from names. I then describe the data sources with demographic
information on names, followed by validation exercises.
3.1.1

CoreLogic Data and Samples

I first describe the CoreLogic data on housing and how I select the main samples for the analyses
in this paper. CoreLogic is a private data-provider that acquired DataQuick, which compiles public
records on housing assessments and transaction deeds from various jurisdictions into a unified data
set. I use two components of the CoreLogic (formerly DataQuick) data: The assessor file and the
transaction deed records. I describe each component in turn.
The assessor file collects a single cross section in 2012-2013 of property assessment, for assessing
the amount of property tax. Because the purpose of the assessment is to assign a value to the property,
there are a lot of details on the property (e.g., number of bedrooms, total square footage, number of
floors, whether it has a view) and its value (e.g., assessment value, the value it would get if sold on the
market, the latest actual transaction value). Each property is also associated with the owner’s name, the
type of owner (individual or institutional), whether it is a primary residence, and the mailing address of
the owner. Other versions of the CoreLogic data contain multiple years of cross sections, but I have
access to only a single year. The assessor file contains roughly 104 million records, from jurisdictions
covering roughly 94% of the US population.
The transaction-deed records are the official records that are signed and mailed when some
transaction takes place involving a real estate property. Transactions that could lead to a deed record
are sales of an existing property, sales of newly constructed property, mortgage originations, etc.22 I
focus on deeds that result from an ownership transfer, whether of an existing or a new property. For
each transaction, the deed data contain the date of transfer, the value of transaction, and the names of
both buyers and sellers, among a few other details.
These CoreLogic data come from multiple jurisdictions. Jurisdictions are based on counties,
21This dataset is also useful in questions that require more detail on the asset side. Another example is residential
segregation. In an ongoing project, I use the same dataset to study the high-frequency dynamics of residential segregation
by wealth and race.
22 Recording a deed is not required in every place, but it is almost always done.

14

and most jurisdictions are unique within a county.23 In particular for the transaction-deed records,
jurisdictions are added to the CoreLogic database over time. I know when a jurisdiction enters the
database, after which I have the full set of transactions that took place with the properties in that
jurisdiction.
I want to construct a balanced sample of properties to track who owns them over time. Over time,
more jurisdictions are added to the CoreLogic database. At the same time, I do not want the results to
be driven by the selection of jurisdictions over time; I want to form samples of jurisdictions for which I
can construct consistent panel. There is a trade-off: For a longer time series, I am forced to use fewer
jurisdictions, whereas to use more jurisdictions, I am forced to use a shorter time series. This trade-off
is shown in Figure A.1c: For each year in the x-axis, the figure plots the number of counties that would
be included consistently between that year and 2013 (in blue, dashed line; left y-axis) and the share of
total US population covered in that sample (in hollow circles and red, solid line; right y-axis). Figure
A.1c shows that many jurisdictions were added in 1996-1998 and then in 2004.
To maximize the amount of data used, I pick two samples. The first sample spans 1998-2013, and is
selected to cover the largest fraction of the population as possible, while giving a full picture of at least
one boom-bust episode. The second sample spans 1988-2013, and is selected to retain the longest time
period. The counties included in each sample are graphed in Figures A.1a (for the 1998-2013 sample)
and A.1b (for the 1988-2013 sample). The first 1998-2013 sample spans 36 states and more than 60%
of the US population; the second 1988-2013 sample spans 11 states and 21% of the population (Figure
A.1c).24 Even the broader, 1998-2013 sample contains counties that are more likely to be urban and are
not representative of the US as a whole (Figure A.1d shows that the house price boom-bust was larger).
For each sample, I construct a balanced sample of ownership, at the property and year level. For
each sample, I start from the annual cross section of ownership in the 2012-2013 assessor file. Then, I
work backwards and change owners when there is a transaction, using the transaction-deeds data. For
some properties, there are multiple ownership changes within a year. In constructing the annual panel,
I keep the owner on December 31 of each year.25
I omit properties that do not exist in 2012-2013. Properties that were constructed in the middle
23The

exceptions occur in six states: CT (21), MA (25), ME (37), NH (19), RI (8), and VT (18), with the average
number of jurisdictions per county in parentheses.
24The second sample contains 148 counties (674 jurisdictions) in AZ, CA, CT, MA, NC, NJ, NV, OR, RI, TN, and WA.
25 Higher-frequency panels can also be constructed (up to the daily frequency), and may be more useful for studying
cycles.

15

of the sample period appear throughout, but are not assigned owners until they are constructed and
transferred to individuals. How I deal with new constructions is important, especially given that the
cyclicality of constructions is stronger in some areas than others. This issue is intimately tied to the
empirical specification, and will be discussed in more detail.
In the main analysis, I only use the information that can be extracted from surnames. There are two
additional sources of information. First, for investment homes, where the owners live is informative
about how rich they may be, and residential address can be inferred from the mailing address (recorded
to receive the signed deed). Second, rarer full names can be matched to individuals for whom public
data are available. In particular, I have lists of financial brokers, medical doctors, corporate executives,
hedge-fund managers and politicians, by name. Individuals in these high-paying occupations are likely
richer than the population average. I use both of these auxiliary approaches in the online appendix.
3.1.2

Information on Surnames

In the main analysis of this paper, I use surnames in the housing records to infer the owners’ wealth
levels. The information on the surnames comes from two sources.
The first is the 1940 full-count Census, as processed in my earlier work with a co-author (Henry de
Frahan and Sakong (2020)). The 1940 Census is the latest full Census that has been released to the
public following the 72-Year Rule.26 It is also the first Census that explicitly asked for households’
income.27 In addition to the full names and household income, the full-count Census also contain
variables on rents paid for renters and value of homes for homeowners, years of education, race, and
where they live.
The main variable used to assign wealth levels to surnames is the household-level wage income.
To be precise, we average the wage incomes of households whose head has the surname Smith, and
assign that as a proxy for the average wealth of all Smiths in today’s data. In our earlier work, we
document that this variable is a strong predictor of various proxies for wealth in today’s data. For
example, it strongly predicts the average primary-residence value of Smiths who are homeowners
in the CoreLogic assessor data for 2012-2013 (Figure 1a and also in Henry de Frahan and Sakong
26“This

’72-Year Rule’ 92 Stat. 915; Public Law 95-416; October 5, 1978) restricts access
to decennial census records to all but the individual named on the record or their legal heir.”
https://www.census.gov/history/www/genealogy/decennial_census_records/the_72_year_rule_1.html?CID=CBSM+history
27 Previous waves of the Census do contain a variable called “occupational prestige score,” which basically assigned the
average income of the occupation to households by the household head’s occupation.

16

(2020)). The historical wage income is predictive of today’s wealth through both the human capital
(i.e., grandsons of high-income grandfathers are more likely to earn higher income today) and the
non-human capital (i.e., the higher-income grandfathers left more wealth for their descendants).
The raw estimations in this paper are at the surname-level (i.e., Mackenzies consistently behave
differently from the Smalls in the past 30 years). Interpreting the surname-level estimation for familylevel relationships requires additional assumptions. The full econometric framework to interpret the
surname-level estimation is discussed in our earlier work (Henry de Frahan and Sakong (2020)).
The second source of information on last names is the recent Census tabulations of surnames in
2000 and 2010. Recent waves of the Decennial Census include tabulations of surnames, for which
there are 100 or more individuals with that surname, along with those surnames’ composition by major
racial groups (Word et al. (2008)). In 2000, the criteria of having 100 or more people left 151,671
surnames and 242 million people covered by the surname data, relative to the total population of 282
million, implying an 85.8% coverage. In 2010, the same criteria left 162,254 surnames and 295 million
people, implying a 95.6% coverage. Two sets of variables are used from these data: the counts by each
surname and the shares that of the major racial/ethnic groups (Asian, Black, Hispanic and White).
The surname-level population counts are used for two purposes: (1) They are used as denominators
in computing the per-capita-housing holdings (i.e., I divide the number of properties held by Smiths
in the CoreLogic ownership panel by the total number of Smiths in the US from the Census); (2)
They are used to weigh the surname-level data. In the main analysis, I only use the surnames for
which I can observe the total counts in both 2000 and 2010. For the years other than 2000 and 2010,
I linearly interpolate and extrapolate in logs to get the population count (i.e., I assume a constant
population-growth rate by surname).
3.1.3

The Constructed Dataset

Here I briefly describe the structure of the constructed dataset. The structure is common across the
1998-2013 and the 1988-2013 samples.
Each balanced panel has observations at the property and year levels. Each property-year is
associated with a set of surnames, one for each owner in that year (often the property-years have just
one owner). For property-years with multiple owners, still the weights for that property-year add up to
one. For each surname, there is an associated surname-level average-household-wage income from

17

the 1940 Census and the corresponding percentile value (I will refer to this variable as the “1940
income percentile”). For property-years before the construction of the property, there is no associated
owner. Later, I collapse such a panel to “1940 income percentile” and year levels, to facilitate visual
examination and estimation.
Each surname is also associated with a total population count for the given year as well as racial
shares for major racial groups.
The main quantity unit is the number of properties, that is, each observation in the property-yearlevel panel has the same weight. In robustness analysis, I use two other quantity units: number of
bedrooms and square-footage. These quantity measures are available in the 2012-2013 assessor file
only. Therefore, each property is assigned the same number of bedroom sand the same square-footage
throughout the years.
3.1.4

Validation against Census Data

My methodological contribution is to use information on surnames to attribute wealth levels to
owners on housing records. While intuitive, the newly constructed dataset needs to be validated. Here,
I provide one additional validation against 2000 Census. More validation exercises can be found in my
earlier paper with a co-author (Henry de Frahan and Sakong (2020)).
Here is the main idea: I take the 2000 cross section from the constructed panel for the 1998-2013
sample, covering roughly 60% of the US population. For each zip code in the sample, I compute
the average 1940 income across all owners in that zip code, separately for owner-occupied housing
and investment housing. Note again that the average 1940 income was assigned to each owner in the
property-level housing data using the surnames. I take the zip-code-level averages of average 1940
income computed solely from surnames, and compare them to zip-code-level income measures from
the 2000 Census. Note that while I use the historical incomes from 1940 as a proxy for today’s wealth,
I use the zip-code-level income from the 2000 Census, because there is no comprehensive wealth
measure even in the Census.
I run three sets of validation regressions, with the results reported in Table 1.
The first set of tests are for averaged incomes on the CoreLogic side for owner-occupants only (the
first two columns of Table 1). I verify that the average 1940 household-wage income of homeowners in
a zip code (from CoreLogic) is correlated with the median household income from the Census. Column

18

(2) adds county fixed effects and the correlation is still strong.
The second set of tests are for averaged incomes on the CoreLogic side for investment homes
only (columns (3) and (4) of Table 1). For investment housing, there is a zip code associated with the
location of the property and a zip code associated with where the owner lives (the zip codes can be the
same too). I regress the Census 2000 income of the zip codes where the owners live on two variables:
the averaged 1940 income from surnames (from the CoreLogic data) and the Census 2000 income
of the zip codes where the properties are located. Column (4) adds county fixed effects. Here is an
illustration of what I verify: If a Mackenzie from a high-income neighborhood own a property in a
low-income neighborhood, I expect the wealth level attributed to Mackenzies to be associated with the
high income level of the where the Mackenzie lives.
The last sets are for all housing on the CoreLogic side (columns (5) through (7) of Table 1). The
right-hand side variables in these regressions are the two averaged incomes from the CoreLogic panel,
one for owner-occupied properties and another for investment properties. The prediction is that the zip
codes’ incomes from the 2000 Census should be more informative about the permanent income of the
owners who live in the area more than those that do not (i.e., investment-owners). The left-hand side
variables are from the 2000 Census: zip-code-level income for columns (5) and (6), and zip-code-level
median house price for column (7). Columns (5) and (6) show the area’s household income is more
highly correlated with the average 1940 income of owner-occupants, because those owners are the ones
whose incomes are reported to the Census. For home values in column (7), the two types of owners
have similar magnitudes of correlation.
Across the three sets of validation, the correlations are as expected and strong, adding confidence
to the use of surnames and their associated historical income from the 1940 Census in assigning wealth
or permanent income proxies to the housing owners in the CoreLogic data.

3.2

Raw-Housing-Ownership Patterns by Wealth Proxy

In this sub-section, I show raw-data patterns by transforming the CoreLogic housing panel constructed above, one step at a time. This sub-section serves two purposes: (1) show the differences in
housing ownership between rich and poor households are evident even from raw plots, and (2) show
exactly what variations in the data are used for the estimation of “betas” in the next sub-section (i.e.,
the elasticity of housing quantity to house price, by wealth levels).

19

Starting with the CoreLogic panel at the property and year levels for the 1998-2013 sample covering
a larger set of counties, I sum over the “1940 income percentile” assigned using the owners’ surnames,
to create an annual time series of total number of housing properties owned, for each of the 100 “1940
income percentile” groups.28 For each percentile group, I divide the total number of properties owned
by owners with surnames in that percentile group, by the total number of individuals in that percentile
group, to compute the per-capita ownership by number of properties. I will refer to this per-capita
number of housing properties owned as qit for percentile group i in year t. For alternative quantity
measures using number of bedrooms or square footage, see the online appendix.29
Raw time series: Figure 1b plots the average qit for selected decile groups (i.e. the second decile
includes percentiles 11-20), divided by the corresponding level in 1998.30 Higher deciles correspond
to surnames with higher historical incomes in the 1940 Census. In the plot, all holdings are increasing
over time because of new constructions.31 Beyond the broad-based increase in holdings, the rate of
increase is higher for the lower decile groups during the boom years up to 2007 (marked with the red
vertical line), then either decreases or plateaus afterward.
That poorer households bought more pre-2007 and less afterwards is evident from the raw patterns.
To calculate the differences in elasticity of quantity to price, however, I need to transform the data
further.
q

Cyclical variation in housing ownership: Figure 1c plots the residuals εit from
q

log qit = αi + αt + γit + εit

for “1940 income percentile” group i in year t. I describe each transformation in turn. The percentilegroup-fixed effect αi allows us to focus on changes rather than differential levels across groups.32
The year-fixed-effect αt allows us to exclude variations driven by new constructions; for the
purposes of understanding return differences between wealth groups, what matters is the differences
in when they own. Where new constructions occur is an important issue in and of itself, but it is
28 For

the raw patterns for the longer 1988-2013 sample for fewer counties, see the online appendix. The actual
estimation-results are included in the next sub-section for both samples.
29 Both raw-data patterns and estimation results are qualitatively the same when I use either the number of bedrooms per
capita or square footage per capita, instead of the number of properties per capita.
30 I plot the relative quantity, because higher-percentile groups have higher level of q throughout the sample period. See
it
the online appendix for qi,1998 . It looks just like the average-primary-residence values by percentile groups in Figure 1a.
31Also, given that the set of properties in my sample are those in the 2012-2013 assessor file, I omit properties that had
been in place but were not owned by anyone by 2012-2013. This omission would lead to an over-estimation of the growth
rate in the number of properties held by all groups.
32 It is similar to what I did when I plotted the raw time series relative to some base year.

20

orthogonal to observing who owns more when. First, even if new houses are more likely to be built
in poorer neighborhoods in booms, who owns those houses may be the new owner-occupants of that
area who switch from renting or rich owners who buy the new houses and rent them out. Second,
even if the poorer residents of those neighborhoods own the newly constructed units in booms, for
portfolio returns, it is true that poorer households are acquiring risky assets when expected returns
are lower. Because the outcome of interest in this paper is return differential and wealth inequality,
how households acquire housing is not central (although that question is extremely important in and of
itself).
The 1940-income-percentile-group-specific linear trend γi removes any long-term trends that differ
between the percentile groups. The goal is to remove the effects of long-term changes in population,
inequality and homeownership, which do not vary at the same frequency that expected returns vary at.
I discuss the rationale in more detail below, after I present the de-trended price time series.
After these transformations, the boom-bust patterns by wealth groups are more evident. Figure 1c
shows exactly the variation that will be used in the estimation in the next sub-section.
Cyclical variation in house price: Figure 1d plots the residuals ε̃tp from
log Pt = γ̃t + ε̃tp
where Pt is the national house-price index from CoreLogic.33 Because house prices are growing over
time, I remove the linear trend in logs, akin to the transformation for quantities.
I remove a log-linear trend from both the housing quantities and the house price series. There are
several reasons, but they all spring from having a finite sample period, for which realized returns can
differ from expected returns even after averaging. I use house-price level as the proxy for expected
returns (Cochrane (2011)). House-price levels are non-stationary, and ideally I could use the rent-toprice ratio, but the observed rent series does not correspond to the house prices the same way dividends
correspond to stock prices. In this context, taking out a linear trend is akin to assuming that the rent
series increases at a constant rate. This is effectively how Cochrane (2011) deals with the rent series.34
Similarly for quantities, suppose one group increases its holding of housing at a constant rate due to
secular changes in inequality. Over time, that group would earn no extra return due to the timing of
33 CoreLogic

constructs the house-price indices from the transactions micro-data that I use in this paper. The data
documentation states, “The CoreLogic HPI measures changes in housing market prices from 1976 to present. The HPI is a
repeat sales, value weighted, econometric Home Price Index Model. Base year is 2000 set at 100.”
34 He takes the rent value every ten years and interpolates for the years in-between.

21

trades. A log-linear trend removes this variation from the housing-quantity series. With the shorter
1998-2013 sample, using a linear trend to remove long-term trend may be problematic, for the precise
estimation. The 1988-2013 sample is longer and this problem is less egregious, but the short time-series
dimension of the panel is an over-arching issue in this paper.
The estimated “betas” in the next sub-section are basically the ordinary-least-squares (OLS)
q

regression-coefficients of εit (the quantity residual) on ε̃itp (the price residual).

3.3

Estimation of Betas

The raw plots show quantity time series for poor and rich households differ. The estimation turns the
transparent, visual relationship into one number that summarizes the co-movement between quantity
and price, by wealth levels. I basically regress the quantity-residuals plotted in Figure 1c on the
price-residuals plotted in Figure 1d, percentile-group by percentile-group.
For the shorter but wider sample covering 1998-2013, Figure 2a plots the estimated betas (that is,
the housing-quantity-to-price elasticities) βi for each percentile group i, from

log qit = βi log Pt + αi + αt + γit + ξit
where qit is the per-capita number of housing properties held by “1940 income percentile” group i in
year t, and Pt is the national house-price index. This estimation framework uses the same variation as
Hoopes et al. (2016).35 The estimated betas average to zero by construction.
Except for a few percentiles on the extremes, the estimated betas are decreasing in the “1940
income percentile,” which proxies for wealth. That is, poorer households hold more housing more
pro-cyclically. The pattern is especially pronounced for surnames in the bottom 20% of the 1940
average income distribution: For households with those surnames, a 10% increase in prices is associated
with more than 1% higher ownership of housing relative to the population average.
35To

see that I use the same variation as Hoopes et al. (2016), my specification is log qit = βi log Pt + αi + αt + γit + ξit .
I take first difference to obtain, ∆ log qit = βi ∆ log Pt + ∆ αt + γi + ∆ εit . Hoopes et al. (2016) run a two-stage estimation.
In the first stage, they estimate percentile means of log changes in quantity (i.e., ∆ log qit ) after regressing out time fixed
effects (this plays the role of ∆ αt ); in the second stage, they regress the percentile means estimated in the first stage against
a group-specific constant (i.e., playing the role of γi ) and changes in the aggregate state (i.e., playing the role of ∆ log Pt ).
The group-specific linear trends, and equivalently group-specific constant in changes in Hoopes et al. (2016), are more
justified in the daily-frequency setting of Hoopes et al. (2016). In the annual-frequency setting in this paper, the short
sample length makes the distinction between cyclical variation and long-term variation difficult. One solution is to obtain a
longer time series, which I plan to do in future research.

22

The relationship is similar for the longer but narrower sample covering 1988-2013 (Figure 2b).
We want to know whether the differences in cyclical ownership are specific to the recent boom-bust
episode or true more generally. To that end, I run the same estimation for the 1988-2013 sample, but
only for the 1988-2002 sub-period.36
Figure 2c shows that the betas are similar for the bottom 20% of the “1940 income percentile”
distribution, but for the top 80%, the betas are increasing, unlike for the full-period estimation.
These estimation results highlight two issues. First, the results are not robust for the 1988-2002
sub-period, for the top 80%. Second, the stark non-linearity is problematic: Because I have already
aggregated up to surnames and then to the corresponding “1940 income percentile” groups, even
non-linearity in the individual relationship would largely be smoothed out. The issues suggest that the
surname-level aggregation may be picking up variation apart from the differences in their incomes in
1940.
Figure 3a plots the White racial share in 2010 of the individuals with surnames in each “1940
income percentile” group. Surnames are informative about the racial composition (Henry de Frahan and
Sakong (2020)) and surnames that are more White included higher-income households in 1940. Most
notably, the White-share-to-“1940 income percentile” relationship shows the same non-linearity as in
the estimated betas, in the bottom 20%. I explore the variations in betas between-race and within-race
in the next sub-section.

3.4

Betas Between- and Within-Race

To see why this is a decomposition,

yi =

∑ αir + β xi + εi

r∈R

yℓ =

∑ γ r Sℓr + β xℓ + ε ℓ

r∈R

The estimated betas (i.e., housing-quantity-to-price elasticity) exhibited a non-linear pattern (Figure
2a) , which had a similar shape as the shape of each “1940 income percentile” group in the White
racial group (Figure 3a). Therefore, in this sub-section, I decompose the variation in the estimated
betas to the variation between-race and within-race.
36The

end-point 2002 was chosen arbitrarily, to include the most years before the 2000s housing boom starts.

23

For a quick summary, I find that the negative beta-wealth relationship is driven by the between-race
variation: Surnames that include more non-White individuals in 2010 have households who hold more
housing in booms (these are also surnames with lower average income in 1940). Conditional on the
non-White share, the relationship between the estimated betas and the “1940 income percentiles” is in
fact positive.
Before describing the details and presenting results, there is an important caveat in interpreting the
results in this section. The observed race indicator proxies for multiple conceptual factors, including
race per se (e.g., racial prejudice of Becker (1971)) as well as wealth and permanent income (e.g.,
White people may own more wealth than Black people conditional on wage income, even in 1940).
Therefore, it is misleading to interpret the results in this sub-section as independent contributions of
race and wealth. Rather, I interpret them as variation in the estimated betas between racial groups
(e.g., difference between White and non-White population groups) and within the racial groups (e.g.,
difference conditional on racial-group differences), treating races just as measurement. Moreover,
the composite relationship between betas and “1940 income percentile” (Figures 2a and 2b) is still
the overall relationship between the betas and wealth levels as predicted by surname-level historical
income.
Keeping the caveat in mind, I plot residuals from linear regressions of the 100 estimated betas
against the share of each percentile group that is White or against the actual percentile values, residualized by the other variable. That is, Figure 3b plots the residuals ξ˜i from β̂i = δ̃1 (1940 income percentile)i +
ν̃i for “1940 income percentile” group i, against the share of group i that is White in 2010, for the
shorter-but-broader-1998-2013 sample. Figures 3d and 3f plot the equivalent values for the longerbut-narrower-1988-2013 sample; Figure 3d is for the full period, and Figure 3f is for the sub-period
1988-2002.
By contrast, I plot the residuals from regressing the estimated betas on the White shares, against the
“1940 income percentile” in Figure 3c, for the shorter but broader 1998-2013 sample. The equivalent
plots for the longer-but-narrower-1988-2013 sample are presented in Figures 3e (for the full period)
and 3g (for 1988-2002).
That is, the fitted slopes in the figures on the left (Figures 3b, 3d and 3f) are δ0 , and the slopes in
the figures on the right (Figures 3c, 3e and 3g) are δ1 , from
β̂i = δ0 (share White)i + δ1 (1940 income percentile)i + νi
24

for the “1940 income percentile” groups i in the respective samples.
Put together, these plots show that across the samples and sub-sample, the pro-cyclicality of
housing ownership is decreasing in the White share but increasing in the “1940 income percentile”
conditional on the racial shares. That is, the overall negative relationship between the estimated betas
(the housing-quantity-on-price elasticities) and the wealth proxies is driven by the differences between
racial groups.
Consistent with the decomposition, the gap between wealth inequality and income inequality is
the widest between racial/ethnic groups (see Figures A.3a and A.3b for the wealth gaps and income
gaps from the Survey of Consumer Finances, for Black and Hispanic households respectively). An
enormous literature studies the Black-White gap in wealth (e.g., ??); see the online appendix for a
discussion of the racial wealth inequality.
As discussed above, the race indicator can be interpreted as race per se or as another proxy for
wealth. Distinguishing the two possibilities is beyond this paper, but I discuss one under-explored
possibility by which racial minorities may be disproportionately affected by business cycles. In an
earlier work, I found some empirical evidence that cyclical downturns cause racial prejudice of a
metropolitan area to rise (Sakong (2018)). In another earlier work, I use close electoral victory of Black
politicians in local elections as an instrumental variable that increase those areas’ racial prejudice, and
found that an increase in the local racial prejudice causes Black people to lose more jobs and face more
mortgage denials than White people (?). Putting these two together, business cycles may affect racial
minorities disproportionately, because racial prejudice itself is counter-cyclical. This possibility will be
explored more in future research.
For the rest of this paper, as discussed above, the overall relationship between the estimated betas
and the wealth levels as proxied for using the “1940 income percentile” is still valid. The quantification
exercises in the next two sub-sections use the overall relationship between the estimated betas and the
“1940 income percentile” groups. For the corresponding quantification exercises for the between-race
and within-race variations, see the online appendix.

3.5

Conversion from Betas to Return Differentials

Thus far, I have estimated how housing-ownership quantity co-varies with the national house-price
index differentially by the “1940 income percentile.” Both the betas and the wealth proxies do not

25

correspond to anything that can be interpreted with respect to portfolio returns or wealth inequality.
Therefore, in this sub-section, I convert the estimated beta-“1940 income percentile” relationship to a
meaningful relationship between the consequent return differential against today’s wealth groups. The
goal of this sub-section is to arrive at the following summary result: Between the interquartile range of
today’s wealth distribution, richer households earn an annual 60-basis-point higher return on housing
than poorer households, because of the differences in the timing of ownership.
I proceed in two steps. First, I translate the right-hand-side variable, “1940 income percentile,” to
today’s wealth percentile. This step takes two sub-steps: (1) I map the “1940 income percentile” to
the average home value among homeowners today; and (2) I translate the average home value to the
corresponding place in today’s wealth distribution, using a linear relationship estimated from the 2013
Survey of Consumer Finances (SCF). Second, I translate the left-hand-side variable, the “betas,” to a
return differential, using the first-order approximation in Proposition 4.
For the quantification in this sub-section, I use the estimate from the shorter-but-wider 1998-2013
sample. The estimates from the other sample are similar.
3.5.1

From 1940 Income to Today’s Wealth

I first translate the “1940 income percentile” to the predicted place in the 2013 wealth distribution.
The basic idea has two components: (1) For each surname, I can observe the average primary-residence
value in the 2012-2013 assessor file (Henry de Frahan and Sakong (2020)); and (2) Because richer
households live in more expensive homes, I infer how rich the surnames must be based on the value of
their first homes (Engel (1857)). I use the value of first homes rather than total housing wealth, because
rich households will live in more expensive homes even if they choose to invest in other assets for their
investment portfolios (Campbell (2006)).37
First, I translate the “1940 income percentile” groups to their average home values in 2012-2013, to
estimate how the estimated betas co-vary with today’s home values. This estimation takes the following
two-stage form: For “1940 income percentile” group i, I estimate
βi = γxi + εi
xi = Zi Γ
37While

only monotonicity is required, it helps for the second transformation that housing consumption also has an
income elasticity close to one (Henry de Frahan and Sakong (2020)).

26

where xi ≡ E [log home value | own]i is the average home value among homeowners in the 2012-2013
CoreLogic assessor file (Figure 1a), and Zi is a vector of dummies for each of the “1940 income
percentile” groups. This “second-stage” relationship between βi and xi is plotted in Figure 4a.
Second, I infer what wealth levels today must have led to the observed home values, xi . For this
relationship, I use the 2013 Survey of Consumer Finances (SCF), which is the most reliable source of
micro-data on wealth (Pfeffer et al. (2016)). Figure 4b groups households by the net-worth percentile
they belong to and then plots the average log-first-home-value, only among the homeowners in that
group. The bin scatter (by percentiles) shows that for most of the distribution in the middle, the
log-home-value is linearly increasing in the wealth percentiles.
The slope of the home-value-on-wealth-percentile relationship in Figure 4b is estimated from
x j ≡ E [log home value | own] j = a + b wealth percentile j

≡ f wealth percentile j



for household j. I use the estimated linear relationship to turn the average home-values by “1940
income percentile” groups from the CoreLogic data, into their corresponding wealth percentiles, i.e.,
(wealth percentile)i = f −1 (xi ).38 This variable will be on the x-axis in the final return-differential-onwealth-percentile relationship.
3.5.2

From Estimated Betas to Return Differentials

The estimated betas measure the elasticity of the quantity of housing held by one group of surnames
in response to the national house-price index. Proposition 4 gives a simple, linear approximation
of how this beta translates to a return differential arising from the timing of trades. As a reminder,


k −1
the linear relationship is given by −var µtk b̃k Dk
β̂i , where β̂i are the estimated betas for the
P

“1940 income percentile” groups i. The coefficient on β̂i is composed of expected-return properties

of housing: var µtk measures the variance of expected returns, b̃k measures how expected returns
co-vary with the rent-to-price ratio

Dtk
,
Ptk

k

and

D
k
P

is the time-averaged rent-to-price ratio. Given those

housing-market-level values, the estimated betas β̂i can be translated to a return differential linearly.
38 Because

I estimate the function f (·) at the household-level in the 2013 Survey of Consumer Finances (SCF), but apply
the inverted function to surname-level data, I make the assumption is that the non-wealth determinants of homeownership
and home values relate to wealth in the same way at the individual level as at the surname level. For a formalization of what
assumptions are required to translate surname-level relationships to household-level relationships, see Henry de Frahan and
Sakong (2020).

27

In this section, I take the characteristics of the expected returns of the national housing market from
Cochrane (2011), who showed that return predictability is comparable between the aggregate stock
market and aggregate housing market. I also verify similar numbers by using the national house-price

index from CoreLogic myself. I assume the following numbers: var µtk ≈ (0.0546)2 and b̃k ≈ 3.8
k

from Cochrane (2011) for the aggregate stock market, and the average rent-to-price ratio of

D
k
P

≈

1
16 .

Note that for transparency, I used the characteristics of the aggregate housing market (i.e., the asset k
is the aggregate housing stock). In reality, expected-return dynamics may vary by local housing markets
and may be correlated with the beta-against-wealth-level relationships. I explore this heterogeneity by
local housing markets k, in the next section on the geographical cross-section.
3.5.3

Interpreting the Return Differential by Wealth Percentiles

Figure 4c plots the imputed return differential against the imputed net-worth percentile. The linear
fit shows that a 10% increase in the net-worth percentile translates to a roughly 12-basis-points-higher
annual return on wealth.
The estimate is quite large. For the sake of comparison, I “extrapolate” onto the interquartile range,
with a 60-basis-point return differential.39 Note that this extrapolation is more justified than appears:
Because I have aggregated population to surname-level data, the linear relationship is the first-order
approximation of the true relationship in the population (see a similar explanation for zip-code-level
data in Mian and Sufi (2016)).
There is only a limited number of estimates of how portfolio returns vary by wealth level.40 The
most comprehensive estimate is from Fagereng et al. (2016), who study financial wealth in Norway.
They find a total financial-portfolio-return-differential of roughly 1% in the interquartile range.
If we think the overall return differential for the US housing portfolio is similar (in the absence of
any real estimate), then my estimation argues that 60% of the overall return differential may be due to
the timing of trades given the same assets. Whereas most of the literature on wealth-return differentials
focuses on the asset-side heterogeneity, this estimate argues that the timing of trades matters, even
though it is much less explored.
39A

related calculation is how much realized higher returns were for the richer households, given the realized returns
in the recent boom and bust (1998-2013). That calculation uses the same formula, but instead of using the variance in
expected return, I would use the variance of realized returns, which was roughly 0.01 post-1999. This estimate translates to
an average annual return differential of 2% from the timing of trades.
40 Some examples include Piketty (2014) (using university endowments), Saez and Zucman (2016) (using US foundations), and Piketty and Zucman (2015) (using Forbes global wealth rankings over 1987-2013).

28

3.6

From Return Differentials to Wealth Inequality

In this sub-section, I derive a back-of-the-envelope calculation to translate the estimated return
differential to wealth inequality. This last calculation is non-trivial. While wealth inequality exhaustively
decomposes into contributions from income inequality, savings-rate differences, and portfolio-return
differences, the contribution of return-differentials to wealth-inequality levels depends on the other
two factors. For example, in the neoclassical benchmark in which human capital is tradable (i.e.,
idiosyncratic labor-income shocks are fully insured), any agent who can earn a systematically higher
return will own all the wealth in the long-run, however small the return differential (Levy (2003)).
In a more realistic model where human capital is not tradable, labor income flows keep the wealth
distribution from diverging even with systematically different returns; given a fixed level of labor
income, as my wealth gets larger, the labor income acts as a higher proportional inflow into my
wealth portfolio (Gabaix et al. (2016) call it “stabilization”). How much labor income can “stabilize”
wealth portfolios in turn depends on how much households spend out of the labor-income inflow. The
back-of-the-envelope calculation makes approximate assumptions on these forces.
it
As a reminder, the accumulation of measured, non-human wealth Wit was given by: dW
Wit =


Yit
Cit
Wit − Wit dt + dRit , where Yit is labor income, Cit is expenditure inclusive of user cost of hous-

ing, and dRit is wealth return.
For the back-of-the-envelope calculation, I assume idiosyncratic labor incomes are not fully insured
and shut down more complex savings-rate differences with the following assumption.
Assumption 5 (Approximate expenditure policy). Assume the following approximation for the expenditure Cit :

Cit = cyYit + cwWit
for the same constants cy and cw for all household types.
This expenditure policy encompasses some benchmarks,41 but it is not theoretically sound. For
example, the approximation predicts that house-changes for homeowners would lead to proportional
changes in expenditures, but the wealth effect of house-price changes is debated (e.g., see Berger et al.
41 For

example, a hand-to-mouth agent would have cw = 0. Heterogeneous-agent models with two states (income and
asset) would imply Ct = cy (Yt ,Wt )Yt + cw (Yt ,Wt )Wt . Here the arguments of the average propensity to consume out of
income and wealth are suppressed.

29

(2017)). The policy also implies a marginal propensity to consume (MPC) that does not vary with
wealth level, which is at odds with the data. For these reasons, the approximate policy is meant as an
approximation and simplification.
To be precise, what matters for the accumulation of measured wealth is the average propensity to
consume (APC).42
Lemma 6 (Measured wealth share). Given all Assumptions and that aggregate labor income (Y ) and
measured wealth (W ) are co-integrated, in the long-run stationary distribution even with aggregate
shocks,



 
E dRit − dRt
Yit
Yt
E
−E
=−
Wit
Wt
1 − cy


Wealth shares inherit income shares if returns are identical.
The lemma follows immediately from E [d logWit − d logWt ] = 0 for agent types i relative to the
population average in the long-run stationary wealth distribution.43 This formula translates the portfolio


return differential, E dRit − dRt , to the difference between wealth inequality to income inequality:
h i
h i
Yit
Yt
−
E
E W
Wt .
it
The actual equation mapping the return differential to wealth inequality is difficult to manipulate
due to the expectations. To get an order of magnitude and for insight into the formula, I treat all
quantities as constants for a back-of-the-envelope calculation.
Corollary 7 (Back-of-the-envelope calculation). The relationship between the wealth share and income
share of household type i is approximate by



income stabilization


z
}|
{


Y
 W 1
Wi 
i
≈ 1 + EdRi − EdR
W 
Y
{z
} Y 1 − cy 
|




return differential
See the appendix for the derivation.
The relationship between wealth share and income share of an agent type i is given by the return
differential, offset by the average income flow rate, tempered by (1 − cy ) (i.e., savings rate out of
42While

MPC is decreasing in income and liquid wealth, the APC gradient is theoretically ambiguous, even among
models with downward-sloping MPC. Models with precautionary savings by the poor would exhibit increasing APC
(Aiyagari (1994)). Models in which the rich prefer to save (i.e., wealth is a luxury) would exhibit decreasing APC (Carroll
(2000)). The interaction is insignificant in the Panel Study of Income Dynamics (PSID).
43The short-term drift E [d logW ] need not equate, in the presence of aggregate shocks.
t
t

30

current income). Again, the APC out of current labor income (cy ) works to scale the return differential
up and down in determining wealth-to-income inequality. To get the order of magnitude, I estimate a
rough consumption policy from the PSID, which has information for consumption, income, and wealth.
The estimated coefficient is cy ≈ 0.25. See the online appendix for the estimation.
Using the measured-wealth-to-labor-income ratio of

W
Y

≈ 10 from the SCF and cy ≈ 0.25, the

interquartile return differential per year of 60 basis points translates to a wealth share that is 8% higher
than the income share. Comparing this to the wealth-to-income elasticities in the SCF (Figure A.5a),
the timing of trades explains roughly a fifth of the residual wealth inequality above and beyond income
inequality.
The approximation exercises in this section are complementary to fully fledged models of steadystate wealth distributions.44

4

Comparing across Geographies
Given poor and rich households’ different ownership elasticity to price, the magnitude of the return

differential depends on the variability of expected returns in the housing market. Since households are
more likely to own housing assets near where they live and residence is sticky over time, we would
expect the trade-timing mechanism to lead to greater return differential and greater wealth inequality
in housing markets with more volatility. In this section, we test this cross-sectional implication of this
paper’s main mechanism.
In the first sub-section, I sort local housing markets by US counties by one predictor of variation
in expected returns: how much the local economy fluctuates along national business cycles. The
predictive power is true in the data, and justified by: (1) Returns in housing as an asset class have
strong geographical components (Piazzesi and Schneider (2016)), and (2) Business cycles are strong
predictors of expected returns (Cochrane (2011)).
In the second sub-section, I test whether the historical cyclicality of the area did accompany both
more volatile prices and higher elasticities of quantity to price for poor households.
In the last sub-section, I test for the long-run, ultimate implication of whether wealth inequality is
indeed higher. The second test can be considered the ultimate outcome, but I have to first overcome an
44 Early examples include Champernowne (1953), Vaughan (1979), Laitner (1979), Stiglitz (1969) (using the neoclassical

growth model), and Hopenhayn and Prescott (1992). Recent models that relate returns and wealth inequality include
Benhabib et al. (2011), Benhabib et al. (2015b), and Nirei et al. (2009).

31

empirical challenge: A local measure of wealth inequality does not exist.

4.1

Defining Housing Asset Sub-classes

I first sort geographies on the cyclicality of their expected returns and consequently the effect of
the timing of trades on wealth inequality. The sorting variable defined in this sub-section will be the
independent variable in the next two sub-sections, where I show that it predicts (1) higher gradient
of the housing-quantity-on-price elasticity (i.e., the “beta”) and (2) higher wealth-inequality levels
relative to income inequality.
Housing markets are potentially segmented between distant geographies (Piazzesi and Schneider
(2016)). I divide housing assets into sub-classes by counties in which those properties are located. In
this paper, I focus on the cyclicality of the county’s local economy. The local business-cycle cyclicality
is computed by regressing county-level log income change on the aggregate log change using data
from the Bureau of Economics Analysis for 1969-2015. That is, I take πc from
∆ logYct = πc ∆ logY t + ηct
where Yct is the per-capita income in county c in year t, and Y t is the national per-capita income
in year t. This regression is estimated using per-capita income data from the Bureau of Economic
Analysis, for 1969-2015. The coefficient πc captures the cyclicality of the local economy in county c.
The distribution of cyclicality is seen in Figure A.4c.
An alternative way to sort locations by the expected-return volatility is to use proxies for the supply
elasticity in the housing market. With more inelastic supply of housing, prices will be more volatile,
leading to more variations in the expected returns. One measure of the housing-supply elasticity is from
Saiz (2010). Using the Saiz (2010) housing-supply elasticity instead of the πc leads to qualitatively the
same results.

4.2

Beta-gradient by Local-Market Cyclicality

Before moving onto the level of wealth inequality, I check whether the beta-gradients (i.e., the
relationship between the housing-quantity-to-price elasticity and wealth levels) are higher or lower in
areas where expected returns are more volatile. The direction is theoretically ambiguous. The higher
expected-return variation could have induced poorer households to engage in less selling in busts via a
32

strong price effect. Alternatively, bigger credit deterioration in those areas with bigger price drops may
have more adversely affected poor households and caused them to sell more via a strong income effect.
To see how the elasticity gradient relates to the cyclicality in the local market, I run the following
two regressions, for each county c:

log qict = δ̃c (log Pct × 1940 income percentilei ) + αic + αct + γict + ξict
where qict is the number of real estate properties located in county k, held by individuals with surnames
in the “1940 income percentile” group i in year t, and Pct is the CoreLogic house-price index in county
c in year t. This regression is estimated using the 1998-2013 CoreLogic sample covering 60% of
the US population. The estimate δ̃c captures the extent to which poorer households hold properties
pro-cyclically in county c.
Figure 5 plots the county-level beta-gradient δ̃c against business-cycle loading πc . The plot shows
the elasticity gradient was more negative in areas with bigger cycles. That is, for real estate properties in
geographical areas with historically higher cyclicality, poorer households exhibited more pro-cyclical
holdings.

4.3

Wealth-inequality Level versus Cyclicality

The return-gradient driven by timing of trades is larger in areas with more cyclical economies
because the expected returns vary more and the beta-gradient is steeper (i.e., poor households’ housingquantity ownership responds more sensitively to house prices). Consequently, in areas with higher
historical cyclicality, we should see a bigger wealth gap than income gap, compared to households
who invest in less volatile housing markets.
Conveniently for testing, housing investment exhibits a strong home bias. Owner-occupants live in
that same house. Even for investors, because housing is heterogeneous and requires local information
to invest, investors too would exhibit strong home bias in terms of geography. These factors imply
that in cities where business cycles and house prices are more volatile, we would see greater wealth
inequality above and beyond income inequality. This sub-section documents the correlation between
business-cycle cyclicality and wealth inequality, above and beyond income inequality. This correlation
is not meant to rule out other mechanisms, but it is a necessary implication of the mechanism discussed
in this section.
33

4.3.1

Measuring CBSA-level Wealth Inequality

I first address an empirical challenge: Local-level measures of wealth inequality do not exist. To
overcome this challenge, I first form zip-code-level balance sheets, and then form between-zip-code
inequality measures for metropolitan areas, and argue that they are informative for household-level
inequality measures.
Following Mian et al. (2013) and Saez and Zucman (2016), the balance sheet is given by
NW = F i + F ni + H − D
for net worth NW . F i is income-generating financial assets. F ni is non-income-generating financial
assets: life insurance and pension funds, currency and non-interest deposits (~1% of total wealth today),
and offshore wealth held through foreign institutions (~4% of net financial wealth) (Saez and Zucman
(2016))45 . H is housing (both owner-occupied and investment housing). D is liability.
I first describe components for which I have direct measures at the zip-code level. Data on housing
ownership come from the assessor file of the CoreLogic data, described in detail in the second section.
I assign housing to zip codes, using the mailing address of the owner. As described earlier, I have the
single cross-sectional assessor data for 2012-2013, so I form the rest of the balance-sheet measures for
2012.
Data on household liability come from Equifax. Equifax is a consumer-credit-reporting agency,
which collects data on consumer-credit histories to assign credit scores. I have access to zip-code-level
aggregate amounts of various household-debt instruments. I have access to Equifax annual panel up to
2011, so I use the zip-code-level debt amounts for that year.
Zip-code-level financial-asset holdings require imputation, because no data on financial-wealth
holdings by zip codes exist. The basic idea is to obtain cash flows by asset categories (e.g., dividend
for equity and interest for bond), and to capitalize them into the stock, assuming households earn the
same yield within a given asset class, following Mian et al. (2013) and Saez and Zucman (2016).4647 I
45 Non-taxable

fixed income claims (state/local government bonds) are tax-exempt but reported on individual tax returns
since 1987 (Saez and Zucman (2016)). Wealth held by individuals through trusts flows directly to dividends, realized capital
gain, interest, and to Schedule E fiduciary income (rents/royalties).
46 Saez and Zucman (2016) use capital income: “Capital income includes dividends, taxable interest, rents, estate and
trust income, the profits of S-corporations, sole proprietorships and partnerships; we also present a series including realized
capital gains... For the post-1962 period, we impute wealth at the individual level by assuming that within a given asset
class, everybody has the same capitalization factor.”
47The capitalization technique has a long history, going back to King (1927), Stewart (1939), Atkinson and Harrison

34

obtain zip-code-level total dividends, total interests and total private-business profits from the Internal
Revenue Services (IRS) Statistics of Income (SOI). I take the capitalization factors from Saez and
Zucman (2016) Appendix Table A11: “Capitalization factors by asset class.” In 2004, for example, the
capitalization factor is 51.4 for taxable interest and 43.6 for dividends.48
Lastly, I also get zip-code-level labor income from the IRS SOI (“wage and salary”). This variable
is used to form labor-income-inequality measures, used as controls so that I can focus on residual
wealth inequality above and beyond income inequality.
After I form the zip-code-level NW , I form two measures of inequality for each metropolitan area
(or core-based statistical areas (CBSA)).
The first inequality measure is the coefficient of variation (CV), defined as the standard deviation
divided by the mean. The squared CV is subgroup-decomposable, so the total CV in wealth at the
household level can be decomposed into between-zip-code variance (observed) and within-zip-code
variance. A key assumption in using the between-zip-code CV is that the between- and within-zip-code
variances are proportional.
Figure A.4a plots the imputed CV in net worth by CBSAs, and Figure A.4b plots the imputed CV
for labor income. Note the two maps have overlap but also have differences.
I validate this between-zip-code measure using total income, for which I can form individual-level
inequality measures at the CBSA level using the American Community Survey accessed via Integrated
Public Use Microdata Series (IPUMS). Note the IPUMS data are a sample and thus have sampling error.
Figure 6c plots the zip-code-level total-income CV from the IRS against the IPUMS household-level
total-income CV. A significant positive relationship exists across CBSAs.
The second inequality measure is elasticity of wealth to labor income, or the wealth-to-labor-income
ratio. The mechanism posits that the same amount of income would translate to more wealth if returns
are increasing in wealth. The wealth-to-labor-income ratio can be calculated at the zip-code level. The
zip-code-level wealth-to-labor-income elasticity of 1.4 for net worth (Table 3b column 1) is close to
the household-level relationship in Figure A.5a, computed from the Survey of Consumer Finances.
(1978), and Greenwood (1983)
48 Capital gains are ignored for now, although they may be useful for inferring equity holdings.

35

4.3.2

Wealth-inequality Level Results

Figure 6a plots the between-zip-code CV in net worth for CBSAs against the average πc (income
cyclicality) in each CBSA, controlling for the CV in wage income. The strong positive relationship is
shown in Table 3a, in column (2), using the specification,
CVm = φ πc + γwage CVm + ΓXc + εc
where CVm is the coefficient of variation in assets or net worth in CBSA m, πc is the cyclicality
measure computed in the first sub-section, and Xc is a vector of controls.
The significant positive relationship is robust to varying the empirical specification. In both tables,
column (3) shows the coefficient on the business cycle goes up if controlling for size of the CBSA or
the average price level. Column (4) uses the average equity and bond holdings over 2003-2012 (formed
using each year’s IRS SOI dividend and interest income and each year’s capitalization factors from
Saez and Zucman (2016)) instead of the value for 2012. Column (5) adds state fixed effects to focus on
more local differences. Across these specifications, higher cyclicality in the past half-century predicts
higher wealth inequality if the area had higher loading on the aggregate cycle.
An alternative test is to see if wealth-to-labor-income ratios are increasing in the local economy’s
cyclicality. Table 3b reports results from:




logWz =ψ log (wage)z × πc + Γ1 log (wage)z × Xc
+ Γ0 Xcz + δ πc + γ log (wage)z + εz
for net worth Wz in zip code z in county c, where Xcz includes log population size and log house-price
level.
The average elasticity in column (1) is the same as the household-level elasticity in Figures A.5a
and A.5b.49 Column (2) shows the elasticity is higher (i.e., wealth-to-labor-income ratio is higher) in
counties with higher cyclicality. This correlation is robust to controlling for city size and average house
price (column 3), using average capital income over 2003-2012 (column 4), and including state fixed
effects (column 5).
49 Using

the 2013 Survey of Consumers Finances (SCF), I plot log net worth against log income (Figure A.5a) and log
total asset against log income (Figure A.5b). Taking logs, these plots restrict the sample to positive amounts of wealth.
The elasticities are roughly constant and significantly greater than 1, so that higher income translates to disproportionately
higher wealth. The coefficient is roughly 1.4 for net worth and 1.5 for total asset.

36

Interpreting the level evidence requires several caveats: (1) The inequality measures are computed
using data at the zip-code level as opposed to individual-level data (one main concern is residential
segregation by income, differentially between CBSAs); (2) I use the capital-income flow to impute
financial-wealth-stock based on constant capitalization factors; (3) The relationship is not causal,
from cyclicality to wealth inequality; (4) Even if I can establish causality, mechanisms other than the
trade-timing mechanism of this paper can generated the causal relationship. Yet the cross-sectional
implication on wealth inequality is borne out in the correlations.

5

Conclusion
Why is wealth distributed so unevenly even among the bottom 99%, and even more so than income

is? This paper gives one partial answer: Poorer households own more housing during booms when
house prices are high and expected returns are low, and vice versa in busts. The return-differential
generated from this channel is large: 60 basis points per year between the interquartile range of the
wealth distribution.
To arrive at this estimate, I construct and use a panel dataset on quantity of housing held by wealth
levels. I assign wealth levels to owners in the housing-deed records from CoreLogic, by matching
them by surnames to the average incomes of those surnames in the 1940 full-count Census. I derive
approximations that translate the quantity-ownership patterns to return differentials.
This trade-timing mechanism behind wealth differentials arises because expected returns on housing
are time-varying and predictable. It further implies that time-series volatility would widen wealth
inequality: I verify this implication across US metropolitan areas.
This paper also makes broader points: Wealth is about accumulation, so dynamic mechanisms are
important and asset-price movements (not just average returns) are important. On the methodological
side, the nexus between deed records and full count Census can be extended back much further in
history. This nexus can be a lens through which to study cycles (and other topics in economics) going
back hundreds of years.

37

References
Aiyagari, S. R. (1994). Uninsured idiosyncratic risk and aggregate saving. The Quarterly Journal of Economics,
109(3):659–684.
Atkinson, A. B. and Harrison, A. J. (1978). Distribution of personal wealth in Britain. Cambridge Univ Pr.
Bach, L., Calvet, L. E., and Sodini, P. (2016). Rich pickings? risk, return, and skill in the portfolios of the
wealthy.
Barberis, N., Greenwood, R., Jin, L., and Shleifer, A. (2015). X-capm: An extrapolative capital asset pricing
model. Journal of financial economics, 115(1):1–24.
Bayer, P., Casey, M., Ferreira, F., and McMillan, R. (2017). Racial and ethnic price differentials in the housing
market. Journal of Urban Economics, 102:91–105.
Bayer, P., Ferreira, F., and Ross, S. L. (2016). The vulnerability of minority homeowners in the housing boom
and bust. American Economic Journal: Economic Policy, 8(1):1–27.
Becker, G. S. ([1957] 1971). The Economics of Discrimination. University of Chicago press.
Benhabib, J., Bisin, A., and Luo, M. (2015a). Wealth distribution and social mobility in the us: A quantitative
approach. Technical report, National Bureau of Economic Research.
Benhabib, J., Bisin, A., and Zhu, S. (2011). The distribution of wealth and fiscal policy in economies with
finitely lived agents. Econometrica, 79(1):123–157.
Benhabib, J., Bisin, A., and Zhu, S. (2015b). The wealth distribution in bewley economies with capital income
risk. Journal of Economic Theory, 159:489–515.
Beraja, M., Hurst, E., and Ospina, J. (2016). The aggregate implications of regional business cycles. Technical
report, National Bureau of Economic Research.
Berger, D., Guerrieri, V., Lorenzoni, G., and Vavra, J. (2017). House prices and consumer spending. The Review
of Economic Studies, 85(3):1502–1542.
Beshears, J., Choi, J. J., Harris, C., Laibson, D., Madrian, B. C., and Sakong, J. (2015). Self control and
commitment: can decreasing the liquidity of a savings account increase deposits?
Bisin, A. and Benhabib, J. (2017). Skewed wealth distributions: Theory and empirics. Journal of Economic
Literature.
Boulware, K. D. and Kuttner, K. N. (2020). Wealth stratification and portfolio choice. In AEA Papers and
Proceedings, volume 110, pages 411–15.
Calvet, L. E., Campbell, J. Y., and Sodini, P. (2009). Fight or flight? portfolio rebalancing by individual investors.
The Quarterly journal of economics, 124(1):301–348.
Campbell, J. Y. (2006). Household finance. The journal of finance, 61(4):1553–1604.
Campbell, J. Y., Ramadorai, T., and Ranish, B. (2018). Do the rich get richer in the stock market? evidence
from india.
Campbell, J. Y., Viceira, L. M., Viceira, L. M., et al. (2002). Strategic asset allocation: portfolio choice for
long-term investors. Clarendon Lectures in Economic.
Campbell, S. D., Davis, M. A., Gallin, J., and Martin, R. F. (2009). What moves housing markets: A variance
decomposition of the rent–price ratio. Journal of Urban Economics, 66(2):90–102.
Carliner, M. S. (1998). Development of federal homeownership policy. Housing Policy Debate, 9(2):299–321.
Carroll, C. D. (2000). Why do the rich save so much? Does Atlas Shrug?: The Economic Consequences of
Taxing the Rich, page 465.
Champernowne, D. G. (1953). A model of income distribution. The Economic Journal, 63(250):318–351.
Charles, K. K. and Hurst, E. (2002). The transition to home ownership and the black-white wealth gap. The
Review of Economics and Statistics, 84(2):281–297.
Cochrane, J. H. (2011). Presidential address: Discount rates. The Journal of finance, 66(4):1047–1108.
Davis, M. A. and Van Nieuwerburgh, S. (2015). Housing, finance, and the macroeconomy. In Handbook of
regional and urban economics, volume 5, pages 753–811. Elsevier.
De Nardi, M. and Fella, G. (2017). Saving and wealth inequality. Review of Economic Dynamics, 26:280–300.
Engel, E. (1857). Die produktions-und konsumptionsverhältnisse des königreichs sachsen. Zeitschrift des
Statistischen Bureaus des Königlich Sächsischen Ministeriums des Innern, 8:1–54.
Fagereng, A., Guiso, L., Malacrino, D., and Pistaferri, L. (2016). Heterogeneity and persistence in returns to
wealth. Technical report, National Bureau of Economic Research.
Gabaix, X., Lasry, J.-M., Lions, P.-L., and Moll, B. (2016). The dynamics of inequality. Econometrica,
84(6):2071–2111.

38

Garbinti, B., Goupille-Lebret, J., and Piketty, T. (2017). Accounting for wealth inequality dynamics: Methods,
estimates and simulations for france (1800-2014).
Ghysels, E., Plazzi, A., Valkanov, R., and Torous, W. (2013). Forecasting real estate prices. In Handbook of
economic forecasting, volume 2, pages 509–580. Elsevier.
Glaeser, E. L. and Nathanson, C. G. (2015). Housing bubbles. In Handbook of regional and urban economics,
volume 5, pages 701–751. Elsevier.
Greenwood, D. (1983). An estimation of us family wealth and its distribution from microdata, 1973. Review of
Income and Wealth, 29(1):23–44.
Guvenen, F. (2011). Macroeconomics with hetereogeneity: a practical guide. Economic Quarterly, (3Q):255–
326.
Guvenen, F., Ozkan, S., and Song, J. (2014). The nature of countercyclical income risk. Journal of Political
Economy, 122(3):621–660.
Heathcote, J., Storesletten, K., and Violante, G. L. (2009). Quantitative macroeconomics with heterogeneous
households. Annu. Rev. Econ., 1(1):319–354.
Henry de Frahan, L. and Sakong, J. (2020). Intergenerational elasticity of consumption and income.
Hoopes, J., Langetieg, P., Nagel, S., Reck, D., Slemrod, J., and Stuart, B. (2016). Who sold during the crash of
2008-9? evidence from tax-return data on daily sales of stock. Technical report, National Bureau of Economic
Research.
Hopenhayn, H. A. and Prescott, E. C. (1992). Stochastic monotonicity and stationary distributions for dynamic
economies. Econometrica (1986-1998), 60(6):1387.
Kaplan, G., Mitman, K., and Violante, G. L. (2017). The housing boom and bust: Model meets evidence.
Technical report, National Bureau of Economic Research.
King, W. I. (1927). Wealth distribution in the continental united states at the close of 1921. Journal of the
American Statistical Association, 22(158):135–153.
Kuhn, M. and Rios-Rull, J.-V. (2016). 2013 update on the us earnings, income, and wealth distributional facts:
A view from macroeconomics. Quarterly Review, Federal Reserve Bank of Minneapolis, April, pages 1–75.
Laibson, D. (1997). Golden eggs and hyperbolic discounting. The Quarterly Journal of Economics, 112(2):443–
478.
Laitner, J. (1979). Household bequest behaviour and the national distribution of wealth. The Review of Economic
Studies, 46(3):467–483.
Levy, M. (2003). Are rich people smarter? Journal of Economic Theory, 110(1):42–64.
Longstaff, F. A. (2001). Optimal portfolio choice and the valuation of illiquid securities. The Review of Financial
Studies, 14(2):407–431.
Mian, A., Rao, K., and Sufi, A. (2013). Household balance sheets, consumption, and the economic slump. The
Quarterly Journal of Economics, 128(4):1687–1726.
Mian, A. and Sufi, A. (2016). Who bears the cost of recessions? the role of house prices and household debt. In
Handbook of Macroeconomics, volume 2, pages 255–296. Elsevier.
Mian, A., Sufi, A., and Verner, E. (2017a). Household debt and business cycles worldwide. The Quarterly
Journal of Economics, 132(4):1755–1817.
Mian, A., Sufi, A., and Verner, E. (2017b). How do credit supply shocks affect the real economy? evidence from
the united states in the 1980s. Technical report, National Bureau of Economic Research.
Nirei, M. et al. (2009). Pareto distributions in economic growth models. Institute of Innovation Research
Working Paper, (09-05).
Pfeffer, F. T., Schoeni, R. F., Kennickell, A., and Andreski, P. (2016). Measuring wealth and wealth inequality:
Comparing two us surveys. Journal of economic and social measurement, 41(2):103–120.
Piazzesi, M. and Schneider, M. (2016). Housing and macroeconomics. In Handbook of Macroeconomics,
volume 2, pages 1547–1640. Elsevier.
Piketty, T. (2014). Capital in the Twenty-First Century. Harvard University Press.
Piketty, T. (2015). About capital in the twenty-first century. American Economic Review, 105(5):48–53.
Piketty, T. and Zucman, G. (2015). Wealth and inheritance in the long run. In Handbook of income distribution,
volume 2, pages 1303–1368. Elsevier.
Quadrini, V. and Ríos-Rull, J.-V. (2015). Inequality in macroeconomics. In Handbook of Income Distribution,
volume 2, pages 1229–1302. Elsevier.
Rajan, R. G. (2011). Fault lines: How hidden fractures still threaten the world economy. princeton University
press.

39

Saez, E. and Zucman, G. (2016). Wealth inequality in the united states since 1913: Evidence from capitalized
income tax data. The Quarterly Journal of Economics, 131(2):519–578.
Saiz, A. (2010). The geographic determinants of housing supply. The Quarterly Journal of Economics,
125(3):1253–1296.
Sakong, J. (2018). Impact of economic condition on racial prejudice.
Sinai, T. and Souleles, N. S. (2005). Owner-occupied housing as a hedge against rent risk. The Quarterly
Journal of Economics, 120(2):763–789.
Stewart, C. (1939). Income capitalization as a method of estimating the distribution of wealth by size groups. In
Studies in Income and Wealth, Volume 3, pages 95–146. NBER.
Stiglitz, J. E. (1969). Distribution of income and wealth among individuals. Econometrica: Journal of the
Econometric Society, pages 382–397.
Vaughan, R. (1979). Class behaviour and the distribution of wealth. The Review of Economic Studies, 46(3):447–
465.
Viceira, L. M. (2001). Optimal portfolio choice for long-horizon investors with nontradable labor income. The
Journal of Finance, 56(2):433–470.
Wainer, A. and Zabel, J. (2019). Homeownership and wealth accumulation for low-income households. Journal
of Housing Economics, page 101624.
Werning, I. (2015). Incomplete markets and aggregate demand. Technical report, National Bureau of Economic
Research.
Wolff, E. N. (2017). Household wealth trends in the united states, 1962 to 2016: Has middle class wealth
recovered? Technical report, National Bureau of Economic Research.
Word, D. L., Coleman, C. D., Nunziata, R., and Kominski, R. (2008). Demographic aspects of surnames from
census 2000. Unpublished manuscript, Retrieved from http://citeseerx. ist. psu. edu/viewdoc/download.

40

F IGURE 1: Raw-data pattern, sorting by proxies
These figures show raw-data patterns, sorting owners by surnames (and associated 1940 income). Panel
(a) plots the average value of primary residence conditional on owning in 2012-2013, for CoreLogic’s
assessor record, which covers almost the entire US population in a single year. Panel (b) plots the per
capita holdings of any real estate asset (i.e., count), for selected decile groups, relative to the 1998
levels. Panel (c) plots the residuals εit for the same set of selected deciles from the regression:
log (qit ) = αi + αt + γit + εit
where the regression is weighted by the number of individuals in each decile group, and qit is the
holdings of all real estate by number of property by members of the decile group in a given year. For
comparison, panel (d) plots the same residuals for CoreLogic national house-price index, i.e., εt from
log (Pt ) = γ0t + εt
where Pt is the house-price index. The vertical red line indicates 2007.
( A ) By surname: primary-residence value

( B ) By surname: Per capita holding vs. 1998

( C ) By surname: Detrended log residual

( D ) National house price: Detrended log residual

41

F IGURE 2: Estimated Quantity Elasticity versus Wealth Level
These figures plot the elasticity βi from
log (qit ) = βi log (Pt ) + αi + αt + γit + ξit
where qit is the total number of properties held per capita for the percentile group i, where the
percentiles are sorted using the associated surnames’ average household-wage income from the 1940
full Census, and Pt is CoreLogic national house-price index. Panel (a) is estimated using the 1998-2013
CoreLogic sample covering roughly 60% of the US population. Panels (b) and (c) are estimated using
the 1988-2013 CoreLogic sample covering roughly 25% of the US population. Panel (b) uses the entire
1988-2013 period for estimation; panel (c) uses only 1988-2002 to exclude the subprime boom and
bust.
( A ) For 1998-2013 sample

42

F IGURE 2: Estimated Quantity Elasticity versus Wealth Level (continued)
( B ) For 1988-2013 sample: Full period

( C ) For 1988-2013 sample: Only 1988-2002

43

F IGURE 3: “Beta” between- and within-racial share
Panel (a) plots the average share of each “1940 income percentile group” that is racially White. To
create the rest of the figures, I first estimated the elasticity βi from
log (qit ) = βi log (Pt ) + αi + αt + γit + ξit
where qit is the total number of properties held per capita for the percentile group i, where the percentiles
are sorted using the associated surnames’ average household-wage income from the 1940 full Census,
and Pt is CoreLogic national house-price index. Then, for different samples, I plot the βi against the
average White share, controlling linearly for the numerical “1940 income percentile” values, in panels
(b), (d) and (f); I plot the βi against the numerical “1940 income percentile” values, controlling for the
average White share, in panels (c), (e) and (g). Panels (b) and (c) are estimated using the 1998-2013
CoreLogic sample covering roughly 60% of the US population. Panels (d), (e), (f) and (g) are estimated
using the 1988-2013 CoreLogic sample covering roughly 25% of the US population. Panels (d) and (e)
use the entire 1988-2013 period for estimation; panels (f) and (g) use only 1988-2002 to exclude the
subprime boom and bust.
( A ) White share (2010) by 1940 surname income

( B ) 1998-2013 sample: White share (residual)

( C ) 1998-2013 sample: 1940 income (residual)

44

F IGURE 3: “Beta” between- and within-racial share (continued)
( D ) 1988-2013 sample (full): White share (residual)

( E ) 1988-2013 sample (full): 1940 income (residual)

( F ) 1988-2013 sample (1988-2002): White share
(residual)

( G ) 1988-2013 sample (1988-2002): 1940 income
(residual)

45

F IGURE 4: Conversion to return differential
These figures convert the estimated housing quantity elasticity to house price by surnames’ associated
1940 income percentiles, to return differential by the corresponding wealth percentiles today. Panel
qit
(a) plots dd log
log Pt estimated from the holdings panel for each income-percentile group (by income in
1940 Census), against those percentile groups’ average primary-residence value in 2012, conditional
on owning. Each dot represents a percentile group. The plotted relationship can be viewed as a “second
stage” of quantity elasticity against wealth level as proxied for using home value, with surnames as the
instruments.
Panel (b) is estimated using the 2013 Survey of Consumer Finances (SCF), in order to map primaryresidence value to the corresponding place in the wealth distribution, using an Engel curve argument.
Panel (b) plots average log home value conditional on owning primary residence, against the net worth
percentile. Estimation yields:
E [log home value | own] = 0.026 net worth percentile + 10.408
≡ f (net worth percentile)
Panel (c) plots imputed return differential (relative to population average) against imputed net worth
percentile today. Each dot represents a percentile group defined by surnames’ associated 1940 income,
as with panel (a). primary-residence value in 2012-2013 is converted to net worth percentile using
qit
f −1 (E [log home value | own]). Elasticity dd log
log Pt is converted to return differential using
  

D
d log qit
return differentiali = −var (Et dRt ) b̃
θ
d log Pt
P
with the following coefficients from Cochrane (2011): var (Et dRt ) ≈ (0.0546)2 , b̃ ≈ 3.8. I further use
D
1
and θ ≈ 1. All regressions are estimated using the 1998-2013 CoreLogic sample covering
≈ 16
P
roughly 60% of the US population.
( A ) Beta vs. primary-residence value

46

F IGURE 4: Conversion to return differential (continued)
( B ) SCF: primary-residence value vs. wealth percentile

( C ) Return differential vs. wealth percentile

47

F IGURE 5: Elasticity gradient by local cyclicality
This figure plots a bin scatter at the county-level. For each county k, it plots the quantity-to-price
elasticity gradient β̃k against business-cycle loading δk . β̃k are estimated from:
log qikt = β̃k (log Pkt × 1940 income percentilei ) + αik + αkt + γikt + ξikt
where qikt is the number of real estate properties located in county k, held by individuals of surname i
in year t, and Pkt is the house-price index in county k in year t. The estimate β̃k captures the extent to
which poorer households hold properties procyclically in county k. δk are estimated from:
∆ log ykt = δk ∆ logYt + νkt
where ykt is the per-capita income in county k in year t, and Yt is the national per-capita income in year
t. δk captures the cyclicality of the local economy in county k. All regressions are estimated using the
1998-2013 CoreLogic sample covering roughly 60% of the US population.

48

F IGURE 6: Wealth inequality level: Coefficient of variation from zip code data
Panel (a) plots Core-based Statistical Area (CBSA)-level coefficient of variation of asset in 2012
against the area’s “income loading,” controlling for the CBSA-level coefficient of variation of wage
income. Income loadings have been calculated at the county-level by regressing changes in county-level
log per-capita income on changes in aggregate log per-capita income, using data from the Bureau
of Economic Analysis 1969-2015. Coefficients of variations in asset, wealth and wage have been
calculated for CBSAs using zip code-level variation. Wage comes directly from “Salaries and wages”
in the IRS Statistics of Income. Asset and net worth are imputed using capital income from the IRS
Statistics of Income, capital income capitalization factors from Saez and Zucman (2016), housing
ownership from CoreLogic assessor records, and zip code-level debt stocks from Equifax.
Panel (b) plots the CBSA-level coefficient variation of net worth against the area’s income loading,
again controlling for the CBSA-level coefficient of variation of wage income.
In panel (c), CBSA’s are grouped by their coefficient of variation of zip code-level adjusted gross
income from the IRS Statistics of Income, computed as above for 2012. It plots the household-level
coefficient of variation of household income from the 2008-2012 5-year American Community Survey,
accessed via IPUMS. The two measures are both meant to measure household-level inequality in
income: the x-axis variable uses zip code-level data to compute; the y-axis variable is based on sample
of household-level data.
( A ) Net worth coefficient of variation (CBSA)

49

F IGURE 6: Wealth inequality level: Coefficient of variation from zip code data (continued)
( B ) Asset coefficient of variation (CBSA)

( C ) Household income coefficient of variation: IPUMS vs. IRS

50

TABLE 1: Validation of surname-based historical income against Census 2000

51

These regressions validate the use of surnames to predict housing owners’ wealth. From the 1998-2013 sample of CoreLogic data with
owners matched via surnames to their average 1940 income, I take the single-year data for 2000. I then average the average-1940-incomes by
the zip code where the property is located and the zip code where the owner lives. Each regression relates the zip-code-level averages from
the CoreLogic data to the corresponding zip-code-level data from the 2000 Census. Columns (1) and (2) are for owner-occupied housing
only: they regress the log-median-household-income from Census 2000 against CoreLogic’s 1940 income. Column (2) includes county
fixed effects. The data for columns (1) and (2) are at the zip-code-level. Columns (3) and (4) are for non-owner-occupied housing only: they
regress the log median household income of the zip code where the owners live (i.e., the mailing address zip code) against CoreLogic’s 1940
income, controlling for the median income of the zip code where the property is located. Column (4) includes county fixed effects for both
the property site and the owners’ residential area. The data for columns (3) and (4) are at the zip code of property site × zip code of owner’s
residence level, and standard errors are clustered by the zip code of owner’s residence. Columns (5), (6) and (7) include both owner-occupied
and non-owner-occupied housing, and for each tenure status there is a separate variable for CoreLogic’s 1940 income. Columns (5) and (6)
regress the log median household income of the zip code of the property site from Census 2000 against CoreLogic’s 1940 income, with
separate variables for owner-occupants and investor-occupants. Column (6) includes county fixed effects. The data for columns (5), (6) and
(7) are at the property-site-zip-code level. Column (7) runs the same specification as column (6), but with the average log home value from
Census 2000 in the property-site-zip-code as the dependant variable.
Census income

1940 log wage

owner residence zip income

(1)

(2)

(3)

(4)

1.633∗∗∗
(0.036)

1.824∗∗∗
(0.061)

0.169∗∗∗
(0.005)
0.192∗∗∗
(0.003)

0.071∗∗∗
(0.002)
0.071∗∗∗
(0.002)

site area log income
1940 log wage (not own)
1940 log wage (owner occupants)
Constant
Adjusted R2
county FE
# of clusters
Observations
Standard errors in parentheses
p < 0.05, ∗∗ p < 0.01, ∗∗∗ p < 0.001

∗

-0.535∗
(0.247)

7.588∗∗∗
(0.058)

0.216

0.481
O

9878

9727

0.046
28271
1796362

0.460
O
28099
1793999

property-site income

home value

(5)

(6)

(7)

0.586∗∗∗
(0.112)
1.162∗∗∗
(0.110)
-1.324∗∗∗
(0.230)

0.882∗∗∗
(0.120)
1.236∗∗∗
(0.113)

1.587∗∗∗
(0.180)
1.423∗∗∗
(0.163)

0.226

0.490
O

0.696
O

9808

9666

9654

TABLE 2: Race decomposition
( A ) Return (imputed from "beta"; unit in percent)

1998-2013

wealth percentile (2013)

1988-2002

(1)

(2)

(3)

(4)

(5)

(6)

0.014∗∗∗
(0.001)

-0.003∗∗∗
(0.000)
-0.003
(0.034)
-0.170∗∗∗
(0.009)
-0.419∗∗∗
(0.006)

0.020∗∗∗
(0.001)

-0.001∗
(0.001)
0.018
(0.044)
-0.153∗∗∗
(0.012)
-0.544∗∗∗
(0.008)

0.005∗∗∗
(0.001)

-0.014∗∗∗
(0.001)
0.146∗
(0.063)
-0.246∗∗∗
(0.025)
-0.477∗∗∗
(0.016)

0.152
119420

0.554
119420

0.090
118667

0.299
118667

0.001
117835

0.015
117835

Asian share
Black share
Hispanic share
Adjusted R2
Observations

1988-2013

Standard errors in parentheses
∗ p < 0.05, ∗∗ p < 0.01, ∗∗∗ p < 0.001

( B ) Average primary residence value ("first stage")

2013 wealth percentile

1940 income

(1)

(2)

(3)

15.480∗∗∗
(0.193)

10.207∗∗∗
(0.152)
13.350∗∗∗
(0.756)
-10.236∗∗∗
(0.260)
-6.865∗∗∗
(0.142)

-0.155∗∗∗
(0.031)
-0.865∗∗∗
(0.011)
-0.606∗∗∗
(0.014)

0.662
119432

0.462
119531

Asian share
Black share
Hispanic share
Adjusted R2
Observations

1940 income

0.504
119432

Standard errors in parentheses
∗ p < 0.05, ∗∗ p < 0.01, ∗∗∗ p < 0.001

52

TABLE 3: Level of wealth inequality
For county c in CBSA m,
CVm = β income loadingc + γwage CVm + ΓXc + εc
where Xc includes log population size and log house price level. Income loadings have been calculated
at the county-level by regressing changes in county-level log per-capita income on changes in aggregate
log per-capita income, for 1969-2015. Coefficients of variations in asset, wealth and wage have been
calculated for CBSAs using zip code-level variation. Wage comes directly from “Salaries and wages”
in the IRS Statistics of Income. Asset and net worth are imputed using capital income from the IRS
Statistics of Income, capital income capitalization factors from Saez and Zucman (2016), housing
ownership from CoreLogic assessor records, and zip code-level debt stocks from Equifax. All standard
errors are clustered at the CBSA-level.
( A ) CBSA coefficient of variation

networth c.v. (2012)

beta: income per cap

(1)

(2)

(3)

(4)

(5)

0.747∗∗∗
(0.151)

0.541∗∗∗
(0.110)
1.489∗∗
(0.452)

0.078
(0.149)

0.556∗∗
(0.197)
1.181
(0.608)
0.282
(0.145)
0.569
(0.592)
-5.395
(4.230)

0.651∗∗∗
(0.188)
1.149∗
(0.523)
0.265∗
(0.108)
0.523
(0.594)

0.406∗∗∗
(0.113)

0.804∗∗∗
(0.177)
1.317∗∗
(0.473)
0.070
(0.044)
-0.330∗
(0.135)
0.780
(0.734)

0.032

0.312

0.321

0.213

881
1707

881
1707

649
1092

649
1092

0.306
O
649
1091

wage c.v.
log population
log house price
Constant
Adjusted R2
state FE
# of CBSA
Observations

with avg cap income 2003-2012

Standard errors in parentheses
∗ p < 0.05, ∗∗ p < 0.01, ∗∗∗ p < 0.001

53

TABLE 3: Level of wealth inequality (continued)
For zip code z in county c,




log (Y )z =β log (wage)z × income loadingc + Γ1 log (wage)z × Xc
+ Γ0 Xcz + δ income loadingc + γ log (wage)z + εz
where Xcz includes log population size and log house price level. Income loadings have been calculated
at the county-level by regressing changes in county-level log per-capita income on changes in aggregate
log per-capita income, for 1969-2015. Wage, asset and net worth vary at the zip code level. Wage
comes directly from “Salaries and wages” in the IRS Statistics of Income. Asset and net worth are
imputed using capital income from the IRS Statistics of Income, capital income capitalization factors
from Saez and Zucman (2016), housing ownership from CoreLogic assessor records, and zip code-level
debt stocks from Equifax. All standard errors are clustered at the county-level.
( B ) Zip code wealth-wage elasticity

log networth per capita (’12)
(1)

(2)

(3)

(4)

(5)

(6)

0.809∗∗∗
(0.195)

0.689∗∗
(0.223)
0.057∗∗
(0.022)
0.228
(0.207)
-7.074∗∗
(2.381)
-0.674∗∗
(0.238)
-1.771
(2.258)
-1.135
(1.092)
20.478
(11.931)

0.735∗∗∗
(0.204)
0.065∗∗
(0.023)
0.151
(0.220)
-7.457∗∗∗
(2.172)
-0.763∗∗
(0.249)
-1.170
(2.403)
-0.864
(1.145)

0.958∗∗∗
(0.197)
0.066∗∗
(0.023)
0.565
(0.316)

0.316

0.377
O

0.502

1.399∗∗∗
(0.018)
-3.792∗∗∗
(0.188)

0.573∗∗
(0.201)
4.778∗
(2.105)

0.769∗∗
(0.244)
0.056∗
(0.028)
0.093
(0.221)
-7.922∗∗
(2.604)
-0.600∗
(0.302)
-0.245
(2.426)
-0.397
(1.181)
11.489
(12.938)

0.267

0.270

0.300

income beta × wage
population × wage
house price × wage

-8.413∗∗∗
(2.041)

income beta
log population
log house price
log wage
Constant
Adjusted R2
state FE
county FE
# of counties
Observations

21723

with avg cap income ’03-’12

2854
21517

1196
15413

Standard errors in parentheses
∗ p < 0.05, ∗∗ p < 0.01, ∗∗∗ p < 0.001

54

1196
15400

1196
15400

-0.739∗∗
(0.241)

-2.904
(1.599)

O
1158
15362

Appendix
A

Additional Theoretical Results

A.1

Theoretical Ambiguity the Quantity Elasticity versus Wealth Level

Working with a standard household problem, I show that theoretically rich or poor households
could exhibit more procyclical housing ownership. Mainly, I distinguish poor households from rich
households by more frequently binding credit constraints, and consider the comparative static in
quantities that vary over the cycle.50
They are meant to demonstrate the theoretically ambiguous prediction for

d log q
d log P

and are not a

comprehensive analysis of the cross section of trading behaviors. In addition to demonstrating the
ambiguous theoretical prediction, the model exercise serves two more roles: (1) It explains how the
panel data on housing ownership map to the drivers and incidence of business cycles, and (2) the
implication of wealth inequality relies on the pattern happening through time - assessing if it is true is
partly an empirical exercise, but knowing if forces considered universal can generate the selling pattern
is also useful.
I organize the discussion around a simple two-asset consumption-savings model of an individual.
Households maximize
∞

∑ β τ u (Ct+τ )
τ=0

subject to a standard two-asset budget constraint and a borrowing constraint on the risk-free asset:
Yt + Bt + Ht (Pt + Dt ) = Ct + Ht+1 Pt + Bt+1 Qt
{z
}
|
Wt

Bt+1 ≥ B
Define wealth W as cum-dividend wealth. Let ξt denote the multiplier on the borrowing constraint,
scaled by marginal utility and bond price, that is, ξt ≡

multiplier on borrowing constraintt
Qt λt

≥ 0, because both

50 Households of different wealth levels have many possible differences that could generate the observed transacting
behaviors. For example, (1) except for the top 2%, lower-income households have more cyclical income (Guvenen et al.
(2014)), (2) credit-supply fluctuations disproportionately affect lower-income households (Mian et al. (2017a); Mian et al.
(2017b)), (3) lower-income households may be more myopic or extrapolative, or (4) lower-income households may lack
market knowledge and importantly timing skill, among other possible differences. Here, I show that even with just the
difference in the tightness of credit constraint, theoretical predictions are ambiguous.

55

Qt > 0 and λt > 0 always. ξt is the scaled shadow cost of the borrowing constraint and only enters the
Euler equation for the risk-free asset. Derivations of key equations closely follow Viceira (2001) and
Campbell et al. (2002),51 and can be found in the Appendix.
First consider the Merton benchmark. Define the housing share as θt ≡

Ht+1 Pt
Ht+1 Pt +Bt+1 Qt .

To keep θt at

a fixed level, those with a higher level of θt will buy in boom. Their Wt increases disproportionately
more, and thus they have to acquire more Ht+1 . Because the average risky share is higher for richer
households, this simplest benchmark shows clearly that from valuation shocks alone, higher-wealth
households should be the ones exhibiting more procyclical net purchases.
Combining the two Euler equations yields

 1
γCov (ct+1 − ct , r1,t+1 ) ≈ E r1,t+1 − r f ,t + ξt + Var (r1,t+1 )
2
where r1,t+1 ≡

Pt+1 +Dt+1
Pt

for housing and r f ,t ≡

1
Qt

for the risk-free bond.52

A reduction in ξt from the relaxed borrowing constraint acts like a reduction in the risk-free rate for
portfolio choice.53 This lower effective risk-free rate is a price effect that applies to all risky assets. Note
that this lower effective risk-free rate due to reduction in ξt is different from a shock to βt , which would
affect all assets equally. An effect on portfolio allocation via the effect on consumption would arise,
but the direct price-effect channel is gone. This distinction is important: Papers in macroeconomics
often take βt shocks as a reduced-form way to model incomplete market demand shocks (Werning
(2015); Beraja et al. (2016)). With multiple assets, this reduced form would miss the price-effect
and portfolio-choice channel. In fact, most macroeconomic models would miss this price-effect, and
only capture the forces related to consumption (i.e., the direct effect of β and consumption-income
co-movement).
Constraints would also affect the consumption rule. The Euler equation for consumption growth in
terms of portfolio return is

 1


[log β + (1 − θt ) ξt ] − γEt [ct+1 − ct ] + Et r p,t+1 + Var r p,t+1 − γ (ct+1 − ct ) ≈ 0
2
51As

opposed to the more common numerical solution route, these papers use Taylor approximations to solve for
approximate solutions, while taking care to expand properly inside expectations to preserve the precautionary and assetpricing forces. The positive probability of a zero income and a permanent retirement state allows for strictly positive
non-human wealth always. Strictly positive non-human wealth allows for log-linearization. Because I primarily study
homeowners, I make a similar assumption of a permanent zero-income possibility and study W > 0.
52 Note that unlike w and y , and like c , ξ too is an endogenous policy. These equations are not full solutions.
t
t
t t
53This effect is similar with a collateral constraint as well, because a borrowing limit scales risky asset holdings by a
loan-to-value ratio given by some parameter less than 1, that is, φ < 1.

56

where θt ≡

Ht+1 Pt
Ht+1 Pt +Bt+1 Qt .

With higher ξt , the agent simultaneously chooses a lower ct and lower θt ,

and consequently a lower Et r p,t+1 .
For the clearest comparison, I make the following simplification: Assume ξit = ξi for household
type i. Denote “poorer households” as those households with higher ξi . With this simplification, the
solution takes a form similar to that in Viceira (2001). With this simplification, household consumption
ct and portfolio share in risky asset θt are given by
ct − yt = b0 + b1 (wt − yt )

µ − r f + ξ + 12 σu2 − (1 − b1 ) σyu
θt =
b1 σu2
where b1 =

ρw −1
ρc

)ξ
> 0 and b0 = − constant+(1−θ
with b0 < 0 and
b1 ρc

db1
dξ

> 0.

Note that b1 behaves like the marginal propensity to consume (MPC) out of liquidity in the
consumption-savings literature. Shocks to labor income yt are all permanent, so yt represents the
permanent income from which households would borrow. wt is liquid assets that they can access
immediately. Hence, the hedging term implies that if MPC is high, permanent income shocks translate
less to consumption growth ((1 − b1 ) is lower), and so hedging demand is lower. With higher ξ , b0 is
even more negative and consumption is suppressed. Hence,

ρc
ρw

↓, and b1 ↑. We find that the elasticity of

consumption to liquid wealth is higher (again, wt is like cash on hand in this set-up, because yt shocks
are permanent). Higher b1 has two opposite effects. On the one hand, human capital is worth less
because one cannot borrow against it (i.e., the b1 in denominator). On the other hand, innovations to
permanent income “matter less” in that they translate less to consumption growth, so hedging demand
is lower. Given that income shocks are positively correlated with housing returns, this force increases
the demand for housing.
Credit supply affects demand for housing via three channels. Taking comparative statics with
respect to ξ ,




σyu 
dθ
1
db1 
θ


=−
+
− 2 2 +

2
2
dξ
b 1 σu
dξ  b1 σu b1 σu 
| {z
|
} |{z} | {z
{z
}
}
<0

>0

<0

>0

First, note in the combination of the two Euler equations that ξt > 0 acts as if the risk-free return is
higher for those households; a pure price effect would lead to allocation toward more positive risk-free
allocation and toward less of all risky assets. Second, with higher ξ and hence higher b1 , future labor
57

income is less valuable, and thus households behave as if they are more risk averse (i.e., the b1 in the
denominator). Third, because future labor income is less valuable, its covariance with asset prices (i.e.,
background risk) is also less of a concern. The last force moves in the opposite direction of the other
two forces. These three forces apply to all risky assets.54
To understand who buys in booms and busts, I consider comparative statics on θt , because I have
already assumed common price dynamics. Given a one-time permanent shock to a parameter of the
model, I consider whose θt would increase more.
First, consider a procyclical credit supply. Suppose a looser credit constraint lowers ξi more for
households with higher levels of ξi . With σyu not too high, lower ξ would translate to higher demand
for housing for those households for whom constraints had bound more. Therefore, procyclical credit
supply naturally leads to more procyclical net purchase behavior for poorer households who are closer
to borrowing and collateral constraints. The relaxation of credit supply in booms can come from market
forces or from government policy.
Next, consider a procyclical perceived expected return on risky asset µ̃. This comparative applies
to factually higher return in cycle downturns, as well as to extrapolated higher return in booms in
behavioral models. Comparative static yields:
1
∂θ
=
∂ µ̃
b1 σu2
A perceived return would predict that either no transfer of housing would occur along the wealth
distribution or a transfer would occur from poor to rich households, who have better capacity to
capitalize on the higher return expectation.5556
54There

are additional forces not in this simple example that would apply specifically to housing. First, see from the
portfolio return Euler equation that ξt > acts as if β is higher. Household would tilt towards savings from consumption, and
consequently owner-occupied housing demand would be lower. This force is specific to durable consumption, of which
housing is the primary example. Second, illiquid assets are discounted due to illiquidity, and more difficult self-insurance
would increase the cost of illiquidity (Longstaff (2001)). This force is specific to housing as an illiquid asset (i.e., asset with
large transaction costs).
55 Even with an expected housing return shock, an increase in house prices will loosen borrowing/collateral constraints.
This effect would be omitted in partial equilibrium. However, even with a relaxation in some credit constraint from a
house-price increase, the dominant effect is still the rise in house price (a cost) from the perspective of a buyer. Given
a typical loan-to-value constraint b ≥ φ ph, for example, an increase in p would increase b only by φ < 1 fraction. This
general-equilibrium effect cannot overturn the comparative static. In fact, this discussion highlights a key reason for looking
at net housing as opposed to gross borrowing. A causal mechanism going from house price to a looser constraint can
increase borrowing by the poor, yet will not get housing to transfer to them on net.
56This implication is also seen in Kaplan et al. (2017), in which a price-belief shock by itself reduces homeownership,
because owning becomes more expensive than renting. That prices rise so that poor households switch out of homeownership
is equivalent to saying poor households experience a lower demand increase than richer households. Endogenous price
movement inherits directly the excess demand in a partial-equilibrium set-up.

58

Therefore, depending on which forces are stronger and what changes are happening at cyclical
frequencies, in theory rich or poor households could be holding housing procyclically.

B

Derivations

B.1

Wealth Inequality

Suppose
log µtk = ak + bk log

Dtk
Ptk

log θitk = log Ptk + log Qkit − logWit
Then


h
i
h i h i
cov θitk , µitk =E θitk µitk − E θitk E µitk
h i h ih
 

i
=E θitk E µitk exp cov log θitk , log µtk − 1
h i h i


k
k
k
k
≈E θit E µit cov log θit , log µt
h i h ih 




i
=E θitk E µitk cov log Ptk , log µtk + cov log Qkit , log µtk − cov logWit , log µtk
Focus on middle term:


h i h i


acov θitk , µitk ≡ E θitk E µitk cov log Qkit , log µtk
h i h i


= −bk E θitk E µitk cov log Qkit , log Ptk


h i h i cov log Qk , log Pk 
t
it
k
k
k
k

var log Pt
= −b E θit E µit
var log Ptk
cov log Qkit , log Ptk

≈ − (0.1987424) (1) (1.123208) (.1578269)2
var log Ptk

cov log Qkit , log Ptk

≈ 0.0056
var log Ptk
The assumptions are: (1) process for xtk ≡



Dtk
, (2) lognormal distributions, (3) approximation around
Ptk

covarinace of 0.

59

B.1.1

Back-of-envelope

Wi
W
Yi
Y

≈

1
1

1 − (µ i − µ)

(1−cy ) WY
W 1
≈ 1 + µi − µ
Y 1 − cy

where the last approximation is a Taylor expansion around 0.
Plug


h i h i cov log Qk , log Pk 
t
it
k
k
k

var log Pt
−b E θit E µit
var log Ptk
k


into µ i − µ .
B.1.2

Estimating behavioral consumption rule

Estimating consumption rule in levels is difficult, so I modify log-linearization method used in
macroeconomics. Start from
Ci = cwWi + cyYi
The log deviation of some household’s consumption from C = cwW + cyY , is logCi − logC. First-order
approximation gives,
logCi ≈ logC +

Ci
−1
C

Plug into the level consumption policy:




W
Y
logCi ≈ cw
logWi + cy
logYi + K
C
C
where K is a constant given by K = logC − cw W
logW − cy CY logY .
C
Based on national account numbers, labor income is roughly 70% of output and consumption is
roughly 70% of output, so cy CY ≈ cy .
Estimation of the following regression in the PSID,
logCit = γ̂y logYit + γ̂w logWit + εit
yields γˆy ≈ 0.25 and γ̂w ≈ 0.05. Note that this estimation omits individuals with non-positive income
or wealth. Note also that the estimated coefficients along with the consumption rule are not internally
consistent. This part is a rough approximation.
60

B.2

Micro-foundation

Then, log Euler equations for the two assets,
1
0 = log β − γEt [ct+1 − ct ] + Et [r1,t+1 ] + Var [r1,t+1 − γ (ct+1 − ct )]
2
1
0 = log β − log (1 − ξt ) − γEt [ct+1 − ct ] + r f ,t + Var [−γ (ct+1 − ct )]
2
where r1,t+1 ≡

Pt+1 +Dt+1
Pt

for housing and r f ,t ≡

1
Qt

for risk-free. Taking difference:


 1
γCov (ct+1 − ct , r1,t+1 ) = E r1,t+1 − r f ,t + Var (r1,t+1 ) + log (1 − ξt )
2

 1
≈ E r1,t+1 − r f ,t + Var (r1,t+1 ) − ξt
2

 1
= E r1,t+1 − r f ,t + ξt + Var (r1,t+1 )
2
for ξt ≥ 0.
Log budget constraint
wt+1 − yt+1 ≈ k + ρw (wt − yt ) − ρc (ct − yt ) − ∆ yt+1 + r p,t+1

1
r p,t+1 = θt r1,t+1 − r f ,t + r f ,t + θt (1 − θt )Var (r1,t+1 )
2
where θt ≡

Ht+1 Pt
Ht+1 Pt +Bt+1 Qt .

Euler with portfolio return:

 1


log β − γEt [ct+1 − ct ] + Et r p,t+1 + Var r p,t+1 − γ (ct+1 − ct ) = log (1 − ξt (1 − θt ))
2
≈ − (1 − θt ) ξt
Simplifications: Assume a fixed ξ and stationary environment with fixed expected return as in
Viceira (2001). Comparative static on ξ . (For notational convenience, just stick with ξ with no t
subscript.) With actual time-varying expected return, have to keep track of another state variable,
as opposed to the one-state set-up (in wt − yt ). Comparative static should be clearest in terms of
highlighting the forces. Switch to numerical sooner. For now, just assume shock to µ, i.e., higher
expected return in bust as well as higher perceived house price appreciation in early 2000s are all just


MIT shocks to µ = E r p,t+1 . With fixed ξ , solutions are entirely the same, with a few changes in the
coefficients.
Solution still takes the form:
ct − yt = b0 + b1 (wt − yt )
61


µ − r f + ξ + 12 σu2 (1 − b1 ) σyu
θ=
−
b1 σu2
b1
σu2
Note that b1 is like the MPC out of liquidity we are used to in the consumption-savings literature.
Shocks to labor income yt are all permanent, so yt represents permanent income that households would
borrow from. wt is liquid assets that they can immediately access. Hence, the hedging term implies: if
high MPC, permanent income shocks translate less to consumption growth ((1 − b1 ) is lower), and so
there is lower hedging demand.
Doing the same trick with trivial inequality: ct+1 − ct = (ct+1 − yt+1 ) + (yt+1 − yt ) − (ct − yt ), and
for γ = 1, arrive at



b0 + b1 k + (1 − b1 ) g + b1 E r p,t+1 − (b1 ρc + 1) b0 + b1 ρw − b21 ρc − b1 (wt − yt )

 1

= log β + (1 − θ ) ξ + E r p,t+1 + Var r p,t+1 − (ct+1 − ct )
2
where g = E (yt+1 − yt ). Setting coefficients to zero,
b1 =
b0 = −

ρw − 1
ρc

constant + (1 − θ ) ξ
b1 ρ c

with b0 < 0.
With higher ξ , b0 is even more negative, and consumption is suppressed. Hence

ρc
ρw

↓, and b1 ↑. We

get that there is higher elasticity of consumption to liquid wealth (again, wt is like cash-on-hand in this
set-up, since yt shocks are entirely permanent). Higher b1 has two opposite effects. On the one hand,
human capital is worth less since cannot borrow against it (i.e., the b1 in denominator). On the other
hand, innovations to permanent income “matter less” in that they translate less to consumption growth,
so there is lower hedging demand. Given that income shocks are positively correlated with housing
returns, this force actually increases the demand for housing.

62

F IGURE A.1: CoreLogic samples
These figures describe the two samples from CoreLogic used in this paper. In Panel (a), counties
colored in blue are included as a balanced panel in the 1998-2013 sample. These areas cover roughly
60% of the US population. In Panel (b), counties colored in blue are included as a balanced panel
in the 1988-2013 sample. These areas cover roughly 25% of the US population. In Panel (c), the
blue line (with scale on left axis) plots the number of counties that would be included in a balanced,
consistent panel of counties in a sample that starts from a given year in the x-axis and ends in 2013.
The red line plots the fraction of the US population that would be covered in each sample using
concurrent population for each year, while the green line uses the 1990 county population to calculate
the population share. The sample that starts in 1988 only includes a few counties but still covers roughly
a quarter of the US population. The counties that appear earlier in CoreLogic are not representative
also along other dimensions. Panel (d) plots the average house-price index for counties that are in the
1998-2013 CoreLogic sample (red line) and those that are not (blue line). The sample counties had
bigger house price boom and bust.
( A ) Counties in the 1998-2013 sample

63

F IGURE A.1: CoreLogic samples (continued)
( B ) Counties in the 1988-2013 sample

64

F IGURE A.1: CoreLogic samples (continued)
( C ) Fraction of the US population included

( D ) House price of in-sample counties for 1998-2013

65

F IGURE A.1: CoreLogic samples (continued)
( E ) Aggregate (Fed)

( F ) Aggregate (Census)

66

F IGURE A.2: Raw-data pattern, sorting by residence Census income
These figures show raw-data patterns, sorting owners by surnames (and associated 1940 income) on
the left column, and sorting owners by the mailing addresses’ Census block group (and associated 200
income, within county) on the right column.
Panels (a) and (b) plot the average value of primary residence conditional on owning in 2012-2013, for
CoreLogic’s assessor record, which covers almost the entire US population in a single year. Panels
(c) and (d) plot the per capita holdings of any real estate asset (i.e., count), for selected decile groups,
relative to the 1998 levels. Panels (e) and (f) plot the residuals εit for the same set of selected deciles
from the regression:
log (qit ) = αi + αt + γit + εit
where the regression is weighted by the number of individuals in each decile group, and qit is the
holdings of all real estate by number of property by members of the decile group in a given year. For
comparison, panel (g) plots the same residuals for CoreLogic national house-price index, i.e., εt from
log (Pt ) = γ0t + εt
where Pt is the house-price index. The vertical red line indicates 2007.
( B ) By block group: Per capita holding vs. 1998

( A ) By block group: primary-residence value

( C ) By block group: Detrended log residual

67

F IGURE A.3: Wealth inequality vs. income inequality
( A ) White / Black

( B ) White / Hispanic

( C ) Above median income / below

( D ) College / no college

68

F IGURE A.4: Wealth inequality level: Geographical variation
Panel (a) plots Core-based Statistical Area (CBSA)-level coefficient of variation of net worth in 2012.
Net worth coefficient of variation has been calculated for CBSAs using zip code-level variation. Zip
code-level net worth has been imputed using capital income from the IRS Statistics of Income, capital
income capitalization factors from Saez and Zucman (2016), housing ownership from CoreLogic
assessor records, and zip code-level debt stocks from Equifax. Panel (b) plots CBSA-level coefficient
of variation of wage income in 2012, calculated using zip code-level wage information. Wage comes
directly from “Salaries and wages” in the IRS Statistics of Income. Panel (c) plots business cycle
income loadings, calculated at the county-level by regressing changes in county-level log per-capita
income on changes in aggregate log per-capita income, using data from the Bureau of Economic
Analysis 1969-2015.
( A ) Net worth coefficient of variation by CBSA (2012)

69

F IGURE A.4: Wealth inequality level: Geographical variation (continued)
( B ) Wage coefficient of variation by CBSA (2012)

( C ) Business cycle loading by county (1969-2015)

70

F IGURE A.5: Wealth vs. income (SCF)
From the Survey of Consumer Finances for 2013, panel (a) plots log net worth against log household
income, and panel (b) plots log asset against log income. Panel (a) shows a slope of 1.397 of log net
worth against log income. Panel (b) shows a slope of 1.530 of log asset against log income.
( A ) Log net worth vs. log income (2013)

( B ) Log asset vs. log income (2013)

71

TABLE A.1: Level of wealth inequality: CBSA coefficient of variation (continued)
( A ) Asset

asset c.v. (2012)

beta: income per cap

(1)

(2)

(3)

(4)

(5)

0.557∗∗∗
(0.108)

0.377∗∗∗
(0.077)
1.202∗∗∗
(0.284)

0.009
(0.095)

0.347∗
(0.165)
1.104∗∗
(0.410)
0.246
(0.129)
0.675
(0.524)
-5.624
(3.756)

0.482∗∗
(0.166)
1.106∗∗
(0.350)
0.217∗
(0.095)
0.651
(0.515)

0.256∗∗
(0.081)

0.603∗∗∗
(0.122)
1.105∗∗∗
(0.302)
0.043
(0.030)
-0.120
(0.103)
-0.038
(0.494)

0.034

0.375

0.387

0.232

926
1759

926
1759

670
1114

670
1114

0.333
O
670
1113

wage c.v.
log population
log house price
Constant
Adjusted R2
state FE
# of CBSA
Observations

with avg cap income 2003-2012

Standard errors in parentheses
∗ p < 0.05, ∗∗ p < 0.01, ∗∗∗ p < 0.001

72

TABLE A.2: Level of wealth inequality: Zip code wealth-wage elasticity (continued)
( A ) Asset

log asset per capita (’12)
(1)

(2)

(3)

(4)

(5)

(6)

0.555∗∗∗
(0.142)

0.557∗∗∗
(0.161)
0.046∗∗∗
(0.013)
-0.036
(0.119)
-5.816∗∗∗
(1.727)
-0.458∗∗
(0.142)
0.916
(1.302)
0.429
(0.596)
4.513
(6.499)

0.456∗∗
(0.141)
0.045∗∗∗
(0.013)
-0.215
(0.125)
-4.569∗∗
(1.515)
-0.468∗∗
(0.143)
2.533
(1.366)
1.407∗
(0.631)

0.600∗∗∗
(0.128)
0.057∗∗∗
(0.013)
0.071
(0.173)

1.487∗∗∗
(0.012)
-3.980∗∗∗
(0.124)

0.918∗∗∗
(0.148)
1.857
(1.558)

0.609∗∗∗
(0.166)
0.041∗∗
(0.013)
-0.083
(0.122)
-6.368∗∗∗
(1.782)
-0.372∗
(0.145)
1.453
(1.332)
0.720
(0.609)
0.936
(6.670)

0.473

0.474

0.542

0.527

0.614
O

27643

3059
27383

1231
17820

1231
17820

1231
17820

income beta × wage
population × wage
house price × wage

-5.688∗∗∗
(1.497)

income beta
log population
log house price
log wage
Constant
Adjusted R2
state FE
county FE
# of counties
Observations

with avg cap income ’03-’12

Standard errors in parentheses
∗ p < 0.05, ∗∗ p < 0.01, ∗∗∗ p < 0.001

73

-0.581∗∗∗
(0.135)

-0.122
(0.877)

0.741
O
1220
17809