View original document

The full text on this page is automatically extracted from the file linked above and may contain errors and inconsistencies.

Working Paper Series

Selection and Monetary Non-Neutrality in
Time-Dependent Pricing Models

WP 12-09R

Carlos Carvalho
PUC-Rio
Felipe Schwartzman
Federal Reserve Bank of Richmond

This paper can be downloaded without charge from:
http://www.richmondfed.org/publications/

Selection and Monetary Non-Neutrality in Time-Dependent Pricing
Models∗
Carlos Carvalho
PUC-Rio

Felipe Schwartzman
Federal Reserve Bank of Richmond
July 30, 2014
Working Paper 12-09R

Abstract
For a given frequency of price changes, the real effects of a monetary shock are smaller if
adjusting firms are disproportionately likely to have last set their prices before the shock. This
type of selection for the age of prices provides a complete characterization of the nature of
pricing frictions in time-dependent sticky-price models. In particular: 1) The Taylor (1979)
model exhibits maximal selection for older prices, whereas the Calvo (1983) model exhibits
no selection, so that real effects are smaller in the former than in the latter; 2) Selection is
weaker and real effects of monetary shocks are larger if the hazard function of price adjustment
is less strongly increasing; 3) Selection is weaker and real effects are larger if there is sectoral
heterogeneity in price stickiness; 4) Selection is weaker and real effects are larger if the durations
of price spells are more variable.
JEL classification codes: E10, E30
Keywords: price setting, monetary non-neutrality, general hazard function, selection effect,
heterogeneity.

∗

For comments and suggestions, we thank participants at AEA 2013, LAMES/LACEA 2012, EEA 2012, SED
2012, ESWM 2012, and ESEM 2011. We also thank Andrew Owens for excellent research assistance. Some of the
mathematical results presented here were first derived in Carvalho and Schwartzman (2008) – a legacy working paper
that we never submitted for publication. The views expressed in this paper are those of the authors and do not
necessarily reflect the position of the Federal Reserve Bank of Richmond or the Federal Reserve System. E-mails:
cvianac@econ.puc-rio.br, felipe.schwartzman@rich.frb.org.

1

1

Introduction

Infrequent price changes at the microeconomic level do not necessarily imply that monetary disturbances have large real macroeconomic effects. For the same frequency of price changes, the real
effects of a monetary shock are small if the firms adjusting their prices are also the ones most likely
to change prices by a large amount. The importance of this selection effect has been well understood at least since Caplin and Spulber (1987). In their model, large price adjustments by a small
fraction of firms completely offset monetary shocks and induce money neutrality. This is because
in Caplin and Spulber (1987), as in menu-cost models more generally, there is self-selection: firms
always have the option of incurring a menu cost to adjust their prices, so that adjusting firms are
also the ones which would like to adjust their prices by the greatest amount.1
In this paper we argue that selection effects do not necessarily hinge on self-selection. In fact,
we show that selection is relevant in time-dependent sticky-price models, where the probability of
a price change depends only on the time elapsed since the price was last reset. This is because the
real effects of a monetary shock differ depending on whether adjusting firms are more or less likely
to have prices that pre-date the shock. More fundamentally, we show that in such an economy, for
a given average frequency of price changes, the real effects of a monetary shock depend solely on
this type of selection. In particular, the real effects of nominal shocks are larger if older prices are
relatively less likely to be adjusted.
A proper understanding of the fundamental role of selection in determining the real effects of
monetary shocks in time-dependent models is important, since such models are prevalent in the
sticky-price literature. While originally used for tractability, subsequent literature has shown that
time-dependent pricing rules emerge optimally in the presence of information costs as in Caballero
(1989), Bonomo and Carvalho (2004), and Reis (2006).
We tie selection to features of the distribution of price spells, some of which have been singledout in previous literature as important in determining the extent of monetary non-neutrality in
time-dependent price-setting models. In particular, we show that:
1) Calvo (1983) pricing implies no selection, as the probability of price changes does not depend
on the age of the price. In contrast, Taylor (1979) pricing implies maximum selection, since changing
prices are always the ones which have been in place for longest. This explains why, for a given
frequency of price changes, Taylor pricing generates a lower degree of monetary non-neutrality than
Calvo pricing (Kiley, 2002).
2) If the hazard of price adjustment is increasing, then selection for older prices is relatively
1

While Caplin and Spulber (1987) do not consider menu costs explicitly, the state-dependent pricing rule that
they postulate can be rationalized by the presence of such costs.

2

stronger, and the real effects of a nominal shock are smaller than under Calvo pricing. This conforms
with discussions by Dotsey, King, and Wolman (1997) and Wolman (1999). Moreover, the more
increasing the hazard of price adjustment is, the larger selection effects are.
3) Under certain conditions, we can show that cross-sectoral heterogeneity in price stickiness
is associated with lower (and possibly negative) selection, as sectors with low frequency of price
changes have both a larger proportion of old prices and a lower probability of price changes. This
clarifies and generalizes the finding in Carvalho (2006) that heterogeneity in price setting can lead
to larger real effects of monetary shocks.
4) When comparing two economies, one in which the distribution of the duration of price spells
is a mean preserving spread of the other, selection for older prices is weaker – and the real effects of
the shock are larger – in the economy with more variable price spells. In particular, for a commonly
used specification for monetary shocks, the mean and the variance of the duration of price spells
are sufficient statistics for the real effects of nominal disturbances.2
Our framework encompasses a great degree of generality. As in Dotsey, King, and Wolman
(1997, section 3), price changes arrive according to a generic function of the time elapsed since
the last price adjustment. We are able to analyze the impact of quite general monetary shocks
thanks to an equivalence between the real effects of monetary shocks in sticky-price models and
in sticky-information models, as in Mankiw and Reis (2002). For most of the paper, we focus on
an environment in which the optimal price for a given firm is neither a strategic substitute nor a
strategic complement to the prices set by other firms – what we refer to as strategic neutrality in
price setting. As a robustness check, we investigate the role of selection in settings with strategic
complementarity or substitutability through numerical simulations. The results suggest that the
relationship that we uncover between selection and real effects of monetary shocks is robust to
those strategic interactions.
Our paper is not the first one to identify a role for selection in time-dependent pricing models.
Using a recursive formulation in a discrete-time setting, Sheedy (2010) shows that selection for older
prices is associated with higher inflation persistence – an issue that we do not examine. He does not,
however, examine the implications of selection for the real effects of monetary shocks. Subsequent
work by Alvarez, Le Bihan, and Lippi (2014) elaborates on the link that we uncover between timedependent selection and monetary non-neutrality in a different setting. Another related paper is Yao
(2014), who uses numerical examples to show how differences in the distribution of price durations
2

Without linking it to selection, we first proved this result in Carvalho and Schwartzman (2008). Subsequently,
Vavra (2010) and Alvarez et al. (2012) provided alternative proofs of the same result. The result is also complementary
to the one in Alvarez, Le Bihan, and Lippi (2014) which, in a different setup, connects the real effects of monetary
shocks to a moment of the distribution of price changes rather than moments of the distribution of price spells.

3

affect the dynamics of the economy in response to shocks. Finally, Vavra (2010) explores the
empirical distribution of price durations estimated from micro-data for the U.S. to study monetary
non-neutrality.
We proceed as follows: In Section 2 we lay out the model, which is a continuous time, perfect
foresight version of the baseline New Keynesian model, with general distribution of price durations.
Section 3 introduces our concept of selection in time-dependent price-setting models, and states
the key propositions linking selection to the real effect of a monetary shock. Section 4 shows how
selection relates to various ways of summarizing the distribution of price spells, that is, it states
and discusses results 1) to 4) listed above. In section 5 we present the numerical results for cases
allowing for strategic interactions in price setting. The last section concludes.

2

Model

There is a representative household which derives utility from a continuum of differentiated consumption goods aggregated in a Dixit-Stiglitz composite, and supplies a continuum of firm-specific
varieties of labor. Labor is hired by monopolistically competitive firms that produce the goods.
The household owns these firms, so it receives back whatever profits they generate. Firms hire
labor in competitive markets. We assume a cashless economy with a risk-free nominal bond in zero
net supply as in Woodford (2003), and abstract from fiscal policy.
In our analysis, we rely on a first-order approximation of the model around a zero inflation
steady state. This allows us to resort to the certainty equivalence principle and focus on the dynamic response of the economy to one-time, zero probability shocks in a world of otherwise perfect
foresight. We use a continuous-time formulation since it yields tractable closed form solutions,
although none of the key results or intuitions rely on the continuous-time assumption. The representative household maximizes:
"Z

∞

Et

C (t)1−σ − 1
−
1−σ

−ρt

e
0

Z

Z
0

1

1
1+ ψ

Lj (t)
1 + ψ1

!
dj

#
dt

1

Wj (t) Lj (t) dj − P (t) C (t) + T (t) , for t ≥ 0,

s.t. Ḃ (t) = i (t) B (t) +
0

and subject to a no-Ponzi condition. Here ρ is the discount rate, σ is the inverse of the elasticity
of intertemporal substitution, ψ is the Frisch elasticity of labor supply, C (t) is consumption of
the composite good, Lj (t) is the quantity of labor supplied for the production of variety j, Wj (t)
is the nominal wage for labor of variety j, T (t) are firms’ flow profits received by the consumer,
B (t) denotes bond holdings that accrue a nominal interest at rate i (t), and P (t) is a price index

4

to be defined below. Et is the expectations operator with respect to information available at time
t. Given the assumption of perfect foresight except for a one-time, zero probability shock, we can
ignore the expectations operator for the solution of the household problem.
The composite consumption good is given by:
1

Z
C (t) ≡

Cj (t)

ε−1
ε

ε
 ε−1

dj

,

0

where Cj (t) is consumption of the variety of the good produced by firm j. The elasticity of
substitution between varieties is ε > 1. Denoting by Pj (t) the price charged by firm j at time t,
the corresponding consumption price index is:
Z

1

P (t) =

1−ε

Pj (t)

1
 1−ε

dj

.

0

The first-order conditions for the representative consumer’s optimization problem are:
1
Wj (t)
= C (t)σ Lj (t) ψ ,
P (t)

(1)

"
#
Ċ (t)
Ṗ
(t)
= σ −1 i (t) −
−ρ ,
C (t)
P (t)

Cj (t) = C (t)

Pj (t)
P (t)

−ε
, j ∈ [0, 1] .

(2)

Firms transform labor into output one for one. They sell their products at a nominal price that
they only change infrequently. In the meantime, they commit to producing as much as necessary to
satisfy the demand for their output given their chosen price. The timing of those occasional price
changes depends probabilistically on the time elapsed since the firm’s last price change – i.e., price
setting is time dependent. Particular examples of time-dependent models include Taylor (1979) and
Calvo (1983). We follow Section 3 in Dotsey, King, and Wolman (1997), and consider a general
time-dependent setting. We denote the probability of a price surviving for a period of length less
than s by a generic cumulative distribution function G (s). At this point, the only restrictions we
impose are that G (s) depends only on the time elapsed since the price was last reset, but not on
the particular date in which it was reset. Note that G being a c.d.f. implies that lims→∞ G (s) = 1,
so that all price spells come to an end with probability one. Certain results require additional
restrictions on G that we will introduce as needed.

5

A firm that resets its price at time t chooses the price Xj (t) to solve:
Z

e

max Et

Xj (t)

∞

−ρs


(1 − G (s)) [Xj (t) Yj (t + s) − Wj (t + s) Nj (t + s)] ds

0

s.t. Yj (t + s) = Nj (t + s) ,


Xj (t) −ε
Yj (t + s) =
Y (t + s) ,
P (t + s)

(3)

where Nj (t + s) is the amount of labor demanded by the firm, and where the demand function
already takes into account that goods market clearing implies Cj (t) = Yj (t). The first-order
condition yields:

Xj (t) = Et

ε
ε−1

R∞
0


e−ρs (1 − G (s)) P (s)ε Y (s) Wj (s) ds
R∞
.
−ρs (1 − G (s)) P (s)ε Y (s) ds
0 e

As is usual in the literature, we focus on the symmetric equilibrium in which all adjusting firms
choose the same nominal price. This allows us to drop the j subscripts and denote the price set by
any firm at time t as X (t). Moreover, we assume uniform staggering of pricing decisions, so that
the price index satisfies:
Z

t

Λ (1 − G (t − v)) X (v)

P (t) =

1−ε

1
 1−ε

dv

,

−∞

where Λdt ≡

R ∞
0

(1 − G (s)) ds

−1

dt is the constant “fraction” of prices which are changed over

an infinitesimally small interval dt. We refer to Λ as the average frequency of price changes in the
economy. Using integration by parts, it is straightforward to show that:
Λ

−1

Z

∞

=

sdG (s) ,
0

which is the average duration of price spells.
The model is closed by a monetary policy specification that ensures existence and uniqueness
of a rational expectations equilibrium. Following standard practice in the price-setting literature
(e.g. Mankiw and Reis, 2002), we leave the details of monetary policy unspecified and assume an
exogenous path for nominal aggregate demand, M (t) = P (t) Y (t).
We log-linearize the model around a zero-inflation steady state. In this log-linear environment,
firms that change prices at time t set (lowercase variables denote log-deviations from the steady
state):

R ∞
x (t) = xj (t) = Et

0


e−ρs (1 − G (s)) wj (t + s) ds
R∞
.
−ρs (1 − G (s)) ds
0 e

(4)

Log-linearizing the labor supply condition in equation (1), and combining the log-linear versions
6

of the production function (equation 3), the household’s demand for varieties (equation 2), and the
market clearing condition Cj (t) = Yj (t) yields the following equilibrium expression for nominal
wages:

wj (t + s) = p (t + s) + σ + ψ −1 y (t + s) − εψ −1 (x (t) − p (t + s)) .
Note that wj (t + s) is the same for all j, so that, consistent with the symmetry assumption above,
xj (t) is also the same for all j. We can also use m (t + s) = p (t + s) + y (t + s) to substitute out
y (t + s), rearrange slightly, and obtain:


wj (t + s) = 1 − σ − ψ −1 p (t + s) + σ + ψ −1 m (t + s) − εψ −1 x (t + s) .
Substituting the expression above in the first-order condition for the firm’s problem (equation
4) and rearranging yields:
R ∞
x (t) = Et
where α =

0


e−ρs (1 − G (s)) [αm (t + s) + (1 − α) p (t + s)] ds
R∞
,
−ρs (1 − G (s)) ds
0 e

(5)

σ+ψ −1
.
1+εψ −1

According to equation (5), the model implies strategic neutrality in price setting if α = 1. This
means that the marginal cost of production for a given firm and, therefore, its desired price, only
depends on the exogenous process m (t + s) and not on decisions made by other firms. This requires
specific constellations of primitive parameters such as, for example, σ = 1 and ψ → ∞ (log utility in
consumption and linear disutility of labor). More generally, pricing decisions will be either strategic
substitutes or strategic complements. If α < 1, there is strategic complementarity in price setting,
meaning that firms will choose prices close to what they expect the aggregate price level to be.
With α > 1 pricing decisions are strategic substitutes.
Finally, the aggregate price level is given by:
Z

t

Λ (1 − G (t − v)) x (v) dv.

p (t) =

(6)

−∞

2.1

Monetary shocks

The economy starts with a constant level of nominal aggregate demand M old , with associated
pricing decisions X old , the aggregate price level P old , and constant output Y old . We analyze the
impact of a one-time, unforeseen shock to nominal aggregate demand. The shock hits the economy
at t = t0 , yielding thereafter a new path for nominal aggregate demand M new (t), and associated
paths for pricing decisions, aggregate price level, and output – respectively, X new (t), P new (t), and
Y new (t). The assumptions that price setting is purely time dependent and that price changes are
7

uniformly staggered over time allow us to set, for notational convenience, t0 = 0 without loss of
generality.
In log-linear terms, the ex-post path of nominal income is:
(
m (t) =

mold , if t < 0,
mnew (t) , if t ≥ 0.

(7)

The assumption of a one-time unforeseen shock implies that Et [M (t + s)] = M old if t < 0 and
Et [M (t + s)] = M new (t) if t ≥ 0, and analogously for X (t), P (t) and Y (t). Thus, from this point
onward, we drop the expectations operator and use the superscripts “new” and “old” instead.

3

Selection and monetary non-neutrality

In this section we introduce a concept of selection appropriate for time-dependent models. We show
that, for a given frequency of price changes, this type of selection contains the same information
as the distribution of price durations, so that a characterization of how selection evolves over time
provides a complete description of nominal rigidities in the model. Finally, we show how selection
affects monetary non-neutrality, with lower selection for prices set before the shock being associated
with higher real effects of nominal shocks.

3.1

Selection

In statistics, there is a selection bias if a sample is not a random draw from the population. In that
case, sample moments provide biased estimates of population moments. By analogy, the prices
being reset at a given point in time are a sample of the population encompassing all existing prices.
As a measure of selection bias, we focus on the fraction of prices set before the shock (“old prices”)
being reset at t, as compared to the corresponding fraction of old prices in the population still in
place at t.
Because the distribution of the duration of price spells, G, is time-invariant, at any time t ≥ 0,
the fraction of old prices among changing prices is equal to 1 − G (t) – which is the probability that
Rt
a price survives for t or longer. In turn, 1 − ω (t) ≡ 1 − 0 Λ (1 − G (s)) ds is the fraction of old
prices in the population at time t. In this context, we say there is positive selection for old prices if
1 − G (t) > 1 − ω (t) and negative selection otherwise. This suggests a natural measure of selection
for old prices at each point in time after a shock.
Definition 1. For all t such that ω (t) < 1, selection (at t), denoted by µ (t), is defined as
µ (t) ≡

1 − G (t)
− 1,
1 − ω (t)
8

and for t such that ω (t) = 1,

µ (t) = 0.
The extension of the definition for the cases in which ω (t) = 1 is natural, since with ω (t) = 1
all adjusting prices as well as all prices in the population are set after the shock (that is, they are
all “new”). Hence, the “sample” of prices which can adjust at any point in time has the same
composition as the population and there is no selection bias.
The state of the economy at any t ≥ 0 is a function of the history of selection for old prices
starting at the time of the shock. To capture this history, we also employ a related measure,which
emphasizes not selection at a given point in time, but cumulative selection since the shock hit:
Definition 2. Cumulative selection (at t), denoted by Ξ (t), is defined as
t

Z
Ξ (t) ≡

µ (s) ds.
0

We refer loosely to selection for old prices in economy A being stronger than in economy B if
either µA (t) > µB (t) ∀t and/or ΞA (t) > ΞB (t) ∀t. It is easy to see that the first ordering implies
the second, but that the converse is not necessarily true.
We now proceed to show how the population of old prices at any point in time is determined
by the history of selection up to that point. After the monetary shock hits, the pool of new prices
ω (t) increases as firms a) have the opportunity to change prices (this is given by the frequency of
price changes, Λ) and b) are doing so for the first time after the shock (this applies to a fraction
1 − G (t) of price changers). Therefore:
∂ω (t)
= Λ (1 − G (t)) .
∂t

(8)

Solving the differential equation (8) with ω (0) = 0 as a boundary condition, and using the
definitions above yields the following:3
1 − ω (t) = e−Λt−Λ

Rt
0

µ(v)dv

= e−Λt−ΛΞ(t) .

(9)

Equation (9) suggests that, given Λ, G can be obtained from µ (and vice versa). As the following
lemma shows, this is indeed the case:4

3
4

See Lemma A.1 in appendix for a formal statement and proof.
All proofs are in the Appendix.

9

Lemma 1. Let M be the set of all functions µ : [0, ∞) → R that can be constructed using DefiR∞
nition 1. Let G Λ be the set of all functions G : [0, ∞) → [0, 1] satisfying 0 (1 − G (s)) ds = Λ−1 .
Then, there is a mapping f : M → G Λ that allows us to recover G from µ.
The last result implies that, given the frequency of price changes Λ, µ and G are equally valid
primitives for the general class of time-dependent pricing models that we consider.

3.2

Monetary non-neutrality

We now examine the link between selection and monetary non-neutrality. We focus on the case
with strategic neutrality in price setting, since it allows us to isolate the role of selection from the
well-known effects of interactions between firms’ pricing decisions. Using numerical simulations, in
Section 5 we examine whether our main results survive the presence of pricing interactions.
The effects of the shock on real output are given by:
y new (t) − y old .

(10)

We measure the degree of monetary non-neutrality by the discounted cumulative effect of the shock
on output. More specifically, our measure of non-neutrality is given by:
Z

∞

Γ=

i
h
e−ρt y new (t) − y old dt.

0

In the Appendix we show that, up to a first-order approximation, this measure is proportional to
the ex-post utility impact generated by the monetary shock. We refer to Γ generically as the real
effects of the monetary shock.
3.2.1

Level shocks

We start by analyzing the commonly used case where, following the shock, the level of nominal
income changes once and for all, that is, mnew (t) = mnew = mold + ∆m for some constant ∆m.
Apart from being a common benchmark, this case is interesting because the link between selection
and the real effects of monetary shocks is particularly transparent.
From (5) it follows immediately that:
xold = mold , xnew = mnew .
Taking into account the different price-setting decisions made before and after t = 0, we can

10

then write the evolution of the aggregate price level for t ≥ 0 as:
pnew (t) = p (t) = ω (t) mnew + (1 − ω (t)) mold ,
where ω (t) is the fraction of firms with new prices in the population (i.e., who last set their prices
after the shock).
The effects of the shock on real output are thus given by:


y new (t) − y old = mnew − pnew (t) − mold − pold = ∆m (1 − ω (t)) .
In words, the output effect at t is proportional to the size of the shock ∆m and to the fraction of
firms with old prices at t, 1 − ω (t). Thus, for a given sized shock, the real effects at t are larger if
the pool of old prices is larger.
Using (10), we can write the real effects of the shock as:
Z

∞

Γ = ∆m

e−ρt (1 − ω (t)) dt.

(11)

0

The real effects are increasing in the integral over time of the fraction of old prices in the population.
The longer the fraction of old prices in the population takes to shrink to zero after the shock, the
larger are its real effects. It follows from (9) that:
Γ
=
∆m

Z

∞

−ρt

e

Z

∞

(1 − ω (t)) dt =

0

−(ρ+Λ)t−Λ

e
0

Rt
0

µ(v)dv

Z
dv =

∞

e−(ρ+Λ)t−ΛΞ(t) dt.

(12)

0

We can thus derive the following immediate implications, summarized in the lemma below:
Lemma 2. Given Λ and strategic neutrality (α = 1), the effects of a shock to the level of nominal
income (m1 = m0 + ∆) are larger if either
1) selection, µ (t), is smaller for all t, or
2) cumulative selection, Ξ (t), is smaller for all t.
3.2.2

General shocks

One difficulty in establishing analytical results for general shocks is that, differently from the simple
case with a level shock, the cross-sectional distribution of prices is in general not concentrated on
only two values – one for old prices and one for new prices. In spite of that, we are able to handle
these cases thanks to the following proposition:5

5

Since we rely on a log-linear approximation to the model around a zero inflation steady state, these more general
shocks should not involve permanently non-zero inflation.

11

Proposition 1. Consider an economy characterized by a distribution of price spells G and strategic
neutrality (α = 1). The real impact of a monetary shock of the general form considered in equation
(7) is

∞

Z
Γ=



e−ρt (1 − ω (t)) mnew (t) − mold .

0


This proposition holds in spite of the fact that, in general, y new (t)−y old 6= (1 − ω (t)) mnew (t) − mold .
The fact that it holds is a consequence of optimality of firms’ price-setting decisions. Given strategic neutrality in price-setting, a firm j choosing its price after the shock would like to set xj (t) =
mnew (t + s) for all s, but this is impossible if mnew (t + s) varies over time. As a “compromise”,
it optimally sets xj (t) to be equal to a weighted average of mnew (t + s), with weights given by
the probability with which it expects the price to remain in place at each date t + s. For some
period of time, xj (t) will remain below mnew (t + s), and for some other period it will remain
above. Over time these differences, as weighted by the probabilities of the price remaining in
place, cancel out exactly, so that overall the real effects are the same as if the firm was able to set
pj (t + s) = mnew (t + s) for all s ≥ 0.
In the Appendix, we show that Proposition 1 can be alternatively formulated as stating that
the real effects of a monetary shock in a sticky-price model are identical to those effects in a
sticky-information model, so long as the distribution of price spells in the former is identical to the
distribution of price plans in the latter. Thus all the analytical results in this paper translate to
an equally large class of models with sticky information, as in Mankiw and Reis (2002).
As in Section 3, we can write the result in Proposition 1 in terms of cumulative selection:
Z
Γ=

∞



e−(ρ+Λ)t−ΛΞ(t) mnew (t) − mold dt.

(13)

0

From equation (13), it is evident that, given Λ, so long as mnew (t) > mold for all t (or vice
versa), the monetary shock has smaller real effects in the economy with larger cumulative selection
Ξ (t) everywhere. This last result implies a generalization of Lemma 2:
Proposition 2. Consider a shock to nominal aggregate demand characterized by mnew (t) ≥ mold
for all t.
Consider the impact of the shock in two economies, A and B, characterized by distributions of
R∞
R∞
price durations GA (t) and GB (t), with 0 (1 − GA (t)) dt = 0 (1 − GB (t)) dt = Λ−1 . Then, ΓA <
ΓB if either
1) µA (t) ≥ µB (t) ∀t or,
2) ΞA (t) ≥ ΞB (t) ∀t.

12

4

Selection and the distribution of price durations

We now turn to results concerning how selection is related to different properties of the distribution
of price spells. We start by discussing the benchmark cases of Taylor and Calvo pricing. We then
revisit two topics which have been the subject of previous work: the slope of hazard functions and
ex-ante heterogeneity in price setting. Finally, we show that there is a link between selection and
the variance of price durations, and explore conditions under which this variance is an accurate
scalar measure of the real effects of monetary policy shocks.

4.1

Benchmark cases: Taylor and Calvo pricing

We start with a discussion of the two most widely used time-dependent models, which are the ones
proposed by Taylor (1979) and Calvo (1983). We show that these cases are polar opposites insofar
as selection is concerned. In particular, Taylor pricing implies maximal selection and Calvo pricing
implies no selection. Thus, the real effects of monetary shocks will be minimal under Taylor and
larger in Calvo than in any model with non-negative selection.
4.1.1

Taylor pricing

Firms set prices for a fixed period of time (given by Λ−1 ). Thus, the distribution of price durations
is degenerate at Λ−1 . This specification has been very influential in the sticky-price literature,
and, apart from Taylor (1979), it has been used in prominent papers such as Chari, Kehoe, and
McGrattan (2000).
For a given frequency of price durations Λ, we can define Taylor pricing in terms of our notation
as:

(
GT aylor (t) =

0 if t < Λ−1 ,
1 otherwise.

Under Taylor pricing, selection at time t is:
(
µT aylor (t) =

− 1 if t < Λ−1 ,
0 otherwise.

1
1−Λt

(14)

Selection is equal to zero for t ≥ Λ−1 since from that point onward the pool of old prices is
thoroughly depleted, so that 1 − ω T aylor (t) = 0. Selection is positive elsewhere. Within the range
where selection is positive, it is also maximal, since all changing prices were set before the shock.
We formalize the point in the following Lemma:
Lemma 3. Consider an arbitrary time-dependent economy with distribution of price durations
characterized by G (t) and with average frequency of price durations Λ. Let µ (t) and Ξ (t) be,
13

respectively, the corresponding selection and cumulative selection functions. Let µT aylor (t) and
ΞT aylor (t) be, respectively, the selection and cumulative selection functions for a Taylor economy
with average frequency of price changes Λ. Then µT aylor (t) ≥ µ (t) for all t < Λ−1 and ΞT aylor (t) ≥
Ξ (t) for all t.
Given Proposition 2, it follows immediately that, for a given Λ, Taylor pricing implies the
smallest real effects among all time-dependent pricing models.6
4.1.2

Calvo pricing

A further leading example of time-dependent pricing used in the literature is the one proposed
by Calvo (1983), which is the key building block of the canonical New Keynesian model. In this
setting, the probability of a given firm changing its price over any given period of time does not
depend on the time elapsed since it last adjusted. This implies an exponential decay of the survival
probability of a price.
In terms of our notation, we can denote the cumulative distribution of price durations under
Calvo as:
GCalvo (t) = 1 − e−Λt .
It is easy to verify that:
ω Calvo (t) = 1 − e−Λt ,
so that selection is given by:
µCalvo (t) =

e−Λt
− 1 = 0.
e−Λt

Thus, under Calvo pricing there is no selection. In other words, price changing firms are a representative draw from the population.

4.2

Hazard functions

The empirical literature on price-setting has devoted substantial effort to estimating the shape of
the hazard function of price adjustment.7 The motivation is that, at least since the work of Dotsey,
King, and Wolman (1997, 1999), and Wolman (1999), it has been clear that the shape of the hazard
function matters for the real effects of monetary shocks.

6

Vavra (2010) provides a different proof of the fact that the real effects under Taylor are minimum. Without
linking it to selection, we first proved that result in Carvalho and Schwartzman (2008).
7
For a recent review of this literature, see Klenow and Malin (2010, section 5.3).

14

Assuming G is differentiable, the hazard function can be defined as:
h (s) =

∂G(s)
∂t

1 − G (s)

.

We start by showing that the concept of selection and hazard functions are closely related. Specifically, the following holds:
Lemma 4. Let µ and h be, respectively, the selection function and hazard function associated with
a differentiable c.d.f. G. Let t1 ∈ [0, ∞) be the smallest value of t such that ω (t1 ) = 1. Then, for
t < t1 ,
Z

t1

µ (t) =
t

h (s)
Ψt (s) ds − 1,
Λ

(15)

where
Ψt (s) ≡ R t1
t

1 − G (s)
(1 − G (v)) dv

.

is the density of prices of age s among all prices older than t.
Thus, up to a constant, selection at t is proportional to a weighted average of the hazard function
evaluated at t and later. The intuition is as follows: Selection at t is tied to the probability of prices
set before time 0 changing at t. Given stationarity, this is equivalent to the probability of prices
changing at age t or afterwards. Since the hazard function is the continuous-time analogue of the
probability of a price changing at a given age, conditional on it having survived up to that age,
selection at date t can be obtained from integrating the hazard function from t to infinity using
Ψt (s) as weights. The normalization by the average frequency of price changes Λ reflects the
fact that, unlike the hazard function, selection does not depend on the average frequency of price
changes.
From (15), we can show that if a hazard function is strictly increasing, then there is positive
selection at all t:
Lemma 5. For a given distribution of price durations G (t), consider the corresponding hazard
h (t) =

∂G(t)
∂t / (1

− G (t)) and selection µ (t) =

1−G(t)
1−ω(t)

− 1 functions. If h (t0 ) > h (t) for all t0 > t,

then µ (t) > 0 for all t > 0.
The result is intuitive. An increasing hazard function implies positive selection, since the probability of a price change increases with the age of the price. An immediate implication is that any
economy featuring an increasing hazard of price adjustment will feature higher selection than an
economy featuring Calvo pricing. The Lemma thus verifies the intuition spelled out by Wolman
(1999) for the reason why, as compared to Calvo pricing, increasing hazard functions are associated
with smaller real effects of monetary shocks.
15

The general intuition behind Lemma 5 extends to the comparison of two hazard functions. In
this case, the c.d.f.’s can be ranked in terms of the associated cumulative selection. Given two
economies, one with a more increasing hazard function than the other, the economy with the more
increasing hazard function features higher cumulative selection and lower monetary non-neutrality:8
Proposition 3. For two economies A and B with the same average frequency of price changes
(ΛA = ΛB ) and for which the relevant moments and derivatives are defined, if either
1) there is a single crossing at some t∗ so that hA (t) ≥ hB (t) for t ≤ t∗ and hA (t) < hB (t) for
t > t∗ ,or
2)

∂hA (t)
∂t

<

∂hB (t)
∂t

∀t,

Then ΞA (t) < ΞB (t) ∀t.
Given Proposition 3, it follows immediately from Proposition 2 that monetary shocks are associated with smaller real effects in economies in which the hazard function increases more quickly.

4.3

Heterogeneity in price stickiness

In this section we show that selection effects also shed light on, and allow us to generalize the
findings in Carvalho (2006), that a one-sector model calibrated to the average frequency of price
changes is likely to understate the real effects of nominal shocks relative to a model with crosssectoral heterogeneity in price stickiness. These findings are of particular importance because,
as documented by Bils and Klenow (2004) and others, there is substantial heterogeneity in the
frequency of price changes.
Consider a heterogeneous sticky-price economy with K sectors indexed by k, each with a measure Φk of firms and sector-specific distribution of price-durations Gk (t).9 For notational convenience, we use E [·] to denote cross-sectoral weighted averages:
E [xk ] ≡

K
X

Φk xk .

(16)

k=1

The price level in the heterogeneous economy is:
p (t) = E [pk (t)] ,

8

The result is actually stronger than this, as all that is required is a single-crossing condition on the two hazards
(see the proof in the Appendix).
9
For brevity we do not specify the whole multisector model here, and borrow the required log-linear equations
directly from Carvalho and Schwartzman (2008).

16

where pk (t) is the price level in sector k. These sectoral price levels are aggregates of past pricing
decisions:

t

Z

Λk [1 − Gk (t − s)] xk (s) ds,

pk (t) =
−∞

where Λk ≡

R ∞
0

−1
(1 − Gk (s)) ds
is the average frequency of price changes in sector k.

Definition 1 for selection does not apply to the heterogeneous economy, but it is possible to
construct a natural extension. First, if the fraction of new prices in sector k at time t is ωk (t), it
follows that the fraction of new prices in the economy as a whole (which we denote by ω het (t)) is
just the average of the fraction of new prices across sectors:
ω het (t) = E [ωk (t)] .
Calculating the fraction of new prices among changing prices is slightly more involved. Here,
we have to take into account that the mass of prices changing in a given sector at any given interval
dt is given by Φk Λk dt – the mass of firms in the sector, Φk , multiplied by the frequency of price
changes Λk and by the length of time dt. If we denote the economy-wide fraction of new prices
among changing prices at time t by Ghet (t), then:
G

het




Λk
(t) = E
Gk (t) .
E [Λk ]

We can now generalize Definition 1 to the heterogeneous economy:
Definition 3. For all t such that E [ωk (t)] < 1, selection (at t), denoted by µhet (t), is defined as
µhet (t) ≡

1 − Ghet (t)
− 1.
1 − ω het (t)

(17)

For t such that ω het (t) = 1,
µhet (t) = 0.
Definition 2 also generalizes to the heterogeneous economy in the natural way, so that Ξhet (t) =
Rt
0

µhet (s) ds. Given those definitions, it is possible to extend Proposition 2 to heterogeneous

economies:
Proposition 2’. Consider the real effects of a shock to nominal aggregate demand given by mnew (t) ≥
mold for all t in two economies, A and B, characterized by sector specific distribution of price du
 B
 A KA
 B K
KA
KB
rations GA
k (t) k=1 and Gk (t) k=1 and by sectoral weights Φk k=1 and Φk k=1 . Suppose,
moreover, that the cross-sectoral average of the frequencies of price changes in both economies is
 
 B
the same, that is, E ΛA
k = E Λk . Then, ΓA < ΓB if either
17

het
1) µhet
A (t) ≥ µB (t) ∀t, or
het
2) Ξhet
A (t) ≥ ΞB (t) ∀t.

We are now ready to show the role of selection in generating the results in Carvalho (2006).
In Carvalho all sectors feature Calvo pricing, with different hazards of price adjustment. This is
a particular example of economies where the relevant source of heterogeneity across sectors in the
distribution of price durations is summarized by a single sector-specific scaling parameter.
R∞
0

Specifically, let the c.d.f. of price durations in sector k be given by Gk (t) = Ḡ (Λk t), with

1 − Ḡ (t) dt = 1. Note that Ḡ is a generic c.d.f. common to all sectors, but that the average

frequency of price change in sector k is equal to Λk .
Given this parametrization of the heterogeneous economy, we can compare it to a counterfactual
one-sector economy with c.d.f. of price durations Ḡ (E [Λk ] t), defined below:
Definition 4. Consider a multisector economy characterized by sector specific distribution of price


R∞
K
durations Ḡ (Λk t) k=1 , where 0 1 − Ḡ (t) dt = 1, and sectoral weights Φk . The counterfactual
one-sector economy is an economy with one sector and c.d.f. of price durations given by Ḡ (E [Λk ] t) .
The following proposition compares the cumulative selection function in both economies:
Proposition 4. Let Ξhet (t) denote cumulative selection of a multisector economy characterized


R∞
K
by the sectoral c.d.f.’s of price durations Ḡ (Λk t) k=1 , where 0 1 − Ḡ (t) dt = 1, and sectoral
weights Φk , and let Ξcount (t) denote cumulative selection of its counterfactual one-sector economy.
Then,
Ξhet (t) < Ξcount (t) ∀t.

(18)

Thus, cumulative selection in the multisector economy is always smaller than in the counterfactual one-sector economy. It follows immediately from Proposition 2’ that a shock to nominal
aggregate demand in the multisector economy has larger real effects than in the counterfactual
one-sector economy.
The intuition for Proposition 4 is easiest to understand in the case considered by Carvalho
(2006), where the hazard of price adjustment is constant within each sector, as in Calvo (1983). In
this economy, Ḡ (t) = 1 − e−t , so that:
Gk (t) = 1 − e−Λk t , ωk (t) = 1 − e−Λk t .

(19)

Each sector features Calvo pricing so that within each sector there is no selection. This, however,

18

is not true in the aggregate. We can check that selection in the heterogeneous economy is negative:

µhet (t) =

E [Λk e−Λk t ]
E[Λk ]
E [e−Λk t ]

−1=

cov (Λk , 1 − ωk (t))
< 0,
E [Λk ] E [1 − ωk (t)]

(20)

where cov denotes the cross-sectional covariance given sectoral weights Φk .
The covariance term in equation (20) neatly summarizes the intuition behind the main result
in this section. In the heterogeneous economy, price changes are disproportionately selected from
sectors with high frequency of price changes (high Λk ). However, exactly because these sectors
have a high frequency of price changes, they have a smaller fraction of old prices (low 1 − ωk (t)).
Therefore, cov (Λk , 1 − ωk (t)) < 0 and selection is negative. In contrast, the counterfactual onesector economy is just a Calvo economy, so that selection is zero. Thus, the heterogeneous economy
features lower selection than its counterfactual one-sector counterpart, and higher real effects of
monetary shocks.

4.4

The variance of price durations

For economists trying to calibrate time-dependent sticky-price models, the results presented so far
may seem a bit discouraging. They imply that the average frequency of price changes is far from
a being a sufficient statistic for the real effects of nominal shocks. Rather, they suggest that one
cannot do without the whole distribution of price durations, since it is the shape of that distribution
that determines selection.10
Our next results show that it may not be necessary to account for the entire distribution of
price durations. They make the case that, for a given frequency of price changes, the variance of
price durations may be a good scalar metric of selection effects and, in some particular cases, a
sufficient statistic.
As a first step, we compare selection among two distributions of price durations where one is a
mean preserving spread of the other. Proposition 5 states the result:
Proposition 5. Consider two economies, A and B, characterized by the distribution of price spells
GA and GB , where GA is obtained from a mean preserving spread of GB . Then, ΞA (t) ≤ ΞB (t)
∀t. Conversely, if ΞA (t) ≤ ΞB (t) and ΛA = ΛB , then GA is a mean preserving spread of GB .
Thus, if we restrict ourselves to comparing economies that can be ordered in terms of cumulative
selection, the variance of price durations is a sufficient statistic for that ordering. Given that
10

For a model calibrated with microeconomic estimates of the full distribution of the duration of price spells, see
Vavra (2010). As shown in that paper, an alternative to our approach is to consider the distribution of remaining
durations of prices in place. Given stationarity, one is just a transformation of the other.

19

restriction, in the context of Proposition 5, if the variance of price durations in economy A is
higher than in economy B, then, selection is lower in A than in B.
For a given frequency of price changes, the variance of price durations is, furthermore, a sufficient
statistic for the real effects of nominal disturbances in the case of shocks to the level of nominal
income discussed in Section 3.2.1:11
Proposition 6. Suppose an economy is characterized by a distribution of price spells G with finite
mean and variance given by Λ−1 and σ 2 . The real effects of a permanent level shock to nominal
aggregate demand of size ∆m satisfy

Γ
1 −1
=
Λ + Λσ 2 .
ρ→0 ∆m
2
lim

(21)

Note that, unlike Proposition 5, Proposition 6 does not require the economies under comparison
to be ordered by degree of selection. In that sense, it applies more broadly than previous results
that hinged on an ordering by cumulative selection.
Furthermore, the Proposition presents a closed-form expression for the real effects of the monetary shock. As an example, it allows us to easily calculate the real effects of a level shock under
Taylor and Calvo pricing. They are given by

Λ−1
2 ∆m

for Taylor and Λ−1 ∆m for Calvo. Hence, the

real effects of a level shock are twice as large under Calvo pricing than under Taylor pricing.
The variance is not a sufficient statistic for more complicated shocks. Proposition 6 is a special
case of Proposition 6’, which applies to any shock whose impulse response function can be well
approximated by a polynomial function.
Proposition 6’. Suppose an economy is characterized by a distribution of price spells G with finite
moments of order between 1 and K + 1. Let the random variable τ be the realized duration of price
P
k−1 are:
spells. The real effects of a monetary shock characterized by mnew (t) − mold = K
k=1 ak t
lim Γ =

ρ→0

K
X
k=1



E τ k+1
ak
.
k (k + 1) E [τ ]

The proposition states that the number of (uncentered) moments necessary to characterize
the real effects of a monetary shock increases with the number of polynomial terms necessary to
approximate the new trajectory of nominal income. For example, consider the effects of permanent
shocks to the growth rate of nominal aggregate demand. Such shocks are relevant for periods of

11

This is not the case more generally because orderings by variance do not imply an ordering by mean preserving
spreads.

20

disinflation such as the early 80s in the United States. The growth rate shock is:12
(
m (t) =

mold , t < 0,
mold + bt, t ≥ 0.

(22)

In that case, the real effects of the shock are given by:
lim Γ =

ρ→0


b −2
Λ + 3σ 2 + Λησ 3 ,
6

where σ 2 is the variance of price durations and η is the skewness. It is, again, straightforward to
calculate the real effects of this kind of shock under Taylor and Calvo pricing. They are, respectively,
Λ−1
6 b

and Λ−1 b, so that the real effects of the shock are 6 times as large under Calvo than under

Taylor.

5

Interactions in pricing decisions

The analytical results presented above hold under strategic neutrality in price setting. In this
section we perform some numerical exercises to assess whether our main results extend to more
general cases.
We consider the real effects of a shock across different sticky-price economies indexed by T and
characterized by the following family of survival functions:
(
1 − GT (t) =
with θ such that:
Z

1 − e−θt if t < T,
0 if t ≥ T,

(23)

∞


1 − GT (t) dt = D for all T .

(24)

0

That is, for different values of T , we adjust θ to ensure that the average duration of price spells
equals D. We take the unit of time to be a quarter and set D = 2, so that the average price-spell
lasts 2 quarters in all economies.
This family includes the two leading cases of constant duration (Taylor, 1979) and constant
hazard (Calvo, 1983). The first obtains if T = 2 and θ = 0. Calvo pricing obtains with T → ∞
and θ → 21 .
For any T 00 > T 0 , it is easy to check that the survival function parameterized by T 00 is a mean
preserving spread of the survival function parameterized by T 0 .13 Thus, from Proposition 5 it
12

While this shock involves permanently non-zero inflation, Carvalho (2008, Appendix A.6) shows that, as long
as the discount rate (ρ) is not strictly equal to zero, our result is a good approximation for temporary but highly
persistent shocks to the growth rate of nominal income, so that inflation converges slowly back to zero.
Rt
Rt
00
0
13
In particular, it is straightforward to verify that for any T 00 and T 0 with T 00 > T 0 , 0 GT (s) ds ≥ 0 GT (s) ds

21

follows that, as T increases, cumulative selection decreases.
To perform the simulations, we consider the discrete-time analogue of the model in Section 2.
The discrete-time analogue of the family of survival functions described in equations (23) and (24)
is:

(
1−

with θ such that:

∞
X

GTt

=

θt if t < T,
0 if t ≥ T.

(1 − Gt ) dt = 2 for all T .

t=0

The reset price chosen by all firms adjusting in period t is:
xt =

X

(1 − Gt ) [αmt + (1 − α)pt ] ,

(25)

where α determines whether pricing decisions are strategic complements or strategic substitutes.
In our experiments we compare results with α = 1 (strategic neutrality), α = 1/3 (strategic
complementarity) and α = 3 (strategic substitutability).14
In order to perform the numerical exercises, we also need to parameterize the shock process.
We follow Mankiw and Reis (2002) and consider a process that is mean reverting in the growth
rate of nominal aggregate demand:
∆mt = 0.5∆mt−1 + t .

(26)

For given T , equations (25), (26) and the discrete-time analogue of the aggregate price-level equation
(6) define a standard linear rational-expectations model in {pt , mt , xt }, which we solve using Dynare.
Figure 1 shows the log cumulative real effects of monetary shocks described in equation (26)
for different levels of strategic interactions, and different T ’s. As expected, the real effects are, for
a given T , largest under strategic complementarity (α = 13 ) and smallest under strategic substitutability (α = 3). Furthermore, for given α, they also increase noticeably as selection decreases
(T increases).
Lastly, note that moving from a model with Taylor pricing (T = 2 and θ = 0) to one approaching
a constant hazard of price adjustment (T = 20 and θ = 0.4998) implies an increase in the real
effects by a factor of approximately two under strategic neutrality (α = 1). This is close to
what is implied by the analytical result in Proposition 6, even though the nominal income process
in equation (26) does not imply instantaneous level shifts as assumed in the proposition. With
for all t.
σ+ψ −1
14
Recall that α = 1+θψ
−1 . The parametrization with strategic complementarities obtains if σ = 1, ψ = 1, and
θ = 5. The parametrization with strategic substitutability obtains if σ = 3, ψ → ∞, and any θ.

22

strategic complementarities, real effects increase by a factor greater than two, whereas with strategic
substitutability, they increase by a factor smaller than two.

2

1.8

α = 1/3

Cumulative Real Effect

1.6

α=1
α=3

1.4

1.2

1

0.8

0.6

0.4

0.2

2

4

6

8

10

12

14

16

18

20

T

Figure 1: Cumulative real effect a shock to the level of nominal income: Same average duration,
different variances.
Note: Average price duration is equal to 2. Distribution of price durations follow equations (23)
and (24), see Appendix for details.

6

Summary and conclusion

We investigate the different ways in which the shape of the distribution of duration of price spells
affects the real effects of nominal aggregate demand shocks. We highlight a mechanism that so far
has barely been given attention in the literature: a selection for the time in which prices were last
adjusted. In fact, we show that selection provides a complete characterization of the distribution
of price durations in time-dependent sticky-price models.
The results in the paper suggest that a careful characterization of the distribution of price
durations is of crucial importance for the proper evaluation of the real aggregate effect of monetary
shocks. While the results are derived for the case of time-dependent pricing, there is no reason
23

why the selection effect identified here should not hold some relevance more broadly, whenever the
timing of price changes is not entirely up to the discretion of the firms. This suggests that further
research on price setting would do well to focus on models that are able to fully account for the
distribution price spells and investigating the extent to which the mechanisms emphasized here
continue to matter.

24

References
[1] Alvarez, F., F. Lippi, and L. Paciello (2012), “Monetary Shocks in a Model with Inattentive
Producers,” mimeo.
[2] Alvarez, F., H. Le Bihan and F. Lippi (2014), “Small and Large Price Changes and the
Propagation of Monetary Shocks”, NBER Working Paper 20155.
[3] Bomono, M. and C. Carvalho (2004) “Endogenous time-dependent rules and inflation inertia.”
Journal of Money, Credit and Banking: Vol. 36, N. 6, pp. 1015-1041.
[4] Benigno, P. and M. Woodford (2005), “Inflation Stabilization and Welfare: The Case of a
Distorted Steady State,” Journal of the European Economic Association 3: 1185-1236.
[5] Bils, M. and P. Klenow (2004), “Some Evidence on the Importance of Sticky Prices,” Journal
of Political Economy 112: 947-985.
[6] Caballero, R. (1989), “Time dependent rules, aggregate stickiness and information externalities,” Columbia University Working Paper.
[7] Calvo, G. (1983), “Staggered Prices in a Utility Maximizing Framework,” Journal of Monetary
Economics 12: 383-98.
[8] Caplin A. and D. Spulber (1987), “Menu Costs and the Neutrality of Monery,” Quarterly
Journal of Economics 102: 703-726.
[9] Carvalho, C. (2006), “Heterogeneity in Price Stickiness and the Real Effects of Monetary
Shocks,” Frontiers of Macroeconomics: Vol. 2 : Iss. 1, Article 1.
[10]

(2008), Heterogeneity in Price Setting, Ph.D. dissertation, Princeton University.

[11] Carvalho,
havior

and

C.

and

F.

Aggregate

Schwartzman
Dynamics:

(2008),
Some

“Heterogeneous

General

Results,”

Price
mimeo

Setting

Be-

available

at

http://cvianac.googlepages.com/papers.
[12] Chari, V.V., P. Kehoe and E. McGrattan (2000), “Sticky price models of the business cycle:
can the contract multiplier solve the persistence problem?” Econometrica 68.5: 1151-1179.
[13] Dotsey, M., R. King and A. Wolman (1997), “State-Dependent Pricing and Dynamics of
Business Cycle” Federal Reserve Bank of Richmond Working Paper Series, No. 97-2.

25

[14]

(1999), “State-Dependent Pricing and the General Equilibrium Dynamics of
Money and Output,” Quarterly Journal of Economics 114: 655-690.

[15] Kiley, M. (2002), “Partial Adjustment and Staggered Price Setting,” Journal of Money, Banking and Credit 34: 283-298.
[16] Klenow, P., and O. Kryvtsov (2008), “State-Dependent or Time-Dependent Pricing: Does It
Matter for Recent U.S. Inflation?” Quarterly Journal of Economics 123: 863-904.
[17] Klenow, P. and B. Malin (2010), “Microeconomic Evidence on Price Setting,” Handbook of
Monetary Economics, Ed. 1, vol. 16(6): 231 - 284.
[18] Mankiw, G. and R. Reis (2002), “Sticky Information Versus Sticky Prices: A Proposal to
Replace the New Keynesian Phillips Curve,” Quarterly Journal of Economics 117: 1295-1328.
[19] Reis, R. (2006), “Inattentive Producers”, The Review of Economic Studies 73 (3): 793-821.
[20] Sheedy (2010), “Intrinsic Inflation Persistance,” Journal of Monetary Economics 57: 10491061.
[21] Taylor, J. (1979), “Staggered Wage Setting in a Macro Model,” American Economic Review
69: 108-113.
[22] Vavra, J. (2010), “The Empirical Price Duration Distribution and Monetary NonNeutrality,”

mimeo

available

at

https://sites.google.com/site/jvavra/docs/paper10-29-

10 fordistribution.pdf?attredirects=0.
[23] Wolman, A. (1999), “Sticky Prices, Marginal Cost, and the Behavior of Inflation,” Federal
Reserve Bank of Richmond Economic Quarterly 85 (Fall 1999), 29-48.
[24] Wooodford, M. (2001), “Inflation Stabilization and Welfare,” Contributions to Macroeconomics: Vol. 2: Iss.1, Article 1.
[25]

(2003), Interest and Prices: Foundations of a Theory of Monetary Policy,
Princeton University Press.

[26]

(2014), “ A Note on Increasing Hazard Functions and the Monetary Transmission Mechanism”, Macroeconomic Dynamics, Forthcoming.

26

A

Appendix

A.1

A welfare-based measure of real effects

We measure the degree of monetary non-neutrality by the discounted cumulative effect of the shock
on the output gap:

∞

Z
Γ=

e−ρt [y1 (t) − y0 ] dt,

0

where ρ is the discount rate and y0 is the counter-factual path for output that would have held if the
shock had never happened. Here we show that, up to a first-order approximation, Γ is proportional
to the total impact of the shock on the representative agent’s utility. To see this, recall that the
utility function is:
Z

"

∞

e

U (t) =

−ρs

t

1
1+ ψ

R1

C (t + s)1−σ
−
1−σ

Lj (t + s)
1 + ψ1

0

dj

#
,

where
Z

1

C (t + s) ≡

Cj (t + s)

ε−1
ε

ε
 ε−1

dj

.

0

Taking the total derivative of U (t) with respect to ln (Cj (t + s)) and ln (Lj (t + s)) yields the
first-order approximation:
Z

∞

−ρs

e

u (t) =


C

1−σ

1

Z

cj (t + s) dj − L

1
1+ ψ

0

t

Z

1


lj (t + s) dj dt,

0

where u (t) is the linear approximation of the utility function as measured from t = 0 onward,
C is steady-state consumption, L is steady-state aggregate hours worked and, as before, cj (t + s)
and lj (t + s) are log-linear deviations from steady state of consumption of different varieties and
employment at different firms, respectively.
In equilibrium, cj (t + s) = lj (t + s) = yj (t + s). Also, up to a first-order approximation,
R1
y (t) = 0 yj (t + s) dj. Finally, in steady state, C = L = Y . Hence,


u (t) = Y

1−σ

−Y

1
1+ ψ

Z

∞

e−ρs y (t + s) dt.

t

When comparing the utility under two different trajectories for nominal aggregate demand we
can write:
u1 (0) − u0 (0) =



=



Y 1−σ − Y

1
1+ ψ

Z

1
1+ ψ



0

Y

1−σ

−Y
27

Γ.

∞

e−ρt [y1 (t) − y0 ] ,

Thus, the utility impact of the shock is, up to a first-order approximation, proportional to Γ.
It remains to prove that the first term, in brackets, is greater than zero, to be sure the effect goes
in the right direction.
Optimal price-setting in the zero inflation steady state implies that:
P =

1
ε
ε
ε
W =
P L ψ Cσ =
PY
ε−1
ε−1
ε−1

1
+σ
ψ

,

where the second equation follows from the optimality condition for labor supply (equation 1) and
the third equality follows from the production technology, the goods market clearing condition and
the assumption of a symmetric steady-state equilibrium. It follows that:

 1
1 σ+ ψ1
Y = 1−
,
ε
and, since ε > 1, σ > 0 and ψ > 0,
Y 1−σ − Y

1
1+ ψ

> 0.

At this point it is important to pause for a comparison with the literature, in particular Woodford (2001) and Benigno and Woodford (2005). First, in contrast to Woodford (2001), we do not
include in the model a subsidy that undoes the monopolistic distortion. This generates a first-order
positive utility impact of a surprise increase in nominal aggregate demand. The reason is that, by
surprising firms, the nominal shock reduces their markup leading to more efficient production.
The second point is that, in contrast with both Woodford (2001) and Benigno and Woodford
(2005), we only approximate the utility function up to first order. By doing this we miss the
component of the welfare cost of inflation most emphasized in these papers: the ensuing dispersion
in prices that distorts allocation across varieties of products. Thus, our results do not speak to the
impact of nominal aggregate demand shocks on this component of welfare.
Lastly, we do not mean to imply that we have a superior welfare criterion to analyze optimal
policy than Woodford (2001) or Benigno and Woodford (2005), or that our results suggest that
surprise positive nominal shocks are a commendable policy. In particular, Benigno and Woodford
(2005) show that even in the presence of monopolistic distortions, for most reasonable parameterizations, optimal policy under commitment should concern itself primarily with price stabilization
and should not attempt to undo the monopolistic distortion by surprising firms. Still, we contend
that to the extent that monetary surprises do take place, it is important to understand how they
impact household’s welfare, and Γ provides a useful measure of this impact.

28

A.2

The sticky-information model

We now describe a sticky-information model analogous to the sticky-price model in the main text.
The comparison with this model plays a key role in the proof of Proposition 1 which, as we show,
can be alternatively formulated as stating that the real effects of a monetary shock in a sticky-price
model are identical to those effects in a sticky-information model, so long as the distribution of
price spells in the former is identical to the distribution of price plans in the latter. One immediate
implication of the proposition thus formulated is that all the results in the paper apply equally to
time-dependent variants of Mankiw and Reis (2002).
We lay out a continuous-time version of Mankiw and Reis (2002) with general survival functions
for price plans. The household and the market structure are identical to the one laid out in Section
2. Firms are also identical, except that instead of choosing prices that remain in place for a period
of time, firms choose price plans, which are insensitive to new information. In particular, under
sticky information, a firm that resets its price plan Xj (t, s) at time t solves:
Z
max Et

Xj (t)

∞

−ρ(s−t)

e


(1 − G (s)) [Xj (t, s) Yj (t + s) − Wj (t + s) Nj (t + s)] ds

0

s.t. Yj (t + s) = Nj (t + s) ,


Xj (t, s) −ε
Yj (t + s) =
Y (t + s) ,
P (t + s)
where Nj (t + s) is the amount of labor demanded by the firm, and where the demand function
already takes into account that goods market clearing implies Cj (t) = Yj (t).
Firms operate under perfect foresight, except for the fact that they do not anticipate the shock.
Given that, the first order condition implies that:
Xj (t, s) =

ε
Et [Wj (t + s)] ,
ε−1

where Et [Wj (t + s)] = Wjold (t + s) for t < 0 and Et [Wj (t + s)] = Wjnew (t + s) for t ≥ 0.
The monetary shock is specified in the same way as in Section 2. As before, we log-linearize the
model around a deterministic, zero-inflation symmetric steady state. In this log-linear environment,
the optimal reset price plan for firms that change prices at time t is (lowercase variables denote
log-deviations from the steady state):

x (t, s) = Et [wj (t + s)] .

29

(27)

A sequence of substitutions analogous to the ones laid out in Section 2 yields:
x (t, s) = Et [αm (t + s) + (1 − α) p (t + s)] ,
where α =

σ+ψ −1
.
1+εψ −1

Finally, the aggregate price level is given by:
Z

t

Λ (1 − G (t − v)) x (v, s) dv.

p (t) =
−∞

In analogy to the sticky-price model, we can partition firms into those with “old” price plans
and those with “new” price plans, depending on whether the price plan was set before or after the
shock. Under sticky information, all firms with the same information set choose the same price
path. Hence, at any given point in time, all firms with old price plans set the same price, and all
firms with new price plans also have the same price, irrespective of the path of nominal income.
It follows that, for α = 1:
p (t) = ω (t) mnew (t) + (1 − ω (t)) mold .
The real effects of a nominal shock are (using the fact that pold = mold ):
si

Γ

∞

Z
=

Z0 ∞
=




e−ρt mnew (t) − pnew (t) − mold − pold dt =


e−ρt (1 − ω (t)) mnew (t) − mold .

0

Note that Proposition 1 states that, in a sticky-price economy with strategic neutrality in

R∞
price setting, Γ = 0 e−ρt (1 − ω (t)) mnew (t) − mold . Hence, if α = 1 and the distribution of
durations of price plans in a sticky-information economy is the same as the distribution of price
spells in a sticky-price economy, Proposition 1 states that the real effects of a monetary shock in
both economies are the same. Importantly, since we can define selection in a sticky-information
economy just as in the sticky-price economy, it follows that all the analytical results in the paper
concerning the relationship between selection and real effects of nominal shocks are equally valid
in a sticky-information environment.

A.3

Proofs

Lemma A. 1. Let G (t), ω (t), µ (t) and Ξ (t) be as defined in Section 3. Then
1 − ω (t) = e−Λt−Λ

Rt
0

µ(v)dv

= e−Λt−ΛΞ(t) .

Proof of Lemma A.1. The second equation follows from the definition of cumulative selection. We
30

prove that the first equation holds.
Let t1 be the smallest t such that ω (t) = 1. Since ω (t) is monotonically increasing, ω (t) = 1
for all t ≥ t1 and ω (t) < 1 otherwise.
We consider two cases: t ∈ [0, t1 ) and t ∈ [t1 , ∞). We use a guess and verify procedure.
1) t ∈ [0, t1 ):
Rt

First, note that for t = 0, 1 − ω (t) = 1 and e−Λt−

0

µ(v)dv

= 1. Thus, we confirm that the guess

works for t = 0. We verify the guess also holds for t ∈ [0, t1 ) if we can show that the derivative of
both sides of the equation with respect to t are identical over that range.
The derivative of the left-hand-side is:
∂(1 − ω (t))
∂ω (t)
=−
= −Λ (1 − G (t)) ,
∂t
∂t
and the derivative of the right-hand-side is:
Rt

Rt
∂e−Λt−Λ 0 µ(v)dv
= −Λ (1 + µ (t)) e−Λt−Λ 0 µ(v)dv
∂t

Given our guess, e−Λt−Λ

Rt
0

µ(v)dv

= 1 − ω (t), so that

−Λ (1 + µ (t)) e−Λt−Λ
By definition, 1 + µ (t) =

1−G(t)
1−ω(t)

Rt
0

µ(v)dv

= −Λ (1 + µ (t)) (1 − ω (t)) .

so that
Rt

∂e−Λt−Λ 0 µ(v)dv
= −Λ (1 − G (t)) .
∂t
2) t ≥ t1 :
We know that 1−ω (t) = 0 for t ≥ t1 . We need to show that this also the case for e−Λt−Λ

Rt
0

µ(v)dv

.

First, note that, given the result in (1) above, we know that, taking the left-limit as t → t−
1 , we
find that limt→t− e−Λt−Λ

Rt
0

µ(v)dv

1

= limt→t− (1 − ω (t)) = 0.
1

Since 1 − ω (t) = 0 for t ≥ t1 , it remains to show that it is also the case that e−Λt−Λ,

Rt
0

µ(v)dv

=0

for all t ≥ t1 . From the proposed solution, for any t and tprime such that t > t0 , 1 − ω (t) =
0

(1 − ω (t0 )) e−Λ(t−t )−Λ

Rt

t0

µ(v)dv

. Taking the left-limit again and applying the definition of µ, so that

µ (t) = 0 for t > t1 , it follows that, for t > t1 :

31

1 − ω (t) =
=
Given that e−Λt−Λ,

Rt
0

0

Rt

lim

1 − ω t0



e−Λ(t−t )−Λ

lim

1 − ω t0



e−Λ(t−t ) = 0.

t0 →t−
1
t0 →t−
1

t0

µ(v)dv

0

= 0 for all t > t1 , it follows that limt0 →t+ e−Λt−Λ,

µ(v)dv

both the left and right limits coincide and e−Λt1 −Λ,

R t1
0

Rt
0

µ(v)dv

1

µ(v)dv

= 0. Thus,

= 0.

Proof of Lemma 1. We check that f is given by
G(t) = 1 − (1 + µ(t))e−Λt−Λ

Rt
0

µ(s)ds

∀t.

Integrating equation (8) we find that
Z
1 − ω (t) = 1 − Λ

t

(1 − G (s)) ds.
0

From the sequence of equalities in (9) it follows that
Z
1−Λ

t

(1 − G (s)) ds = 1 − ω (t) = e−Λt−Λ

Rt
0

µ(s)ds

.

0

Thus, we can write



Z t
G(t) = 1 − (1 + µ(t)) 1 − Λ
(1 − G (s)) ds .
0

Use the definition of µ (t) to substitute it out, to get, for t such that ω (t) < 1
G (t) = 1 −

!

Z t
1 − G (t)
1+
−1 1−Λ
(1 − G (s)) ds
Rt
1 − Λ 0 (1 − G (s)) ds
0

= G (t) .
For t such that ω (t) = 1,


Z t
G (t) = 1 − (1 − 1) 1 − Λ
(1 − G (s)) ds = 1

(28)

0

Rt
Rt
Since ω (t) = 1 − Λ 0 (1 − G (s)) ds, it follows that if ω (t) = 1, 0 (1 − G (s)) ds = Λ−1 . But,
R∞
by definition, Λ−1 = 0 (1 − G (s)) ds. Hence, if ω (t) = 1, G (t) = 1, as implied by equation (28).

32

Proof of Lemma 2. 1) and 2) follow from inspection of equation (12).



Proof of Proposition 1. Let Γsp mold , mnew (t) ; G and Γsi mold , mnew (t) ; G be the cumulative
real effects of an unexpected change in the path of nominal aggregate demand from m to mnew (t)
in, respectively, a sticky price as defined in the text and in a sticky information economy as defined
in Appendix A.2, with G the c.d.f that summarizes the arrival of adjustment opportunities in each
of these economies. We show that, if α = 1, Γsp (m0 (t) , m1 (t) ; G) = Γsi (m0 (t) , m1 (t) ; G) =

R∞
new (t) − mold dt, where the second equality follows from the discussion in Ap0 (1 − ω (t)) m
pendix A.2. We denote the “new” and “old” paths of the various endogenous variables in the sticky
information and sticky price economies by y si,new (t), y si,old , y sp,new (t), y sp,old , etc.
Define ∆Γ ≡ Γsi − Γsp . Then:
∞

Z
∆Γ =

Z


e−ρt y si,new (t) − y si,old dt −

Z0 ∞
0

∞

Z
−
Z ∞0
=



e−ρt y sp,new (t) − y sp,old dt =

0
−ρt

e

=

∞



new

m

(t) − p

si,new



(t) − mold − psi,old
0



dt




e−ρt mnew (t) − psp,new (t) − mold − psp,old dt =

e−ρt

h

i h
i
psp,new (t) − psp,old − psi,new (t) − psi,old dt.

0

Since mold is a constant, it follows that psi,old = psp,old = mold and
∞

Z
∆Γ =


e−ρt psp,new (t) − psi,new (t) .

0

Recall that
p

sp,new

t

Z

Λ (1 − G (t − v)) x

(t) =

sp,new

Z

Λ (1 − G (t − v)) xsp,old (v) dv

−∞

0
t

Z

0

(v) dv +

=
0

Λ (1 − G (t − v)) xsp,new (v) dv + psp,old
=
0

t

Z

Λ (1 − G (t − v)) xsp,new (v) dv + m0 ,

=
0

and, analogously,

p

si,new

Z
(t) =

t

Λ (1 − G (t − v)) xsi,new (v, t) dv + m0 ,

0

33

so that:
Z

∞

−ρt



Λ (1 − G (t − v)) xsp,new (v) − xsi,new (v, t) dv

e

∆Γ =
0

We prove that

t

Z


dt.

0

R∞
0

h
i
si,new
sp,new
(v) − x1
(v, t) dvdt = 0. First write the
0 Λ (1 − G (t − v)) x

Rt

e−ρt

integral as:
∞Z ∞

Z
0



1l (v ≤ t) e−ρt Λ (1 − G (t − v)) xsp,new (v) − xsi,new (v, t) dvdt,

0

where 1l (.) is the indicator function. Now do the substitution: v = z and t = z + w:
Z

∞Z ∞

0



e−ρ(z+w) 1l (z ≤ z + w) Λ (1 − G (w)) xsp,new (z) − xsi,new (z, z + w) dwdz

0

Note that the indicator function is always equal to 1 for all z and w ≥ 0, so we can write:
Z

∞Z ∞

0



e−ρ(z+w) Λ (1 − G (w)) xsp,new (z) − xsi,new (z, z + w) dwdz

0

Now, we show that the inner integral is equal to zero. We can take all multiplicative terms that
only depend on z out of the inner integral and rearrange it to get
Z

∞



e−ρ(z+w) Λ (1 − G (w)) xsp,new (z) − xsi,new (z, z + w) dw =
0
Z ∞
Z ∞
e−ρw (1 − G (w)) xsi,new (z, z + w) dw =
e−ρw (1 − G (w)) dw × xsp,new (z) − Λe−ρz
= Λe−ρz
0
0
"
#
R ∞ −ρw
Z ∞
si,new (z, z + w) dw
e
(1
−
G
(w))
x
−ρz
−ρw
sp,new
R∞
.
= Λe
e
(1 − G (w)) dw × x
(z) − 0
−ρ(z+w) Λ (1 − G (w)) dw
0
0 e
Optimal price-setting implies:
R∞
x

sp,new

(z) =

0

e−ρw (1 − G (w)) mnew (z + w) dw
R∞
,
−ρw (1 − G (w)) dw
0 e

xsi,new (z, z + w) = mnew (z + w) ,
so that

R∞
x

sp,new

(z) =

0

e−ρw (1 − G (w)) xsi,new (z, z + w) dw
R∞
.
−ρw (1 − G (w)) dw
0 e

It follows that
Z
0

∞Z t



Λ (1 − G (t − v)) xsp,new (v) − xsi,new (v, t) dv = 0.

0

34

Proof of Proposition 2. 1) and 2) follow from inspection of equation (13) and from the fact that 1)
implies 2).
Proof of Lemma 3. Consider any alternative pricing rule G with the average frequency of price
changes Λ. If G(t)=1, µ(t) = 0 and the result follows trivially. Otherwise, we denote the corresponding selection at t as µ (t) :
µ (t) =

1 − G (t)
− 1.
1 − ω (t)

(29)

The proof of the result follows from a comparison of numerators and denominators in (14) and (29)
for a given Λ. First, it is immediate that 1 − G (t) ≤ 1. Second, we can show that 1 − ω (t) ≥ 1 − Λt:
Z
1 − ω (t) = 1 − Λ

t

Z
(1 − G (s)) ds = 1 − Λt +

0

t

G (s) ds ≥ 1 − Λt.
0

Thus, wherever µT aylor (t) is positive, µT aylor (t) ≥ µ (t) for any G.
Cumulative selection under Taylor pricing is:
(
ΞT aylor (t) =

− t if t < Λ−1 ,
− ln(1−Λt)
Λ
∞ otherwise.

Since under Taylor pricing selection is the highest possible for t < Λ−1 , and for t ≥ Λ−1 cumulative selection is infinite, it follows that cumulative selection is the highest possible everywhere.

Proof of Lemma 4. We verify that the statement is true through a sequence of substitutions. SubRt
R 1−G(s)
stituting out Ψt (s) in the right-hand-side, we get that µ (t) = t 1 h(s)
Λ t∞ (1−G(v))dv ds − 1. From
Rt
equation (8) it is easy to verify that Λ t 1 (1 − G (v)) dv = 1 − ω (t). Since it does not depend
R t1
1
on s we can take it out of the integral to get µ (t) = 1−ω(t)
t h (s) (1 − G (s)) ds − 1. Finally,
∂G(s)
R t1 ∂G(s)
1
∂s
note that h (s) (1 − G (s)) = 1−G(s)
(1 − G (s)) = ∂G(s)
∂s , so that µ (t) = 1−ω(t) t
∂s ds − 1 =
R
∞ ∂G(s)
1−G(t)
1
∂s ds − 1 = 1−ω(t) − 1, where the second equality follows from the fact that G (t) = 1
1−ω(t) t
for t > t1 . This is exactly how µ (t) is defined in Definition 1 for ω(t) < 1.

Proof of Lemma 5. First, note that µ (0) = 0 always, since G (0) = ω (0) = 0. Second, we can
h
i
write µ (t) = EΨ0 h(s)
|s
≥
t
− 1, where the conditional expectation is taken with respect to
Λ
the probability measure Ψ0 . If the hazard function is (weakly) increasing, it follows that µ (t) =

35

EΨ0

h

h(s)
Λ |h (s)

i
≥ h (t) − 1 > 0. Also, since, by assumption, h (t) increases in t, so does µ (t). Since

µ (0) = 0, µ (t) > 0 ∀t > 0.

Proof of Proposition 3. 1) We prove this in two steps:
i) There is a unique t∗∗ > 0 such that GA (t∗∗ ) = GB (t∗∗ ) < 1, 1 − GA (t) < 1 − GB (t) if t < t∗
and 1 − GA (t) > 1 − GB (t) if t > t∗ .
First we show that a t∗∗ with GA (t∗∗ ) = GB (t∗∗ ) < 1 exists. Suppose not, then GA (t) >
GB (t) ∀t or vice versa (otherwise, since G is differentiable, it is continuous and t∗∗ must exist
by the intermediate point theorem). But this contradicts the assumption that ΛA = ΛB , since
R∞
R∞
ΛA = 0 (1 − GA (t)) dt and ΛB = 0 (1 − GB (t)).
Second, we show that t∗∗ > t∗ . Note that:
1 − GA (t) = e−

Rt

1 − GB (t) = e−

Rt

0

0

hA (s)ds

,

hB (s)ds

.

Since hA (t) > hB (t) ∀t < t∗ , it follows that
1 − GA (t) < 1 − GB (t) ∀t < t∗ .
It follows that t∗∗ > t∗ .
Let t∗∗ be the first crossing point. Since e−
1 − GA (t) − (1 − GB (t)) = e−

Rt

R t∗∗
0

hA (s)ds

= e−

R t∗∗
0

hB (s)ds

, we can write

Rt

− e− 0 hB (s)ds
 Rt

R
R t∗∗
− ∗∗ hA (s)ds
− t∗∗ hB (s)ds
= e− 0 hA (s)ds e tt
− e tt
if t > t∗∗ .
0

hA (s)ds

Now, recall that t∗∗ > t∗ , so that if t > t∗∗ , then hA (t) < hB (t). Thus, the expression in parenthesis
is strictly positive. Thus there is no crossing point to the right of t∗∗ . There is also no crossing
point to the left of t∗∗ , since in that case we could repeat the exercise above to show that t∗∗ cannot
exist. Thus, t∗∗ is unique.
ii) If there is t∗∗ so that 1 − GA (t) < 1 − GB (t) for t < t∗∗ and 1 − GA (t) > 1 − GB (t) for
t > t∗∗ , then ΞA (t) > ΞB (t) ∀t.
For t < t∗∗ it follows trivially that

Rt
0

GB (s) ds <

Rt
0

GA (s) ds ∀t with the inequality strict for

t above a certain range. For t > t∗∗ ,
∂

Rt
0

[GB (s) − GA (s)] ds
= GB (t) − GA (t) > 0.
∂t
36

This means that we can bound

Z

Rt
0

[GB (s) − GA (s)] ds above as follows:

t

∞

Z
[GB (s) − GA (s)] ds <

[GB (s) − GA (s)] ds ∀t ≥ t∗∗

0

0

Using integration by parts plus the condition that the expected values are the same implies that
the bound is zero:
Z
0

Thus

Rt

0 GB (s)ds <

∞


−1
[GB (s) − GA (s)] ds = − Λ−1
=0
B − ΛA

Rt

0 GA (s)ds ∀t. We can verify from equation (8) that ω (t) = Λ

Rt
0

(1 − G (s)) ds.

Hence, it follows that 1 − ωB (s) > 1 − ωA (s) ∀t. Since, from equation (9), 1 − ω(t) = e−Λt−ΛΞ(t) ,
it follows that ΞA (t) > ΞB (t) ∀t.
2) Suppose the two functions do not cross. The either hA (t) > hB (t) for all t or vice versa.
In the first case, we have that GA (t) = 1 − e−

Rt

hA (v)dv

> 1 − e−

Rt

hB (v)dv

= GB (t) for all t (and
R∞
vice versa in the opposite case). But both of these violate the condition that 0 (1 − GA (t)) dt =
R∞
−1
−1
0 (1 − GB (t)) dt which is necessary for ΛA = ΛB .
R t∗ A (s)
Let t∗ be a crossing point. Then, for any t < t∗ , hA (t) = hA (t∗ ) − t ∂h∂s
ds and hB (t) =
R
∗
t
∂h
(s)
∂h
(s)
∂h
(s)
B
A
B
hB (t∗ ) − t
< ∂s
and hA (t∗ ) = hB (t∗ ) it follows that hA (t) > hB (t).
∂s ds. Since
∂s
R
R t∗ B (s)
∗
t
A (s)
Likewise, for any t > t∗ , hA (t) = hA (t∗ ) + t ∂h∂s
ds and hB (t) = hB (t∗ ) + t ∂h∂s
ds so that
0

0

hB (t) > hA (t). Thus there is a single crossing point and part 1) applies.

Proof of Proposition 2’. The proposition relies on the fact that we can build a one-sector econh
i
Λk
omy characterized by G̃ (t) = E E[Λ
G
(t)
and Λ̃ = E [Λk ] in which monetary shocks have
k
k]
the same real effects as in the heterogeneous economy.15

Given α = 1, firms in each sector

do not interact with one another, so that each sector behaves as if it were a separate economy. From Proposition 1 the real impact of the shock in an economy characterized by Gk is

R∞
Rt
Γk = 0 e−ρt (1 − ωk (t)) mnew (t) − mold dt, where ωk (t) = Λk 0 (1 − Gk (t)) dt. The real impact of the shock in the multisector economy is just the cross-sectoral average of the real effects in
each sector:
het

Γ

Z
= E [Γk ] =

∞



e−ρt (1 − E [ωk (t)]) mnew (t) − mold dt.

0

We now show that the real effects of any given monetary shock in the heterogeneous economy
k Gk (t)]
. The
described above are identical to the effects in a one sector economy with G̃ (t) = E[ΛE[Λ
k]

R ∞ −ρt
real impact of the shock in that economy is Γ̃ = 0 e (1 − ω̃ (t)) mnew (t) − mold dt, where

15

Note that this is not the counterfactual one-sector economy discussed subsequently.

37

ω̃ (t) = E [Λk ]

Rt
0

1−

E[Λk Gk (t)]
E[Λk ]



Z

E [ωk (t)] = E Λk
0

t



dt. Since


Z t
E [Λk Gk (s)]
1−
(1 − Gk (s)) ds = E [Λk ]
ds ds = ω̃ (t) ,
E [Λk ]
0


it follows that Γ̃ = Γhet .
Finally, we can show that the average frequency of price changes in this one-sector economy is the
hR
i−1  E Λ R ∞ (1−G (t))dt −1
]
[ k 0
∞ E[Λk (1−Gk (t))]
k
=
=
same as in the multisector economy. It is Λ̃ = 0
dt
E[Λk ]
E[Λk ]


−1
E [Λk Λ−1
k ]
= E [Λk ].
E[Λk ]
Note that G̃ (t) = Ghet (t) defined in the text, just as ω̃ (t) = ω het (t). Thus, µhet (t) and Ξhet (t)
correspond to selection and cumulative selection for the one-sector economy with the same real
effect and the same average frequency of price changes as the multisector economy. The last step
of the proof then follows by applying Proposition 2.

Proof of Proposition 4. First, we note that for the counterfactual one-sector economy,
1 − ω count (t) = e−E[Λk ]t−Ξ
where ω count (t) ≡

Rt
0

count (t)

,


1 − Gcount (s) ds and for the heterogeneous economy,
het (t)

1 − ω het (t) = e−E[Λk ]t−Ξ

,

so that Ξcount (t) ≥ Ξhet (t) for all t so long as 1 − ω count (t) ≤ 1 − ω het (t) for all t.
Use Definition 4 and integrate both sides of equation (8) to write 1 − ω count (t):
1−ω

count

Z

∞


1 − Ḡ (E [Λk ] s) ds,

(t) = E [Λk ]
Zt ∞


= E [Λk ]
1 − Ḡ (v) E [Λk ]−1 dv,
E[Λk ]t
Z ∞

=
1 − Ḡ (v) dv,
E[Λk ]t

= 1 − ω̄ (E [Λk ] t) .
We can show that any ω(t) = Λ

Rt

0 (1

− G(s))ds is concave, i.e.:



ω λx + (1 − λ) x0 > λω (x) + (1 − λ) ω x0 .
This is trivial to see if G (t) is differentiable, since in that case
38

∂ 2 ω(t)
(∂t)2

= −Λ ∂G(t)
< 0. More
∂t

generally, given its definition, ω is concave if and only if
Z

λx+(1−λ)x0

Z

x

Z

[1 − G (s)] ds.

[1 − G (s)] ds + (1 − λ)

[1 − G (s)] ds > λ

0

0

0

x0

W.l.o.g., let x0 > x. Then, we can write
Z x0
Z x
[1 − G (s)] ds
[1 − G (s)] ds + (1 − λ)
[1 − G (s)] ds > λ
0
0
0
" R
#
"
#
R x0
λx+(1−λ)x0
[1
−
G
(s)]
ds
[1
−
G
(s)]
ds
0
0
⇐⇒ λ
> (1 − λ)
Rx
R λx+(1−λ)x
0
− 0 [1 − G (s)] ds
− 0
[1 − G (s)] ds
Z x0
Z λx+(1−λ)x0
[1 − G (s)] ds
[1 − G (s)] ds > (1 − λ)
⇐⇒ λ
Z

λx+(1−λ)x0

λx+(1−λ)x0

x

R λx+(1−λ)x0


[1 − G (s)] ds
⇐⇒ (1 − λ) x0 − x λ x
> λ (1 − λ) x0 − x
0
(1 − λ) (x − x)
R x0
R λx+(1−λ)x0
[1
−
G
(s)]
ds
λx+(1−λ)x0 [1 − G (s)] ds
>
.
⇐⇒ x
(1 − λ) (x0 − x)
λ (x0 − x)


R x0

λx+(1−λ)x0

[1 − G (s)] ds

λ (x0 − x)

We can verify that the inequality holds since G is increasing, implying that

R λx+(1−λ)x0
x

[1 − G (s)] ds
>
(1 − λ) (x0 − x)

R λx+(1−λ)x0
x


[1 − G (λx + (1 − λ) x0 )] ds
= 1 − G λx + (1 − λ) x0 ,
0
(1 − λ) (x − x)

and

R x0

λx+(1−λ)x0

R x0

[1 − G (s)] ds

λ (x0

− x)

<

λx+(1−λ)x0

[1 − G (λx + (1 − λ) x0 )] ds
λ (x0

− x)

In particular, given that any ω is concave, so is ω̄ (t) ≡ 1−

Rt
0

= 1 − G λx + (1 − λ) x0




1 − Ḡ (s) ds with Ḡ (s) as defined

in the statement of the proposition is also concave. Therefore, from Jensen’s inequality, it follows
that E [ω̄ (Λk t)] < ω̄ (E [Λk t]), and
1 − ω het (t) = 1 − E [ω̄ (Λk t)] > 1 − ω̄ (E [Λk ] t) = 1 − ω count (t) .
The last step of the proof follows from 1 − ω(t) = e−Λ−ΛΞ(t) . Since, by construction, Λhet =
Λcount = E [Λk ], and since 1 − ω het (t) > 1 − ω count (t), it follows that Ξhet (t) < Ξcount (t).

39

Proposition 6 is a special case of Proposition 6’, proved below:
Proof of Proposition 6’. Consider first a case with bounded support, i.e., there is z such that ω (t) =
1 ∀t ≥ z:
K
∞X

Z
lim Γ =

ρ→0

0

Z
=

=

(1 − ω (t)) ak tk−1 dt =

k=1
K
zX

(1 − ω (t)) ak tk−1 dt =

0 k=1
 k
K
X
z

ak

k=1

= 0+
Z
= Λ

K
X

k

(1 − ω (z)) − 0 × (1 − ω (0)) −
z k
t

ak

0 k=1


∂ (1 − ω (t))
dt =
k
∂t

0

Z

k=1
K
zX

z k
t

Z

0

ak

∂ω (t)
dt =
k ∂t

tk
(1 − G (t)) dt.
k

Then,
Z
lim Γ = Λ

ρ→0

ak

0 k=1
K
X

"
= Λ

K
zX

Z
= Λ

tk
(1 − G (t)) tdt =
k

z k+1
(1 − G (z)) − 0 × (1 − G (0)) −
ak
k (k + 1)

k=1
K
zX

ak

0 k=1
K
X

Z
0

z

#
tk+1
d (1 − G (t)) =
k (k + 1)

tk+1
dG (t) =
k (k + 1)

i
h
ak
E τ k+1 =
k (k + 1)
k=1


K
X
E τ k+1
ak
=
,
k (k + 1) E [τ ]
= Λ

k=1

where the last line follows from Λ−1 = E [τ ].
The case with unbounded support can be obtained by constructing a sequence of distribution
functions Gz (t) defined as:
Gz (t) =

G (t)
1l (t ≤ z) + 1l (t ≥ z) ,
G (z)

40

with associated Λz =

R ∞
0

(1 − Gz (t)) dt

−1

. We take the limit
K
∞X

Z
lim lim Γ = lim Λz

z→∞ ρ→0

z→∞

0

ak

k=1

tk
(1 − Gz (t)) dt.
k

We then use Lebesgue’s dominated convergence theorem (see, for example, Kolmogorov and
R∞
k
Fomin 1970) to show that this limit is equal to Λ 0 ak tk (1 − G (t)). Let {z1 , z2 , ..., zn , ...} be an
infinite sequence such that zk+1 > zk ∀k and z1 > 0. Then, for all t, limn→∞ Λzn (1 − Gzn (t)) tk =
Λ (1 − G (t)) tk . Furthermore there is a Λ̄ < ∞ such that, Λzn (1 − Gzn (t)) tk < Λ̄ (1 − G (t)) tk .
That 1 − Gzn (t) < 1 − G (t) follows trivially from the definition of Gzn . To see that such a Λ̄
−1
R ∞
R∞
and that for all n > 1, 0 (1 − Gzn (t)) dt =
exists, recall that Λzn = 0 (1 − Gzn (t)) dt




h
 i−1
R zn
R z1
R z1 
G(t)
G(t)
G(t)
1
−
1
−
1
−
.
dt
>
dt.
Thus,
it
is
enough
to
pick
Λ̄
>
0
0
0
G(z )
G(z1 )
G(z1 ) dt
R
R∞ n
∞
If 0 (1 − G (t)) tk dt < ∞ then Λ̄ 0 (1 − G (t)) tk dt < ∞ and all the conditions of the
R∞
R∞
theorem are satisfied. It follows that limn→∞ 0 (1 − Gzn (t)) tk dt = 0 (1 − G (t)) tk dt and
R ∞ PK
R ∞ PK
tk
tk
Λ 0
k=1 ak k (1 − G (t)) dt = limz→∞ Λz 0
k=1 ak k (1 − Gz (t)) dt.

A.4

Implementation of the Numerical Simulation

The numerical analysis is based on a log-linearized, discrete-time version of the model solved using
Dynare. The heart of the model is a pricing rule, dependent on α. Let xt be the reset price chosen
by firms who get to choose their prices at t in log-deviation form. The discrete time analogue of
equation (4) is

xt = Λ

∞
X

β s (1 − Gs ) [αmt+s + (1 − α) pt+s ] ds,

s=0

The price level at t is:

pt = Λ

∞
X

(1 − Gs ) xt−s

s=0

and output is

yt = mt − pt
The law of motion for mt is:

41

(30)

∆mt = 0.5∆mt−1 + εt
IRF be the impulse response function of a unit shock to ε that hits the economy at t. We
Let yt+s
t

calculate the cumulative real effect as

Γ=

∞
X

IRF
β s yt+s

s=0

This system of four equations in four variables describes how output reacts to shocks. Note
that mt is not stationary, so, as written this model is not amenable to be solved using conventional
methods. We can make the model stationary by rewriting it in terms of p̃t = pt −mt and x̃t = xt −mt .
Then we have the equivalent model:

x̃t = Λ
p̃t = Λ

∞
X
s=0
∞
X

β s (1 − Gs )

" s
X

#
∆mt+v + (1 − α) p̃t+s ds,

(31)

v=0

(1 − Gs ) x̃t−s −

s
X

!
∆mt−s+v

(32)

v=0

s=0

yt = −p̃t

(33)

∆mt = 0.5∆mt−1 + εt

(34)

This model as written is still not solvable in Dynare because it involves infinite sums, even if
convergent. We solve it for cases where there is some J such that Gs = 1 for all s > J. In those
cases the state space becomes finite.
For the truncated exponentials case, we have that, given the maximum duration T and the
decay parameter θ, the average duration of price-spells is:

1
E [τ ] = 1 − (1 − θ)T −1
+ (1 − θ)T −1
θ
For any T , we look for θ such that E [τ ] = 2. We consider T = {2, ..., 20}. Table (1) gives the
corresponding θ’s, together with the mean, variance and skewness of price durations for some of
these cases. Note that the variance is increasing in T .

42

Table 1: Parameters and Moments for Truncated Exponential Models
T
2
4
6
8
10
12
14
16

θ
0.000
0.456
0.491
0.498
0.500
0.500
0.500
0.500

mean
2.000
2.000
2.000
2.000
2.000
2.000
2.000
2.000

variance
0.000
1.234
1.718
1.903
1.969
1.990
1.997
1.999

43

skewness
0.703
1.430
1.804
1.988
2.070
2.103
2.115