View original document

The full text on this page is automatically extracted from the file linked above and may contain errors and inconsistencies.

Federal Reserve Bank of Chicago

Mind the gap! Stylized dynamic facts
and structural models.

Fabio Canova and Filippo Ferroni

September 1, 2020
WP 2020-29
https://doi.org/10.21033/wp-2020-29
*

Working papers are not edited, and all opinions and errors are the
responsibility of the author(s). The views expressed do not necessarily
reflect the views of the Federal Reserve Bank of Chicago or the Federal
Reserve System.

Mind the gap! Stylized dynamic facts and structural models.
Fabio Canova ∗
Norwegian Business School, CAMP and CEPR
and
Filippo Ferroni
Federal Reserve Bank of Chicago
September 1, 2020

Abstract
We study what happens to identified shocks and to dynamic responses when the data generating process features q disturbances but q1 < q variables are used in an empirical model.
Identified shocks are linear combinations of current and past values of all structural disturbances
and do not necessarily combine disturbances of the same type. Theory-based restrictions may be
insufficient to obtain structural dynamics. We revisit the evidence regarding the transmission of
house price and of uncertainty shocks. We provide suggestions on how to validate the dynamics
of larger scale DSGEs with smaller scale VARs.

Key words: Deformation, state variables, dynamic responses, structural models, house price
shocks, uncertainty shocks.
JEL Classification: C32, E27, E32.

∗

We thank Giorgio Primiceri (the editor), three anonymous referees, Thomas Drechsel, Stefano Neri, Jesper Linde’,
Adrian Pagan, Marti Eichembaum, Jordi Gali, Makram el-Shagi, Juan Dolado, Jesus Gonzalo, Alex Richter and the
participants of the Bank of Finland Workshop on Empirical Macroeconomics; Norges Bank NBR spring institute;
The CREI-Bank of Canada conference: Recent Developments in Macroeconomic Modeling, Barcelona; The EABCN
conference Advances in business cycle analysis, Madrid; The third research conference: Macroeconomic modeling and
model comparison network, Frankfurt; The QMUL Workshop on structural VARs, London; The EABCN conference:
New approaches for understanding business cycles, Mannheim; the 2019 IAAE annual meetings, Cyprus and of seminars
at Henan University, UC3M, the Riksbank, the Bank of England, University of Sydney, University of Adelaide, Cardiff
University, UIUC for comments and suggestions. The views presented in this paper do not reflect those of the Federal
Reserve Bank of Chicago, or the Federal Reserve system.

1

1

Introduction

It is common in macroeconomics to collect stylized facts about the dynamic transmission of certain
identified shocks using (small scale) vector autoregressive (VAR) models and then build (larger scale)
dynamic stochastic general equilibrium (DSGE) models to explain the patterns found (see e.g. Galı́
[1999]; Iacoviello [2005], Basu and Bundick [2017] among many others).
Several authors, including Ravenna [2007], Fernández-Villaverde, Rubio-Ramı́rez, Sargent, and
Watson [2007], and Giacomini [2013] emphasized that such a matching exercise is imperfect as the
linear solution of a DSGE model has a vector autoregressive-moving average (VARMA) format. To
reduce the mismatch, the VAR should feature a large number of lags; but even a generous lag length
may be insufficient in endemic cases. When long lags can not be used due to short data, the noninvertibility or non-fundamentalness problem is typically taken care by i) simulating data from the
linear decision rules of the same length as the actual data, ii) running the same VAR on both actual
and simulated data, and iii) comparing the dynamics of the endogenous variables in the two systems
after shocks are conventionally identified (see Chari, Kehoe, and McGrattan [2005]).
In recent years, the term non-invertibility has been employed generically, to cover misspecification
problems preventing researchers to get information about theoretical quantities using a VAR. Thus,
the presence of anticipated disturbances, Leeper, Walker, and Yang [2013]; news, Forni, Gambetti,
and Sala [2018]; news and noise, Blanchard, L’Huillier, and Lorenzoni [2013]; omitted variables Kilian
and Lutkepohl [2017]; and latent variables have been listed as causing non-invertibility.
This paper studies a related mismatch problem, which may also prevent researchers to get information about the objects of interest from a VAR and could be important for deciding which theory
is consistent with the data. We call it deformation. It is an aggregation distortion and occurs when
the data generating process (DGP) features q structural disturbances, but only q1 < q variables enter
in the empirical model. We investigate two questions. Given that not all structural disturbances can
be obtained, will the innovations provide information about ”classes” of disturbances? Will they give
information about a particular disturbance? In general, the answer is negative.
Deformation makes identified shocks mongrels with little economic interpretation for two reasons.
Identified shocks are unlikely to combine structural disturbances of the same type, making it difficult
to relate, say, identified technology shocks with the TFP disturbances present in a model. Furthermore, when the empirical model is too small, shock identification requires stringent conditions, which
limit the type of disturbances one can analyze in practice. Perhaps more importantly, the shocks
one can identify will be, in general, linear combinations of current and past structural disturbances.
Thus, they will display stronger propagation than the corresponding disturbances in the DGP.
The first problem (named cross sectional deformation) emerges when the DGP is such that several
structural disturbances contemporaneously affect the variables entering the small scale empirical
model. The second problem (named time deformation) instead occurs whenever the small scale
2

empirical model is specified without paying sufficient attention to the theory used to explain the data
and it is exacerbated when the empirical model does not respect the theoretical relationship between
endogenous and state variables or alters the law of motion of the state variables. Cross sectional
deformation makes robust theoretical restrictions insufficient to obtain the structural disturbances.
Time deformation alters the information flow of the structural disturbances.
The plan of the paper After an illustrative example in section 2, to enhance the intuition and
to differentiate deformation from standard non-invertibility problems, section 3 derives the formal
results, assuming a linear state space representation for the DGP. Our focus is on general equilibrium
models, but deformation has identical implications in partial equilibrium settings, since the linear
solution of such models also has a state space representation. We provide sufficient conditions for the
identification of a ”class” or a particular disturbance, highlight the distortions when the mismatch is
due to the omission of control or state variables, and give conditions for the VAR-DSGE comparison
exercises to be valid.
Section 4 provides a constructive approach to compare a larger scale DGP and a small scale
empirical model, when one has an idea of the process that may have generated the data. With a
standard New Keynesian model as DGP, we show the problems occurring when the empirical model
is too small; how time deformation can be reduced by more explicitly linking the empirical model to
the theory; and which disturbances are more likely to be identified in different empirical systems.
Section 5 reverts the viewpoint of section 4, starts from an arbitrarily small scale empirical model,
and examines how the matching exercise is affected by disturbances potentially omitted from the
theory. We take the four variable VAR used by Iacoviello [2005] as given and compare the dynamics
induced by identified house price shocks and by preference disturbances in a model with either the
original four disturbances or the original four plus a disturbance to the borrowing constraints of
entrepreneurs, which is nowadays employed to explain the macro-financial linkages present in the
data, see e.g. Linde’ [2018]. While the dynamics induced by identified house price shocks and
preference disturbances are closely aligned in the baseline scenario, this is not the case when the
theory features five disturbances. In fact, the responses to identified house price shocks also reflect
the dynamics induced by monetary policy and the borrowing constraint disturbances.
Section 6 extends the analysis to DGPs displaying non-linear terms, such as those generated by
higher order perturbed solutions of equilibrium models, nowadays used to analyze risk or uncertainty
disturbances. We demonstrate that the results of section 3 hold unchanged, that deformation biases
are likely to be more severe, and use Basu and Bundick [2017]’s model to show them.
Section 7 concludes and provides suggestions to users who want to avoid the deformation trap in
practice. Given that deformation may be pervasive, the practice of comparing small scale VAR and
larger scale DSGE responses should be considerably refined. Showing that the pattern of responses

3

to interesting impulses is similar is insufficient for a structural model to be considered successful.
While there is no set of recommendations always applicable, carefully selected exercises, like those
discussed in sections 4 and 5, may provide information about the extent of deformation deficiencies
and the quality of the DGSE-VAR match.
Apart from using small scale VARs to validate the implication of a theory, it is popular to use
them to cross off theories inconsistent with the data see e.g. Angeletos, Collard, and Dellas [2019],
or to estimate structural parameters via response matching, see e.g. Christiano, Eichenbaum, and
Evans [2005]. With deformation, the magnitude and persistence of the responses obtained from an
identified VAR shock are generally unreliable. Thus, it is dangerous to exclude theories using, say,
the magnitude of multipliers or the share of the variance explained, or to provide policy advices
based on the structural estimates. For the exercises to be valid, one needs empirical facts that are
insensitive to deformation.
Contribution to the literature Our work is related to Canova and Hamidi Sahneh [2018], who
analyze the effects of cross sectional deformation on Granger causality tests, and to Miranda Agrippino and Ricco [2019], who examine the conditions for shock identification in SVAR-IV under partial
identificability. Early work by Blanchard and Quah [1989], Hansen and Sargent [1991], Marcet [1991],
Lutkepohl [1984], Braun and Mittnik [1991] and Faust and Leeper [1997] is also relevant as it discusses similar issues but in different settings. Some of the results we present have similar flavor as
Wolf [2018], but they are due to deformation rather than insufficient identification restrictions. Our
analysis is also linked to the large literature investigating non-invertibility (recently studied in, e.g.
Beaudry, Feve, Guay, and Portier [2016], Plagborg Moller and Wolf [2019], Pagan and Robinson
[2018], Chahrour and Jurado [2018]). In particular, it is connected to Kilian and Lutkepohl [2017]
and Forni et al. [2018], who have pointed out that rectangular systems, like those we analyze, always
generate non-invertibility.
Our contribution is to formally derive the mapping between the larger scale DGP and the smaller
scale empirical model when particular endogenous variables are absent from the empirical system;
to bring to light cases where informational sufficiency conditions may fail; and to stress that deformation issues may arise even in ideal conditions when the DGP features no news or anticipated
shocks, all theoretical quantities are observables, and the standard invertibility condition holds, but
short samples or identification convenience make applied researchers work with small scale empirical
models. Although the working paper version of Fernández-Villaverde et al. [2007] also derives a mapping between reduced form innovations and structural disturbances valid for the cases we consider,
they analyze only square systems, where the number of observable variables equal the number of
theoretical disturbances, and the ”poor man invertibility condition” they derive is valid only in that
framework.

4

2

Some intuition

Consider a simple consumption-saving problem when there are disturbances to TFP (Zt ), to the price
of investment (Vt ), and to preferences (Bt ) 1 . The representative agent maximizes:
∞
X

β t Bt U (Ct )

(1)

α
Ct + It = Ot = Zt Kt−1

(2)

max
Ct

t=1

subject to the constraints

Kt = (1 − δ)Kt−1 + Vt It

(3)

We assume that 0 < α < 1, 0 < β < 1 and that (Zt , Vt , Bt ) are iid with unitary means and standard
deviation σi , i = Z, V, B. When U (Ct ) = log Ct and δ = 1, the solution is
log Ot = α log Kt−1 + log Zt

(4)

log Ct = log(1 − αβ) + α log Kt−1 + log Bt + log Zt

(5)

log Kt = log(αβ) + α log Kt−1 + log Vt + log Zt

(6)

The theory has three endogenous variables and three disturbances (two supply (Zt , Vt ) and one
demand Bt ). In a VAR with ot = log Ot , ct = log Ct , kt = log Kt , all structural disturbances are
identifiable from the innovations using theory-based recursive restriction (zt = log Zt can be obtained
from the innovations in ot ; given zt , the other two innovations determine vt = log Vt and bt = log Bt ).
Deformation Suppose a researcher employs an empirical model with only two observables. Given
that at most two disturbances can be obtained, would she be able to identify a ”demand” and a ”supply” disturbance? Would she be able to trace out the dynamics due to the preference disturbance?
The answers depends on the variables used.
Suppose (kt , ct ) are employed. Integrating out ot (a control) from the problem, the solution of
the theory is:
kt = log(αβ) + αkt−1 + u1t

(7)

ct = log(1 − αβ) + αkt−1 + u2t

(8)

where u1t = vt + zt , u2t = bt + zt . Note that u2t mixes demand (bt ) and supply (zt ) disturbances
and that recursivity is lost. Thus, a VAR featuring (kt , ct ) exhibits cross sectional deformation,
because three structural shocks are mapped into two innovations. Here, current and past values of
1

We are grateful to Thomas Drechsel for suggesting a version of this example.

5

the observables do not provide enough information to extract a supply or the preference disturbance
because the theoretical restrictions, valid in the original three variable system, fail.
Suppose instead (ot , ct ) enters the empirical model. Integrating out kt (a state) from the problem,
the solution of the theory is:
ct = bc + αct−1 + u1t

(9)

ot = by + αot−1 + u2t

(10)

where u1t = zt + bt − αbt−1 + αvt−1 , u2t = zt + αvt−1 , and bc , by are constant. Note that omission of
kt causes two new states ct−1 , ot−1 to appear in the solution. In addition, recursivity is lost and u1t
mixes demand and supply disturbances, but now with different timing. Thus, a VAR with (ct , ot )
displays both cross sectional and time deformation. In such a system the (recursive) cross correlation
between ujt and current and lagged values of any of the structural disturbances does not go to
one, even when the number of lags goes to infinity. Thus, it is impossible to recover the relevant
disturbances using current and lagged values of observables. Because adding future values of ct , ot
does not help, the recoverability condition of Chahrour and Jurado [2018] also fails. Note that, also
in this case, theoretical motivated restrictions will not identify any structural disturbance.
Is there a two variable system which allows the identification of a supply and a demand disturbance? If the two great ratios, (kt − ot ) and (ct − ot ) are used as observables, one can recover vt , bt
from the innovations. Thus, while individual variables may not allow the identification of classes or
particular disturbances, linear combinations of observables of the original model might. This happens
in our example, because each disturbance enters the decision rule of one linear combination only.
Relationship with non-invertibility For the readers familiar with the ”invertibility” language,
one may note that the systems (7)-(8) and (9)-(10) are non-invertible in the observables, although
for different reasons. Furthermore, non-invertibility is not driven by the properties of the structural
disturbances (there is no news or anticipated disturbances) or by the intrinsic dynamics of the original
system (here α < 1), but by the scale of the empirical model. The system with great ratios is, instead,
invertible because (the history of) each combination of variables carries unique information about
one structural disturbance.
How different is deformation from traditional non-invertibility? We explicitly consider empirical
systems featuring less observables than structural disturbances (”rectangular” systems), while the
literature focuses on informational deficiencies present in systems with as many observable variables
as structural disturbances (”square” systems). (4)-(6) could be one such square system; and it is easy
to verify that with kt , ct , ot as observables, the ”poor man invertibility” condition (see FernándezVillaverde et al. [2007]) is satisfied and, as mentioned, all disturbances can be obtained from the
innovations of a VAR.
6

Deformation and omitted variables It is useful to stress that omitting variables present in the
theory does not necessarily generate deformation problems. What it is crucial for deformation is that
the omission causes a mismatch between the number of VAR variables and the number of structural
disturbances. To illustrate, consider the original consumption-saving model, but now assume that
the TFP disturbance Zt is an AR(1) with persistence ρ 2 . The solution is:
log Ot = α log Kt−1 + ρ log Zt−1 + log ezt

(11)

log Ct = log(1 − αβ) + α log Kt−1 + log Bt + ρ log Zt−1 + log etz

(12)

log Kt = log(αβ)α log Kt−1 + ρ log Zt−1 + log Vt + log ezt

(13)

log Zt = ρ log Zt−1 + log ezt

(14)

In this system there are three disturbances and four endogenous variables. Suppose that a researcher
uses a VAR with (ot , ct , kt ). It is easy to check that the ”poor man invertibility” condition holds,
despite the fact that the exogenous state zt is omitted. Moreover, when the VAR features sufficient
lags, it is possible to recover the three structural disturbances using theoretically motivated recursive
restrictions. To restate the concept differently, deformation occurs when the empirical system is not
large enough relative to the vector of structural disturbances. Omission of theory relevant variables
is neither a necessary nor a sufficient condition for deformation to emerge.
Deformation and measurement errors Although in the theory all disturbances are structural,
deformation would emerge unchanged if the theory, instead, is driven by a mixture of structural
disturbances and measurement errors. Suppose, for instance, that vt is a measurement error. Then,
a VAR with (kt , ct ) will still display cross sectional deformation and a VAR with (ct , ot ) will display
both cross sectional and time deformations. Finally, in the VAR with the two great ratios, a researcher
will be able to identify the preference disturbance (but not the TFP disturbance).
To sum up, deformation may emerge even when traditional forms of non-invertibility are absent
and it is produced by a dimensionality mismatch between the empirical model and the disturbances
of the DGP. In this situation, the variables entering the empirical system determine the informational
content of the reduced form innovations and the dimensionality mismatch problem becomes more
severe when state variables are omitted. In general, strict conditions are needed to recover a ”class”
or a particular disturbance and one needs to verify they hold for the vector of observables used. The
next section formalizes these conclusions.
2

We thank one of the referees for suggesting such an example.

7

3

The analytical results

This section derives the mapping between structural disturbances and reduced form innovations when
the empirical model contains different combinations of endogenous states and controls (propositions
1 and 2) and compares the dynamic responses in the theory with those obtained in various empirical
systems (proposition 3). We employ the generic term ”empirical system” throughout the section
because the implications we derive hold when a researcher estimates a VAR but also a state space
model. We assume that the DGP is of the form:
xt = A(θ)xt−1 + B(θ)et

(15)

yt = C(θ)xt−1 + D(θ)et

(16)

where xt is a k × 1 vector of endogenous and exogenous states, yt is a m × 1 vector of endogenous
controls, et ∼ N (0, Σ(θ)) is a q × 1 vector of disturbances, Σ(θ) a diagonal matrix and θ a vector of
structural parameters; A(θ) is k×k, B(θ) is k×q, C(θ) is m×k, D(θ) is m×q. For convenience, we let
the eigenvalues of A(θ) to be all less than one in absolute value. Thus, if there are disturbances with
permanent effects, (15)-(16) represent a properly scaled version of the process generating the data.
Predictable disturbances or news about future disturbances are not considered to leave standard
non-invertibility issues aside. While (15)-(16) are general, in our applications they are produced by
the (log-) linear solution of the optimality conditions of a structural macroeconomic model.
In general, m ≥ q and some of the endogenous variables may be latent. Hence, the variables
entering the empirical model are zt = S[xt , yt ]0 , where S is a selection matrix. Fernández-Villaverde
et al. [2007] assume S = [0, I] and consider m = q; Ravenna [2007] and Pagan and Robinson [2018]
assume that either S=I and consider m + k = q, or S = [0, I] and consider m = q. In general, S is
chosen so that the dimension of zt matches the number of structural disturbances.
The reduced form (innovation representation) corresponding to (15)-(16) is
xt = A(θ)xt−1 + Kx (θ)ut

(17)

yt = C(θ)xt−1 + Ky (θ)ut

(18)

where ut = zt − Et [zt |Ωt−1 ] is a q × 1 vector of innovations, Ωt−1 includes (at least) lags of zt , Kx (θ)
and Ky (θ) are steady state Kalman gain matrices, and for those xt and yt belonging to zt , Ki (θ) has
a row with zeros except in one position.
Given (17)-(18), the identification of the structural disturbances requires a mapping from ut into
et . When the empirical model is a VAR, Sims and Zha [2006], Plagborg Moller and Wolf [2019],
developed sufficient conditions to obtain et from current and past zt ; Chahrour and Jurado [2018]
discuss sufficientconditions
 to recover et from current, past and future zt . Here, when S = I, one
B(θ)
needs to invert
et = ut ; when S = [0, I], one needs to invert D(θ)et = ut . In both cases,
D(θ)
standard conditions apply, see Rubio Ramirez, Waggoner, and Zha [2010].
8

In the identification exercise two assumptions are implicitly made. First, there is no misspecification in (15)-(16), at least, as far as sources of disturbances are concerned, so that dim(zt ) = dim(et ).
If disturbances are left out, the identification exercises becomes problematic, even when excluded
disturbances are orthogonal to included ones, and included disturbances account for a large portion
of the variability of zt . Second, when zt = yt , and dim(zt ) = dim(et ), Ωt−1 it is typically specified to
include long lags of zt to take care of omitted states. When disturbances are left out, having a rich
Ωt−1 is generally insufficient to make the identification problem well behaved.
Three small empirical systems In our analysis dim(zt ) < dim(et ). Thus, we focus on the
situation when, say, a two variable VAR is used to collect stylized facts but the DGP features
more than two disturbances. A researcher who wants to interpret the dynamics of the small scale
empirical system may employ a theoretical model that is less complex than the DGP and may specify
only enough disturbances to match the number of empirical variables. We show that the dynamics
produced by such model may not be relevant for the comparison and omitted disturbances may
play a crucial role. To ease the notation, from now on we will omit the dependence of the reduced
form matrices A, B, C, D, Kx , Ky , Σ on the structural parameters θ, unless it creates confusion. Let
zit ≡ Si [xt , yt ]0 , where Si is a qi × q, and dim(zit ) = qi < dim(et ) = q, ∀i. We consider three Si
matrices.
• Case 1: S1 = [I, S12 ]. This choice of S generates an empirical system which retains the states
but integrates out part of the controls. The DGP in terms of z1t = [xt , y1t ]0 , y1t ≡ S12 yt is:
xt = Axt−1 + Bet

(19)

y1t = C1 xt−1 + D1 et
(20)





A 0
B
A 0
or z1t = F1 z1t−1 + G1 et , where F1 =
and G1 =
. Let F =
.
C1 0
D1
C 0
• Case 2: S2 = [S21 , S22 ]. This choice of S generates an empirical system which integrate out


part of the states and part of the controls. Let xt = (x1t , x2t ), yt = (y1t , y2t ), where (x1t , y1t ) are the
variables excluded from the empirical system. The DGP in terms of z2t = [x2t , y2t ], where x2t ≡ S21 xt ,
y2t ≡ S22 yt , is
x2t = A2 x2t−1 + B2 et + w1t−1

(21)

y2t = C2 x2t−1 + D2 et + w2t−1
(22)




A2 0
B2
or z2t = F2 z2t−1 + G2 et + wt−1 , where F2 =
and G2 =
, where w1t−1 = H2 x1t−1
C2 0
D2
and H2 = [A21 C21 ]0 . Alternatively, using (15) to separate observable and non-observable states, and
integrating x1t out, the DGP for z2t is
x2t = Ã21 x2t−1 + Ã22 x2t−2 + B̃20 et + B̃21 et−1

(23)

y2t = C̃21 x2t−1 + C̃22 x2t−2 + D̃20 et + D̃21 et−1

(24)

9

(21)-(22) point out the misspecification present using a first VAR for z2t . (23)-(24) shows that DGP
for the observables is a VARMA(2,1).
• Case 3: S3 = [S31 , 0]. This choice of S generates an empirical system which repackages the
states and eliminates the controls. The DGP in terms of z3t = x3t ≡ S31 xt is
x3t = A3 x3t−1 + B3 et + w3t−1

(25)

where w3t−1 is a function of the repackaged states. Analogously with case 2, one may write (25) as
z3t = Ā31 z3t−1 + Ā32 z3t−2 + B̄30 et + B̄31 e3t−1

(26)

The processes for zit , i = 1, 2, 3 are obtained integrating out the relevant variables from the decision
rules. They can also be equivalently obtained substituting optimality conditions into others, prior
to the computation of the decision rules. The matrices characterizing these solutions generally
differ from those obtained solving the original model and crossing out the rows corresponding to the
variables absent from zit , because not all the original states are necessarily used in the computation of
the decision rules. Section 4 provides examples of smaller scale empirical systems which correspond
to (19)-(20), (23)-(24), and (26) for a specific DGP.
The innovation representation of (15)-(16), when zit are observables is
xit = Axit−1 + K̂ix uit

(27)

yit = Cxit−1 + K̂iy uit

(28)

where uit = zit − Et [zit |Ωit−1 ] is a qi × 1 vector of innovations, K̂ix , K̂iy are steady state Kalman gain
matrices featuring some rows with zeros except in one position.
We study the mapping between uit and eit when qi < q. Given that not all disturbances can
be identified, we ask whether a researcher can recover a ”class” of disturbances or a particular
disturbance appearing in the DGP. We then examines whether the dynamic induced by identified
shocks match those in the DGP.
The mapping between innovations and structural disturbances when the empirical system eliminates theoretical controls We analyze the relationship between u1t and et , when
E[z1t |Ω1t−1 ] = F̃1 z1t−1 and thus
u1t = z1t − F̃1 z1t−1

(29)

Proposition 1 i)If F̃1 = S1 F S1∗ ≡ F1 , then u1t = λ1 et , where S1∗ is the generalized inverse of S1 ,
λ1 = S1 G depends on θ, and is a q1 × q matrix.
ii) A block diagonal G1 is sufficient to identify classes of disturbances.
iii) If G1 has at most one non-zero element in row k, one can obtain ejt , for some k and j.
10

(The proof of all the propositions is in appendix A).
As point i) indicates, when z1t is used in the VAR, the innovations u1t respect the timing protocol
of the structural disturbances et , but cross sectionally deform them because λ1 is a q1 < q matrix.
Because G1 is rectangular, one may ask when elements of the innovation vector carry enough information to recover some structural disturbances. Suppose that structural disturbances are order by
classes, i.e. disturbances 1 to j1 belong to class 1, disturbances j1 + 1 to j2 belong to class 2, etc. As
point ii) indicates, the k-th element of u1t compresses a class of structural disturbances only if G1 has
a block diagonal structure. Finally, as point iii) suggests, the k-th element of u1t carries information
about ejt if G1 has at most one non-zero element in row k in position j.
The restrictions in ii) and iii) are strong and unlikely to be satisfied in a large class of general
equilibrium models. They require that the theory features many ”conveniently” placed delay restrictions so that, contemporaneously, either a reduced number of disturbances of the same class affects
the k-th variable of the empirical model or only one structural disturbance affects the k-th variable.
Proposition 1 determines the properties of u1t , given et . Thus, u1t will be a mean zero process
and its autocovariance function will be restricted by
E(u1t u01t−s ) = E(λ1 et e0t−s λ01 ),

s≥0

(30)

When et are iid, the variance of u1t and et differ and the magnitude of the amplification depends on
the properties of λ1 . Thus, a ejt disturbance with a small variance or small loadings λ1j will be hard to
identify. Similarly, the serial correlation properties of u1t depend on the structure and magnitude of
the λ1 polynomial and its row dimension. However, even when λ1j = G1j , cross sectional distortions
may make the autocovariance function of u1t insufficient to recover the autocovariance of some ejt ,
unless additional restrictions are imposed.
The mapping between innovations and structural disturbances when the states in the
empirical and the theoretical models differ We analyze the relationship between uit , i = 2, 3
and et when E[zit |Ωit−1 ] = F̃i zit−1 , i = 2, 3 so that
uit = zit − F̃i zit−1

(31)

Proposition 2 i) uit = λi (L)et , where λi depends on θ and is qi × q for each L, ı = 2, 3.
ii) uit = ψi (L)u1t , i = 2, 3, where ψ(L) is a function of  and A, K̂ and K, x̂t and xt .

Point i) states that when the empirical system eliminates state variables, u2t does not respect
the timing protocol of the structural disturbances et and cross sectionally deform them. However,
an empirical system including only the state variables of the DGP does not solve time deformation
11

problems since their law of motion may be altered. Thus, also u3t will in general carry too little
∗
= F̃3 are insufficient to avoid time
information to recover a etj . Note that S2 F S2∗ = F̃2 , or S31 AS31

deformation problems.
Point ii) indicates that, in general, uit 6= u1t , i = 2, 3 and the timing of information they contain
differs even when Si F Si∗ = F̃i , ∀i. In other words, it matters which variables enter the empirical
system. To clearly see this, let λ1 (L)∗ be the generalized inverse of λ1 (L). Then:
uit = λi (L)λ1 (L)∗ u1t ≡ ψi (L)u1t

(32)

By construction ψi0 = I. Thus, an impulse in u1t and uit , i = 2, 3 has identical effects on the variables
present in both z1t and zit , i = 2, 3 but will last longer when zit are the observables - persistence is
altered. Hence, the dynamics induced by identified shocks in small scale empirical systems of the
same dimension but featuring different variables will generally differ.
(31) is misspecified when states are omitted or repackaged. What happens when uit are constructed using a larger information set, e.g., uit = zit − F̃i (L)zit−1 L = 1, 2, . . .? Because both z2t
and z3t are VARMA processes, standard issues discussed in the literature apply. In principle, F̃i (L)
must be non-zero for L → ∞ for time deformation biases to disappear. Still, even when L → ∞,
cross sectional deformations will remain.
Proposition 1 is related to the aggregation results of Faust and Leeper [1997]. Because their DGP
is a VAR, they can not analyze the consequences of omitting states or altering their law of motion.
Proposition 2 has the same flavor as the main result in Fernández-Villaverde et al. [2007]. The main
difference is that here uit , i = 2, 3 are reduced ranked moving averages of et and the reason is time
deformation rather than non-invertibility.
Dynamic responses Consider zit responses to an impulse in the shocks. In the DGP they are:


B
zit = Si
et
D


Ah B
zit+h = Si
et i = 1, 2, 3; h = 1, 2, . . .
(33)
CAh−1 B
In the empirical system with z1t as observables, they are:
z1t = u1t
z1t+h = F̃1h u1t

(34)

The impact effect differs because ut = G1 et and G1 is not a square matrix. Thus, having the correct
B, D matrices may be insufficient
torecover some ejt , unless G1 only has one non-zero element in the

A
j-th row. However, if F̃1 =
responses at longer horizons to a properly identified shock are
S12 C
12

proportional to those of the DGP. Thus, qualitatively, (34) provides a good approximation to (33),
if some ekt can be recovered from u1t .
The responses computed in systems with zit , i = 2, 3 as observables are instead:
zit = uit
zit+h = νij uit + F̃ih uit

(35)

Here, the dynamic responses of zit will be distorted, even in the (unlikely) case that some of the etj
can be recovered from the uit vector. Thus, both quantitatively and qualitatively, the dynamics of
these systems may have nothing to do with those of the DGP. We summarize the discussion in a
proposition.
Proposition 3 i) Identified
 impulse
 responses constructed in a z1t system could match those of the
A
structural model if F̃1 =
and G1 has at most one non-zero element in one row.
S12 C
ii) Even if the conditions in i) holds, the dynamic responses obtained from identified shocks in a zit
system, i = 2, 3, differ from those of the DGP.
(34)-(35) provide an analytic approach to compute deformation biases in impulse responses. Braun
and Mittnik [1991] derived a similar expression when the empirical model and the DGP are VARs.
Summary When qi < q, the variables entering in the empirical model determine the quality of the
(small) VAR- (large) DSGE matching exercises. Eliminating controls creates innovations that cross
sectionally combine the structural disturbances, but eliminating states or repackaging their law of
motion creates both cross sectional and time distortions. However, an empirical model with all the
theoretical states (and none of the controls) may not be enough for proper inference. When the VAR
omits or repackages some of the states, long lags are needed for a VAR to reproduce the VARMA
of the DGP and for identified shocks to have any relationship with the structural disturbances.
When long lags can not be used because of short samples, careful variable selection may reduce time
deformation - see section 4 for an example. In general, the qualitative and quantitative dynamics
produced by the identified shocks under deformation may have nothing to do with those of the
structural disturbances.
4

Given a theory, how do I choose the variables of a small scale VAR?

To illustrate the practical implications of the propositions and the problems that may emerge matching a larger DGP to a small scale VAR model we use a standard New Keynesian model featuring
five structural disturbances: a permanent at and a transitory ζt TFP shock, a preference χt shock, a
cost push µt shock, and a monetary policy εt shock. The optimality conditions are (see Canova and
Ferroni [2011] for details):
13

1
h
Et gt+1 +
gt + rt − Et πt+1
1 − h
1−h

h
gt + (1 + σn ) nt + kp (µt − χt )
= Et πt+1 β + kp
1−h
= ζt + (1 − α) nt

χt = Et χt+1 −

(36)

πt

(37)

ot

(38)

rt = ρr rt−1 + (1 − ρr ) (φy gt + φp πt ) + εt

(39)

gt = at + ot − ot−1

(40)

where (36) is the Euler equation, (37) is the Phillips curve, (38) is the production function, (39) is
the Taylor rule, and (40) is the definition of output growth. ot is output and gt its growth rate, nt
is hours worked, πt is the inflation rate, rt is the nominal interest rate and ct is consumption. h
is the coefficient of (external) consumption habit, β the discount factor, σn the inverse of the Frish
elasticity of labor supply, κp the slope of the Phillips curve, α the labor share in production, φy , φπ
the coefficients of the Taylor rule. The disturbances evolve as AR(1) processes with persistence
0 < ρj < 1j = z, a, χ, µ, ε while ρ is assumed to be zero.
We solve the model using a first order perturbation setting α = 0.33; β = 0.99; σn = 1.5; h =
0.9; kp = 0.05; φy = 0.1; φp = 1.5; ρr = 0.8; ρz = 0.1; ρa = 0.5; ρχ = 0.5; ρµ = 0.1, ρε = 0.0. We obtain
decision rules of the form (15)-(16), where the minimal state vector is xt−1 = [ot−1 , rt−1 , ζt−1 , at−1 , µt−1 , χt−1 ]0 ,
and the control vector is yt = [gt , ot , πt , nt , rt ]0 . Thus, A(θ) is 6 × 6, B(θ) is 6 × 5, C(θ) is 5 × 6 and
D(θ) is 5 × 5. It is easy to verify that the ”poor man invertibility” condition holds when zt = yt
and that all disturbances are identifiable from the VAR once a sufficient number of lags and proper
identification restrictions are employed.
Smaller scale VARs Given that the theory has 5 disturbances, we consider systems with less than
5 variables. We ask i) which deformation distortions each system displays; ii) which disturbance could
be identified using theory-based restrictions; iii) whether there is a minimum size of the VAR below
which all identified shocks become mongrels.
The first system employs four observable variables, zt = (ot , πt , nt , rt ). The theory corresponding
to this system, can be obtained integrating out gt from the solution. Alternatively, one can use (40)
in (36)-(39) and solve the resulting set of equations (the optimality conditions of all smaller scale
models discussed in this section are in appendix B). Since gt is a control, the minimal state vector
remains xt−1 = [ot−1 , rt−1 , ζt−1 , at−1 , µt−1 , χt−1 ]0 . It is easy to verify that A(θ), B(θ) are unaltered.
This system corresponds to case 1 of section 3. Because five structural disturbances are mapped into
four innovations, proposition 1 tells us that cross sectional deformation will be present.
The second empirical system employs three variables, zt = (ot , πt , nt ). It is obtained integrating
out gt , rt from the solution or substituting (40) in (36)-(39) and then (39) in the remaining equations.
14

Here an endogenous control, gt , and an endogenous state, rt−1 , are eliminated. Thus, this empirical
system corresponds to case 2 of section 3. When rt is integrated out, the minimal state vector
is x∗t−1 = [ot−1 , ot−2 , ζt−1 , at−1 , µt−1 , χt−1 ]0 , because the Euler equation becomes a second difference
equation. Proposition 2 tells us that the innovations of this system will mix et−s , s ≥ 0, cross
sectionally; and proposition 3 that dynamic biases will be larger than in the four variables system.
The third system employs zt = (πt , nt , rt ) as observables. In this VAR an endogenous control, gt , and an endogenous state, ot−1 , are integrated out. Here the minimal state vector is now
x̃t−1 = [nt−1 , rt−1 , ζt−1 , at−1 , µt−1 , χt−1 ]0 because the optimality conditions remain a system of first order difference equations. Since, given ζt−1 , nt−1 proxies for ot−1 , states are simply repackaged. Thus,
deformation problems should be less pronounced than in an empirical system with zt = (ot , πt , nt ).
Time deformation To evaluate whether time deformation distortions are present, it is sufficient
to check if the autocorrelation function of the innovations of the three systems, which we calculate
analytically from the solution and the innovation representation, have any element significantly different from zero. Figures C1-C3 in appendix C present the function for each system, together with
a 95% asymptotic tunnel for the hypothesis that the autocorrelation at each horizon is zero - which
would hold if time deformation is absent.
Figure 1: Cross correlation function, innovations in the (ot , πt , nt ) system and structural shocks.
Permanent technology

Transitory technology

Shock 1

0.1

0

0

-0.1

-0.1

Mentary Policy
0.1
0

-0.2

-20

Shock 2

Markup
0.1

0
0
-0.1
0

20

-0.4
-20

0.1

0.1

0

0

0

20

0

20

0.1

-20

0

20

-0.2
-20

0.1

0

0

0

-0.2

-0.1

-0.1

0

20

0

20

0

20

-0.4
-0.2
0

20

0

-20

0

20

0

-0.2

-20

0

20

-20

0.1

0.1

0

0

0

20

0
-0.1

-0.1

-0.1
0

Lag

20

-0.4
-20

-0.6
-20

0.1

-0.2

-0.4
-20

-20

-0.1

-0.1

-0.1
-0.2
-20

Shock 3

Preferences
0.1

0

20

-20

0

Lag

Lag

20

-20

0

20

-0.2
-20

Lag

Lag

Note: Parallel lines delimit 95 % asymptotic tunnel for the hypothesis of zero cross correlations.

As expected, the (yt , πt , nt ) system has innovations displaying considerable serial correlation and
numerous elements of the autocorrelation function are significant. The other two systems have
serially uncorrelated innovations. Figure 1 provides evidence on the causes of time deformation in
15

the (ot , πt , nt ) system. It presents the cross-correlation function between the innovations and the
structural disturbances together with a 95% asymptotic tunnel for the hypothesis that they are
all zero - absent time deformation, only the contemporaneous elements should be significant. The
innovations correlate with several lags of the transitory TFP and monetary policy disturbances.
Thus, the shocks that one may be able to identify in this system will be time contaminated.
Cross sectional deformation Each of the three system displays cross sectional deformation. To
examine whether one will still be able to identify, say, a stationary technology or a monetary policy
disturbances using theory-based restrictions, we present in table 1, the λ0 matrix, the contemporaneous mapping between innovations and structural shocks.
Innovations
at
u1t
u2t
u3t
u4t

0.018
-0.15
-1.46
-0.04

u1t
u2t
u3t

-0.09
-0.20
-1.63

u1t
u2t
u3t

-0.15
-1.46
-0.04

Structural shocks
ζt
χt
µt
(ot , πt , nt , rt ) system
-0.72 0.08 -0.005
-0.30 0.04 0.04
-1.07 0.13 -0.007
-0.08 0.01 0.01
(ot , πt , nt ) system
0.70 0.10 0.05
-0.28 0.04 0.07
-1.04 0.16 0.08
(πt , nt , rt ) system
-0.30 0.04 0.04
-1.07 0.13 -0.007
-0.08 0.01 0.01

t
-0.30
-0.71
-0.45
0.77
-0.26
-0.71
-0.39
-0.71
-0.45
0.77

Table 1: Entries of the λ0 matrix
With four observables, the monetary policy disturbances remains identifiable as it will maintain,
for example, a unique set of theory-based sign restrictions on the four observable variables. However,
positive stationary TFP and negative preference disturbances will be confused when sign restrictions
are used for identification, as they both produce an instantaneous fall in (ot , πt , nt , rt ).
In the (ot , πt , nt ) system, distortions are magnified. Here sign restrictions can not separate any
of the stationary structural disturbances. Intuitively, larger distortions occur for two reasons. First,
the Euler equation defines an dynamic aggregate demand in output and inflation, while the Philips
curve and the production function define a dynamic aggregate supply equation in the same variables.
Because they are both instantaneously moved by e.g., TFP and preference disturbances, it will
be impossible to separate them using output, inflation and hours data. Second, the Euler equation

16

depends on at−1 , ζt−1 and, because ot−2 enters the equation, also on ζt−2 . Thus, the aggregate demand
equation evolves more persistently in response to disturbances than in the original model.
In the (πt , nt , rt ) system, the sign and the magnitude of the loadings of the structural disturbances
are the same as in the four variable system. As compared with the (ot , πt , nt , rt ) system, we loose
the possibility to distinguish stationary TFP, permanent TFP and preference shocks. However,
there is no change in the ability to recover monetary policy disturbances. Hence, a careful choice
of observables in a smaller scale system may minimize time deformation distortions and allow the
identification of monetary policy disturbances using theory-based restrictions.
Cholesky factors Baumeister and Hamilton [2015] have argued that the Haar prior, typically
employed to generate candidates to check for sign restrictions, may determine the shape of the VAR
responses. While the points made in the previous paragraph, in particular, that theory-based restrictions may be insufficient to identify certain disturbances, are independent of the way sign restrictions
are imposed, it is easy to show that the same conclusions hold if one instead uses triangular restrictions. Table 2 displays the Cholesky factors of the covariance matrix of the innovations of original
model (assuming disturbances have unit variance and with the rows and columns corresponding to
the variables solved out eliminated) and of the three smaller systems. While the entries of λ0 (θ) are
such that zero restrictions are unlikely to identify structural disturbances, applying the same recursive restrictions to the innovations of the original and of the reduced systems makes the comparison
meaningful, see e.g. Chari et al. [2005].
Observables
(ot , πt , nt , rt )

(ot , πt , nt )

(πt , nt , rt )

Original
0.75
0.68 0.26
1.06 1.14
-0.42 -0.13
0.75
0.68 0.26
1.06 1.14
0.26
1.14 0.95
-0.13 0.16

system

0.95
0.16

0.95

0.07

Reduced
0.78
0.55 0.57
1.14 0.44
0.07 -0.22 -0.70
9.55
5.16 1.50
15.36 -0.02
0.79
1.11 1.50
-0.65 0.36

system

1.42
0.26 0.07

1.52

0.23

Table 2: Cholesky factors
In the (ot , πt , nt , rt ) system the signs of the Cholesky factor match those of the original model, but
magnitudes are altered, sometimes substantially (see the (3,2) or (4,2) elements). A similar picture
emerges for the (π, nt , rt ) system. Thus, instantaneous responses to orthogonal shocks in these two
systems qualitatively mimic those of the original model, but display magnitude distortions.
17

For the (ot , πt , nt ) system, biases are more significant as the signs and magnitudes are affected.
For example, while in the original system an orthogonal unitary shock to nt implies a roughly similar
instantaneous effect on ot and πt , the same shock in the (ot , πt , nt ) system has a 15 times larger effect
on ot and a negative effect on πt .
Impulse responses We show dynamic deformation distortions when we identify shocks with contemporaneous sign restrictions.
Figure 2: Responses to identified monetary policy shocks, (πt , nt , rt ) system
0.2

0

1

0.8

0

0.6
0.4

-1
-0.2

0.2
-2
-0.4

0
-3
-0.2

-0.6
-4

-0.4
-0.8
-5
-1

Small system 016
Small system 084
True MP shock

-0.6
Small system 016
Small system 084
True MP shock

-6

-1.2

-7
0

5

10

15

inflation responses

20

25

Small system 016
Small system 084
True MP shock

-0.8
-1

0

5

10

15

20

25

0

Hours responses

5

10

15

20

25

Interest rate responses

Note: The dashed regions report 68 % interval obtained accounting for rotation uncertainty. The solid line reports
the responses in the DGP.

Figure 2 presents the responses to a monetary policy shock in the (πt , nt , rt ) system when policy
disturbances are identified assuming that an increase in rt leads to a contemporaneous fall in πt , nt .
Figure C.4 in appendix C has the responses to a monetary shock in the (ot , πt , nt , rt ) system. Dotted
lines represent 68% credible sets across rotations satisfying the restrictions. Superimposed as continuous lines are the responses of the original five variable model. The three variable system encodes
enough information to recover monetary policy disturbances and omitting output and its growth rate
does not affect our ability to interpret the responses to identified monetary shocks, provided hours
enter the empirical system. Given over 25 years of empirical literature investigating the dynamics
induced by monetary policy disturbances, it is comforting to find that these shocks can be identified
with conventional restrictions, even in trivariate VARs models.
Recall that the entries of λ0 imply that positive stationary TFP and negative preference disturbances have the same contemporaneous sign implications in the four variable system. Figure 3,
which plots the responses to sign-identified stationary TFP disturbances, shows that indeed the size
of estimated impact responses is significantly off; and that dynamic responses are more persistent in
18

Figure 3: Responses to identified stationary TFP shocks, (ot , πt , nt , rt ) system
0.5

0.1
0

0

-0.1
-0.5
-0.2

Small system 016
Small system 084
True z shock

-1

Small system 016
Small system 084
True z shock

-0.3

-1.5

-0.4
0

5

10

15

20

25

0

5

Output responses

10

15

20

25

Inflation responses
0

0
-0.5

-0.1
-1
-0.2

Small system 016
Small system 084
True z shock

-1.5
-2

Small system 016
Small system 084
True z shock

-0.3
0

5

10

15

20

25

0

Hours responses

5

10

15

20

25

Interest rate responses

Note: The dashed regions report 68 % interval obtained accounting for rotation uncertainty. The solid line reports
the responses in the DGP.

the smaller system. Hence, theory-based restrictions valid in the five variable model, only identify
a linear combination of the two disturbances, a reminiscent of the masquerading effect discussed in
Wolf [2018].
An empirical model with only the theoretical states Omission of the theoretical states or
failure to proxy for them generates time deformation. However, an empirical system with only the
states (and none of the controls) does not necessarily produce interpretable identified shocks.
Starting from the original five variable system and integrating out all but zt = (ot , rt ) produce
a solution where the state vector is unchanged. However, the optimization problem is different
because, for example, ot+2 and rt+1 now appear in the equilibrium conditions. Since (Ā(θ), B̄(θ))
differ from the original (A(θ), B(θ)) matrices, this system will also feature timing distortions and
mongrel identified shocks. Figure 4, which plots the cross correlation of the innovations with the five
structural disturbances, confirms this fact: the innovations ut are serially correlated and load on a
number of lags of the monetary policy disturbance.
Cross sectional deformation also matter. With zt = (ot , rt ), one can at most identify a linear
combination of the five disturbances via sign restrictions. However, no combination separates, say, a
supply from a demand type disturbance. For example, identified monetary policy shock will combine
markup and monetary policy disturbances. Hence, a two variable VAR is too small to make economic
sense of the shocks one recovers.

19

Figure 4: Cross correlation function, innovations in (ot , rt ) system and structural shocks.
Permanent technology
0.1

0.2

Transitory technology

Preferences

Markup

Monetary Policy

0.15

0.15

Shock 1

0.1
0.05

0

0

-0.2

-0.4

-0.05
-0.1
-20

0

20

-0.6
-20

Lag

0.1
0.05

0

0

-0.1

-0.05

-0.05

-0.2

-0.1

-0.3

-0.1
-0.15
0

20

-20

Lag

0

-0.15
20
-20

Lag

0

0

20

-0.4
-20

Lag
0.15

0.1

0.1

0.8

0.05

0.05

0.05

0.05

0.6

0

0

0

0

0.4

-0.05

-0.05

-0.05

-0.05

-0.1

-0.1

-0.1

-0.1

-0.15
-20

0

Lag

-0.15
20
-20

0

Lag

20

1

0.15

0.1

0

Lag

0.1

0.15

Shock 2

0.1
0.05

0.2
0

-0.15

-0.15
20
-20

0

20

Lag

-20

0

Lag

20

-20

0

20

Lag

Note: Parallel lines describe the 95 % asymptotic tunnel for the hypothesis of zero cross correlations.

Permanent technology shocks and hours worked In the literature it is common to use a
VAR with output growth (or labor productivity) and hours to identify permanent TFP shocks. The
dynamics are then compared with the dynamics permanent TFP disturbances produce in standard
RBC or new Keynesian models, see e.g. Galı́ [1999]. While the comparison is meaningful when the
DGP features, say, a permanent TFP and a monetary policy disturbances, it may be inappropriate
when the model of this section generates the observed data.
When zt = (gt , nt ), lagged output growth and lagged hours become state variables. Since the
states and their law of motion are altered, the innovations of the (gt , nt ) system are related to several
lags and leads of the structural disturbances. For example, lags of the permanent TFP disturbances
and of the preference disturbances load significantly on the second innovation (see figure C.5 in
appendix C). Hence, in this system, there is no guarantee that the identified technology shock will
only capture the permanent technology disturbance.
Figure 5 shows that if the DGP only has a permanent TFP and monetary policy disturbance,
the responses obtained identifying a permanent supply shock in a VAR with zt = (gt , nt ) replicate
well the dynamics produced by permanent TFP disturbances (compare the dashed blue and the
solid black lines). Instead, when the model of this section is the DGP, magnitude and persistence
distortions are important (see the red dashed line). Here, the model can not be reduced to a bivariate
system with output growth and hours and meaningful innovations. Once again, a two variable VAR
is too small for identified permanent TFP shocks to make sense. One needs at least a four variable
VAR for identified permanent technology shocks to bear any resemblance with the permanent TFP

20

disturbances the theory features.
Figure 5: Responses to identified permanent TFP shocks, (gt , nt ) system.
0.5

0.3
Reduced system
Two shocks
Permanent TFP shock

0

0.25

-0.5
0.2
-1
0.15
-1.5
Reduced system
Two shocks
Permanent TFP shock

0.1
-2

0.05

-2.5

0

-3
0

5

10

15

20

25

0

Output growth

5

10

15

20

25

Hours

R2 for invertibility It is common in the literature to check whether a structural disturbance
can be obtained from a particular vector of VAR variables using the R2 of a regression of that
disturbance on the reduced form innovations (alternatively, on the variables of the empirical system),
see e.g. Sims and Zha [2006] or Plagborg Moller and Wolf [2019]. While the approach is appealing
when the VAR includes as many variables as disturbances in the theory, it may give misleading
information in the cases we consider. The reason is that a R2 measures whether there is enough
information in the observables, but it does not tell us if theory-based identification restrictions are
valid. To clarify the point, consider the zt = (ot , πt , nt , rt ) system. The R2 of a regression of the
stationary TFP disturbance on zt is 0.98, suggesting that there is enough information to recover the
disturbance. However, as we have already discussed, stationary TFP and preference disturbances have
the same sign implications on these four variables. Hence, if one imposes theory-based restrictions,
she will end up with a mongrel mixing preference and stationary TFP disturbances (see figure 3).
A similar issue also emerges in smaller systems. For example, in the zt = (ot , πt , nt ) system, the
monetary policy disturbance has an R2 of 0.99 on zt , but theory-based sign restrictions will confuse
stationary technology, preference, markup, and monetary policy disturbances. Thus, in rectangular
systems, having a high R2 is a necessary but not a sufficient condition to be able to identify a
structural disturbance. As we have discussed, the sign and the magnitude of the entries of λ0 ,
provide complementary information to understand which vector of observable variables allows the
identification of a disturbance of interest.

21

5

Given a small scale VAR, does a theory match the facts?

The dynamics of output and inflation following house price disturbances have become of primary
policy importance following the 2008 financial crisis. Starting with Iacoviello [2005] many authors
have tried to understand whether the responses obtained in a SVAR can be rationalized with a
structural model featuring housing services, leveraged agents, and standard macroeconomic frictions.
Since house price disturbances are not necessarily a major source of macroeconomic fluctuations, at
least in normal times, the theoretical models employed to interpret the data typically contain several
other disturbances, see e.g. Rabanal [2018], Linde’ [2018] for recent examples. However, apart from
obvious core choices, it is not clear which other disturbances should be included.
Iacoviello [2005] sidesteps the problem by selecting the minimum number of disturbances needed
to map the empirical evidence into a structural model. He uses a four variable VAR to construct
the dynamic responses to recursively identified house price shocks and a model with preferences,
monetary policy, technology, and cost push disturbances to estimate the structural parameters; and
then interprets the SVAR dynamics through the lenses of preference disturbances. Here we take the
four variable VAR and the identified house price shocks as given, and ask whether they would still be
interpretable though the lenses of preference disturbances when the theory is enlarged to include LTV
disturbances to the entrepreneurs’ problem, which have been extensively used to study the dynamics
of house prices since Iacoviello’s seminal work. In other words, we ask whether deformation problems
could prevent a researcher to map preference disturbances into identified house price shocks and, if
this is the case, what identified house price shocks would capture.
To be clear about the scope of the exercise, in section 4 we take a theory as given, and ask which
small empirical model allows the identification of interesting disturbances and with what restrictions.
Here we reverse that viewpoint, take a VAR and an identification scheme as given, and ask whether
omitted disturbances alter our perception of the match between the theory and the VAR.
The properties of the enlarged model The optimality conditions and the law of motion of
the disturbances are in Appendix D. The model economy features 8 endogenous states (lagged house
holdings of impatient consumers and of entrepreneurs, lagged bond holdings of patients and impatient
consumers, lagged capital shock, lagged output, lagged nominal interest rate, and lagged inflation)
and 15 endogenous controls. When the VAR includes output, nominal rate, inflation, house prices,
and the stock of housing, the ”poor man invertibility” condition holds (all eigenvalues of A − BD−1 C
are less than one in absolute value). Furthermore, the R2 of a regression of each disturbances on the
simulated data is 1. Thus, when at least these five variables enter the VAR, there are no informational
deficiencies and all structural disturbances are potentially recoverable.
We take data for real GDP (Ot ), the nominal interest rate (Rt ), inflation rate (πt ), and real house
prices (qt ) from the FRED data base for the period 1975:1-2018:3 and identify house price shocks
22

Figure 6: Data and models, qt innovations
data vs. theory 4

Interest rate

Inflation
datau
datal
theory 4

0.6

1

0.4
0.2

0.5

0
0
10

15

20

5

10

15

20

5

10

15

0.02

20

-0.02

0.5

-0.04

0

var4u
var4l
theory4

0.6

1

0

0.4
0.2

5

10

15

20

10

15

20

5

10

15

20

5

10

15

20

5

10

15

20

0
-0.05
-0.1

0

VAR4 vs. Theory5

0.04
0.02
0
-0.02
-0.04
-0.06
-0.08

0.8

5

-0.15
5

10

15

20

5

10

15

20

2
0

0.05

0

-0.05

1

0

-1

var4u
var4l
theory 5

-0.1
-2
0

-0.15
5

VAR4 right states vs. Theory5

Output
0.8

5

VAR4 vs. theory4

House price

0.08
0.06
0.04
0.02
0
-0.02

10

15

20

0

5

10

15

20

-0.1
-0.15
10

15

20

15

-0.1

20

0.5

0

0

-0.1

0.5

-0.5

varRu
varRl
theory 5

-1

0
5

10

1

1.5

-0.05

5

-0.05

5

10

15

20

5

10

15

-0.2

20

Note: The first row reports the responses to preference disturbances in Iacoviello (2005) model and the 68% highest
posterior interval in the data; the second row the responses to preference disturbances in the same model and the
68% highest posterior interval in a four variable VAR on simulated data; the third row the responses to preference
disturbances in a model with 5 shocks and the 68% highest posterior interval in a four variable VAR on simulated
data when only πt , rt , ot are used as states; the fourth row the responses to preference disturbances in a model with 5
shocks and the 68% highest posterior interval in a four variable VAR on simulated data when all the states are used.

using the same lag setting (2 lags), the same data transformation (HP filtering of GDP and house
prices) 3 , and the same recursive identification scheme of Iacoviello [2005]
The evidence The first row of figure 6 plots the posterior 68% response intervals to an identified
house price shock in the data and the responses to preference disturbances in the theory with four
disturbances. Iacoviello’s main result holds with the extended dataset: after a temporary house price
increase, output, inflation and the nominal interest rate persistently rise; and a similar pattern is
generated by preference disturbances, although in the data the maximum output response is delayed.
The second row demonstrates that truncation lags and the use of recursive restrictions (which fail to
hold in the theory) do not affect the mapping between preference disturbances and identified house
price shocks. In fact, comparing the data responses with the theoretical responses to preference
disturbances or with the Cholesky identified house price responses in a VAR(2) on simulated data
gives the same qualitative conclusions, see also Chari et al. [2005]. Hence, in the baseline case, it is
3
While this choice alters the timing of house price shocks and the responses they generate, we decided to stick to
this transformation since the purpose of the exercise is to show the effects of deformation not of filtering.

23

legitimate to interpret identified house price responses in the data through the lenses of model based
preference disturbances. This is not necessarily the case when the theory features one additional
disturbance for two reasons. Because the five disturbances are mapped into four innovations, cross
sectional deformation matter. Furthermore, because only three state variables (lagged output, lagged
inflation and lagged nominal interest rate) enter the VAR, time deformation will also be present.
The third row of table 6 plots the responses to a preference disturbance in the theory with 5
disturbances and the posterior 68% response interval to a Cholesky identified house price shock in a
VAR including output, nominal rate, inflation and house prices simulated from the theory which only
keeps output, inflation, house prices and the nominal rate as endogenous variables 4 . Note that the
sign and the persistence of the responses to identified price shocks in the VAR now differ from those
of the theory: output and the nominal interest rate respond negatively; and inflation is insignificant
after a few quarters. Deformation matters: a four variable VAR is too small to produce identified
house price shocks with the same interpretation as preference disturbances or, put it differently, the
mapping between preference disturbances and identified house price shocks is altered.
Explanations Why are rows 2 and 3 different? Is it time or cross sectional deformation that
changes the pattern of responses? Row 4 of figure 6 presents a counterfactual where time deformation
is absent 5 . Because the responses in rows 3 and 4 have similar sign and quantitative differences are
small, it is cross sectional deformation that alters the signs of output and interest rate responses.
Alternatively, because five structural disturbances are compressed into four VAR innovations, the
mapping between identified house price shocks and preference disturbances is polluted by other
disturbances. Standard information sufficiency measures are incapable of capturing these distortions.
For example, the R2 of a regression of the theoretical preference disturbances on simulated output,
inflation, nominal interest rate, and house prices is 0.94.
To understand what identified house price shocks capture, we compute the matrix of contemporaneous loadings of the four innovations on the five structural disturbances (the λ0 matrix). House
price innovations load on the monetary policy disturbances, eR (-2.01), on the borrowing constraint
disturbances, ei1 (-1.68), while the weight on the preference disturbances ej is small (0.06). Because
positive borrowing constraint disturbances increase output and the nominal rate, the negative output
and interest rate responses observed in row 3 are due to the large negative loading that borrowing
constraint disturbances have on identified house price shocks. To support this interpretation, we
compute the contemporaneous correlation between identified house price shocks and preference disturbances in the model with four and five disturbances. The point estimate in the former is 0.91 (95%
4

The new disturbance has persistence equal 0.75 and standard deviation 1.0. Since we normalize the impulse to
unity, the magnitude of the standard deviation is irrelevant.
5
The scenario is generated simulating data for the four endogeneous variables of interest with the decision rules of
the model with five disturbances and all the states, identifying house price shocks in a VAR as before. Since all the
states are retained, only cross sectional deformation is present.

24

confidence range across simulations [0.90, 0.92]); in the latter it is only 0.66 (95% confidence range
[0.63,0.70]). On the other hand, the contemporaneous correlation of identified house price shocks
with the borrowing constraint disturbances is -0.67 (95% confidence range (-0.70,-0.64)).
Table 3: Loading of structural disturbances on innovations in (Rt , πt , qt , Ot )
Innovations
Rt
qt
Ot
πt

Monetary policy
1.0
-2.01
-2.75
-0.60

Disturbances
Preference Markup
0
0
0.06
-0.62
0.01
-1.80
-0.003
1.29

Technology LTV
0
0
0.14
-1.68
-0.09
4.06
-0.11
0.22

One may ask what is the minimal dimension of the VAR that allows a direct mapping between
identified house price shocks and preference disturbances when the DGP has five disturbances. Figure
D.1 in appendix D shows that when the VAR includes the nominal interest rate, house prices, output,
inflation, consumption, investment and the total stock housing, the dynamics induced by preference
disturbances and identified house price shocks are again qualitatively similar.
6

An extension

The process in (15)-(16) may be restrictive in certain situations. For example, when analyzing risk or
uncertainty disturbances, the model is solved using higher order methods. Hence, a non-linear DGP
specification is needed. This section studies how the conclusions of section 3 change in this case.
As shown in Andreasan, Fernandez Villaverde, and Rubio Ramirez [2018], the pruned solution of
a nonlinear state space model approximated with higher order perturbations can be written as:
Xt = µx (θ) + ν1 (θ)Xt−1 + ν2 (θ)Et

(41)

Yt = µy (θ) + ν3 (θ)Xt

(42)

where, for example in the case of a second order approximation, Xt = ((xft )0 , (xst )0 , (xft ⊗ xft )0 )0 ,
and xft are the states of the first order system, xst are the states of the second order system; Et =
(e0t , (et ⊗ et − vec(Ine ))0 , (et ⊗ xft−1 )0 (xft−1 ⊗ et )0 )0 , where et are the structural disturbances and Ine the
identity matrix of dimension ne ; Yt are the controls and the matrices µx (θ), µy (θ), ν1 (θ), ν2 (θ), ν3 (θ)
are given in the appendix A of Andreasan et al. [2018]. Thus, a higher order DGP has a linear
state space representation but with a larger number of states and of structural disturbances. If a
linear VAR is specified and features Z̃t = S̃[Xt , Yt ] as observables, where S̃ = [S̃1 , S̃2 ], the conclusions
derived in propositions 1-3 still hold. However, cross section and time deformations will be more
25

severe because the dimension of Et is larger, and a larger number of states (in particular, those
involving higher order and cross terms) is omitted 6 .
To highlight the effects of deformation in this situation, we take the model of Basu and Bundick
[2017], which features disturbances to the volatility of the preference shock, to the level of the
technology and to the level of preferences. The model is solved with a third order perturbation so
0 0
0
] where
, E2t
that Et = [E1t

E1t = (e0t , (et ⊗ et − vec(Ine ))0 , (et ⊗ xft−1 )0 (xft−1 ⊗ et )0 (et ⊗ xst−1 )0 )0

(43)

E2t = ((et ⊗ xft−1 ⊗ xft−1 )0 (xft−1 ⊗ xft−1 ⊗ et )0 (xft−1 ⊗ et ⊗ xst−1 )0 (xft−1 ⊗ et ⊗ et )0 (et ⊗ xft−1 ⊗ et )0
(et ⊗ et ⊗ xft−1 )0 ((et ⊗ et ⊗ et ) − E(et ⊗ et ⊗ et )))0

(44)

Since et is a 3 × 1 vector, and xft a 9 × 1 vector including lagged values of consumption, capital,
hours, output, the nominal rate, expected utility and the three disturbances, Xt is a 432 × 1 vector
and Et is a 1112 × 1 vector. They use an eight variables VAR to trace out the effects of uncertainty
shocks, which are identified via a Cholesky decomposition with the VXO index ordered first. The
VAR includes four endogenous states (output, consumption, hours and nominal rate), a proxy for
the capital state (investment), two controls (inflation, and a volatility measure) and a money supply
variable, which is absent from the model.
The evidence The first row of figure 7 presents the point estimates and the 95% response intervals
of output, consumption, investment, hours and VXO to an uncertainty shock in the VAR of the data.
The second row has the responses to an uncertainty shocks in Basu and Bundick [2017]’s original setup
and parameterization: the dashed line reports theoretical responses, and the solid lines the estimated
95% SVAR response intervals in simulated data, identifying the uncertainty shock as in the first
row. The match between the theory and the VAR of the data appears to be good. Furthermore,
theoretical responses and SVAR responses constructed with simulated data are similar.
Two features of the authors’ specification are, however, questionable. Although the nominal
interest rate enters the VAR, the model has little to say about it because it posits a deterministic
Taylor rule with no persistence (see equation (7), page 945). Second, it is not obvious why changes
in uncertainty are only demand driven; second moment shocks to the technology could generate
similar dynamics in real aggregate variables via a precautionary saving channel. Thus, the DGP
potentially features more disturbances than those used in the model and the restrictions used to
identify uncertainty shocks may be insufficient. For illustration, we add a monetary policy disturbance
to the model, keeping the structure and the parameterization unchanged. As row 3 of figure 7 shows
theoretical and the estimated response intervals obtained from simulated data now differ significantly.
Moreover, the response intervals in rows 1 and 3 do not line up.
6

When the class of models suggested by Arouba, Boccola, and Schorfeide [2017] is used, some of the additional
deformation problems are eased.

26

Figure 7: Data and Models, V XO innovations

Data

5

10-3 Output

-3
10Consumption

10-3

Investment

0.02

2

0.01

0

0

-2

0

-0.01

-4

-2
-5
4

8

12

16

20

4

8

12

16

20

BB Model

0.05

4

8

12

16

20

0.25

0

0

-0.05

8

12

16

20

0.05

8

12

16

20

4

8

12

16

20

0.05
0

-0.05

-0.2

-0.15
4

8

12

16

20

4

8

12

16

20

4

8

12

4

8

12

16

20

16

20

16

20

0
-0.15
4

8

12

16

20

-10
4

8

12

16

20
30

1
0.75
0.5
0.25
0
-0.25
-0.5
-0.75

0.2

12

10

-0.05

-0.75

0.15

8

20

-0.5
-0.15

4
30

-0.25

-0.1
4

Estimated SVAR Responses
Actual Data
95% Response Interval
Point estimate

-0.1
4

0.5

0.1

-0.2

Add MP shocks

Implied Stock Volatility

0.2
0.1

0

0

Hours

2

0.15

20

0.05

4

8

12

16

10

-0.05

0

-0.15

-10

20

Estimated SVAR Responses
on Simulated Data
95% Response Interval
True Model

4

8

12

16

20

Note: The solid lines in the first row report 95% response intervals and the dashed line the point estimate using the
actual data; the solid lines in the second and third row report the 95% response interval in the simulated data and the
dashed line the conditional response in the theory.

Explanations Rows 2 and 3 differ because monetary policy and uncertainty disturbances get
mixed up - they both increase the nominal rate and make all other variables fall. While theoretical
responses are constructed conditional on the monetary policy disturbances being zero, in the VAR
with simulated data, the monetary policy disturbances can be positive and negative. Hence, the sign
of the responses of output, consumption, investment and hours to uncertainty shocks depends on the
relative importance of uncertainty and monetary disturbances and the sign of the monetary policy
disturbances at each t. Given that VAR responses are insignificant, identified uncertainty shocks are
likely to pick up positive uncertainty and negative monetary policy disturbances.
To support this conclusion, we compute the contemporaneous correlation of identified volatility
shocks with the volatility disturbances in the original and in the extended model with monetary policy
disturbances. In the former, the point estimate is 0.77 (95% confidence range across simulations
(0.68, 0.86)); in the second it is 0.62 (95% confidence range (0.50,0.74)). In the latter system, the
contemporaneous correlation between identified volatility shocks and monetary policy disturbances
is -0.46 (95% confidence range (-0.50,-0.41)).
Larger scale BVAR Would the estimation of a larger scale BVAR solve the problems? Because
deformation is due both to the fact that an eight variable VAR is too small and that the volatility
and the monetary policy disturbances need both to come first in a Cholesky decomposition to be
properly identified, using a larger scale BVAR in the exercise will not necessarily resolve the issue.
27

In addition to the correctly sized VAR, one needs a set of identification restrictions that differentiate
the two disturbances, see also Wolf [2018].
7

Conclusions and implication for practice

It is common in macroeconomics to collect stylized facts about the transmission of structural shocks
using small scale VAR models and then build larger scale DSGE models to interpret the dynamics
found. This paper argues that important inferential and interpretation distortions may emerge when
the process generating the data features more disturbances than the variables entering a VAR.
Cross sectional deformation makes shock identification hard, because ”classes” of structural disturbances need not be properly compressed into identified shocks, and may make valid theoretical
identification restrictions insufficient. Time deformation complicates the matching process because
the timing of identified shocks and of structural disturbances differs.
We highlight the practical implications of deformation in two ways. First, we take the DGP as
given and show what happens to identified shocks when the empirical model is too small; describe
how to reduce time distortions explicitly linking the empirical model to the theory; and highlight the
disturbances which are recoverable from different small scale empirical systems. Second, we take a
small scale VAR as given and ask what would happen to the perceived match between the theory
and the data when the DGP includes additional disturbances. In both cases, the gap between the
theory and the VAR of the data may be larger than previously thought.
Although it is tempting to associate cross sectional deformation with the elimination of theoretical controls and time deformation with the elimination of theoretical states, such an association
is imperfect. Time distortions emerge also when the empirical system contains all the endogenous
states. Conversely, integrating out controls may induce both biases, if the relationship between the
remaining controls and the states is altered.
While it is common to sweep deformation under the rug, distortions may be pervasive. For
example, Central Banks use structural models with dozens of disturbances to interpret the data and
academic researchers often twist standard models in estimation so that structural parameters become
exogenous disturbances (e.g an elasticity of substitution becomes a markup disturbance) to improve
their fit. If there are more than two or three disturbances driving macroeconomic variables, it is
difficult to take seriously the evidence small scale VAR models deliver.
Clearly, employing a large scale VARcan go a long way to ease deformation problems. However,
while one can estimate large Bayesian VAR models, even with relatively short datasets, their identification is an issue. Hence, small scale VARs are still likely to be preferred by macroeconomists.
In that case, proceedings as in sections 4 and 5, may inform users about potential issues, solidify
inference, and avoid interpretation confusions.
Are there empirical alternatives that could make the gap with the theory smaller? They do exist,
28

but they have to be appropriately rigged to deliver the correct conclusions. For example, one may
be able to reduce time deformation if FAVAR models are employed to build dynamic facts, provided
factors are constructed using the omitted states. However, FAVARs do not necessarily eliminate cross
sectional distortions. In fact, statistical principal components are unlikely to properly combine classes
of structural disturbances and to make the mapping between innovations and structural disturbances
better behaved.
It has become common to use IV approaches to identify certain shocks and local projection
techniques to compute dynamic responses in the data (see e.g. Rossi [2019] for a survey). Would
such methods reduce the deformation gap? They could, but a number of conditions need to be
met. Take, for example, case 2 of section 3, where some states are absent from the empirical model.
The DGP for the observables is a VARMA(2,1) which, in 
a companion from, canbe written 
as
F21 F22
G20 G21
Wt = QWt−1 + Rvt where Wt = [yt , yt−1 ]0 vt = [et , et−1 ]0 , Q =
and R =
.
I
0
0
0
Projecting Wt+h , h = 1, 2, . . . on t-1 information:
Wt+h = Qh+1 Wt−1 + Qh Rvjt + ut+h

(45)

where vjt is the disturbance of interest, ut+h = Qh Rv−jt + Qh−1 Rvt+1 + . . . + Rvt+h , and v−jt are all
the disturbances at t except the j −th one. Because local projections do not rely on VAR innovations,
they are less prone to cross sectional deformation. However, for the projections to be successful in
recovering Qh R, the regressors of the projection equation should be Wt−1 and vjt . When vjt is not
observable, we need proxies that capture the effect of both ejt and ejt−1 . If only a proxy for ejt is used,
the right hand side variables are correlated with the error term, making OLS invalid. Similarly, if
an IV approach is used, the instruments have to be strictly exogenous and able to capture variations
in Wt+h only due to vjt . Predetermined instruments are insufficient, unless the projection equation
includes an infinite number of lags of Wt .
It is well known since Sims and Zha [2006] that if a structural disturbance is invertible in (xt , yt ),
it is unlikely to be invertible in zt = S[xt , yt ] only. However, even in that case, impulse responses
could be identified, if a proper IV procedure is used, see e.g. Miranda Agrippino and Ricco [2019]
among others. Propositions 1 and 2 indicate that when deformation is present invertibility in zt alone
is a very low probability event and impulse response hard to identify. In addition, proper instruments
may be difficult to find when deformation exists.
Our analysis has implications for two related strands of literature. Rather than using small scale
VARs to validate a theoretical mechanism, it is quite common to employ them to cross off theories
inconsistent with the data (see e.g. Galı́ [1999], Angeletos et al. [2019]). While the qualitative features
of the responses are, at times, unchanged by deformation, magnitudes and persistences are generally
affected. Thus, it is dangerous to exclude theories, say, using variance decomposition exercises or the
magnitude of multipliers, as it is done in the literature.
29

It is also popular to estimate the parameters of a theoretical model by matching responses to
certain disturbances in the VAR of the data and in the theory, see e.g. Christiano et al. [2005].
Limited information approaches may avoid certain forms of misspecification of the theoretical model
in estimation. However, they are unsuited to reduce the gap that deformation creates unless a
large scale VAR is employed, see e.g. Christiano, Trabant, and Walentin [2010]. In fact, when
the DGP features more disturbances than VAR variables, three conditions need to be met to make
estimation meaningful. First, to avoid cross sectional deformation, the theory should be reduced
to the same observables used in the empirical model prior to the computation of the decision rules
and to estimation. Second, to avoid time deformation, data responses should be computed using a
generous lag length and carefully selected variables. Third, one needs to check that the disturbances
of interest are identifiable in the small scale empirical system using theory-based restrictions. When
any of these conditions fail, parameter estimates become difficult to interpret.

30

References
Martin Andreasan, Jesus Fernandez Villaverde, and Juan Rubio Ramirez. The pruned state space
system of a non-linear dsge model. Review of Economic Studies, 85:1–49, 2018.
Marios Angeletos, Fabrice Collard, and Harris Dellas. Business cycle anatomy. Technical report,
MIT manuscript, 2019.
Borogan Arouba, Luigi Boccola, and Frank Schorfeide. Assessing dsge nonlinearities. Journal of
Economic Dynamics and Control, 83:34–54, 2017.
Susanto Basu and Brent Bundick. Uncertainty shocks in a model of effective demand. Econometrica,
85:937–958, 2017.
Christianne Baumeister and James Hamilton. Sign restrictions, structural vector uttoregressions,
and useful prior information. Econometrica, 83:1963–1999, 2015.
Paul Beaudry, Patrick Feve, Alain Guay, and Frank Portier. When is non-fundamentalness in var a
real problem? Technical report, University of Toulouse, working paper 16-738, 2016.
Olivier J. Blanchard, Jean-Paul L’Huillier, and Guido Lorenzoni. News, Noise, and Fluctuations: An
Empirical Exploration. American Economic Review, 103(7):3045–3070, December 2013.
Olivier Jean Blanchard and Danny Quah. The dynamic effects of aggregate demand and supply
disturbances. American Economic Review, 79:655–73, 1989.
Philip Braun and Stephan Mittnik. Misspecification in vector autoregressions and their effects on
impulse responses and variance decompositions. Journal of Econometrics, 59:319–341, 1991.
Fabio Canova and Filippo Ferroni. Multiple filtering devices for the estimation of cyclical dsge models.
Quantitative Economics, 2:73–98, 2011.
Fabio Canova and Mehdi Hamidi Sahneh. Are small scale var useful for business cycle analysis?
revisiting non-fundamentalness. Journal of the European Economic Association, 16:1069–1093,
2018.
Ryan Chahrour and Kyle Jurado. Recoverability. Technical report, Boston College., 2018.
Varadarajan Chari, Patrick Kehoe, and Ellen McGrattan. A critique of structural vars using real
business cycle theory. Technical report, Federal Reserve Bank of Minneapolis, 2005.
Larry Christiano, Martin Eichenbaum, and Charles Evans. Nominal rigidities and the dynamic effects
of a shock to monetary policy. Journal of Political Economy, 113:1–45, 2005.
31

Larry Christiano, Matthias Trabant, and Karl Walentin. Dsge models for of monetary policy analysis.
Technical report, NBER working paper 16074, 2010.
Jon Faust and Eric Leeper. When do long run restrictions give reliable results. Journal of Business
and Economic Statistics, 15:345–353, 1997.
Jesus Fernández-Villaverde, Juan F. Rubio-Ramı́rez, Thomas J. Sargent, and Mark W. Watson.
ABCs (and Ds) of Understanding VARs. American Economic Review, 97:1021–1026, 2007.
Mario Forni, Luca Gambetti, and Luca Sala. Structural vars and noninvertible macroeconomic
models. Journal of Applied Econometrics, 34:221–246, 2018.
Jordi Galı́. Technology, employment, and the business cycle: Do technology shocks explain aggregate
fluctuations? American Economic Review, 89:249–271, 1999.
Raffaella Giacomini. The relationship between dsge and var models. in T.B. Fomby, L. Kilian, A.
Murphy, (eds.), Advances in Econometrics, 32, Emerald Press, pages 1–25, 2013.
Lars Peter Hansen and Thomas J Sargent. Two difficulties in interpreting vector autoregressions. in
L.P. Hansen and T.J. Sargent, (eds). Rational expectations econometrics, Westview Press Boulder,
CO, pages 77–120, 1991.
Matteo Iacoviello. House prices, borrowing constraints, and monetary policy in the business cycle.
American Economic Review, 95:739–764, 2005.
Lutz Kilian and Helmuth Lutkepohl. Structural Vector Autoregressive Analysis. Cambridge University
Press, 2017.
Eric Leeper, Todd Walker, and Susan Yang. Fiscal foresight and information flows. Econometrica,
82:1115–1145, 2013.
Jesper Linde’. Dsge models: still useful in policy analysis? Oxford Review of Economic Policy, 34:
269–286, 2018.
Helmuth Lutkepohl. Linear transformation of vector arma processes. Journal of Econometrics, 26:
283–293, 1984.
Albert Marcet. Time aggregation of economic time series. in L.P. Hansen and T. J. Sargent, (eds).
Rational expectations econometrics, Westview Press Boulder, CO, pages 237–281, 1991.
Silvia Miranda Agrippino and Giovanni Ricco. Identification with external instruments in structural
vars under partial invertibility. Technical report, University of Warwick and Bank of England,
2019.
32

Adrian Pagan and Tim Robinson. Implications of partial information for econometric modelling of
macroeconomic systems. Technical report, University of Sydney., 2018.
Mikkel Plagborg Moller and Christian Wolf. Instrumental variable identification of dynamic variance
decompositions. Technical report, Princeton University, 2019.
Pau Rabanal. An estimated dsge model to analysize housing market policies in hong kong sar.
Technical report, IMF working paper, 18-90, 2018.
Federico Ravenna. Vector autoregressions and reduced form representations of dsge models. Journal
of Monetary Economics, 54:2048–2064, 2007.
Barbara Rossi. Identifying and estimating the effects of unconventional monetary policy in the data:
How to do it and what have we learned? Technical report, Universitat Pompeu Fabra., 2019.
Juan Rubio Ramirez, Daniel Waggoner, and Tao Zha. Structural vector autoregressions: Theory of
identification and algorithms for inference. Review of Economic Studies, 77:665–696, 2010.
Christopher Sims and Tao Zha. Does monetary policy generate recessions? Macroeconomic Dynamics, 10:231–272, 2006.
Christian Wolf. Svar (mis-)identification and the real effects of monetary policy. Technical report,
Princeton University, 2018.

33

Appendix A: Proof of the propositions of section 3
Proposition 1: Simply match (29) with (19)-(20).
Proposition 2: To prove part i), we first match (31) and (21)-(22). Then u2t = (S2 F S2∗ − F̃2 )(I −
S2 F S2∗ L)−1 (G2 et−1 + H2 x1t−2 ) + G2 et + H2 x1t−1 . Because x1t has a VARMA(2,1) format: M (L)x1t =
N (L)et , where M (L) is invertible, we have u2t = λ2 (L)et , where λ2 (L) = G2 + (S2 F S2∗ − F̃2 )(I −
S2 F S2∗ L)−1 (G2 + H2 M (L)−1 N (L)L + H2 M (L)−1 N (L)L2 . Matching (31) with (25) one similarly obtains that u3t = λ3 (L)et . Part ii) and iii) are immediate.

34

Appendix B: The optimality conditions of the NK model with a reduced number
of endogenous variables
1) Theory with zt = (ot , πt , nt , rt ).
h
1
Et (at+1 + ot+1 − ot ) +
(at + ot − ot−1 ) + rt − Et πt+1
1 − h
1−h

h
(at + ot − ot−1 ) + (1 + σn ) nt + kp (µt − χt )
= Et πt+1 β + kp
1−h
= ζt + (1 − α) nt

χt = Et χt+1 −

(46)

πt

(47)

ot

rt = ρr rt−1 + (1 − ρr ) (φy (at + ot − ot−1 ) + φp πt ) + εmp t

(48)
(49)

2) Theory with zt = (ot , πt , nt ).
(1 + ρr )χt − ρr Et χt−1 = χt+1 −

1
h + ρr
Et (at+1 + ot+1 − ot ) + (
+ (1 − ρr )φy ) (at + ot − ot−1 )
1−h
1−h

hρr
) (at−1 + ot−1 − ot−2 ) + (ρr + (1 − ρr )φp ) πt + emp t − Et πt+1 (50)
1−h


h
= Et πt+1 β + kp
(at + ot − ot−1 ) + (1 + σn ) nt + kp (µt − χt ) (51)
1−h
= ζt + (1 − α)nt
(52)

− (
πt
ot

3) Theory with zt = (rt , πt , nt ).
χt = χt+1 −

1
(at+1 + ζt+1 − ζt + (1 − α) (nt+1 − nt ))
1−h

h
(at + ζt − ζt−1 + (1 − α) (nt − nt−1 )) + rt − πt+1
(53)
1−h


h
= πt+1 β + kp
(at + ζt − ζt−1 + (1 − α) (nt − nt−1 )) + (1 + σn ) nt + kp (µt − χt ()54)
1−h
= ρr rt−1 + (1 − ρr ) (φy (at + ζt − ζt−1 + (1 − α) (nt − nt−1 )) + φp πt ) + εmp t
(55)

+
πt
rt

35

4) Theory with zt = (ot , rt ).
β
1
(at+1 + ot+1 − ot ) +
(at+2 + ot+2 − ot+1 )
1−h
1−h
h
hβ
(
) (at + ot − ot−1 ) − (
) (at+1 + ot+1 − ot ) + rt − βrt+1
1−
1−h
h

h
1
kp
(at+1 + ot+1 − ot ) + (1 + σn )
(ot+1 − ζt+1 ) − kp (µt+1 − χt+1 )
(56)
1−h
1−α
βrt+1 + ρr rt−1 − βρr rt + (1 − ρr )φy ((at + ot − ot−1 ) − β(at+1 + ot+1 − ot ))
 


h
1
(1 − ρr )φπ kp
(at + ot − ot−1 ) + (1 + σn )
(ot − ζt ) + kp (µt − χt )
1−h
1−α
mpt − βmpt+1
(57)

χt = (1 + β)χt+1 − βχt+2 −
+
−
rt =
+
+

5) Theory with zt = (gt , nt ) (assuming β −1 = (1 − ρr )φπ π + ρr ).
1
ρr + h
gt+1 + (
+ (1 − ρr )φy )gt
1−h
1−h
h
hρr
gt−1 + mpt + κp (
gt + (1 + σn )nt ) + κp (µt − χt )
−
1−h
1−h
= at + ζt + (1 − α)nt − ζt−1 − (1 − α)nt−1

(1 + ρr )χt = ρr χt−1 + χt+1 +

gt

36

(58)
(59)

Appendix C: Additional graphs for the NK model

Sample Autocorrelation Function, Shock 1

1
0.8

0.8

0.6

0.6

0.4

0.4

0.2

0.2

0

0
0

2

4

6

8

10

12

14

16

Sample Autocorrelation Function, Shock 2

1

18

20

0

2

4

6

8

Lag
Sample Autocorrelation Function, Shock 3

1

0.8

0.6

0.6

0.4

0.4

0.2

0.2

0

0
2

4

6

8

10

12

14

16

18

20

12

14

16

18

20

Sample Autocorrelation Function, Shock 4

1

0.8

0

10

Lag

18

20

0

2

4

6

8

Lag

10

12

14

16

Lag

Note: Parallel lines describe 95 % asymptotic tunnel for the hypothesis of zero autocorrelations.

Figure C.1: Autocorrelation function, innovations in (ot , πt , nt , rt ) system.
Sample Autocorrelation Function, Shock 1

1
0.5
0
0

2

4

6

8

10

12

14

16

18

20

14

16

18

20

14

16

18

20

Lag
Sample Autocorrelation Function, Shock 2

1
0.5
0
0

2

4

6

8

10

12

Lag
Sample Autocorrelation Function, Shock 3

1
0.5
0
0

2

4

6

8

10

12

Lag

Note: Parallel lines delimit the 95 % asymptotic tunnel for the hypothesis of zero autocorrelations.

Figure C.2: Autocorrelation function, innovations in (ot , πt , nt ) system.
37

Sample Autocorrelation Function, Shock 1

1
0.5
0
0

2

4

6

8

10

12

14

16

18

20

14

16

18

20

14

16

18

20

Lag
Sample Autocorrelation Function, Shock 2
1
0.5
0
0

2

4

6

8

10

12

Lag
Sample Autocorrelation Function, Shock 3

1
0.5
0
0

2

4

6

8

10

12

Lag

Note: Parallel lines delimit the 95 % asymptotic tunnel for the hypothesis of zero autocorrelations.

Figure C.3: Autocorrelation function, innovations in (πt , nt , rt ) system.
0

0
-0.2

-0.5
-0.4

Small system 016
Small system 084
True MP shock

-1

Small system 016
Small system 084
True MP shock

-0.6

-1.5

-0.8
0

5

10

15

20

25

0

5

Output responses

10

15

20

25

Inflation responses
1

0

Small system 016
Small system 084
True MP shock

-0.5
0.5
-1
Small system 016
Small system 084
True MP shock

-1.5

0

-2
-0.5
0

5

10

15

20

25

0

Hours responses

5

10

15

20

25

Interest rate responses

Note: The dashed regions report 68 % interval obtained accounting for rotation uncertainty. The solid line reports
the responses in the DGP.

Figure C.4: Responses to monetary policy shocks, (yt , πt , nt , rt ) system.

38

Permanent Technology

Shock 1

Transitory Technology

Preferences

0.15

0.15
0.1

0.1

0.05

0.05

0

0

-0.05

-0.05

Monetary Policy
0.15

0.1

0.1

0.1

0.05

0.05

0.05

-0.1

-0.1

-0.15

-0.15

-20

0

20

-20

0
-0.05

0

20

-0.1
-20

0

20

0

0

-0.05

-0.05

-0.1

-0.1

-0.15
-20

-0.15
-20

0

20

0.15

0.15

0.15

0.15

0.1

0.1

0.1

0.1

0.1

0.05

0.05

0.05

0

0

-0.05

-0.05

-0.1

-0.1

0.05

Shock 2

Markup
0.15

0.15

-0.15
-20

-0.05

0

Lag

20

-0.1
-20

0

20

-0.05

0
-0.05
-0.1

20

0

0.05
0

0

0

Lag

20

-0.15
-20

0

20

Lag

-0.15
-20

-0.1

0

Lag

20

-0.15
-20

Lag

Note: Parallel lines describe the 95 % asymptotic tunnel for the hypothesis of zero cross correlations.

Figure C5: Cross correlation function, innovations in (gt , nt ) system and structural shocks.

39

Appendix D: The (linearized) equations of the extended Iacoviello model
rrt = rt − pit+1

(60)

yt = cy ct + (1 − cy − ciiy − iy )cit + ciiy ciit + iy it

(61)

cit = cit+1 − rrt

(62)

it − kt−1 = γ(it+1 − kt ) +

1
(1 − γ(1 − δ)
(yt+1 − xt+1 − kt ) + ( (ct − ct+1 )
ψ
ψ

(63)

qt = γE qt+1 + (1 − γE )(yt+1 − xt+1 − ht ) − mβ rrt − i1,t − (1 − mβ)(ct+1 − ct )
− φE (ht − ht−1 − γ(ht+1 − ht ))

(64)

qt = γH qt+1 + (1 − γH )(jt − hiit ) − miiβ rr + (1 − miiβ)(ciit − ωciit+1 )
− φH (hiit − hiit−1 − βii (hiit+1 − hiit )

(65)

phiH
(h(ht − ht−1 )
hi
+ hii(hiit − hiit−1 ) − βh(ht+1 − ht ) − βhii(hiit+1 − hiit ))

(66)

qt = βqt+1 + (1 − β)jt + ιht + ιii hiit + cit − betacit+1 +

bt = qt+1 + ht − rrt + i1,t
biit = qt+1 + hiit − rrt
η
1−ν−µ
(at + νht−1 + µkt−1 ) −
(xt + αcit + (1 − α)ciit )
η − (1 − ν − µ)
η − (1 − ν − µ)
= βπt+1 − κxt + ut

(67)
(68)

yt =

(69)

πt

(70)

kt = δit + (1 − δ)kt−1

(71)

by
(rt−1 + bt−1 − πt ) − (1 − si − sii)(yt − xt )
β
biiy
= ciiy ciit + qhiiy (hiit − hiit−1 ) +
(biit−1 + rt−1 − πt ) − sii(yt − xt )
β

by bt = cy ct + qhy (ht − ht−1 ) + iy it +
biiy biit

(72)
(73)

rt = (1 − ρR )(1 + ρπ )πt−1 + ρy (1 − ρR )yt−1 + ρR rt−1 + eR

(74)

jt = ρj jt−1 + ej

(75)

ut = ρu ut−1 + eu

(76)

at = ρa at−1 + ea

(77)

i1,t = ρ1 i1,t−1 + ei 1

(78)

tct = cy ct + (1 − cy − ciiy − iy )cit + ciiy ciit

(79)

tht = ht + hiit

(80)

40

Interest rate

Output

Inflation

0.8
0.03

0.6

0.02

0.4

0.05

0

0.2
0.01
0
0

-0.05

-0.2
5

10

15

20

5

House stock

10

15

20

5

10

15

20

House prices

6
1

4

vardatau
vardatal
theory5

2
0.5
0
0

-2
5

10

15

20

5

10

15

20

Note: The solid blue line represents responses to preference shocks in a theory with 5 disturbances. The red dashed
lines represent the 68% highest posterior responses to an identified house price shocks in a VAR with five variables
and data simulated from the model with 5 disturbances.

Figure D1: Responses to a Cholesky identified qt innovations in a 5 variable VAR and theoretical
preferences disturbances.

41

Appendix E: The equations of extended Basu and Bundik model
yt + f ixedcostt = productionconstant ∗ (zt ∗ n)t)( 1 − α) ∗ (ut ∗ kt−1 ))( α)
ct + leverageratio ∗ kt /rrt = wt ∗ nt + det + leverageratio ∗ kt−1

(82)

wt = ((1 − η)/η) ∗ ct /(1 − nt )
vf

= (utilityconstant ∗ at ∗

(
(ct η)

(83)
(

(

∗ (1 − nt ) 1 − η)) (1 − σ)/thetavf )

(

+ β ∗ expvf 1sigmat 1/thetavf ))( thetavf /(1 − σ))
(
vft+1 1

(81)

(84)

− σ)

(85)

wt ∗ nt = (1 − α) ∗ (yt + f ixedcost)/mut

(86)

expvf 1sigmat =

rrkt ∗ ut ∗ kt−1 = α ∗ (yt + f ixedcost)/mut

(87)

qt ∗ deltauprimet ∗ ut ∗ kt−1 = α ∗ (yt + f ixedcost)/mut

(88)
(

kt = ((1 − deltaut ) − (φk /2) ∗ (invt /kt−1 − delta0 ) 2)) ∗ kt−1 + invt
2

deltaut = delta0 + delta1 ∗ (ut − 1) + (δ2 /2) ∗ (ut − 1)

(90)

deltauprimet = delta1 + delta2 ∗ (ut − 1)
sdft = β ∗ (at /at−1 ) ∗
∗

(
((ct η)

∗ (1 − nt )( 1 −

(89)

(
η))/(ct−1 η)

(91)
1
−
σ
)
∗ (1 − nt−1 )( 1 − η)))(
thetavf

(

(ct−1 /ct ) ∗ (vft 1 − σ)/expvf 1sigmat−1 )( 1 − 1/thetavf )

1 = rrt ∗ sdft+1

(92)
(93)

(

1 = rt ∗ sdft+1 ∗ (piet+1 ) − 1)

(94)

1 = sdft+1 ∗ (ut+1 ∗ rrkt+1 + qt+1 ∗ ((1 − deltaut+1 ) − (φk /2) ∗ (invt+1 /kt − δ0 )2
+ φk ∗ (invt+1 /kt − δ0 ) ∗ (invt+1 /kt )))/qt

(95)

1 = sdft+1 ∗ (det+1 + pet+1 )/pet

(96)

log rt = (1 − ρr ) ∗ (log(rss) + ρpie ∗ log(piet /piess) + ρy ∗ log(yt /yt−1 ))
+ ρr ∗ log(rt−1 ) + e

(97)
2

det = yt − wt ∗ nt − invt − (φp /2) ∗ (piet /piess − 1) ∗ yt
qt−1

− leverageratio ∗ (kt−1 − kt /rrt )

(98)

= 1 − φk ∗ (invt /kt−1 − δ0 )

(99)

φp ∗ (piet /piess − 1) ∗ (piet /piess) = (1 − thetamu) + (thetamu)/mut + sdft+1 ∗ φp ∗ (piet+1 /piess − 1)
∗

(yt+1 /yt ) ∗ (piet+1 /piess)

(100)

prof itt = (µt − 1) ∗ yt − f ixedcost

(101)

expret = (det+1 + pet+1 )/pet

(102)

expre2t =
varexpret =

2

(det+1 + pet+1 ) /pe2t
expre2t − expre2t

at = (1 − ρa ) ∗ ass + ρa ∗ at−1 + volat−1 ∗ eat
volat = (1 − ρvola ) ∗ volass + ρvola ∗ volat−1 + volvola ∗ evolat

(103)
(104)
(105)
(106)

zt = (1 − ρz ) ∗ zss + ρz ∗ zt−1 + volz ∗ ezt

(107)

et = (1 − ρe ) ∗ ess + ρb ∗ et−1 + voless ∗ eet

(108)

42