Full text of Working Papers (Federal Reserve Bank of Richmond) : Information-Aggregation Bias, Working Paper 91-06

View original document

The full text on this page is automatically extracted from the file linked above and may contain errors and inconsistencies.

Working Paper Series

Information-Aggregation Bias

WP 91-06

This paper can be downloaded without charge from:
http://www.richmondfed.org/publications/

Marvin Goodfriend
Federal Reserve Bank of Richmond

Working Paper 91-6

INFORMATION-AGGREGATIONBIAS

Marvin Goodfriend*

June 1991

Abstract

Aggregation in the presence of data processing lags distorts the
information content of data, violating orthogonality restrictions that hold at
the individual level. Though the phenomenon is general, it is illustrated
here for the life cycle-permanent income model. Cross-section and
pooled-panel data induce information-aggregationbias akin to that in
aggregate time series. Calculations show that information-aggregation can
seriously bias tests of the life cycle model on aggregate time series,
cross-section, and pooled-panel data. (JEL C2, D12, D82, D91, E21)

*Research Department, Federal Reserve Bank of Richmond, Richmond, Virginia
23261. The paper has benefited from conversations with Marjorie Flavin,
Robert King, Bennett McCallum, and Stephen Zeldes. It has also benefited from
presentations at the Federal Reserve Bank of Richmond, Indiana University,
the Konstanz Seminar on Monetary Theory and Policy, North Carolina State
University, University of Pennsylvania, University of Rochester, and
University of Virginia.

Introduction
Econometric method interprets economic time series as resulting
from the choices of private agents interacting in well-organized markets.
Individual agents are imagined to solve carefully specified dynamic and
stochastic optimization problems. Decision rules yield optimal supply and
demand (e.g., consumption, labor supply, and asset demands) in period t as
functions of period t information on variables exogenous to the individual.
The method is attractive because it yields first-order conditions that imply
readily testable restrictions on time series generating processes.
Restrictions implied by theory at the individual level have been
tested on aggregate data by assuming that they are invariant under
aggregation. This paper shows that orthgonality restrictions implied by
intertemporal optimization, rational expectations, and information processing
need not hold under aggregation of randomly heterogeneous and imperfectly
informed representative agents.

Put more simply, aggregation creates problems

for econometric estimation and evaluation of models that rely critically on
the distinction between expectations and surprises at the individual level.
The life cycle-permanent income hypotheses is one such model.

Tests

for excess sensitivity of consumption have received substantial attention in
recent years.

The tests of consumption theory are based on particularly

simple orthogonality restrictions. Thus, the consumption example provides a
natural context within which to illustrate the information-aggregation bias
that can invalidate orthogonality restrictions generated at the individual
level.

It must be emphasized, however, that the information-aggregation bias

identified here is quite general and will interfere with the estimation and
evaluation of other economic models as well.

- 2 -

The discussion opens in Section I with a brief review of the
testable implications of the life cycle-permanent income hypothesis developed
by Hall [1978]. In contrast to the discussion in Section I, which proceeds
entirely at the level of a representative agent using aggregate variables, the
analysis in Section II is carried out initially at the level of the individual
agent and deals explicitly with the fact that individual agent income is
generated by aggregate and relative components. Because aggregate income is
only observable with a delay due to data processing lags, an individual's
response to innovations in his own income involves signal processing.

In a

manner related to Lucas's [1976] Phillips curve example, aggregating
individual agent decision rules yields some surprising results for the
aggregate "consumption function." For example, information-aggregation bias
invalidates Hall's test of the life cycle-permanent income model using
aggregate data, though the test remains valid for variables, such as stock
prices, that are widely known contemporaneously.
Section III shows that information-aggregationbias can be important
in practice.

To illustrate the point, Flavin's [1981] estimate of the excess

sensitivity of consumption is reinterpreted after taking explicit account of
randomly heterogeneous agent income within the framework of Section II.
Section IV shows that procedures involved in testing the life cycle-permanent
income model on cross-section and pooled-panel data induce informationaggregation bias akin to that in aggregate time series.

Calculations show

that the bias can be important for cross-section and pooled-panel data too.

Hall's Test of the Life Cvcle-Permanent Income Model

Hall [1978] considered a conventional life cycle-permanent income
model under uncertainty in which the household chooses a stochastic

- 3 -

consumption plan to maximize the expected value of its time-additive utility
function

subject to the budget constraint,

t~O(l+r)-t(ct-yt>

(2)

where

- Ao,

E mathematical expectation conditional on all
information available in t, including c
t-i' Yt-i I
and A t l, for i - 0, 1, 2, ...

E subjective rate of time preference

E a constant real rate of interest

Ct
Yt
At

E consumption
E labor income
= assets

The first-order necessary condition for maximization of (1) subject
'to (2) is

,yJ

(3)

= [(1+6)/(l+r)lU’(ct_l>.

As Hall pointed out, (3) implies that no information available in
period t-l apart from the level of consumption, ct 1, helps predict
consumption, c
utility.

in the sense of affecting the expected value of marginal

In particular, income or assets in period t-l or earlier and

consumption in t-2 or earlier are irrelevant once c
is taken into account.
t-l
Using (3), Hall went on to argue that the AR1 process ct = Xct 1 + <,, where
tVt

= 0, should be a good approximation to the stochastic behavior of

consumption under the life cycle-permanent income hypothesis.

The Et term

4 -

reflects consumption adjustments due to period t "news" on current and future
income prospects.
Hall proposed and implemented tests of the life cycle-permanent
income model by checking whether lagged income, lagged stock prices, ,'or
additional lags of consumption help predict consumption after one consumption
lag is taken into account. On the basis of (3), he regarded evidence that
variables other than consumption at one lag help predict consumption as
inconsistent with the life cycle-permanent income model.

II)

Randomlv Heterogeneous and Imnerfectlv Informed Aeents

Hall's test of the life cycle-permanent income model proceeds
entirely at the level of a representative agent using aggregate variables.

contrast, the analysis in this section is carried out initially at the level
of the individual agent and deals explicitly with the fact that individual
agent income is generated by both aggregate and relative components. The goal
is to derive an aggregate consumption generating process by explicitly summing
'decision rules of randomly heterogeneous and imperfectly informed agents.
The economy is populated by n individual agents.
income,

' is assumed to be generated as the sum of an aggregate component,
Yt9

and a relative income component, v1
t’

(4)

where

The ith agent's

such that

i
Yt
E i=o.
i-lvt
The relative income generating process is assumed to be identical

across agents.

Agents differ only by their relative income innovations. They

are, in effect, randomly heterogeneous representative agents. Aggregate and
relative income are assumed to be uncorrelated. As in Section I, agents are

- 5 -

assumed to be infinitely lived. One may think of aggregate and relative
income as generated by ARMA processes. We may write

(5)
where

i
21

agent's expectation conditional on information

= the
set

= the aggregate income innovation in period t
Et
i
- the relative income innovation in period t.
Ut
For what follows, it is convenient to assume that the utility
function in (1) is quadratic, U(ct) = -[l/23(&,)2,

so that, by (3), house-

hold consumption obeys the exact regression ct - c(r-6)/(l+r) +
[(1+6)/(l+r)]ctNl + tt.

Assume, in addition, that the real interest rate

equals the rate of time preference, so household consumption follows a random
walk.
If individual agent consumption obeyed the random walk version of
the life cycle-permanent income model, and if aggregate income were
contemporaneously observable, then using (5) we could write
.

(6)

AC: = rdA$)ct+

where

$A, 4R =

rdRut,

the present value of a revision, at time t,
in the expected path of future income in
response to one unit period t innovations in
the aggregate and relative components of
individual agent income, respectively.

The rtiAand rlR terms represent the period t revisions in permanent income
upon observing the period t innovations in the aggregate and relative
components of individual income. Under the random walk version of the life
cycle-permanent income model, period t consumption adjusts one-for-one with

- 6 -

these terms. .Expression (18) below gives 4 in terms of the parameters of the
underlying income generating process.
With no other complications, aggregating (6) over all n agents using
(5) and i-lv:
E - = 0 yields

(7)

AC,- rdAet.

Since et is an aggregate income innovation, the disturbance in (7)
is serially uncorrelated, and consumption is still a random walk in the
aggregate. Moreover, et is unpredictable on the basis of lagged income.

short, when aggregate income is contemporaneously observable, Hall's test of
the life cycle-permanent income model remains valid for aggregate data.
However, aggregate income is not contemporaneously observable. Data
collection and processing lags delay the observation of national income for at
least one quarter.

To capture this information delay, assume agents observe

aggregate income with a one period lag.1
In this case, agents must infer as best they can the contemporaneous
aggregate income innovation et from their own contemporaneous income
.
. .
innovation, Y:-&Y: * In other words, agents must engage in signal
processing in implementing their optimal consumption adjustment rule.

Using

equation (5), the optimal agent inference of E conditional on observing
t
.

~:-~E;yi may be written

(8)

where

fct

= nn&gY:lP

ith
agent's expectation conditional on information set
E the i
I!
I *-*t

Ytml’

Y,-4’

. ..).

i
and where (5) and statistical independence of 6 and ut
imply
t

- 7 -

(9)

In addition to receiving new information on individual income each
period, agents

receive

a direct observation on the previous period's aggregate

income. Any discrepancy between the previous period's actual income
and last period's prediction of ctel from observing
innovation e
t-l
i
represents new information to the individual agent. Because
Yt-&Y:_l
the new information generally leads to a revision of permanent income, it also
induces an adjustment in consumption.
Decision rule (6) must be modified to take the above considerations
into account.

First, ict and u: must be replaced with their optimal
.
. .
contemporaneous inferences n(y~-tH;y~) and (l-n)(y'
;-fgy:, 9 respectively.
Second, the decision rule must include the $6, and u i inference errors made
t
known when aggregate period t income is published

in period

t+l.

Lagged unit

e and u inference errors cause permanent income to be revised by rdA(l+r) and
r4R(l+r) units, respectively. Hence, equation (6) becomes
.

(10)

AC: = r#An(y~-t+:)

+ r(l+r)dAl~ct-l-n(y:_l-tE~y~
- ,)I

+ r+R(l-n)(yf-tiz:yt)+ r(l+r)dR[u~-l-(l-n)(y~-l-tE~y~_l)l.

Using (5), equation (10) may be rewritten

(11)

AC: - r[dAn+dR(l-n)](&+u:)

+-r(l+r)(dA-dR)(l-n)Jnct-l

+ r(l+r)s2(4R-~A)u~-l.

- 8 -

Substituting with (9) for n in (ll), it is easy to show that
COV[AC:,AC~-~] is zero.

In other words, individual consumption is still

a random walk as it was in (6). COV[+Y~~~I

is also zero.

Imperfect

information does not invalidate Hall's test of the life cycle-permanent
income model at the individual level.
Using the fact that (5) and &vE

.
= 0 imply i$u: = 0, sum (11)

over all n agents to derive the aggregate consumption generating process

(12)

ACt

= r[dAn+#R(l-n)]ft + r(l+r)(l-n)[dA-~RILt-l.

The model underlying the aggregate "consumption function" (12) is
one in which individual agents follow the random walk version of the life
R
cycle-permanent income model exactly. Yet, since O<n<l, unless 4A equals 4
aggregate consumption is not a random walk, i.e., Cov[AcyAct-ll

is not zero.

This is surprising because the decision rules which together determine
aggregate consumption are themselves random walks.

Efficient information

processing guarantees that individual consumption remains a random walk with
randomly heterogeneous and imperfectly informed agents. However, the portion
of the aggregate income innovation contemporaneously unperceived by
individuals, i.e., (l-n)c
affects their consumption both contemporaneously
t'
and with a lag.

Relative income aggregates out but the consecutive effect of

the initially unperceived aggregate income innovation does not, so changes in
aggregate consumption are autocorrelated.
Because Cov[A~~,y~-~] is not zero unless dA equals #R, lagged
aggregate income generally helps predict aggregate consumption changes
generated by (12). Since that finding would cause Hall to reject the life
cycle-permanent income model even though individual agents were following a

version of it exactly, his test is clearly invalid for aggregate data under
imperfect information.
In this example, Hall's test is protected against informationaggregation bias by a simple lagging procedure. With a one period reporting
lag, the life cycle-permanent income hypothesis implies that aggregate income
lagged two or more periods should not help predict aggregate consumption.
Hall's test is valid here for aggregate candidate predictor variables lagged
two periods or more.2
In practice, data are revised over a long period of time, so even
aggregate predictor variables lagged beyond their initial publication date
would still induce information-aggregationbias.

The problem is completely

circumvented by employing as instruments variables, such as interest rates and
stock prices, that are widely known contemporaneously.

III)

ReinterDreting Flavin's Estimate of
The Excess Sensitivitv of Conswnntion

Flavin [1981] used Hall's insights, but went beyond his "reduced
form" test to develop a simple structural econometric system in which she
estimated parameters measuring excess sensitivity of consumption to income.
This section demonstrates the potential size of information-aggregation bias
in the context of Flavin's estimation strategy. Flavin's method may be
illustrated as follows.

She proceeded by adding a current and seven lagged

@Ay terms to an equation like (7) to capture any direct effect of income on
consumption apart from the effect operating through revisions in permanent
income.

I illustrate her consumption generating process here by adding a

current and only two lagged /3Ayterms as follows

(13)

Act

= BOAY, + BlAY,-1 + B2AY,-2 + rdAft.

Flavin noted that the /3coefficients in (13) cannot be consistently
estimated by OLS because Ayt is correlated with the disturbance term r#Act.
Using the fact that agents see their own income contemporaneously, Flavin
pointed out that income in period t-l and earlier cannot be "news" in period t.
On that basis, she used a univariate income autoregression to eliminate yt
in (13). Assuming, for illustrative purposes, that income is generated by an
AR3 process, her substitution yields the following consumption regression
(14)

ACt

- p + [Bo(P1-l)+B1lYt-l+ u$)P2-B,+B21Yt-2

+ [Bop3-82lY,-3

+ [rd +B,l~,

The coefficients in the income autoregression, i.e., pl, p2, and p3,
together with the y coefficients in (14) just identify the /Jo,p,, and @,
coefficients. Flavin argued that ps significantly different from zero should
be interpreted as evidence that consumers do not behave according to the life
cycle-permanent income hypothesis.

In her words, finding &s to be positive

would indicate an excess sensitivity of consumption to current income.
By expressing the AR3 income generating process in moving average
form, (14) can be rewritten as

(15)

ACt = P + =ft-l

where

=1 = Bo(P1-l) + B,
“2 E “lP1

+ 4-93

A3 = +q+P2)

+ *2e t-2 + =gt-3

+ Wt

- Bl + B2
+ h2-7y1)P1+

B(y3-

A
= (r4 +B,)e,
+ (6 terms lagged more than three periods)

- 11 Now suppose that equation (14) is estimated using data generated by
the model from Section II in which individual consumption follows a random
walk, but aggregate information is received with a lag.

In this case, we

recover the ps as follows. Using (12) and the fact that the et 1, et 2, and
%-3

in (15) are orthogonal to each other and to w

we may express the A

regression coefficients as3

(16)

rl- r(l+r)(l-n)(dA-#R)
lr’2 - 0
"3 = 0.

The ps solve equation system (16). For example, the solutions for
PO and j?,are
l-q-P2
(17)

BO -=

[

Pl+P2+P3-l

l-P1-P2

Bl
where

p1+p2+p3-1

9rl[l+(l-Pl)[
Pl+P2+P3-l

< 0 required for stationarity of the

AR3 income autoregression.

The model underlying the expressions for PO and /3, in (17) is one in
which individual agents follow the life cycle-permanent income hypothesis
exactly.

Yet

the ps would all be

require that dA equal dR.

zero

only if ~~-0 which, since 0~0~1, would

The condition #A-dR means that the present value of

a revision in the anticipated path of future income due to an innovation must
be identical for both the aggregate and relative income generating components
of individual income. There is no reason to expect this.

In fact, Friedman

[1957] exploited cross-sectional differences in income generating processes'
for his most striking confirmation of the permanent income hypothesis.

- 12 -

short, once randomly heterogeneous agent income is taken into account in
deriving the aggregate consumption function, there is no reason to expect the
ps to be zero even if agents follow the life cycle-permanent income hypothesis
exactly. Contrary to Flavin's claim, estimating the ps to be significantly
different from zero does not necessarily constitute evidence against the life
cycle-permanent income hypothesis.
Information-aggregation affects Flavin's p estimates by biasing x1
away from zero. We can assess the potential importance of the bias by
focusing on fll. First choose a plausible set of parameter values.

Since

Flavin used quarterly data, if we take the annual interest rate to be 4
percent, then r-0.01. Assuming the innovation variances of the aggregate and
relative components of individual income to be the same, we have S%=O.5.5 In
addition, we need to multiply the expression for x1 by 0.5 to account for the
fact that Flavin used only nondurable consumption as the dependent variable,
which averages roughly half of the total over her sample period.

The result

is 0.0025(dA-dR).6
What is a plausible range for OA-dR?

Flavin, pp. 988-89, shows that

for an ARMA(p,q) income generating process, 4 can be expressed as

(18)

~(Pj,

j-1,

.--,

7,~

S-1,

. . . .

l+r

Consider, for example, an ARMA(l,l) process. Using (18), we can write the
value of 4 corresponding to that process as

(19)

4(P,7)

1 (l+r+7
= l+r l+r--p)'

Expression (19) makes clear that the value of 4 can be extremely
sensitive to values of the autoregressive and moving average coefficients, p

- 13 -

and 7.
p

p-l

and 7-0, then income is a random walk and 4-100.

But values of

only a bit below unity can reduce 4 substantially. For instance,

7-O yield 4-17.
when

and

p-O.95

Finally, differences in the moving average coefficient 7,

is unity or near unity, can affect 4 by as much as a factor of 100.
Since aggregate income is near or actually nonstationary, even small

differences in persistence between relative and aggregate income generating
processes can yield large values for 4A-4R.7

For example, the set of

parameter values chosen above requires 4A-4R to be about 23 for
information-aggregationbias alone to account for Flavin's estimated A
of 0.058.

value

If aggregate and relative income generating processes were each

AEMA(l,l) with

p-l,

the required 4A-4R value could be produced by a difference

between their moving average (7) coefficients of only 0.23. Alternatively, if
aggregate and relative income processes were AR1 and
income, then a
required 4A-4R.

were 0.99 for aggregate

value of about 0.97 for relative income would yield the
It seems fair to say that information-aggregation can

easily yield large enough values of K 1 to be important in the context of
Flavin's strategy for estimating the excess sensitivity of consumption to
income.
Flavin's B estimates are vulnerable to information-aggregation bias
because ytW1 is not a valid instrument for Ayt in (13). The reason is that
<n
ytB1 coincides with the one period reporting lag.
procedure described above applies here too.

However, the lagging

In this example, Flavin's

procedure is protected against information-aggregationbias by simply lagging
the set of income instruments one period beyond the publication lag, e.g., by
using income from t-2 and earlier to instrument for by, and Ay, 1.

mentioned above, the fact that actual data.are revised after their initial
release means that even aggregate variables lagged beyond their initial

- 14 -

publication date may not be free of bias.

Again, widely-known

contemporaneously-observableinstruments such as financial prices completely
circumvent the problem.
It is worth pointing out that while information-aggregation bias may
partly explain the rejections of the life cycle model in aggregate data, it
appears not to be the sole explanation. For example, Campbell and Mankiw
[1990] perform a test that is robust to the information-aggregation problem.
They regress the log change in consumption on the contemporaneous log change
in income, using nominal interest rate changes dated t-2 and earlier as
instruments. They find a large and statistically significant coefficient.

IV)

Imnlications for Cross-Section and Panel Data

A number of studies have been conducted implementing Hall's test of
the life cycle-permanent income model on panel data.8

Because such procedures

are based on data for individual spending units, they would appear to be
immune to misinterpretations arising from information-aggregationbias.

This

would certainly be the case if the tests were performed on individual
household data separately, since the orthogonality conditions implied by (3)
have to hold for individual households following the life cycle-permanent
income model.

However, panel data consist of a large number of households

reporting over a relatively short time period.

For example, Hall and Mishkin

[1982] used a panel of about 2000 households reporting annual consumption over
a seven-year span.

Because the time series on individual households are so

short, panel data is invariably pooled prior to applying statistical
procedures. The purpose of this section is to illustrate how statistical
procedures associated with pooling induce an information-aggregation bias akin
to that in aggregate time series.

- 15 -

To illustrate the point, suppose that we have individual agent data
.
AC: and Ay: generated in the environment of Section II. For such data, we
th
agent,
for the i
know that a simple regression of AC:
.
(20)

where

AC: - ai* + aiAytal+ et,
i

i - 1, 2, .... n;

= the disturbance term,

yields ai-0. We also know that a single regression pooling all the household
data would yield a coefficient on Ayiml of zero.

Assuming that the number of

time series observations is large, consistency in the pooled regression
.
requires only that Cov[Ay:-l,e:] is zero, a condition guaranteed by individual
behavior in accord with the life cycle-permanent income hypothesis.
The number of time series obsenrations is, however, often quite
small in panel data.

For a short panel, a consistent estimate of the a

coefficient in the pooled regression requires the stronger condition that
.
the e: disturbances be uncorrelated. See Chamberlain [1984]. The life
.
'cycle-permanent income hypothesis guarantees that ei is serially uncorrelated.
But because they capture the adjustment in consumption due to news about
.
future income prospects, which includes common components, the e: disturbances
are correlated across agents in any given period.

If, for example, aggregate

income were unexpectedly high this period, then all agents would tend to have
higher than expected consumption this period.
Most studies using pooled data purge individual household
consumption expectational errors of their common component to yield consistent
parameter estimates and standard errors. One technique for doing so, as in
Zeldes [1989], is to use wave dummies to capture the aggregate component of
.

expectation errors.

For example, one could regress Act
' for each agent in the
-.
.
panel on the aggregate Act and use only the residual, AC: E AC: - E[Ac:(Act],

- 16 -

in the pooled regression. For data generated in the environment of Section
II, we would have
“.

(21)

AC: - r[4An+4R(1-n)]u:
me

+ r(l+r)n(4R-4A)u~-l.

absence of E terms in (21) indicates that the procedure does

purge individual consumption data of its common component. However. if 4A+4R,
then neither Co~[Ac:,ac:-~] nor COV[A~~,A~~-~] would be zero. Therefore, an
-.
1
econometrician using the transformed series, ACt' would not find consumption
to be a random walk.

Nor would he find the a2 coefficient to be zero, even

though individual agents were following the random walk version of the life
cycle-permanent income model.

Note the similarity between (21) and (12).

Aggregation eliminates the u terms in (12). Purging individual consumption
changes of their common component eliminates the e terms in (21). Both data
transformations make individual agents appear inefficient relative to life
cycle-permanent income behavior by implicitly imputing to them too much
contemporaneous information.
The transformation would be admissible only if it were unimportant
for individual agents to distinguish relative and aggregate components of
AR
their own income, i.e., if 4 -4
observable.

, or if aggregate income were contemporaneously

In general, however, there is a tension between removing residual

correlation and introducing information-aggregationbias.

In short, because

of statistical procedures associated with pooling, tests of the life cyclepermanent income model on panel data are not immune to information-aggregation
There seems to be no way of avoiding the bias for the coefficient
.
on Ay: 1. But in this example too, the coefficients on a set of Ayi

bias.

regressors lagged at least two periods are free of information-aggregation
bias.

Note once more, however, that data revisions in practice mean that

- 17 -

aggregate variables lagged beyond their initial release date are still not
completely free of bias.
In Section III, we saw that information-aggregationbias could be
large in the context of Flavin's strategy for estimating the excess
sensitivity of consumption to income. We also saw that since Flavin
estimated nl>O, 4A must exceed dR for information-aggregationbias to
contribute to the excess sensitivity she found.

For 4%4R, from (21) we would

predict that if individual agents followed the life cycle-permanent income
model, studies using pooled-panel data would find a negative correlation
between the change in consumption and lagged income.

In fact, studies such as

Hall and Mishkin [1982], Hayashi [1985], and Zeldes [1989] all find
significant negative correlation.
The above result is intriguing. One wonders to what extent
information-aggregation might be responsible for it.
that question here.

But we will not pursue

We are merely interested in gauging the potential

importance of information-aggregation for pooled-panel data.

One way of doing

so is to ask whether information-aggregationbias could account plausibly for
the magnitude of the negative correlation. Hall and Mishkin used annual data
and Zeldes used data in logarithmic form. However, Hayashi's finding is for
simple quarterly changes, so his is readily compared to the results discussed
above for aggregate time series.
For a panel of about 2000 Japanese households, Hayashi [1985] p.
1092, reported a highly significant correlation coefficient of -0.08 between
the change in food expenditure from 1981:4 to 1982:l and the lagged change in
income. By definition, the correlation coefficient is calculated using data
in deviation from mean form.
-.

COW+Y~-~

In other words, Hayashi reported an estimate of
1

]/ (VarAci)(VarAii), where A;: - Av;.
*

Hayashi's correlation

- 18 coefficient was estimated using only a cross section. But informationaggregation causes problems even if a cross section is used in estimation
because the constant term is equivalent to a single time dummy.9
We can use individual consumption data generated in the environment
of Section II subjected to the W-W transform, to get an idea of the size of
the correlation coefficient we would expect to see in practice if individual
agents followed the life cycle-permanent income model exactly but aggregate
income were reported with a one period lag.l"
-.
-.
To do so, we use AC: from (21) to write Co~[Act,Ay:-~] 'i
r(l+r)n(4R-4A)ot and Varbc -

r2[4An+4R(1-n)]2+[r(l+r)n(4R-gA)]2~t.11

The 4A and 4R values in these formulas depend, as discussed in Section III, on
the processes that generate aggregate and relative income. The formula for
VarAyi depends on the characteristics of the relative income generating
process.12
Consider some simple cases.

Suppose aggregate income were a random

walk and the first-difference of relative income were MAl.
n-O.5 as assumed in Section III.

Let r-0.01 and

In this case, a relative income MA

coefficient of only -0.15 yields the -0.08 correlation coefficient found by
Hayashi.

Alternatively, if aggregate and relative income were both trend-

stationary AR1 processes with an AR coefficient of 0.99 for aggregate income,
then an AR coefficient of only 0.9865 for relative income would yield -0.08.
Casual evidence might suggest that the relative income innovation variance
greatly exceeds that for aggregate income innovations, so that n may be
relatively close to zero.

Even so, an Q as low as 0.05 yields a value for

Hayashi's correlation coefficient of -0.09 if aggregate income is a random
walk and relative income is a trend-stationary AR1 with an AR coefficient of
0.98.

- 19 -

Hayashi's Table IV reports an estimated ARIMA (2,1,0) relative
disposable income process with first and second order AR coefficients of -0.92
and -0.35 respectively. As a final example, consider matching Hayashi's
estimated process with a realistic aggregate income process.

Symmetric

treatment argues for using a difference-stationary representation for
aggregate income. Unfortunately, Hayashi does not report an aggregate income
process. But consider one for quarterly labor income estimated on post-war
U.S. data reported in Campbell and Deaton (1989), p. 361.

Their aggregate

income process is AR1 in growth rates with an AR coefficient of 0.443.
For these aggregate and relative income generating processes,
&179

and 4L44.13

In this case, n-O.5 yields a value for the correlation

coefficient of -0.35.

The more realistic value of n-O.05 yields a correlation

coefficient of -0.09.
Before concluding, I wish to return briefly to a point touched on
above. A number of other explanations have been advanced to account for the
correlation between the change in consumption and the lagged change in income.
These include liquidity constraints, time aggregation, adjustment costs for
consumer durables, or simply the fact that income news in a given quarter may
be received after some consumption decisions have already been executed.14
None of the competing explanations, however, predicts the sign of the
correlation estimated on pooled-panel data to be the opposite of that
estimated on aggregate time series. The information-aggregation view alone
suggests an explanation for the change of sign, thereby reconciling apparently
contradictory aggregate- and panel-data findings.

Conclusion
This paper introduced the idea of information-aggregation bias,
illustrated the source of the problem for aggregate time series, cross-

- 20 -

section, and panel data, and gauged its potential importance in practical
work.

The paper should not be construed as demonstrating that information-

aggregation actually explains the rejections in the consumption literature.
Information-aggregationbias may in fact be partially responsible for some
rejections in the consumption literature, and important in a host of other
applications as well.

Evaluating the size of the bias in specific

applications, however, must be left for future work.

To avoid the bias,

aggregate predictor variables employed in model estimation and evaluation
should be lagged one period beyond their initial publication date.

Potential

bias associated with data revisions is completely circumvented by employing as
instruments variables that are widely known contemporaneously.

- 21 -

FOOTNOTES

1. Throughout the paper, I assume that agents observe their own
real income contemporaneously. In fact, they only observe their nominal
income currently. Only when price level information becomes available can
agents calculate their own real income innovations. This source of 'imperfect
information would be sufficient, even without heterogeneous real income, to
create information-aggregationbias. It has been omitted from the discussion
merely to keep the examples as simple as possible.
2. Recent work on time aggregation in consumption by Hall [1988]
provides another reason to exclude the first lag of income from tests of the
permanent income hypothesis. Campbell and Mankiw [1990] discuss other reasons
for lagging the instruments more than one period in such tests.
3.

See Goldberger [1964], pp. 200-01.

See the material in Box and Jenkins [1976], pp. 53-4.

For any AR(p) income generating process the expressions in (17)
generalize to B - x,[*] and /3,- xl[l+(l-pl)[*]], where [*I = l-pl-p2-...
-Pp-l/Pl+P2+...9Pp-l.

The interpretation of Flavin's B estimates in (17) makes sense only
if the y generating process is stationary. If, for example, y is a random
walk, i.e., by -c , then p in (13) can't be estimated, though we can still
estimate the pt akd B, toePficients. Using equation (12), we would find
/3-r(l+r)(l-n)(4A-4R) and p,=O.
1
5. This assumption is made for convenience. If n were closer to
unity because the variance of aggregate-income innovations exceeded the
relative-income-innovationvariance, then information-aggregationbias in
aggregate time series would be smaller. See Barsky, Mankiw, and Zeldes
[1986], Section 2 for a discussion of the magnitude of individual uncertainty.
6. Flavin's income series was quarterly NIPA disposable personal
income in the post World War II period. It was initially released with a two
month lag from 1947 to 19641 and a one month lag thereafter. But a monthly
estimate of personal income has been reported with a half month lag throughout
the period. While the calculations in the text indicate that informationaggregation bias could still be substantial, monthly release of income data
reduces the bias as calculated there. In addition, predictive content of
financial variables for aggregate income reduces further the news content of
lagged quarterly income.
7. Evidence that income is approximately a random walk is found in
Mankiw and Shapiro [1985] and references contained therein. If income were a
random walk, then as pointed out in footnote 4, we could no longer interpret
Flavin's p estimates according to (17). Flavin's rl regression coefficient
could, however, still be interpreted as r(l+r)(l-n)(4A-4R), which would also
be the interpretation of her fi,coefficient. As Mankiw and Shapiro have

- 22 -

pointed out, Flavin's estimate of 8, could no longer be interpreted as an
excess sensitivity to current income in this case.
Based on her estimated AR(8) income autoregression, Flavin reported
a 4 value of about 18. The estimated plcoefficient in the income
autoregression is 0.96 (O.aS>, however, which is well within one standard
error of 1. So the true 4 could easily be near 100.
MaCurdy [1982] presents some estimates of the persistence of
income-generating processes for longitudinal data on wages and earnings.
8. See, for example, Hall and Mishkin [1982]. Hayashi [1986], and
Zeldes [1989].
9. Stephen Zeldes has suggested to me that the informationaggregation issue might also arise for a panel even with no time dummies
included, if the number of time series observations (T) were small and a
constant term were included. Intuitively, it would seem that this effect
would become less important as T increases.
10. During the period covered by Hayashi's panel, Japanese quarterly
national income data was initially published with a two month lag. The first
revision was released with a further three month lag. Some information on
disposable income based on a sample of households was also collected and
released on a monthly basis.
11. We need to multiply the covariance term by .3 to account for
the fact that Hayashi's correlation coefficient was calculated using food
consumption, which averages roughly 30 percent of the total over his sample.
Likewise, the variance term must be multiplied by (.3)2. These adjustments
cancel in the correlation coefficient.
12. The relevant variance formulas for the examples discussed below
are given in Box and Jenkins [1976], pp. 62 and 76. For zt - m-1
+ et + ret-l,
Varz - [1+~~+2~q~E,
l-p2
l-P2

Varz -

and for z

t = qzt-1+ ppt-2 + et'

2
c7
e

F 1+P2 I w-P2)2-P:l’
13.

If income (y) is generated by the difference-stationary

process Ayt - plAyt-1 + p2Ayt-2 + et, then we get 4 - A,
wwP18-P282)

1
where /3= l+r'

See Campbell and Deaton [1989], p. 359.

The above 4 formula may be applied directly to Hayashi's
difference-stationary process. A slightly modified formula is required to
calculate the 4 value for Campbell and Deaton's AR1 process in growth rates.
Equations (8)-(12) of their paper imply that the income process Alogyt p. + plAlogyt-1 + et yields the following analog to my equation (7):

- 23 -

Act - rdyt-let, where 4 -

~1 is the average quarterly rate

1: l-p1 (+$$)I. ’
c

of income growth, and we require r>p. Using the estimates p-0.00451 and
-0.433 reported by Campbell and Deaton together with r-0.01 we get
4a. 179.
p

14.

Hall [1989] is a useful survey.

- 24 -

REFERENCES
Barsky, Robert B., N. Gregory Mankiw, and Stephen P. Zeldes. 'Ricardian
Consumers with Keynesian Propensities." American Economic Review, 76
(September 1986), 676-91.
Box, George E. P., and Gwilym N. Jenkins. Time Series Analvsis: Forecasting
and Control (San Francisco: Holden Day, 1976).
Campbell, John and Angus Deaton. "Is Consumption Too Smooth?" Review of
Economic Studies 56 (July 1989), 357-373.
Campbell, John and N. Gregory Mankiw. "Permanent Income, Current Income, and
Consumption." Journal of Business and Economic Statistics 8 (July
1990), 265-79.
Chamberlain, Gary. "Panel Data.' In Handbook of Econometrics: Volume 2, Zvi
Griliches and Michael D. Intriligator, eds. (Amsterdam: North
Holland Publishing Co., 1984).
Flavin, Marjorie A. "The Adjustment of Consumption to Changing Expectations
About Future Income." Journal of Political Economy 99 (October
1981), 974-1009.
Friedman, Milton. A Theorv of the Consumotion Function
University Press, 1957).
Goldberger, Arthur S. Econometric Theorv
Inc., 1964).

(Princeton: Princeton

(New York: John Wiley and Sons,

Hall, Robert E. "Consumption." In Modern Business Cvcle Theory,
Robert J. Barro, editor, Cambridge, Mass: Harvard U. Press, 1989.
. "Intertemporal Substitution in Consumption." Journal of Political
Economy 96 (April 1988), 339-57.
. 'Stochastic Implications of the Life Cycle-Permanent Income
Hypothesis: Theory and Evidence." Journal of Political Economy, 86
(December 1978), 971-88.
and Frederic Mishkin. "The Sensitivity of Consumption to Transitory
Income: Estimates From Panel Data on Households." Econometrica 50
(March 1982), 461-81.
Hayashi, Fumio. "The Permanent Income Hypothesis and Consumption Durability:
Analysis Based on Japanese Panel Data." Ouarterlv Journal of
Economics 100, (November 1985), 1083-1113.
Lucas, Robert E., Jr. 'Econometric Policy Evaluation: A Critique." m
Phillips Curve and Labor Markets, Carnegie-Rochester Conference
Series on Public Policy, K. Brunner and A. H. Meltzer, eds.
(Amsterdam: North Holland, 1976).

- 25 -

MaCurdy, Thomas E. "The Use of Time Series Processes to Model the Error
Structure of Earnings in a Longitudinal Data Analysis." Journal of
Econometrics 18 (January 1982), 83-114.
Mankiw, N. Gregory and Matthew D. Shapiro. "Trends, Random Walks, and Tests
of the Permanent Income Hypothesis." Journal of Monetarv Economics
16 (September 1985), 165-74.
Zeldes, Stephen. "Consumption and Liquidity Constraints: An Empirical
Investigation." Journal of Political Economv 97 (April 1989),
305-46.

Full text of Working Papers (Federal Reserve Bank of Richmond) : Information-Aggregation Bias, Working Paper 91-06

FRASER