The Federal Reserve Board eagle logo links to home page

Skip to: [Printable Version (PDF)] [Bibliography] [Footnotes]
Finance and Economics Discussion Series: 2008-15 Screen Reader version

Lack of Signal Error (LoSE) and Implications for OLS Regression: Measurement Error for Macro Data

Jeremy J. Nalewaik*


Keywords: Measurement error models, news and noise

Abstract:

This paper proposes a simple generalization of the classical measurement error model, introducing new measurement errors that subtract signal from the true variable of interest, in addition to the usual classical measurement errors (CME) that add noise. The effect on OLS regression of these lack of signal errors (LoSE) is opposite the conventional wisdom about CME: while CME in the explanatory variables causes attenuation bias, LoSE in the dependent variable, not the explanatory variables, causes a similar bias under some conditions. In addition, LoSE in the dependent variable shrinks the variance of the regression residuals, making inference potentially misleading. The paper provides evidence that LoSE is an important source of error in US macroeconomic quantity data such as GDP growth, illustrates downward bias in regressions of GDP growth on asset prices, and provides recommendations for econometric practice.


1 Introduction

This paper proposes a simple generalization of the classical measurement error model and studies its implications for ordinary least squares (OLS) regression. The usual model starts with the true variable of interest and adds noise, which we call the classical measurement error (CME) term; see Fuller (1987) or virtually any econometrics textbook. The generalization discussed here incorporates a different kind of measurement error that subtracts signal from the true variable; this new error term is called the Lack of Signal Error, or LoSE for short. This additional term adds some much-needed flexibility to the classical measurement error model: it allows the mismeasured variable to have either more or less variance than the true variable of interest, in contrast to the classical model which imposes that the mismeasured variable have greater variance. This restriction does not hold in some important applications in macroeconomics and elsewhere.

The implications of LoSE for OLS regression are opposite the usual intuition about measurement error, which is applicable to CME only. The CME intuition says that measurement error in the dependent variable  Y of a regression poses no real problems for standard estimation and inference. Parameter estimates are unbiased and consistent, while hypotheses are more difficult to reject because CME increases the variance of regression residuals and parameter estimates; inference under these circumstances has a cautious slant. CME in the explanatory variables  X causes the real problems for OLS regression, namely attenuation bias and inconsistency. However with LoSE these results are reversed. For the baseline case considered here, LoSE in the explanatory variables  X produces no bias or inconsistency, and similar to CME in  Y, LoSE in  X boosts the variance of regresson residuals and hence standard errors. And, LoSE in the dependent variable  Y introduces an attenuation-type bias and inconsistency into the regression under some circumstances (in particular, when the explanatory variables contain some signal missing from the dependent variable). In addition, LoSE in  Y shrinks the variance of the regression residuals, thus shrinking parameter standard errors compared to what they would be without this type of mismeasurement. Standard errors are potentially misleadingly small, complicating inferences about the relation between true  Y and  X. Under these conditions, a more cautious approach to inference than has been taken in the past may be warranted.

Mismeasurement in many types of data used for empirical work in economics and other disciplines may be better described by the generalized measurement error model with LoSE than by the pure classical measurement error model. This paper focuses on US macroeconomic quantity data such as gross domestic product (GDP) and gross domestic income (GDI), which attempt to measure the same underlying concept using different source data; see Fixler and Nalewaik (2007) and Nalewaik (2007a). These data pass through numerous revisions, and the more poorly-measured initial estimates have less variance than the revised estimates, providing a concrete example of measurement error that cannot be CME; see also Mankiw and Shapiro (1986). Section 2 of the paper makes this point while motivating the generalization of the CME model outlined here.

Section 3 of the paper discusses the nature of the source data used to compute US macroeconomic quantity data, and points out some reasons why LoSE may be present in the estimates even after they have passed through all their revisions. The fact that GDP growth should equal GDI growth, but does not in the fully revised quarterly or annual frequency data, proves that some mismeasurement remains in either GDP or GDI growth. The evidence in Fixler and Nalewaik (2007) and Nalewaik (2007a,b) supports the notion that this mismeasurement is largely LoSE, with more LoSE in GDP growth than in GDI growth. Those arguments are recounted briefly in this section, and some simple calculations comparing GDP and GDI growth show that the LoSE in GDP growth is likely substantial: after 1984, at least 30% of the variance of the true growth rate of the economy appears to be missing.

Some of the implications of substantial LoSE in GDP growth are fairly obvious. Realizations of GDP growth are simply less informative about the true growth rate of the economy than many macroeconomists currently believe, given the common but incorrect presumption that fully-revised GDP growth is measured with little error. In a macro forecasting context, true forecast errors are larger, on average, than forecast errors computed using data mismeasured with LoSE. The implications for regression estimation are more complicated, and working out those implications is the main contribution of the paper. This is done in section 4, which illustrates the effect of LoSE in the dependent and explanatory variables under different sets of assumptions. For concreteness, examples of popular regressions from macroeconomics that may conform to each set of assumptions are provided.

In a wide variety of econometric specifications employed in the macroeconomics and finance, variables like GDP growth, investment growth and consumption growth are regressed on asset prices - interest rates, stock price changes, exchange rate changes, etc. These regressions are of particular interest because asset prices potentially capture some signal missing from the mismeasured quantities, implying attenuation-type biases in the coefficients. Section 4 tests for these biases, regressing different output growth measures contaminated with more or less LoSE on a fixed set of stock or bond prices. When the dependent variable is contaminated with more LoSE, the regression coefficients are smaller, and the differences across regressions are often statistically significant. For example, the coefficients increase when we switch the dependent variable from the early GDP growth estimates based on limited source data to later GDP growth estimates based on more-comprehensive data. Tellingly, the coefficients increase again when we switch the dependent variable from GDP growth to GDI growth. The hypothesis that measurement error in the dependent variable does not bias OLS regression coefficients, a core piece of conventional wisdom in the profession, is rejected by the data, just as the paper predicts if the measurement error is LoSE. Section 5 concludes the paper.

2 A Generalization of the Classical Measurement Error Model

Let  Y_t^\star be the true value of the variable of interest, while  Z_t is an  \left(1 \times l \right) vector of possibly stochastic variables used to construct  Y_t^\star. The mismeasured estimate of the variable of interest is  Y_t. In many cases a government statistical agency or some other organization computes  Y_t based on information from surveys, administrative records, and other data sources (source data for short); then  Z_t will be variables drawn from the source data, possibly including non-linear functions of the original source data.

Under the classical measurement error model,

\displaystyle Y_t \displaystyle = \displaystyle Y_t^\star + \varepsilon_t.  

The term  \varepsilon_t is "noise" or the classical measurement error (CME) in the estimate. In the current context this is taken to imply independence of  \varepsilon_t and  Y_t^\star, although the weaker assumption  \operatorname{cov}\left(Y_t^\star,\varepsilon_t\right) = 0 suffices for many purposes. The CME may arise from estimation errors or other sources; since many estimates  Y_t are based on surveys, survey sampling errors are often thought to be a source of CME.

Under the generalized model of mismeasurement considered here, the mismeasured estimate  Y_t is as in Fixler and Nalewaik (2007):

\displaystyle Y_t \displaystyle = \displaystyle E\left(Y_t^\star \vert Z_t \right) + \varepsilon_t. (1)

The CME term  \varepsilon_t is assumed independent of  Z_t and  Y_t^\star. It can be seen immediately that the classical measurement error model is a special case of this more general model, where  Z_t spans  Y_t^\star so  E\left(Y_t^\star \vert Z_t \right) = Y_t^\star.

Define the deviation of the variable of interest from its conditional expectation as:

\displaystyle \zeta_t \displaystyle = \displaystyle Y_t^\star - E\left(Y_t^\star \vert Z_t \right). (2)

This deviation represents the information about  Y_t^\star not contained in  Z_t, and is independent of all functions of  Z_t. With  \operatorname{cov}\left(E\left(Y_t^\star \vert Z_t \right),\zeta_t\right) = 0, the variance of the true variable of interest may be decomposed into the variance of the conditional expectation plus the variance of  \zeta_t, and:  \operatorname{var}\left(\zeta_t\right) = \operatorname{var}\left(Y_t^\star\right) - \operatorname{var}\left(E\left(Y_t^\star \vert Z_t \right)\right). The variance of  \zeta_t represents the variance of the information about  Y_t^\star missing from the conditional expectation. Substituting into (1):
\displaystyle Y_t \displaystyle = \displaystyle Y_t^\star - \zeta_t + \varepsilon_t. (3)

Thinking of  \varepsilon_t as mismeasurement from noise,  \zeta_t represents an opposite kind of mismeasurement, mismeasurement from lack of signal about  Y_t^\star in the information used to construct  Y_t. As such,  \zeta_t may be labelled the Lack of Signal Error, or LoSE for short.

Since the CME is independent of  Y_t^\star, it is naturally independent of the LoSE as well. However the LoSE is clearly correlated with  Y_t^\star, with  \operatorname{cov}\left(Y_t^\star,\zeta_t\right) = \operatorname{var}\left(\zeta_t\right) in fact. Taking variances of (3):

\displaystyle \operatorname{var}\left(Y_t\right) \displaystyle = \displaystyle \operatorname{var}\left(Y_t^\star\right) + \operatorname{var}\left(\zeta_t\right) - 2\operatorname{cov}\left(Y_t^\star,\zeta_t\right) + \operatorname{var}\left(\varepsilon_t\right)  
  \displaystyle = \displaystyle \operatorname{var}\left(Y_t^\star\right) - \operatorname{var}\left(\zeta_t\right) + \operatorname{var}\left(\varepsilon_t\right). (4)

Depending on whether the variance of the LoSE is greater than or less than the variance of the CME, the variance of the estimate  Y_t may be greater than or less than the variance of the true variable of interest  Y_t^\star. With CME alone, the variance of the estimate must exceed the variance of the true variable. The key limitation of the CME model is the assumption that  \operatorname{cov}\left(Y_t - Y_t^\star, Y_t^\star \right) = 0; the generalization allows this covariance to range from 0 to a lower bound of minus  \operatorname{var}\left(Y_t - Y_t^\star \right), in which case all the mismeasurement arises from LoSE.

The restrictions imposed by the CME model on variances and covariances have obvious drawbacks, as it is easy to think of hypothetical and actual counterexamples. As a hypothetical counterexample, assume that  Y_t^\star has positive variance, while the estimate  Y_t is just a constant for all  t. The estimate  Y_t is clearly mismeasured, but the classical measurement error model cannot handle this case. The growth rates of US GDP and GDI provide an actual counterexample. These estimates pass through numerous revisions that plausibly reduce measurement error, since they incorporate more comprehensive and higher-quality source data. For example, suitable source data is simply unavailable for many components of the "advance" current quarterly GDP estimate released about a month after the quarter ends. Source data for some of those components is incorporated into the revised "final" current quarterly estimate released about two months later, and higher-quality data are incorporated at subsequent annual and benchmark revisions, likely bringing the estimate closer to its true value.1 Then an early estimate of GDP growth or GDI growth can be modelled as a later revised estimate (the analog to  Y_t^\star) plus a measurement error term, which disappears with revision. Table 1 shows that the initial estimates have less variance than the revised estimates, violating the variance restrictions of the CME model.2 The generalized model here implies that the bulk of the measurement error is LoSE, as noted by Mankiw and Shapiro (1986).

While the generalized model here is less restrictive than the CME model, some restrictions do remain. Writing:

\displaystyle Y_t + \zeta_t \displaystyle = \displaystyle Y_t^\star + \varepsilon_t, (5)

the independence of  \zeta_t from  Y_t is a restriction, implied by the first term in (1) being a conditional expectation. However systematic biases in the estimate, on top of those caused by noise,3 violate this assumption. For concreteness, assume  E\left(Y_t^\star \vert Z_t \right) = Z_t\gamma. Consider an estimate of  Y^{\star}_t based on  Z_t that misuses the information, so  Y_t = Z_t\widetilde{\gamma} + \varepsilon_t with  \widetilde{\gamma} \neq \gamma. The estimate "misses" in a systematic way.4Taking variances, we have:
\displaystyle \operatorname{var}\left(Y_t\right) \displaystyle = \displaystyle \operatorname{var}\left(Y_t^\star\right) + \operatorname{var}\left(Z_t\widetilde{\gamma}\right) - \operatorname{var}\left(Z_t\gamma\right) - \operatorname{var}\left(\zeta_t\right) + \operatorname{var}\left(\varepsilon_t\right).  

For estimation and inference about  Y_t^\star and its relation to other variables, these systematic biases in  Y_t^\star lead to biased and inconsistent estimates, in general. However unless additional information is available about the nature of the "misses"  Z_t \widetilde{\gamma} - Z_t\gamma, the direction and magnitude of these biases is unclear. In some cases this additional information may be available, but in general, an important goal of all creators of data (government statistical agencies as well as other groups) is to avoid such systematic mismeasurement. Indeed, their ultimate goal is probably to produce estimates  Y_t that are as close as possible to  E\left(Y_t^\star \vert Z_t \right), with as broad an information set  Z_t as possible given resource constraints.5 As such, the generalized model (1) is a useful benchmark, and should approximate well the underlying mismeasurement in many situations. It also has the advantage of being mathematically tractable.

Before concluding this section, it is worth emphasizing that  Z_t need not be an exhaustive information set - i.e. it need not contain all available relevant pieces of information about  Y_t^\star. Resource and other constraints certainly preclude this from being the case, and the sections below considering the implications of LoSE allow for this possibility.

3 Data

3.1 Discussion of U.S. Macro Quantity Data

Each estimated growth rate of a macro quantity such as gross domestic product (GDP) is an attempt at measuring the change in the value of all relevant economic transactions, in the entire economy, from one fixed time period to the next. For an entity as large as the U.S. economy, this is a daunting, almost mind-boggling task, as the number of transactors and transactions is typically enormous, with little or no information recorded about many of them at high frequencies. Attempts to measure changes in these macro quantities are much more ambitious than attempts to measure similar changes for a single person, household, or even company. Simply due to their broad, universal nature, estimates of macro quantities are likely to miss more information - i.e. be contaminated with more LoSE - than are estimates of micro quantities (although some micro data sources may be contaminated with LoSE as well).6

Of course, the nature of the available source data determines the information content of macro quantity growth rates of interest, and frequency is important in this regard in the case of data from the U.S. National Income and Product Accounts (NIPA).7 The most comprehensive data on GDP and other major NIPA aggregates are only available at the quinquennial frequency (every five years), at the time of the major economic censuses. Even then, resource constraints make true census counts impossible. Many transactions in the underground economy remain unobserved and must be estimated, and some "above-ground" transactions are simply missed by any census.8 At the annual frequency, the GDP source data are typically samples drawn from the census universe. These samples can be quite large, capturing a sizeable fraction of the relevant value of transactions, but they are typically skewed towards measuring the transactions of larger businesses. As such, they may miss variation arising from the transactions of small companies and from businesses starting, shutting down, and operating in the underground economy. The lack of representation of these segments of the economy may add or substract variance to the official estimates, depending on the relative variance of the non-measured segments and their covariance with the measured segments, but this mismeasurement has the potential to add some LoSE to the data. At the annual frequency, and also at the quarterly frequency to some extent, government tax and administrative records are used as an additional source of information about the value of transactions, especially on the income (GDI) side of the accounts. These data can be informative, but underreporting makes them less than fully comprehensive.

At the quarterly and monthly frequency,9 reliance on samples is more pronounced, and the samples are less comprehensive. Smaller samples introduce larger sampling errors, which have traditionally been thought of as introducing CME into the estimates. The samples are typically random, after all, so part of the difference between the population and sample moments is likely random variation uncorrelated with the variation in the population moments. However smaller samples may introduce some LoSE as well, since smaller samples are simply less informative than larger samples: when different segments of the economy behave quite differently, small samples which are not fully representative may miss variation arising from some segments.10In addition, usable data on the value of transactions at a frequency higher than annual is simply unavailable for a substantial share of some NIPA aggregates such as GDP; many of the services categories of personal consumption expenditures (PCE) lack usable source data, for example.11 Quarterly and monthly growth rates are typically interpolated from annual totals, or estimated as "trend extrapolations." The lack of hard information for these categories must introduce some LoSE into the quarterly and monthly estimates.

3.2 Evidence of LoSE from GDP and GDI growth

Some users of quarterly or annual US NIPA data take the view that the variance of CME is negligible after the data have passed through its sequence of revisions, particularly for the more highly aggregated NIPA quantities like GDP growth where any remaining CME variance in its subcomponents may be diminished by averaging. Absent knowledge of the possible existence of LoSE, this view would imply that the variance of overall mismeasurement is close to zero, since  \operatorname{var}\left(Y_t - Y_t^\star\right) = \operatorname{var}\left(\varepsilon_t\right) in the pure CME model.

However there is ample evidence that the variance of overall mismeasurement in the most aggregated NIPA aggregate, GDP, is not close to zero, especially since the mid-1980s. GDI is an alternative estimate of the same quantity, so examining the relation between GDP and GDI provides some direct evidence on mismeasurement. Table 2 shows that, prior to 1984, the variance of each estimate is close to the covariance between the two, for both annual and annualized quarterly growth rates. The two estimates diverge very little, providing little direct evidence of mimeasurement. However after 1984, when the variance of both estimates drops dramatically (see McConnell and Perez-Quiros (2000)), the correlation between the estimates also falls, as the covariance falls relative to the variances on average. This is especially true for the quarterly growth rates, where the correlation falls from 0.93 to 0.68.12Interestingly, the variance of GDI growth also increases relative to the variance of GDP growth. Under the generalized CME model of section 2, this relatively large GDI variance may stem from some combination of two possible sources: (1) a relatively large amount of CME in GDI growth, boosting its variance, and (2) a relatively large amount of LoSE in GDP growth, damping its variance. The evidence favors the latter as the more important source of mismeasurement.

First, consider the results in Nalewaik (2007a), who estimates a two-state bivariate Markov switching model where mean GDP and GDI growth switch with the state; the low-growth states identified by the model encompass recessions as defined by the NBER. The conditional variance of GDI in that model, conditional on the estimated state of the world, is actually slightly lower than the conditional variance of GDP, despite the fact that the unconditional variance of GDI growth is higher. The higher unconditional variance stems from GDI growing faster than GDP in high-growth periods and slower than GDP in slow-growth periods in and around recessions. In other words, GDI growth appears to contain more signal about the state of the world than GDP growth: the larger spread between its high- and low-growth means implies greater informativeness about the state. Greater signal in GDI growth implies some LoSE in GDP growth, relatively more than in GDI growth.

Second, table 1 shows that the variance of GDI growth becomes relatively large only after the data pass through annual and benchmark revisions; in the earlier current quarterly estimates, the variance of GDP growth actually exceeds the variance of GDI growth. Since the revisions plausibly bring estimated GDI growth closer to the truth, they must either reduce LoSE, which would increase its variance, or reduce CME, which would decrease its variance. The increase in variance from the revisions, then, is likely a decrease in LoSE, suggesting the relatively large variance of revised GDI stems from relatively less LoSE. The annual and benchmark revisions appear to add more signal to GDI growth than GDP growth, increasing the variance of GDI growth relative to GDP growth.13 Fixler and Nalewaik (2007) discuss the revisions evidence in more detail, testing the hypothesis that the idiosyncratic variation of GDI growth is purely CME and rejecting at conventional significance levels. This again implies some LoSE in GDP growth.

To get a sense of the magnitude of the potential variance missing from GDP growth due to LoSE, assume that the CME variance in each estimate is negligible, so the differences between GDP and GDI growth stem entirely from differential LoSE:

\displaystyle \Delta Y_t^{GDP} \displaystyle = E\left(\Delta Y_t^\star \vert Z_t^{GDP} \right) \displaystyle = \Delta Y_t^\star - \zeta_t^{GDP},   and:  
\displaystyle \Delta Y_t^{GDI} \displaystyle = E\left(\Delta Y_t^\star \vert Z_t^{GDI} \right) \displaystyle = \Delta Y_t^\star - \zeta_t^{GDI}.  

Taking variances as in (4) yields:
\displaystyle \operatorname{var}\left(\Delta Y_t^{GDP}\right) \displaystyle = \displaystyle \operatorname{var}\left(\Delta Y_t^\star\right) - \operatorname{var}\left(\zeta_t^{GDP}\right),  
\displaystyle \operatorname{var}\left(\Delta Y_t^{GDI}\right) \displaystyle = \displaystyle \operatorname{var}\left(\Delta Y_t^\star\right) - \operatorname{var}\left(\zeta_t^{GDI}\right),   and the covariance is:  
\displaystyle \operatorname{cov}\left(\Delta Y_t^{GDP},\Delta Y_t^{GDI}\right) \displaystyle = \displaystyle \operatorname{var}\left(\Delta Y_t^\star\right) - \operatorname{var}\left(\zeta_t^{GDP}\right) - \operatorname{var}\left(\zeta_t^{GDI}\right) + \operatorname{cov}\left(\zeta_t^{GDP},\zeta_t^{GDI}\right).  

The idiosyncratic variance of one estimate (its variance minus its covariance with the other estimate) is then proportional to the LoSE in the other estimate:
\displaystyle \operatorname{var}\left(\Delta Y_t^{GDP}\right) - \operatorname{cov}\left(\Delta Y_t^{GDP},\Delta Y_t^{GDI}\right) \displaystyle = \displaystyle \operatorname{var}\left(\zeta_t^{GDI}\right) - \operatorname{cov}\left(\zeta_t^{GDP},\zeta_t^{GDI}\right),   and:  
\displaystyle \operatorname{var}\left(\Delta Y_t^{GDI}\right) - \operatorname{cov}\left(\Delta Y_t^{GDP},\Delta Y_t^{GDI}\right) \displaystyle = \displaystyle \operatorname{var}\left(\zeta_t^{GDP}\right) - \operatorname{cov}\left(\zeta_t^{GDP},\zeta_t^{GDI}\right).  

The information missed by both estimates is  \operatorname{cov}\left(\zeta_t^{GDP},\zeta_t^{GDI}\right); the idiosyncratic variance of GDI growth is then the variance of all the information about  \Delta Y_t^\star missing from measured GDP growth minus the part of that information also absent from GDI growth. Rearranging the covariance provides a lower bound on the variance of true GDP growth  \Delta Y_t^\star:
\displaystyle \operatorname{var}\left(\Delta Y_t^\star\right) \displaystyle = \displaystyle \operatorname{cov}\left(\Delta Y_t^{GDP},\Delta Y_t^{GDI}\right)  
    \displaystyle + \left(\operatorname{var}\left(\zeta_t^{GDI}\right) - \operatorname{cov}\left(\zeta_t^{GDP},\zeta_t^{GDI}\right)\right)  
    \displaystyle + \left(\operatorname{var}\left(\zeta_t^{GDP}\right) - \operatorname{cov}\left(\zeta_t^{GDP},\zeta_t^{GDI}\right)\right)  
    \displaystyle + \operatorname{cov}\left(\zeta_t^{GDP},\zeta_t^{GDI}\right),   so:  
  \displaystyle > \displaystyle \operatorname{cov}\left(\Delta Y_t^{GDP},\Delta Y_t^{GDI}\right)  
    \displaystyle + \left(\operatorname{var}\left(\zeta_t^{GDI}\right) - \operatorname{cov}\left(\zeta_t^{GDP},\zeta_t^{GDI}\right)\right)  
    \displaystyle + \left(\operatorname{var}\left(\zeta_t^{GDP}\right) - \operatorname{cov}\left(\zeta_t^{GDP},\zeta_t^{GDI}\right)\right).  

The last column of table 2 uses this equation to set an upper bound on the fraction of variance of  Y_t^\star captured by measured GDP growth:  \frac{\operatorname{var}\left(\Delta Y_t^{GDP}\right)}{\operatorname{var}\left(\Delta Y_t^\star\right)}. Measured GDP growth captures at most 70% of the variation in  \Delta Y_t^\star after 1984, under the assumption of negligible CME. Of course the assumption of no noise is an extreme one, particularly for the quarterly estimates. Indeed, the evidence in section 4.3.1 from regressions involving GDP growth, GDI growth, and stock prices indicate about a quarter of the variance of GDP growth is noise, but these results actually tighten the upper bound, decreasing it from 70% to 64%. And this is in fact an upper bound, since it does not account for  \operatorname{cov}\left(\zeta_t^{GDP},\zeta_t^{GDI}\right), the variation in  \Delta Y_t^\star missed by both measured GDP and GDI growth. The variation missed by both estimates at the quarterly frequency could be substantial.

Going forward, if these post-1984 variances and covariances are the norm, the implications of a potentially non-trivial amount of LoSE in macro data such as GDP growth should be taken seriously. For estimation and inference, the post-1984 portion of many samples will become increasingly large and important. The next section explores the implications of LoSE for estimation and inference using the most ubiquitous tool in econometrics: OLS regression.

4 Implications for OLS Estimation

Consider ordinary least squares estimation of the relation between a mismeasured variable  Y_t and a  \left(1 \times k \right) set of mismeasured explanatory variables  X_t, using a sample of length  T. When stacking together the observations, time subscripts are dropped for convenience:

\begin{displaymath}\begin{array}{cc} Y = \left(\begin{array}{c} Y_{1} Y_{2} \vdots Y_{T} \end{array}\right); & X = \left(\begin{array}{c} X_{1} X_{2} \vdots X_{T} \end{array}\right). \end{array}\end{displaymath}      

Our full set of assumptions is as follows:
Assumption 1    Y_t^\star = X_t^\star\beta +U_t^\star.  U_t^\star is i.i.d., mean zero, with  \operatorname{var}\left(U_t^\star\right) = \sigma^2_{U^\star} and  U_s^\star independent of  X_t^\star,  \forall t,s.  Y_t follows the generalized measurement error model of section 2:  Y_t = E\left(Y_t^\star \vert Z_t^y \right) + \varepsilon_t. The CME  \varepsilon_t is i.i.d., mean zero, and independent of all conditioning information sets, with  \operatorname{var}\left(\varepsilon_t\right) = \sigma^2_{\varepsilon}. The LoSE  \zeta_t = \left(X^\star_t - E\left(X^\star_t \vert Z_t^y \right)\right)\beta + U_t^\star - E\left(U_t^\star \vert Z_t^y \right) = \zeta_t^{xy}\beta + \zeta_t^u.  \zeta_t^u is i.i.d. and mean zero with  \operatorname{var}\left(\zeta_t^u\right) = \sigma^2_{\zeta,u}, and  \zeta_t^{xy} is i.i.d. and mean zero with  \operatorname{var}\left(\zeta_t^{xy}\right) = \sigma^2_{\zeta,xy}, a  k \times k matrix.  X_t follows the generalized measurement error model of section 2:  X_t = E\left(X_t^\star \vert Z_t^x \right) + \varepsilon_t^x. The CME  \varepsilon_t^x is i.i.d., mean zero, independent of  \varepsilon_t and all conditioning information sets, with  \operatorname{var}\left(\varepsilon_t^x\right) = \sigma^2_{\varepsilon,x}, a  k \times k matrix. The LoSE  \zeta_t^x = X_t^\star - E\left(X_t^\star \vert Z_t^x \right) is i.i.d. and mean zero with  \operatorname{var}\left(\zeta_t\right) = \sigma^2_{\zeta,x}, also a  k \times k matrix.  \frac{1}{T} \left(X^{\star}\right)^\prime X^{\star} \stackrel{p}{\longrightarrow} Q_{xx},  \frac{1}{T} \left(E\left( X^\star \vert Z^y \right)\right) ^\prime E\left( X^\star \vert Z^y \right)\stackrel{p}{\longrightarrow} Q_{xx} - \sigma^2_{\zeta,xy} = Q_{xx}^{zy},  \frac{1}{T} \left(E\left( X^\star \vert Z^x \right)\right) ^\prime E\left( X^\star \vert Z^x \right)\stackrel{p}{\longrightarrow} Q_{xx} - \sigma^2_{\zeta,x} = Q_{xx}^{zx},  \frac{1}{T} \left(E\left( X^\star \vert Z^y \right)\right) ^\prime E\left( X^\star \vert Z^x \right)\stackrel{p}{\longrightarrow} Q_{xx}^{zb}, and  \frac{1}{T} X^\prime X \stackrel{p}{\longrightarrow} = Q_{xx}^{zx} + \sigma^2_{\varepsilon,x}. All relevant fourth moments exist.
Then  Y_t can be written as:
\displaystyle Y_t \displaystyle = \displaystyle E\left(X_t^\star\vert Z_t^y \right) \beta + E\left(U_t^\star \vert Z_t^y \right) + \varepsilon_t (6)
  \displaystyle = \displaystyle X_t\beta + \left(E\left(X_t^\star \vert Z_t^y \right) - X_t\right)\beta + E\left(U_t^\star \vert Z_t^y \right) + \varepsilon_t  
  \displaystyle = \displaystyle X_t\beta + \left(E\left(X_t^\star \vert Z_t^y \right) - E\left(X_t^\star \vert Z_t^x \right) - \varepsilon_t^x\right)\beta + U_t^\star - \zeta_t^u + \varepsilon_t.  

The OLS regression estimator is:
\displaystyle \widehat{\beta} \displaystyle = \displaystyle \left( X^\prime X \right)^{-1} X^\prime Y  
  \displaystyle = \displaystyle \beta + \left( X^\prime X \right)^{-1} X^\prime \left(\left(E\left(X^\star \vert Z^y \right) - E\left(X^\star \vert Z^x \right) - \varepsilon^x\right)\beta + U^\star - \zeta^u + \varepsilon \right). (7)

Consider the sources of bias and inconsistency in this estimate. It is well known that the CME in  Y introduces no bias and inconsistency, since  \varepsilon is independent of  X. Interestingly, the LoSE in  U^\star introduces no bias or inconsistency either: the independence of  U^\star from  X^\star implies the independence of  E\left( U^\star \vert Z^y \right) = U^\star - \zeta^u from  E\left(X^\star \vert Z^x \right)14 and hence  X = E\left(X^\star \vert Z^x \right) + \varepsilon^x. The other components in the error of (6) do cause bias and inconsistency; taking expectations and probability limits of (6) yields:
\displaystyle E\left(\widehat{\beta}\right) \displaystyle = \displaystyle \beta + E\left(\left( X^\prime X \right)^{-1}X^\prime \left( E\left(X^\star \vert Z^y \right) - E\left(X^\star \vert Z^x \right) - \varepsilon^x \right) \right) \beta,   and: (8)
\displaystyle \widehat{\beta} \displaystyle \stackrel{p}{\longrightarrow} \displaystyle \beta + \left(Q_{xx}^{zx} + \sigma^2_{\varepsilon,x}\right)^{-1} \left( Q_{xx}^{zb} - Q_{xx}^{zx} - \sigma^2_{\varepsilon,x} \right)\beta. (9)

The usual attenuation bias and inconsistency from CME in  X is evident here. The additional inconsistency from LoSE depend on the difference between  Q_{xx}^{zb} and  Q_{xx}^{zx}. Like attenuation bias, these additional biases likely tend towards zero:  Q_{xx}^{zb} should be smaller than  Q_{xx}^{zx} if  Z^y and  Z^x contain a substantial amount of non-overlapping (independent) information.

The inconsistency of  \widehat{\beta} can be corrected by instrumenting with a  \left(1 \times m \right) set of instruments  W_t, with  m \geq k, if the instruments meet the following set of assumptions:

Assumption 2   With  P_W = W \left(W^\prime W \right)^{-1}W^\prime,  \frac{1}{T}X^\prime P_W X \stackrel{p}{\longrightarrow} Q_{xx}^w, a positive semi-definite matrix, and  \frac{1}{T} X^\prime P_W \left( \left(E\left( X^\star \vert Z^y \right) - E\left( X^\star \vert Z^x \right) - \varepsilon^x \right) \beta + U^\star - \zeta^u + \varepsilon \right)\stackrel{p}{\longrightarrow} 0. All relevant fourth moments exist.
Valid instruments must be asymptotically independent of the CME in  X, a standard condition. However, an additional condition must be met: the instruments must be asymptotically independent of  E\left( X^\star \vert Z^y \right) - E\left( X^\star \vert Z^x \right). This condition is met by instruments  W that are common to both information sets (if such information exists), so  W \subset Z^x and  W \subset Z^y, since  W^\prime E\left( X^\star \vert Z^y \right) and  W^\prime E\left( X^\star \vert Z^x \right) then have the same probability limit. With valid instruments, we have:
\displaystyle \widehat{\beta} \displaystyle = \displaystyle \left( X^\prime P_W X \right)^{-1} X^\prime P_W Y  
  \displaystyle = \displaystyle \beta + \left( X^\prime P_W X \right)^{-1}X^\prime P_W \left(\left(E\left( X^\star \vert Z^y \right) - E\left( X^\star \vert Z^x \right) - \varepsilon^x \right)\beta + U^\star - \zeta^u + \varepsilon\right), (10)

and  \widehat{\beta}\stackrel{p}{\longrightarrow} \beta. The asymptotic distribution of the estimator is:
\displaystyle \sqrt{T}\left(\widehat{\beta} - \beta\right)\stackrel{d}{\longrightarrow}N\left(0,\left(Q_{xx}^w\right)^{-1} \left(\sigma^2_{U^\star}-\sigma^2_{\zeta,u}+\sigma^2_{\varepsilon} +\beta^{\prime} \left(Q_{xx}^{zy} - 2Q_{xx}^{zb} + Q_{xx}^{zx} + \sigma^2_{\varepsilon,x}\right)\beta \right)\right).      

where  \stackrel{d}{\longrightarrow} denotes convergence in distribution as  T \longrightarrow \infty, and  N\left(a,b\right) is a Gaussian distribution with mean  a and variance  b. The usual estimator of the variance of the error term,  s^2 = \frac{1}{T} \left(Y - X\widehat{\beta}\right)^{\prime}\left(Y - X\widehat{\beta}\right), converges to the error variance in this asymptotic distribution:
\displaystyle s^2 \displaystyle = \displaystyle \frac{1}{T} \left( E\left(X \vert Z^y \right)\beta + E\left( U^\star \vert Z^y \right) + \varepsilon - \left(E\left( X^\star \vert Z^x \right) + \varepsilon^x \right)\widehat{\beta}\right)^{\prime}  
    \displaystyle *\left( E\left(X \vert Z^y \right)\beta + E\left( U^\star \vert Z^y \right) + \varepsilon - \left(E\left( X^\star \vert Z^x \right) + \varepsilon^x \right)\widehat{\beta}\right)  
  \displaystyle = \displaystyle \frac{1}{T} E\left( U^\star \vert Z^y \right)^{\prime} E\left( U^\star \vert Z^y \right) + \frac{1}{T}\varepsilon^{\prime} \varepsilon + \frac{1}{T} \beta^{\prime}E\left(X^\star \vert Z^y \right)^\prime E\left(X^\star \vert Z^y \right)\beta  
    \displaystyle - \frac{1}{T}\beta^{\prime}E\left(X^\star \vert Z^y \right)^\prime E\left(X^\star \vert Z^x \right)\widehat{\beta} - \frac{1}{T} \widehat{\beta}^{\prime}E\left(X^\star \vert Z^x \right)^{\prime}E\left(X^\star \vert Z^y \right)\beta  
    \displaystyle + \frac{1}{T} \widehat{\beta}^{\prime}E\left(X^\star \vert Z^x \right)^{\prime}E\left(X^\star \vert Z^x \right)\widehat{\beta} + \frac{1}{T} \widehat{\beta}^{\prime}\varepsilon^{x\prime}\varepsilon^x\widehat{\beta} + \frac{1}{T}cross terms\displaystyle .  

The first two terms converge in probability to  \sigma^2_{U^\star}-\sigma^2_{\zeta,u}+\sigma^2_{\varepsilon}; the terms involving  \beta and  \widehat{\beta} simplify in the limit since  \widehat{\beta}\stackrel{p}{\longrightarrow} \beta; and the cross terms converge in probability to zero. Then:  s^2 \stackrel{p}{\longrightarrow} \sigma^2_{U^\star}-\sigma^2_{\zeta,u}+\sigma^2_{\varepsilon} + \beta^{\prime} \left(Q_{xx}^{zy} - 2Q_{xx}^{zb} + Q_{xx}^{zx}+ \sigma^2_{\varepsilon,x}\right)\beta. The next four subsections discuss the most important implications of LoSE in  X and  Y for the parameter estimates and standard errors, examining some more specialized examples of this general model that highlight the implications of interest.

4.1 X Mismeasured, Y Not Mismeasured: No LoSE Problems

Given the traditional focus on mismeasurement in  X on regression estimation, we begin with this subsection making the following assumption, in addition to assumption 1:

Assumption 3    Y_t is not mismeasured:  Y_t = Y_t^\star.
Then (5) simplifies to:
\displaystyle Y_t^\star \displaystyle = \displaystyle X_t^\star\beta + U_t^\star  
  \displaystyle = \displaystyle X_t\beta +\left(X_t^\star - X_t\right)\beta + U_t^\star  
  \displaystyle = \displaystyle X_t\beta - \varepsilon_t^x\beta +\zeta_t^x\beta + U_t^\star.  

Not all of the true variation in  X_t^\star appears in  X_t due to LoSE, but all of that variation in  X_t^\star does appear in  Y_t^\star through  X_t^\star\beta. That variation in  Y_t^\star missing from  X_t is relegated to the error term of this equation.

The OLS regression estimator in this case is:

\displaystyle \widehat{\beta} \displaystyle = \displaystyle \left( X^\prime X \right)^{-1} X^\prime Y  
  \displaystyle = \displaystyle \beta + \left( X^\prime X \right)^{-1} X^\prime \left( -\varepsilon^x \beta + \zeta^x \beta + U^\star \right).  

Since  \zeta^x is independent of  E\left( X^\star \vert Z^x \right) + \varepsilon^x = X, the LoSE in  X introduces no bias into  \widehat{\beta} in this case. Given assumption 1,  \frac{1}{T} X^\prime \zeta^x \stackrel{p}{\longrightarrow} 0, and the LoSE introduces no inconsistency either. These results clearly hinge on the assumption that the LoSE is the difference between truth and a conditional expectation, and measurement error of a different form, such as the systematic biases discussed at the end of section 2, would lead to biased and inconsistent parameter estimates. The consistency result here also relies on all  k explanatory variables being conditioned on the same information set  Z^x. Kimball, Sahm, and Shapiro (2007) discuss a related example, where different elements of  X are conditioned on different information sets, causing bias and inconsistency.

Of course, the CME in  X produces the usual attenuation bias. By way of review, and for comparison with later results:

\displaystyle E\left(\widehat{\beta}\right) \displaystyle = \displaystyle \beta - E\left(\left( X^\prime X \right)^{-1}X^\prime \varepsilon^x \right) \beta,   and: (11)
\displaystyle \widehat{\beta} \displaystyle \stackrel{p}{\longrightarrow} \displaystyle \beta - \left(Q_{xx}^{zx} + \sigma^2_{\varepsilon,x}\right)^{-1} \sigma^2_{\varepsilon,x}\beta. (12)

Instruments uncorrelated with the CME in  X yield consistent estimates.

To focus more tightly on the implications of LoSE, the remainder of this subsection considers the case of no CME in  X:

Assumption 4    \operatorname{var}\left(\varepsilon_t^x\right) = 0.
Then  E\left(\widehat{\beta}\right) = \beta, and  \widehat{\beta}\stackrel{p}{\longrightarrow} \beta. The variation in  X_t^\star that appears in  Y_t^\star but is missing from  X_t shows up in the error term of the regression, increasing the variance of the parameter estimates. We have  \operatorname{var}\left(\widehat{\beta}\right) = E\left( \operatorname{var}\left( \widehat{\beta} \vert X \right) \right) + \operatorname{var}\left( E\left( \widehat{\beta} \vert X \right) \right), but  E\left( \widehat{\beta} \vert X \right) = \beta and  \operatorname{var}\left( \beta \right) = 0, so the second term vanishes. Then since  U^\star and  \zeta^x are independent, with both independent of  X,15 standard manipulations show:
\displaystyle \operatorname{var}\left(\widehat{\beta}\right) \displaystyle = \displaystyle E\left( \operatorname{var}\left( \widehat{\beta} \vert X \right) \right) = E\left( E\left( \left(\widehat{\beta}-\beta\right)\left(\widehat{\beta}-\beta\right)^{\prime}\vert X \right) \right)  
  \displaystyle = \displaystyle E\left( E\left( \left( X^\prime X \right)^{-1} X^\prime \left(U^\star + \zeta^x \beta \right) \left(U^\star + \zeta^x \beta \right)^{\prime} X \left( X^\prime X \right)^{-1} \vert X \right) \right)  
  \displaystyle = \displaystyle E\left( \left( X^\prime X \right)^{-1} X^\prime E\left( \left( U^\star U^{\star\prime} + \zeta^x \beta \beta^{\prime}\zeta^{x\prime} \right) \vert X \right) X \left( X^\prime X \right)^{-1} \right)  
  \displaystyle = \displaystyle E\left( \left( X^\prime X \right)^{-1} \right)\left( \sigma^2_{U^\star} + \beta^{\prime} \sigma^2_{\zeta,x}\beta \right).  

Asymptotically, the analogous distributional results hold, as:
\displaystyle \sqrt{T}\left(\widehat{\beta} - \beta\right) \displaystyle \stackrel{d}{\longrightarrow} \displaystyle N\left(0,\left(Q_{xx}^{zx}\right)^{-1}\left(\sigma^2_{U^\star} + \beta^{\prime} \sigma^2_{\zeta,x}\beta\right)\right),  

and  s^2 converges to this error variance  \sigma^2_{U^\star} + \beta^{\prime} \sigma^2_{\zeta,x}\beta. So the LoSE in  X increases the variance of the regression error, and since the power of hypothesis tests is typically decreasing in the variance of the regression error, this decreases power. Then in a regression situation such as that described in this subsection, if a hypothesis test passes at a prescribed level of statistical significance, that is in spite of the dimunition of power from the increased error variance.

4.1.1 Empirical Examples

Since nominal asset prices are measured with virtually no error, they provide a candidate  Y variable meeting the assumptions of this section.16 Given the evidence from the prior section indicating that macroeconomic quantities are measured with LoSE, a specification such as the human capital CAPM, essentially a regression of stock prices on labor income growth, may be well represented by the regression model developed here.

4.2 Y Mismeasured, X Not Mismeasured,  X_t \in Z_t^y: Potentially Misleading Standard Errors

In addition to assumption 1, this subsection makes the following assumptions:

Assumption 5    X_t is not mismeasured:  X_t = X_t^\star, and  X_t \in Z_t^y.
Then  Y_t^\star = X_t\beta + U_t^\star. The relation between  X_t and the information set  Z_t^y has an important effect on the properties of the OLS regression estimates; this subsection considers  X_t \in Z_t^y, and the next  X_t \not \in Z_t^y.

Since  E\left(X_t \vert Z_t^y \right) = X_t, we have:  Y_t = X_t\beta + E\left( U_t^\star \vert Z_t^y \right) + \varepsilon_t in this case. The LoSE impacts only  U_t^\star, so  \zeta_t = U_t^\star - E\left(U_t^\star \vert Z_t^y \right), and  \operatorname{var}\left(E\left(U_t^\star \vert Z_t^y \right)\right) = \sigma^2_{U^\star} - \sigma^2_{\zeta}. The OLS regression estimates  \widehat{\beta} as:

\displaystyle \widehat{\beta} \displaystyle = \displaystyle \left( X^\prime X \right)^{-1} X^\prime Y  
  \displaystyle = \displaystyle \beta + \left( X^\prime X \right)^{-1} X^\prime \left( E\left( U^\star \vert Z^y \right) + \varepsilon \right)  
  \displaystyle = \displaystyle \beta + \left( X^\prime X \right)^{-1} X^\prime \left( U^\star - \zeta + \varepsilon \right).  

LoSE in  U^\star introduces no bias or inconsistency,17 so the overall measurement error in  Y introduces no bias or inconsistency in this case. The assumption that  Y is a conditional expectation of  Y^\star plus noise again plays a critical role here; consistency and unbiasedness do not follow if the first component of  Y is something other than a conditional expectation.

The standard errors around the point estimates are more interesting. For the variance of the point estimates,  \operatorname{var}\left(\widehat{\beta}\right) = E\left( \operatorname{var}\left( \widehat{\beta} \vert X \right) \right), and:

\displaystyle E\left( \operatorname{var}\left( \widehat{\beta} \vert X \right) \right) \displaystyle = \displaystyle E\left( E\left( \left( X^\prime X \right)^{-1} X^\prime \left(E\left( U^\star \vert Z^y \right) + \varepsilon \right)\left(E\left( U^\star \vert Z^y \right) + \varepsilon \right)^{\prime} X \left( X^\prime X \right)^{-1} \vert X \right) \right)  
  \displaystyle = \displaystyle E\left( \left( X^\prime X \right)^{-1} \right)\left( \sigma^2_{U^\star} - \sigma^2_{\zeta} + \sigma^2_{\varepsilon}\right),  

since  E\left( U^\star \vert Z^y \right) and  \varepsilon are independent; the analogous asymptotic results hold. Consider first the impact of the CME  \varepsilon, which increases the variance of the regression residuals and parameter estimates, and reduces the power of hypothesis tests.18 This is a cost of mismeasurement of this type, as hypothesis tests must overcome this dimunition of power to meet conventional levels of statistical significance. Conducting hypothesis tests in the usual way is actually somewhat cautious, as CME in  Y leads to fewer rejections of hypotheses, on average. Perhaps because of this increased caution, this bias against rejecting hypotheses, econometricians have largely concluded that CME in the dependent variable  Y_t poses no real problems for standard OLS regression and inference.

LoSE in the dependent variable has an opposite effect, decreasing the variance of the regression residuals and parameter estimates. Measurement error of this type actually increases the power of hypothesis tests. Is more power a good thing? I would argue not. If a hypothesis is rejected at a prescribed significance level, an immediate concern is that the rejection was due to LoSE-induced power. If the data were free of mismeasurement, the variance of the parameter estimates would be larger and the rejection may not have occured.

Note that the parameter variances here are correct for the parameters governing the relation between mismeasured  Y and  X, and are not biased down in the same sense as they would be if, for example, we ignored positive autocorrelation in the residuals of the regression. However in most applications the parameters of interest are those governing the relation between true  Y^\star and  X, and the econometrician uses the mismeasured data to make inferences about that true relation out of necessity, because the mismeasured data are all that is available. Under the assumptions of this subsection, the parameters estimated using the mismeasured data are unbiased and consistent for the parameters governing the true relation, but the parameter variances are smaller than they would be without the LoSE in  Y. Standard errors computed using data mismeasured in this way then give a misleading sense of precision about the relation between true  Y^\star and  X, and the power of hypothesis tests is misleadingly large.

An example clarifies some of these issues. Imagine a situation where the econometrician has access to true  Y^\star,  X, and a list of other variables  Z^y that are orthogonal to  X but related to  Y^\star. Hypotheses about the relation between  Y^\star and  X are of interest. Testing hypotheses using parameter estimates and standard errors from a regression of  Y^\star on  X is the natural and correct way to procede. But consider the following two step procedure: (1) regress  Y^\star on  X and some subset of  Z^y, and compute predicted values which we call  Y, and then (2) regress  Y on  X, testing hypotheses about the relation between  Y^\star and  X using standard errors from this second regression. The parameter estimates in the second regression are the same as in a regression of  Y^\star on  X, but the standard errors are smaller and tests have greater power because the first stage generates LoSE in  Y. In fact, by making  Z^y arbitrarily small, the standard errors can be made arbitrarily small and any hypothesis may be rejected; setting  Z^y to the null set drives the standard errors to zero. Artificially generating LoSE in  Y in this fashion is clearly ridiculous, and makes hypothesis tests meaningless.

In reality, if the data meet the conditions of this section, the econometrician starts out in stage (2) of this two step procedure, sadly without access to  Y^\star. Something analogous to step (1) has already taken place with the creation of the data  Y, by a government statistical agency or some other entity. The main issue is not materially different if it is a government statistical agency rather than the econometrician who has generated the LoSE in  Y.

If the variance of LoSE  \sigma^2_{\zeta} were known, the most prudent course of action would be to increase the estimated variance of  \widehat{\beta} by  \sigma^2_{\zeta}. This essentially brings us back the pure CME case, where econometricians are comfortable using standard procedures for inference.19Unfortunately,  \sigma^2_{\zeta} is typically unknowable; the results in sections 3.2 and 4.3.1 for measured GDP growth provide only an approximate lower bound on  \sigma^2_{\zeta}. The bottom line is that inferences are fundamentally less definitive when the dependent variable is contaminated with LoSE.

In a forecasting context, it should be noted that the LoSE also shrinks the variance of out-of-sample forecast errors. The actual variance of the out-of-sample forecast error for the true variable of interest,  Y_{t+k}^\star - X_{t+k}\widehat{\beta}, with  \widehat{\beta} estimated using mismeasured  Y_{t}, is  \sigma^2_{U^\star} + \left(\sigma^2_{U^\star}-\sigma^2_{\zeta}+\sigma^2_{\varepsilon}\right)X_{t+k} E\left( \left(X^\prime X \right)^{-1} \right) X_{t+k}^{\prime}. However the LoSE reduces this variance by  \sigma^2_{\zeta}, and if the increase in variance from CME  \sigma^2_{\varepsilon} does not offset this increase, the forecast errors computed using mismeasured  Y_{t+k} give a misleading sense of precision: the deviations of  Y_{t+k}^\star from the forecasts are larger than those mismeasured forecast errors suggest. For example, table 1 shows that for GDP growth forecasts, forecast errors computed using the "advance" GDP growth estimates will be smaller, on average, than the true forecast errors.

4.2.1 Empirical Examples

Regressions of mismeasured macroeconomic quantities like GDP growth on time trends, dummies, and other deterministic variables certainly meet the conditions outlined in this subsection. These types of regressions are run most frequently in a forecasting context, where the primary goal is to forecast  Y^\star, not to estimate the deep structural parameters of an economic model relating