Forthcoming [American Economic Journal: Macroeconomics] doi:10.1257/mac.20240298

Identifying the Impact of Inflation Expectations

William A. Branch

What this paper finds — and why it matters

Layer 1: Overview

Branch (2022) asks whether subjective consumer inflation expectations causally raise the inflation rate — a question whose empirical answer has been elusive despite its central role in New Keynesian theory and central bank communication. The identification problem is acute: expectations are endogenous by construction, and the standard approach of estimating a Phillips curve with aggregate data produces estimates biased sharply downward by endogeneity. OLS regressions of regional inflation on regional mean expectations, controlling for unemployment, lagged inflation, and region and time fixed effects, yield a slope of only 0.069 (Table 2 context; Figure 1b), far below the theoretical prior of near-unity pass-through.

The paper’s empirical strategy exploits a key fact: different demographic groups consume heterogeneous bundles of goods, so their inflation expectations differ systematically and reflect their own basket’s price movements. Using roughly 273,000 individual responses from the University of Michigan Survey of Consumers spanning 1978:1–2022:5, Branch classifies respondents into 160 demographic groups defined by sex, age (five categories), education (four levels), marital status, and parental status. The panel covers four U.S. Census regions, producing dimensions T = 528 months, N = 4 regions, and G = 160 groups. Regional inflation is measured from BLS CPI series for all urban consumers.

The identification strategy is a shift-share (Bartik) instrument: for each region-month, the predicted regional inflation expectation is the population-weighted average of each demographic group’s national-level average inflation expectation, where the weights are the group’s share of the region’s population. Two share measures are used: (i) the January 1978 Current Population Survey (CPS78) distribution, which is time-invariant and plausibly exogenous to subsequent inflation shocks; and (ii) contemporaneous Michigan survey shares. The leave-one-out variant is the preferred construction. The instrument is relevant — first-stage F-statistic of 52.4 (significant at 0.1%) — and the Durbin-Wu-Hausman test rejects OLS consistency at the 1% level (statistic = 8.074).

Main 2SLS estimates: using Michigan survey shares, a 1 percentage point increase in a region’s expected inflation raises regional inflation by 0.33 percentage points (significant at 5%; Table 2). Using CPS78 shares, the estimate rises to 0.55 percentage points (significant at 1%; Table 2). After applying the split-sample jackknife bias correction for finite-sample bias in the small-N/large-T panel, the estimates increase slightly to 0.36 and 0.60 respectively (Table 3). The paper characterizes the 60 basis point estimate as its “preferred” figure. Both are substantially above the OLS estimate of 0.069 and represent a lower bound: because time fixed effects absorb cross-regional spillovers, the aggregate pass-through is likely stronger, with the paper arguing that after accounting for spillovers the effect is plausibly in the range of 1.0–1.6, consistent with the Calvo- and Taylor-model predictions of Werning (2022), who shows pass-through should lie in [1/2, 1] or above.

Sectoral decomposition reveals that the expectation effect is concentrated in non-durable goods prices (coefficient 1.74, significant at 1%; Table 7) and commodities more broadly (1.29, significant at 1%; Table 7), with no statistically meaningful effect on durables (−0.10, insignificant) and only marginal positive effects on services (0.22, marginally significant). Among services, the effect is somewhat larger when housing services are excluded.

A key finding on expectations horizons: when both one-year-ahead and five-to-ten-year-ahead expectations are simultaneously instrumented using their respective Bartik shift-shares, only the short-run (one-year) expectation retains a significant positive effect on inflation. The long-horizon coefficient is small in absolute value, negative in sign, and statistically insignificant in both the joint and standalone specifications (Tables 10 and 12). After conditioning on aggregate macroeconomic factors captured by time fixed effects, long-run inflation expectations have no independent causal role in the regional inflation rate.

Identification heterogeneity: using the Rotemberg weight decomposition of Goldsmith-Pinkham, Sorkin, and Swift (2020), the identifying variation derives primarily from younger, married consumers with at least a high school degree — specifically those aged 18–34 (Michigan instrument) or 25–49 (CPS78 instrument). The group-specific treatment effects (βg) for these heavily weighted groups are positive and significantly above 1. Temporally, the heaviest identification weights fall on the Great Inflation and Volcker disinflation (1978–82), the Great Recession (2007–09), and the post-pandemic inflation episode (2021–22). The impulse response function shows a significant contemporaneous positive effect of expectations on inflation that mean-reverts cyclically within approximately 12 months, though confidence bands are wide at longer horizons.

Layer 2: Deep Dive

What is the core identification strategy and what makes it plausible?

The strategy is a differential-exposure quasi-experiment using a Bartik (shift-share) instrument. For each Census region and month, the instrument is the population-weighted average of each demographic group’s national-level mean inflation expectation, with weights equal to that group’s share of the region’s population. The key identifying assumption has two parts: (1) demographic groups have heterogeneous consumption baskets, so their inflation expectations reflect the prices in their own basket; and (2) the distribution of demographic groups across regions is exogenous to unobserved shocks driving regional inflation (as opposed to being exogenous to regional price levels, which is a weaker and separately justified claim). Plausibility is supported by the CPS78 shares having no predictive power for the other covariates of inflation over the sample, and by using a leave-one-out instrument construction to avoid mechanical correlation.

What are the main threats to identification and how does the paper address them?

The principal threat is that regional demographic composition could be endogenous to regional inflation rather than merely to regional price levels. The paper argues identification requires only exogeneity to the change in prices (inflation), not to the level. The empirical check is that CPS78 beginning-of-period shares show no statistically or economically significant correlation with the other regressors that predict regional inflation. A second threat is that groups may sort into regions based on economic conditions correlated with inflation. The paper argues the channel runs through demand from heterogeneous baskets rather than supply-side sorting. A third threat is weak instruments: this is addressed by first-stage F = 52.4. Fourth, survey measurement concerns (re-interview selection bias, outliers, endogenous prompting thresholds) are addressed through a battery of alternative specifications (first-time respondents only, outlier removal, CPS vs. survey shares, lagged shares, alternative CPI measures).

Why are OLS estimates biased downward and by how much?

OLS is biased because inflation expectations are endogenous — they move with the same shocks driving inflation, so OLS conflates the causal effect with reverse causation and omitted-variable bias. The OLS estimate from the panel regression with region and time fixed effects is approximately 0.069 (Figure 1b). The 2SLS estimates using the Bartik instrument range from 0.33 to 0.55, roughly five to eight times larger than OLS, confirming substantial downward bias. The Durbin-Wu-Hausman test confirms OLS inconsistency at the 1% level.

What heterogeneity across demographic groups is documented?

Women consistently report higher inflation expectations than men, particularly outside the high-inflation 1970s episode. Older respondents (50+) receive small Rotemberg identification weights, meaning their expectations contribute little to the identifying variation. Younger groups (18–34 under Michigan shares; 25–49 under CPS78 shares), married, with at least a high school education are the groups whose expectations drive the regional cross-sectional identification. The group-specific causal effects (βg) for these heavily weighted groups are uniformly positive and significantly above 1.0, ranging roughly from 1.38 to 1.91 in the top-10 groups. College-educated groups receive higher weight under the CPS78 instrument, while the Michigan shares instrument weights high school and college groups more evenly.

What is the sectoral decomposition of the inflation expectations effect?

Table 7 estimates separate 2SLS regressions for components of the CPI. Non-durable goods prices respond most strongly (coefficient 1.74, significant at 1%). Commodities broadly (which include non-durables and durables) also show a large effect (1.29, significant at 1%). Durable goods prices show no meaningful effect (−0.10, statistically insignificant). Services show only a marginal positive effect (0.22, marginally significant at 10%). Among services, the effect is somewhat stronger when housing services are removed. These results are consistent with prior findings that consumer grocery and non-durable prices most directly influence and reflect household inflation expectations.

What do the long-run expectations results show and what is the interpretation?

The Michigan survey’s PX5 question elicits 5-to-10-year ahead inflation expectations. Constructing a shift-share Bartik instrument for these long-horizon expectations and including both short- and long-run instruments simultaneously, the second-stage coefficient on long-horizon expectations is small (−0.023 to −0.037 in the joint specification, Table 10), negative, and statistically insignificant in all specifications. When long-horizon expectations alone are instrumented, the second-stage coefficient is 0.005 to 0.034 (Table 12), positive but still insignificant. The interpretation is that, after controlling for time fixed effects (which capture aggregate macroeconomic factors), long-run expectations have no independent causal role in regional inflation outcomes. Only short-run (one-year ahead) expectations matter. The first stage confirms the long-run instrument is relevant for long-run expectations but orthogonal to short-run expectations.

What robustness checks are reported and what do they find?

Table 8 reports four alternative specifications, all using Michigan survey shares: (1) ‘small’ — removing survey responses with absolute values above 25% — gives a coefficient of 0.66 (significant at 1%), larger than baseline, though the paper does not prefer this because large expectations may have real behavioral effects; (2) ‘first-only’ — using only first-time respondents and dropping the 40% re-interviewed — yields a coefficient of 0.58, still positive though the standard error rises and significance falls; (3) ‘state-CPI’ — replacing the BLS regional CPI with state-level CPIs aggregated as in Hazell et al. (2022) — gives 0.33 (significant at 5%), very close to the Michigan-shares baseline; (4) ’lag Michigan shares’ — instrumenting with 12-month lagged survey shares — gives 0.53 (significant at 5%), bracketed between the two baseline estimates. The jackknife bias correction (Table 3) slightly raises estimates to 0.36 and 0.60 for the two instruments.

What does the impulse response function show?

Using local projections (Jordà 2005) to estimate a 2SLS impulse response function, a shock to inflation expectations produces a significant positive contemporaneous effect on regional inflation. The response is cyclical and mean-reverting, returning to near zero within approximately 12 months. Confidence intervals are wide in subsequent quarters, so the analysis cannot rule out lingering effects, but the central estimates suggest the impact dissipates within about a year. The paper notes that the lack of strong persistence may reflect the specific U.S. inflation history and suggests extending the analysis to countries with more volatile or persistent inflation histories.

How does this paper relate to the New Keynesian Phillips Curve literature?

The standard approach to measuring expectations’ impact on inflation is to estimate a NKPC with an instrument for expectations under rational expectations. Mavroeidis, Plagborg-Moller, and Stock (2014) document that this approach faces severe identification and weak-instrument problems. Branch’s approach avoids these issues by not assuming rational expectations, not requiring an explicit model of expectations formation, and using a shift-share instrument whose validity rests on cross-sectional demographic heterogeneity rather than time-series moment conditions. The theoretical model in Section 3.1 permits non-rational expectations and nests ‘anticipated utility’ or ‘steady-state learning’ (Evans and Honkapohja 2001; Woodford 2013) as the simplifying assumption. The estimated regional coefficients are below but potentially consistent with Werning’s (2022) theoretical range of [1/2, 1] for Calvo and Taylor pricing models once spillovers are accounted for.

How does the paper relate to the literature on household-level inflation heterogeneity?

The paper builds on Hobijn and Lagakos (2005), who show households consume different bundles, and Kaplan and Schulhofer-Wohl (2017), who find two-thirds of cross-household inflation variation stems from paying different prices for the same goods. D’Acunto, Malmendier, Ospina, and Weber (2021) establish that grocery store prices directly influence household inflation expectations. Branch takes these findings as given — they motivate the identifying assumption that expectations reflect basket-specific prices — and focuses on the downstream question of whether those expectations causally raise actual inflation outcomes. Earlier work on heterogeneous expectations by Branch (2004, 2007) using Michigan survey data, finding time-varying heterogeneity across forecasting rules, is also directly referenced.

What does the Rotemberg weight decomposition reveal about the source of identifying variation?

The Bartik estimate is a weighted average of 160 just-identified group-specific estimates. Goldsmith-Pinkham, Sorkin, and Swift (2020) show the weights (αg) measure each group’s contribution to the overall estimate and sensitivity to bias from that group’s potential endogeneity. Tables 4–5 list the top-10 weighted groups: under CPS78 shares, these are predominantly 25–49-year-olds, mostly college-educated, seven of ten married with children. Under Michigan shares, the top groups are even younger (mostly 18–24), with at least a high school degree, almost all married without children. Table 6 shows men receive slightly higher aggregate weight than women (0.53–0.57 vs. 0.43–0.47), and those aged 50+ contribute less than 15% of total weight. Figure 11 shows temporal variation: the heaviest-weighted periods are the late-1970s Great Inflation and Volcker disinflation, the Great Recession (2007–09), and the post-pandemic episode (2021–22).

What are the policy implications and their scope conditions?

The paper provides empirical support for central bank attention to short-run consumer inflation expectations: a 1 percentage point increase in one-year-ahead regional expectations causally raises regional inflation by 0.33–0.55 basis points (lower bound, since spillovers are excluded). Accounting for cross-regional aggregate effects raises the likely total pass-through to above one, validating the central bank emphasis on anchoring short-run expectations. However, the null finding for long-run (5-to-10-year) expectations — controlling for aggregate time effects — suggests that ‘anchoring long-run expectations’ may not independently prevent near-term inflation above and beyond its correlation with short-run beliefs. The scope conditions are important: the estimates come from U.S. Census regions over 1978–2022, so applicability to countries with persistently high or hyper-inflation is uncertain. The identifying variation is concentrated in high-volatility inflation episodes, suggesting potential nonlinearities in the expectations-to-inflation mapping. The empirical strategy also does not capture general equilibrium feedback from realized inflation back to expectations.

What are the data limitations and survey design concerns the paper acknowledges?

Five limitations of the Michigan survey are acknowledged: (1) whether surveys elicit genuine expectations rather than attitudes; (2) the rotating panel structure, with roughly 40% of respondents re-interviewed after six months, creates potential selection bias if more accurate forecasters are likelier to re-participate; (3) declining telephone response rates threaten representativeness; (4) the survey prompts respondents reporting ‘unreasonable’ expectations, with the threshold endogenously tied to recent inflation history; (5) the question wording asks about ‘prices going up’ rather than ‘aggregate U.S. inflation’, making the measure closer to consumption-basket-specific expectations — which the paper treats as a feature rather than a flaw for its identifying assumption. The paper addresses concerns (1)–(4) through alternative specifications (first-time-only respondents, outlier removal, CPS vs. survey shares). The geographic dimension is limited to four Census regions because finer location identifiers are unavailable for a long panel.

Key Concepts

Shift-share (Bartik) instrument for expectations: In this paper, the instrument for regional inflation expectations is constructed by interacting each demographic group’s national-level mean inflation expectation (the ‘shift’) with that group’s population share in the region (the ‘share’). The resulting weighted average predicts how much regional expectations would be elevated purely by the region’s demographic composition reacting to aggregate group-level expectation shocks, isolating variation plausibly orthogonal to region-specific inflation supply shocks.

Differential exposure quasi-experiment: The identification design exploits the fact that U.S. Census regions have different demographic compositions, giving them differential exposure to aggregate shocks in group-specific inflation expectations. Regions with a higher share of a group whose expectations are rising will see a larger predicted increase in regional expectations than regions with a lower share of that group, independent of region-specific factors — this cross-regional contrast is the source of causal identification.

Rotemberg weights: Following Goldsmith-Pinkham, Sorkin, and Swift (2020), the Bartik 2SLS estimate is decomposed as a weighted sum of 160 just-identified group-specific estimates, where the weight αg for group g measures the sensitivity of the overall estimate to potential endogeneity in group g’s share. Groups with large αg drive identification and are the groups most important to probe for exogeneity. In this paper, the heaviest-weighted groups are younger, married consumers with at least a high school degree.

Anticipated utility / steady-state learning: The paper’s theoretical model allows for non-rational subjective expectations. Firms and households are modeled as ‘anticipated utility’ maximizers (Woodford 2013) who adjust expectations over time (’learning’) but assume for current decisions that expected inflation will remain at its present rate — termed ‘steady-state learning’ by Evans and Honkapohja (2001). This assumption implies future prices evolve along a linear trend from current expectations, yielding a tractable closed-form link between current expectations and the sector-specific price-setting equation.

Heterogeneous consumption baskets as identification: The paper’s core identifying assumption is that different demographic groups consume different bundles of goods across sectors, so their inflation expectations reflect the price changes in their own basket rather than a common aggregate signal. This basket heterogeneity is what makes group-level expectations differ systematically and allows the shift-share instrument to generate exogenous variation in regional inflation expectations.

Lower bound interpretation of regional estimates: The 2SLS estimates capture only the regional (within-country, across-region) effect of expectations on inflation, because time fixed effects absorb cross-regional spillovers — if expectations rise in one region, the increased demand for traded goods spills into other regions and raises their prices too. The paper argues the regional estimates are therefore a lower bound on the aggregate pass-through from expectations to overall U.S. inflation, consistent with the stronger aggregate correlation seen in Figure 1a.

Long-run expectations nullity: The paper’s extension finds that 5-to-10 year inflation expectations, instrumented with their own shift-share Bartik and included alongside the one-year instrument, have no statistically or economically significant causal effect on regional inflation once time fixed effects control for aggregate factors. This result implies that, conditional on short-run expectations and macroeconomic controls, long-horizon expectations carry no independent causal information for the current inflation rate.

How this summary was made. Bibliographic fields are pulled from Crossref and OpenAlex and are not model-generated. The summary was drafted from the open-access manuscript , checked by a claim-grounding and calibration review pass, and approved before publishing. Found an error or a misrepresentation? Flag it here — corrections are welcome, especially from the authors.