Studying Generational Risk in a Large-Scale Life-Cycle Model
What this paper finds — and why it matters
Layer 1: Overview
Hasanhodzic and Kotlikoff ask a question prior work assumed away: how large is generational risk, and can pay-go Social Security actually mitigate it? Earlier studies (Diamond, Bohn, Krueger-Kubler, etc.) presumed generational risk is large enough to merit policy and showed Social Security can in principle share it, but did not directly measure its size. This paper measures it directly, with and without Social Security, in a realistically large overlapping-generations (OLG) model.
Model setup: an 80-period annual OLG model with aggregate shocks. Agents work 45 periods (retire at R=45) and live 80, have isoelastic (CRRA) preferences with risk aversion gamma=2 (gamma=5 under the extra-large shocks calibration), annual discount factor beta=0.96 (quarterly 0.99). Production is Cobb-Douglas; log TFP is trend-stationary AR(1) (quarterly rho=0.95, sigma=0.01; annualized rho=0.814, sigma=0.019). Two calibrations add a normal capital-depreciation shock. Households invest in risky capital or one-period safe bonds (zero net supply); “soft” increasing borrowing costs (Chen-Mangasarian function, slope b) shut down private risk-sharing to expose generational risk in its purest form while still delivering a realistic risk and growth premium. Policy is pay-go Social Security with a fixed payroll tax tau=15% (also tested at 1%). The model is solved to high precision via a projection method (building on Marcet 1988; Judd, Maliar, Maliar 2011) over an 81-variable state space (79 cohort cash-on-hand values plus the TFP and depreciation shocks). Generational risk measures are evaluated 300 years into the transition; cohort utility uses generations born after year 300 of a 750-year run. The U.S. data targets cover the return to national wealth and one-month Treasuries, 1947-2015, and detrended NNP/consumption, 1929-2020.
Four calibrations: (1) baseline (TFP shock only, matched to output/consumption variability); (2) larger shocks (adds depreciation shock to match variability of the return to national wealth); (3) extra-large shocks (bigger depreciation shock to match U.S. equity-market return variability, a la Krueger-Kubler); (4) negative risk-free-rate baseline (steeper borrowing costs giving a roughly negative 2% safe rate, to test Blanchard 2019).
Main findings (compensating-consumption differentials needed to reach long-run average lifetime utility): generational risk is 1.396% under baseline, 2.128% under larger shocks, and 15.303% under extra-large shocks (without Social Security). The authors view baseline 1.396% as small (on the order of a good-sized distortion) and prefer the baseline calibration. Social Security slightly WORSENS baseline generational risk (rising to 1.462%), but reduces it by 8% in the larger-shocks and 19% in the extra-large-shocks calibrations. So Social Security’s risk-pooling value depends on calibration. Contemporaneous risk (absolute consumption adjustment for full risk sharing among living cohorts) is tiny: 0.206% baseline, 0.933% larger shocks, 0.437% extra-large; Social Security raises it to 0.310% in baseline but lowers it under the other two.
On welfare and Blanchard’s conjecture: pay-go Social Security at a 15% tax cuts long-run expected utility by 18% in baseline and larger-shocks, and by 56% in extra-large shocks, via crowding out (long-run capital falls 28% baseline, 56% extra-large). Under the negative-safe-rate calibration there is still an 18% long-run welfare loss; the average growth rate is zero in all simulations. The authors find no support for Blanchard’s (2019) claim that deficits can be Pareto-improving when safe rates run below growth: even under Blanchard-favorable conditions, crowding out swamps risk sharing (e.g., 17.83% utility loss at 15% tax, 1.17% at 1% tax). Macro shocks are second-order for policy: the capital transition under Social Security with shocks closely tracks the no-shock (deterministic) path, echoing Lucas (1987).
Layer 2: Deep Dive
What exactly is the paper’s primary measure of generational risk?
It is the average absolute percentage adjustment to a cohort’s annual consumption needed to equate that cohort’s realized lifetime utility to the long-run cross-cohort average realized lifetime utility. Formally, for each generation born in period t they compute lambda_t = U-bar / U_t (U_t is realized lifetime utility, U-bar the average over generations born in years 301-750), then take the mean absolute deviation of lambda from 1. It captures both being born in a bad state and being hit by a bad sequence of lifetime shocks. A value near zero means birth date barely matters.
Why does annualizing to 80 periods matter relative to two-period models?
With one year per period, an agent experiences 45 annual wage shocks and 79 annual investment-return shocks that largely average out, and can self-insure by adjusting saving annually. In a two-period model a single negative TFP shock hits a worker’s entire lifetime earnings or a retiree’s whole old-age return. The authors note, however, that because TFP shocks are positively autocorrelated, amplifying multi-period shocks could in principle generate more risk, not less, so the result is not mechanical.
How is private risk-sharing handled, and why shut it down?
In three of four calibrations the authors impose ‘soft’ increasing borrowing costs (Chen-Mangasarian function, parameter b) calibrated so the marginal borrowing cost is 15-20 times the safe rate (b=28 baseline, 25 larger shocks, 45 for negative-safe-rate cases). This nearly closes the bond market, isolating generational risk with no private or public mitigation. The extra-large calibration omits borrowing costs because its large depreciation shock alone delivers a realistic risk premium (and to match Krueger-Kubler). Notably, adding borrowing constraints has little impact on key macro aggregates.
Why does Social Security INCREASE generational risk in the baseline (single-TFP-shock) case?
Five reasons given: (1) benefits depend on the prevailing wage, so autocorrelated TFP wage shocks now interact with capital-return shocks through retirement, extending nonlinear discounting past retirement; (2) crowding out lowers wages and raises risky returns, so the same percentage TFP shock is larger in absolute terms, making realized resources more variable; (3) Social Security is a random floor on old-age living standards, encouraging less risk-averse consumption and a higher propensity to consume; (4) positive TFP autocorrelation (high benefits today predict high benefits tomorrow) further raises the propensity to consume; (5) Social Security alters the stochastic distribution of the 79 cohort cash-on-hand state variables, producing complex consumption changes. This echoes Rios-Rull’s (1994) paradox that better micro insurance can amplify macro fluctuations.
How does the paper test Blanchard’s (2019) ‘deficits may be free’ conjecture and what does it find?
It uses Blanchard’s own ex-ante Pareto criterion but with 80 periods (vs his 2), realistic risk aversion, and dropping his assumption that half of wages are perfectly safe. Calibrations engineered with negative safe rates and large growth premiums (e.g. risky ~2%, safe ~negative 2%) still show Social Security reducing long-run expected utility: 17.83% loss at a 15% tax (1.17% at 1%) in the standard-premium case, falling to 12.51%/12.582% (15% tax) under even-larger growth premiums, but always negative. Crowding out dominates any risk-sharing gains. The authors find no support for the conjecture. They note Blanchard’s Pareto gains, when they arise, depend critically on his assumption that half of wages are certain, leaving workers ideally placed to insure the elderly.
What heterogeneity across cohorts is documented?
Baseline generational risk has mean 1.396%, s.d. 1.293%, max 4.949% (no Social Security). Decomposed: generations with worst luck need roughly +5.0% positive adjustment; those with best luck need roughly negative 5.1%. Extra-large shocks produce extreme spread: max positive adjustment 66.14%, max negative 44.10%. A separate exercise (Table 8) shows the cost of uncertainty depends on birth state due to mean reversion: those born with low capital actually prefer uncertainty (negative 1.482%) because capital and wages will rise, while those born with high capital would pay 2.374% to lock in their state.
What are the welfare-cost-of-uncertainty and precautionary-saving findings?
Under larger shocks, the compensating variation between the stochastic steady state and a no-shocks steady state is only 1.12% (newborns would need 1.12% more consumption each year to match a never-shocked long run), despite that calibration overstating macro variability. This is small because precautionary saving raises the stochastic economy’s average capital stock 18.4% above the no-shocks steady state: the uncertain long run is ‘riskier, but richer.’ A decomposition removing the 0.77% average age-specific consumption difference leaves a 0.34% residual (about one quarter of 1.12%) reflecting age-pattern and cohort-sequence heterogeneity.
How does this paper build on and differ from Krueger-Kubler (2006)?
Five differences: (1) many more periods (80 vs 9) permit better shock-averaging and more precise autocorrelation treatment plus more self-insurance opportunities; (2) two calibrations the authors view as more realistic than KK (who chose theirs partly to favor a Pareto improvement), using borrowing costs rather than excessively large depreciation shocks to get a realistic risk premium; (3) ex-ante rather than ex-interim expected utility; (4) explicit measurement of generational risk with and without Social Security; (5) testing whether a large growth premium can sustain an intergenerational Ponzi scheme at scale. Like KK, they find a negative net long-run welfare impact of pay-go Social Security.
What does the model deliberately omit, and why?
It is ‘intentionally bare bones to maximize the potential for generational risk’: no variable labor supply (which would help cohorts self-insure), no progressive income taxation (which redistributes from winning to losing generations), and no social insurance other than Social Security. It also omits capital-adjustment costs (which would raise asset-return volatility) because incomplete markets make firm investment policy ill-defined when differently-aged shareholders disagree; the depreciation shock is a crude proxy for adjustment-cost-driven asset-return shocks. The authors flag correlated idiosyncratic shocks (Harenberg-Ludwig) as important future work.
How well does each calibration match the data?
Baseline matches output (model 3.72% vs data 3.33%) and consumption (2.10% vs 1.75%) variability but understates the s.d. of the return to national wealth by an order of magnitude (0.14% vs 4.89%). Larger shocks reproduces the return-to-wealth s.d. (4.61-4.62% vs 4.89%) and a realistic wage/return correlation (negative 0.054) but overstates macro-aggregate variability. Extra-large shocks matches equity Sharpe ratio (model 0.333 vs target 0.286; risk premium 4.63%, return s.d. 13.92%) but overstates return-to-capital variability nearly three-fold and consumption variability sixteen-fold. The model’s overall risk premium ranges 3.55-6.03% vs 5.43% in data.
What is the role of the bond market across calibrations?
The one-period bond market only operates in the extra-large shocks calibration (borrowing costs close it in the others). There, the young short bonds and the old lend: because the young’s resources are mostly human capital (less risky than, and negatively correlated with, stock returns), the young use bonds to insure the old. Workers effectively borrow to hold equity, which the authors rationalize via student loans, credit cards, mortgages alongside 401(k) equity, or implicit long-term firm contracts.
What policy implications follow, and what are their scope conditions?
If macro shocks are calibrated to realistic macro-aggregate volatility (the authors’ preferred baseline), generational risk is small (about 1.4%) and pay-go Social Security slightly worsens it while imposing an 18% long-run welfare loss via crowding out; deterministic models (e.g. Auerbach-Kotlikoff 1987) then suffice to capture the long-run impact of intergenerational redistribution. Social Security’s risk-mitigation value emerges only under calibrations that overstate macro volatility (larger/extra-large shocks). The scope condition is decisive: the case for Social Security as generational insurance hinges on which calibration one finds realistic, and the authors’ preferred reading implies a weak case. They also caution the conclusions may not extend to models with correlated idiosyncratic risk.