Forthcoming [Review of Economic Dynamics] doi:10.1016/j.red.2026.101345

A Tractable Income Process for Business Cycle Analysis

Fatih Guvenen

Alisdair McKay

Conor Ryan

Canonical DOI Free to read · GREEN Open access ↗

What this paper finds — and why it matters

Layer 1: Overview

Guvenen, McKay, and Ryan estimate a stochastic income process for US male workers that simultaneously matches five empirical regularities from Social Security Administration administrative panel data covering 1978–2011: (i) flat and acyclical variance of income growth rates, (ii) volatile and procyclical Kelley skewness, (iii) very high kurtosis — targeted at 20 for one-year changes and 12 for five-year changes — (iv) a near-linear rise in cross-sectional log-income variance from age 25 to 55, and (v) a systematic factor structure in business cycle incidence whereby income losses during recessions are predictably related to a worker’s pre-recession income rank. All five facts are drawn from Guvenen et al. (2014) and Guvenen et al. (2021), which document them from SSA records on individual income histories.\n\nThe income process adds three key departures to the workhorse persistent-plus-transitory Gaussian specification. First, transitory “nonemployment” shocks — arriving annually with approximately 45% probability and drawn from an exponential distribution — create fat tails through their arrival (large income losses) and departure (large income gains), and leave a persistent “scarring” residue through a passthrough parameter ψ estimated at 9.4% in the baseline nonemployment model. Each year, roughly 8.6% of workers experience income declines of 50% or more from the nonemployment shock alone, and 1.8% fall to effectively zero income. The scarring mechanism makes the left tail of the income growth density fatter than the right tail, consistent with the data (left-tail log-density slope 1.4, right-tail slope –2.2). Second, innovations to the persistent AR(1) component are drawn from a time-varying three-component normal mixture — with the dominant central component realized with about 83% probability and near-zero standard deviation (~1%), flanked by left-tail and right-tail components with probabilities of ~10.9% and ~6.2% and standard deviations of ~16.4% and ~19.2% — whose means shift with contemporaneous aggregate wage income growth (xt = β·Δwt). This mean-shifting mechanism generates procyclical skewness under an acyclical variance, because it redistributes probability mass between the tails without altering mixture probabilities or component variances. Third, a piecewise-linear factor structure makes each individual’s income sensitivity to aggregate fluctuations depend on the persistent component of income (γi + zi,t), with a kink separating two slope regimes. In the Great Recession, workers at the 10th percentile of pre-recession income lost approximately 18 percentage points more than workers at the 90th percentile; both the bottom and top deciles were more exposed than the middle of the distribution, producing a V-shaped incidence pattern.\n\nEstimation uses simulated method of moments (SMM) with 360,000 simulated individuals per year, a 1947 burn-in start, and optimization via the TikTak global algorithm. Six models of increasing complexity are estimated, each requiring only one individual state variable (the persistent component z) — matching the parsimony of the standard model. The workhorse Gaussian model (Model 1) understates the variance of one-year log income changes by 60–80%; introducing nonemployment shocks (Model 2) largely resolves this, matching one-year variance exactly and narrowing the five-year shortfall to 30%. Adding the time-varying normal mixture (Model 3) generates procyclical skewness and acyclical variance. Adding the factor structure (Model 4) captures differential recession exposure. Models 5 and 6 introduce Heterogeneous Income Profiles (HIP, σκ = 0.015) and estimate AR(1) persistence freely, obtaining ρ ≈ 0.80, which better captures the right tail of the income growth distribution.\n\nThe paper recommends Model 5 as a general-purpose benchmark (without the factor structure), Model 4 when differential business cycle incidence is central, and Model 3 when maximum parsimony is needed. The richer income dynamics documented here have direct implications for quantifying the welfare cost of business cycles, the value of social insurance, the design of automatic stabilizers, the distribution of marginal propensities to consume, and asset pricing under heterogeneous agents.

Layer 2: Deep Dive

What is the estimation procedure and what data does it use?

The paper uses simulated method of moments (SMM), targeting approximately 120+ moments derived from Social Security Administration administrative panel data on individual income histories of US male workers over 1978–2011 (from Guvenen et al. 2014 and 2021). The simulation panel contains 360,000 individuals per year, initialized in 1947 with a burn-in period. Optimization uses the TikTak global algorithm (Arnoud et al., 2019). Moments targeted include the 10th, 50th, and 90th percentiles of one-, three-, and five-year income growth averaged across 1979–2011 (nine moments); kurtosis at one-year and five-year horizons (two moments); cross-sectional variance of log income at ages 25, 35, 45, and 55 (four moments); left- and right-tail mass and log-density slopes from the 1995–1996 income growth distribution (four moments); the full time series of Kelley skewness for one-, three-, and five-year changes (93 moments); and piecewise-linear slopes of the factor structure for seven business cycle episodes — four recessions and three expansions covering 1979–2010 (14 moments). Moments are weighted approximately equally, with skewness moments down-weighted collectively.

What are the three key departures from the workhorse Gaussian model and what feature does each address?

First, transitory ’nonemployment’ shocks drawn from an exponential distribution, arriving with ~45% annual probability, along with a scarring parameter ψ that loads a fraction of the transitory shock onto the persistent state — this generates the high kurtosis, thick tails, and asymmetry (steeper right than left tail) of the income growth distribution. Second, a three-component time-varying normal mixture for persistent innovations — the component means shift with the aggregate wage component xt = β·Δwt — producing procyclical skewness and acyclical variance simultaneously. Third, a piecewise-linear factor structure f(γi + zi,t) mediating each individual’s exposure to aggregate fluctuations, capturing the V-shaped relationship between pre-recession income rank and recession income loss.

What is the scarring mechanism and how large is it empirically?

Transitory nonemployment shocks ζi,t are assigned with probability (1 − pζ) each year and drawn from an exponential distribution with parameter λ, where ℓi,t ∈ [0,1] represents the income fraction lost. A fraction ψ of this transitory shock flows permanently into the persistent state zi,t via ˜ηi,t = ηi,t + ψζi,t. In Model 2, the annual probability of receiving a nonemployment shock is 45% (pζ ≈ 0.55), λ = 3.357 (mean income loss fraction ≈ 0.30), and ψ = 9.4%. Each year, 8.6% of workers experience income declines of 50% or more from the nonemployment shock alone, and 1.8% effectively lose all income (full-year nonemployment). The scarring makes the right tail steeper than the left tail in the income growth distribution, as re-employed workers do not return to their pre-shock income level.

How does the time-varying normal mixture generate procyclical skewness without changing variance?

The three normal mixture components for the persistent innovation η are: a central component (probability ~83%, standard deviation ~1%), a left-tail component (~10.9%, ~16.4% sd), and a right-tail component (~6.2%, ~19.2% sd). Their means shift via the latent variable xt = β·Δwt: the central and left-tail means move with xt while the right-tail mean does not. A normalization ensures xt has zero mean-income effect. In recessions (xt < 0, Δwt < 0), the left-tail component’s mean shifts down and the right-tail component’s mean shifts up relative to the central, generating more left-skewed draws without changing the probabilities or variances of the components — hence acyclical variance and procyclical skewness. Alternative designs (cyclical mixture probabilities or variances) did not generate both patterns simultaneously.

What is the factor structure and how non-monotonic is it?

In deep recessions the factor structure is broadly monotone decreasing over the bulk of the distribution (lower-income workers lose more), with the 10th percentile losing about 18 percentage points more than the 90th percentile in the Great Recession (2007–2010). However, the pattern reverses for the top 10% of the income distribution: high earners also face large losses in financial-market-driven recessions, producing a V-shape. The piecewise-linear model f(q) with a kink at q-bar and slopes α1 (below) and α2 (above) captures this. The model fits the Great Recession V-shape and the mild 1990–1992 and 2000–2002 recessions (where the pattern is flatter, consistent with smaller drops in wt), but struggles to fit the large top-income losses in 2000–2002 without an additional stock-market-correlated factor.

What is the levels-vs-differences puzzle and how is it resolved?

The canonical persistent-plus-transitory Gaussian model (Model 1) faces a fundamental tension: it can fit the cross-sectional variance of log income levels at each age, but it then understates the variance of one-year and five-year log income changes by 60–80% (squared standard deviations from Figures 8a and 9a). This tension was documented by Heathcote, Perri, and Violante (2010). Introducing the nonemployment shocks in Model 2 largely resolves it: the one-year variance of log income changes is matched exactly, and the five-year understatement narrows to about 30%. The nonemployment shock contributes high-frequency variance in income changes without requiring a comparably large increase in the variance of the persistent state, because it is mostly transitory.

What role does HIP play and what tensions does it create?

Heterogeneous Income Profiles (HIP, σκ = 0.015 from Baker 1997 and Guvenen et al. 2021) allow AR(1) persistence ρ to be estimated freely rather than restricted to 1. The estimated ρ falls to 0.80 in Models 5 and 6. HIP provides a convex component to the lifecycle variance profile (from dispersion in individual growth-rate slopes κi) that offsets the concave contribution of mean-reverting persistent shocks, maintaining a near-linear age-variance profile at ρ < 1. Lower persistence better fits the right tail of annual income growth and the standard deviation of five-year changes. However, in Model 6 HIP worsens the fit to the factor structure, because mean reversion at ρ < 1 already generates faster income growth for low-income workers in expansions, reducing the work the factor structure needs to do in booms while resisting the factor structure’s ability to generate large losses for low-income workers in recessions.

What robustness checks and alternative specifications are estimated?

The paper estimates two supplementary models reported in Appendix B. Model 2’ removes the scarring component (ψ ≡ 0) from Model 2, finding a worse fit particularly in the histogram, kurtosis, and lifecycle inequality moments. Model 3’ replaces the time-varying mixture with a static normal mixture (β ≡ 0), still improving over Model 2 (objective falls from 2.44 to 2.26) via better tail fit and average skewness, but without capturing the procyclical skewness time series. Model 4’ removes time variation from the innovation distribution (β ≡ 0) while retaining the factor structure, showing that the factor structure fit survives without time variation in skewness. Additionally, the paper discusses a special parsimony case: under ρ = 1, homothetic preferences, and no factor structure, z can be normalized away entirely, leaving no individual state variable.

How does this paper relate to and differ from prior work on non-Gaussian income processes?

Kaplan, Moll, and Violante (2018) capture leptokurtic income growth but include no business cycle variation and no factor structure. McKay (2017), McKay and Reis (2021), and Catherine (2021) allow for procyclical skewness in income risk but do not target high kurtosis or a factor structure. Bhandari, Evans, Golosov, and Sargent (2021) allow for a factor structure but do not match higher-moment properties of income risk. Other work documenting the relevant facts includes Guvenen, Ozkan, and Song (2014) for countercyclical skewness in US SSA data; Guvenen, Karahan, Ozkan, and Song (2021) for lifecycle earnings dynamics from the same source; Harmenberg (2021) and Kramarz, Nimier-David, and Delemotte (2021) for related European evidence; and Guvenen, Schulhofer-Wohl, Song, and Yogo (2017) for factor structure evidence labeled ‘worker betas.’ This paper is the first to jointly target and fit all four properties within a single tractable process that adds only one state variable.

What are the policy and structural implications highlighted by the paper?

Leptokurtic income risk (high kurtosis, fat tails) has quantitatively important effects on the value of social insurance and optimal redistribution (Saez, 2001; Golosov, Troshkin, and Tsyvinski, 2016) and interacts with borrowing constraints to shape the distribution of wealth and marginal propensities to consume (Kaplan, Moll, and Violante, 2018). Cyclical variation in income risk — the procyclical skewness feature — matters for the welfare cost of business cycles (Storesletten, Telmer, and Yaron, 2001; Krebs, 2003, 2007) and for the optimal design and welfare value of automatic stabilizers (McKay and Reis, 2021; Bhandari et al., 2021). The factor structure is relevant for cyclical variation in income inequality and for asset pricing under household heterogeneity (Mankiw, 1986; Constantinides and Duffie, 1996; Constantinides and Ghosh, 2016). The scope condition throughout is male US workers in the SSA administrative data; no direct results are provided for female workers, self-employed individuals, or other countries, though the modeling framework is general.

What practical guidance does the paper provide for incorporating the process into dynamic models?

The paper provides explicit Bellman equation structure: cash on hand m and the persistent income state z are the two endogenous individual state variables (z being the single income-process state variable), with individual parameters γ and κ treated as fixed effects. Income at each node requires evaluating a closed-form expression from Equation 1. Expectations over next-period z and ζ are handled via quadrature, with the time-varying mixture of normals requiring quadrature nodes that shift with the aggregate state S and S′ — following McKay and Reis (2021). Under the special case ρ = 1, homothetic preferences, and no factor structure, all variables can be normalized by exp(z + γ), eliminating z as a state variable and reducing the problem to one with no idiosyncratic income state. The authors note that a perpetual-youth demographic structure avoids tracking age as a state variable.

Key Concepts

Procyclical skewness: In the paper’s sense: the Kelley skewness of the cross-sectional distribution of one-year and five-year income growth rates falls significantly during every NBER recession (distribution shifts left — more large negative shocks, fewer large positive ones) and rises during expansions, while the standard deviation of that distribution shows no discernible cyclical pattern. This is a feature of the income shock distribution itself, not of average income levels.

Nonemployment shock with scarring: A transitory income loss event modeled as an exponential random variable ℓi,t ∈ [0,1] (representing the fraction of income lost) arriving with probability ~45% per year. A fraction ψ of this transitory shock is loaded permanently onto the persistent income state — the ‘scarring’ effect — so that re-employed workers do not fully return to their pre-shock income trajectory. In the paper’s model this single mechanism generates high kurtosis, thick double-Pareto tails, and asymmetric tail slopes.

Time-varying normal mixture for persistent innovations: A three-component mixture of normals for the AR(1) innovation η in which the component means (not probabilities or variances) shift proportionally to contemporaneous aggregate wage income growth via a loading parameter β. A mean-preserving normalization ensures no effect on average income. This mean-shifting mechanism moves probability mass between the central and tail components of the innovation distribution, generating procyclical skewness while keeping income growth variance acyclical.

Factor structure in business cycle incidence: A systematic, pre-determined relationship between a worker’s position in the persistent income distribution and the magnitude of income change experienced during a given recession or expansion. Modeled as a piecewise-linear function f(γi + zi,t) that multiplies the aggregate income component wt, with slopes that differ below and above an estimated kink point. Empirically, the factor structure produces a V-shaped incidence pattern: income losses in deep recessions are largest at both the bottom and top of the pre-recession income distribution, and smallest in the middle.

Income scarring parameter (ψ): The fraction of a transitory nonemployment shock ζi,t that is permanently loaded onto the persistent income state zi,t via the equation ˜ηi,t = ηi,t + ψζi,t. Estimated at 9.4% in Model 2 and 15.1% in Model 3. Controls the degree to which transitory shocks generate long-lasting income effects and determines the relative steepness of the left versus right tails of the annual income growth distribution.

Heterogeneous Income Profiles (HIP): Individual-specific linear deterministic growth-rate slopes κi distributed with standard deviation σκ = 0.015 (calibrated from Baker 1997 and Guvenen et al. 2021), representing permanent heterogeneity in the steepness of individual income trajectories over the lifecycle. Introducing HIP allows the AR(1) persistence parameter ρ to be estimated below 1 (≈0.80 in Models 5–6) while preserving the near-linear age-variance profile, because the convex variance contribution of heterogeneous slopes offsets the concavity induced by mean-reverting persistent shocks.

Kelley skewness: In the paper’s use: a robust, percentile-based measure of skewness defined as [(P90 − P50) − (P50 − P10)] / (P90 − P10), which the paper prefers for income growth distributions because it is less sensitive to extreme outliers than moment-based skewness. Used as the primary target for capturing business cycle variation in the shape of the income growth distribution.

How this summary was made. Bibliographic fields are pulled from Crossref and OpenAlex and are not model-generated. The summary was drafted from the open-access manuscript , checked by a claim-grounding and calibration review pass, and approved before publishing. Found an error or a misrepresentation? Flag it here — corrections are welcome, especially from the authors.